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SOCIOLOGY IN AMERICA: THE DISCIPLINE AND THE PUBLIC* 
_ AMERICAN SOCIOLOGICAL ASSOCIATION, =~ 
1988 PRESIDENTIAL ADDRESS 


HERBERT J. GANS 


When I first began to think about the 
presidential address, I planned to choose one 
of the research areas in which I’ve worked all 
of my professional life. I considered a paper 
on Sociology and the City, urban sociology 
currently being in an exciting intellectual 
transition, and also one on Poverty and 
Inequality, a topic about which sociologists 
have far more to contribute than they now do. 
I would also have liked to discuss Sociology 
and the Mass Media, an ever more significant 
field which still has not received the attention 
and respect’ from the discipline that it 
deserves. 

Instead of writing a paper that might have 
been relevant to only some colleagues, 
however, I chose a topic in which all of us are 
or should be interested, the discipline.! More 
particularly, I want to discuss our relations 
with America’s nonsociologists, the lay 
public: both the very large general public and 
the smaller well-educated one which does 
much of the country’s professional-level 
analytic and creative work. Since the lay 
public includes the country’s entire popula- 
tion, less the approximately 20,000 sociolo- 
gists, my topic is also an intrinsic part of 
Sociology in America. 

Although I shall concentrate on what we 


* Direct all correspondence to Herbert J. Gans, 
Department of Sociology, Columbia University, 
New York, NY 10027. 

I am grateful to many colleagues who made 
helpful comments on the version of this essay 
presented in Atlanta, and to those at the Graduate 
Center of the City University of New York, the 
State University of New York at Albany, and 
Fordham University for allowing me to try out 
early versions.of it on them. My thanks also to 
Anna Karpathakis for library research assistance 
and to Allan Silver for convincing me to use an 
allusion to Alexis de Tocqueville's classic work for 
the title of this paper and the theme of the 1988 
Annual Meeting. 

'[ had, however, made presentations about 
where I thought sociology was going to seminars at 
Columbia University in 1980 and 1985, each time 
before large enough audiences to suggest that there 
was considerable interest in the topic. 


still need to do to serve the lay public and the 
institutions in which it is involved, in many 
respects we are doing better than we have in 
the past. Sociology has established a presence 
in many kinds of policy analysis and is 
moving into large numbers of other so-called 
practice areas, even if our ideas continue to 
be largely absent from the country's political 
thinking. As best I can tell from energetic but 
unsystematic observation, the news media 
pay more attention to us than before, and 
some journalists now want sociological angles 
on feature stories they are covering. Slowly 
but surely they are also becoming interested 
in sociological research. We even show up as 
sympathetic characters in occasional popular 
novels and films, although we continue to 
play villains and fools in high culture. I have 
the impression that the majority of the literary 
community still believes tbat only it can 
analyze society. 

When one talks with publishers of general, 
nonacademic books as well as with editors 
and writers for so-called serious magazines 
and with foundation heads, the picture also 
remains discouraging. Too many people still 
dislike sociology or, worse stil, are not 
interested in it. To be sure, often they react to 
caricatures of sociology, but the very fact that 
they are not motivated to go beyond carica- 
tures is itself depressing. In effect, we play a 
smaller part in the country's intellectual life 
than we should. 

Many sociologists find nothing wrong with 
this state of affairs. For them, sociology is a 
social science with emphasis on the science, 
and reaching out to, or obtaining the attention 
of, the lay public is irrelevant. Others hold a 
stronger version of this point of view; being 
in touch with the laity, except when necessary 
for earning a living, impedes the progress of 
scientific research. Colleagues who feel most 
strongly speak of vulgarizing sociology or 
pandering to the uninformed. 

I believe that these feelings are mistaken. 
Maintaining some relationship with the Amer- 
ican public is part of our responsibility as 
members of society and as recipients of its 
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funds, public or private, whether as tuition 
payments, salaries, grants, or contacts. More- 
over, when members of the lay public feel 
that our work is useful or enlightening or 
both, they have an incentive to give us their 
cultural and political support if we need it— 
when issues like student interest in sociology, 
the allocation of research funds, and freedom 
of research are at stake. The rest of the essay 
will show that paying more attention to lay 
America can be done without pandering. 

This essay has three major parts. The first 
describes some of the research needed to 
analyze sociology’s roles in America, for 
without it we cannot fully understand how we 
can best reach out to the lay public. The 
second part discusses some ways in which we 
can now improve our relations with the 
public. In the last part of the paper I focus on 
sociology itself, offering some ideas on what 
we can do better for ourselves even as we do 
better by the public.? 

Before I start I must define the term “we.” 
I use it broadly, referring to “we the 
discipline” and “we the collectivity,” know- 
ing all the while that the discipline is highly 
diverse while the collectivity is far from a 
functioning sociopolitical entity. “We” is 
therefore mainly a shorthand about how 
numbers of us act or how we should all act, 
but I must apologize to the practitioners that 
my “we” is mostly the academic discipline 
and collectivity, they being what I know best. 


STUDYING SOCIOLOGY IN AMERICA 


My initial topic is researching Sociology in 
America. At one level, I see the topic as a set 
of studies in the sociology of knowledge that 
‘tries to understand where we are coming from 
and going and how we are tied to the main 
structures and hierarchies of American soci- 
ety. In the process, we should identify our 
employers, sponsors, funders, supporters, 
and allies, as well as our clients or constitu- 
- ents—and our possible victims. In short, we 
must understand whose sides we have been 
on, purposely or accidentally (Becker 1967). 

At another level, Sociology in America is 
evaluative, the application of our analytic 





2 Some of what follows was also said by 
presenters at the Atlanta thematic and special 
sessions, but I wrote this essay before reading their 
presentations. 
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tools and our values to understand and assess 
what we are doing for and to the country, as 
well as to all the sectors on which we might 
impinge, from underdogs to top dogs, for 
instance. We need to know whom we help 
and whom we injure and damage, intention- 
ally and unintentionally, so that we can figure 
out what we should be doing and not doing in 
behalf of a better society, however "better" 
may be defined. 

"Sociology in America" is a good title for 
an ASA annual meeting theme, but the topic 
could also be called sociology and society, in 
part to emphasize that it must be cross- 
national and cross-cultural as well (Kohn 
1987). A first priority is conceptualizing the 
basic subject, and many alternatives are 
possible. One can begin by looking for and at 
sociology's contributions, identifying activi- 
ties and institutions in which sociologists 
have participated directly or in which their 
work has been used indirectly. A major 
problem with looking at contributions is that 
we tend to forget the negative ones and the 
ones we fail to make, but this problem can be 
corrected. 

A slightly different approach would be to 
ask what roles sociology has played and is 
playing, adding the evaluative element by 
also asking how well these roles were played, 
and which should be played in the future. 
Some roles are self-evident, but the concept 
allows us to wonder whether, for example, 
we somehow also represent particular interest 
groups, or falling, not to mention rising, 
classes. Or are we mainly one of a set of 
academics whose role it is to add a touch of 
cultural polish and a smidgen of social 
conscience to the socialization of young 
Americans able and willing to go to college? 
Yet how do we fit into the scheme of things 
when we play what I think of as the Martian 
role, distancing ourselves and going to Erving 
Goffman’s backstage—or back of it—to 
report on how society or some of its 
constituent parts operate. 

My own thinking takes me in the direction 
of effects concepts, because what matters 
most is not what we have done but how our 
work has affected others. Somewhat the same 
outcome as a study of effects can be achieved 
by the use of functional analysis, for 
functions are operationalized as conse- 
quences—as long as we always inquire into 
functions and dysfunctions of what for whom, 
and assume the possibility that some of our 
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activities are functional mainly for ourselves. 
Alternatively, one can look at sociology's 
benefits and costs—if these are not treated 
solely as quantitative concepts. We must also 
remember that researchers will not always 
agree on what is beneficial and costly, and 
that the determination of benefits and costs 
must reflect the views of all those who 
actually win and lose. Moreover, we must 
never exclude the possibility that our work 
has neither significant benefits nor costs— nor 
major independent effects. We are, after all, 
only 20,000 in a country of 230 million. 

I am aware of all the methodological 
difficulties of studying effects, functions, and 
benefits and costs, but we must discover what 
impact we have had. Furthermore, any 
properly sociological effects study has to 
examine the agents and processes that have 
shaped sociology to achieve whatever impacts 
it is having. Thus, a study of sociology's 
impact on America must be preceded by 
research on America's impacts on sociology 
(Gouldner 1970; Vidich and Lyman 1985). 
However, if we analyze the roles we have 
played, we must likewise ask who helped us 
play these roles and how we were invited or 
shoeborned into them. 

Needless to say, there are other conceptual 
schemes for looking at sociology in America, 
but whatever the schemes, the questions I 
have raised also have to be answered 
historically. In fact, it may be strategic to 
begin with historical analyses because the 
historical view can give us a better fix on the 
primary theoretical and empirical issues on 
which we must concentrate in order to 
understand the present. 

Although the teaching of sociology has still 
not obtained enough respect from the disci- 
pline, the fact remains that virtually all 
academic sociologists, including those at the 
most elite research universities, earn their 
living by teaching. Consequently, one of the 
first and most important questions to be 
researched concerns the effects, and thus also 
the effectiveness, of our teaching. 

ASA estimates that 75 percent of Ameri- 
ca's sociologists —or 15,000—are still aca- 
demics. If each teaches four courses a year, 
and many unfortunately teach many more, 
that comes to 60,000 courses a year, and of 
these the most frequently taught continue to 
be introductory, marriage and the family, and 
social problems. Although studies have been 
made of the major texts used in these courses, 
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we ought to start finding out what is actually 
being taught in them: not only what kinds of 
sociology, but what descriptions of and 
prescriptions for American society. For exam- 
ple, a multicampus sample of marriage and 
the family courses could be analyzed to 
identify what models of marriage and the 
family sociologists teach, and what postures 
they encourage students to take toward them, 
explicitly or implicitly. To what extent do we 
teach conformity to the culturally dominant 
models, and if we suggest the desirability of 
sociopolitical change, what new or old 
models do we have in mind? 

After that, we ought to begin on the more 
urgent but also more complex task of looking 
at what students learn from these basic 
courses, for their own lives and their citizen 
roles, to see if we can establish findings about 
the effects of their exposure to sociology. 
Since sociology has begun to drift down to 
the high schools, similar research can be done 
there. Schools not being the only teaching 
institutions in America, however, someone 
should also take a look to see whether 
sociology has yet had any visible impact on 
the country’s news and entertainment media. 

Parallel kinds of research can be under- 
taken among sociological practitioners. In- 
deed, now is an ideal time to begin, for 
before-and-after studies, should immediately 
be conducted at some of the many public 
agencies and private companies that are first 
hiring sociologists, so that we can learn what 
early effects they are having. Now that 
sociologists are being employed in market 
research, for example, it would be useful to 
look at a sample of firms to discover what, if 
anything, the sociologists do differently —and 
with what effects—from the previous market 
researchers who have generally been MBA’s 
and psychologists. Do sociological market 
researchers have more empathy for the 
subjects of market research than had their 
predecessors, and what effects does this have 
on their work, the resulting firm policies, and 
the profits? Or are sociologists in big 
organizations more likely to practice what 
their organizations prescribe rather than what 
their discipline has trained them to practice? 
Incidentally, an interesting study of academic 
practitioners, the increasing number of sociol- 
ogists who become deans and provosts of 
their universities, could be done to see what, 
if anything, they do differently because they 
are sociologists. 


The effects studies of the greatest urgency 
are those with potential public policy signifi- 
cance. I will limit myself to two examples. 
One is the roles and functions sociology has 
played in past culture-of-poverty research and 
is now playing in the study of what is 
currently called the underclass. We could 
begin, for example, with the effects the most 
widely-read new sociological book of the last 
two years, William J. Wilson’s The Truly 
Disadvantaged (1987), has had for the public 
understanding of the underclass, and for the 
policies needed to bring it into the country’s 
mainstream. As sociological underclass re- 
search proliferates, however, we must also 
look at what we may be doing against the 
people now assigned to that class. 

The term underclass was first used in 
recent times by Gunnar Myrdal (1963) as an 
economic concept for describing a set of 
people being driven to the edges or out of the 
economy. While most current underclass 
research seems to be in the hands of 
economists, they have generally adopted a 
different definition, perhaps of journalistic 
origin, in which the members of that class are 
also associated with a variety of criminal, 
pathological, or stigmatized activities and are 
generally black or Hispanic. 

No laws prevent us from studying the 
impact of economists alongside of, or in 
comparison to, our own, and many questions 
deserve answering. Do studies using the 
underclass concept call attention to people 
who need economic and other kinds of aid? 
Or are researchers primarily giving scientific 
legitimization to the latest buzzword for the 
undeserving poor and concurrently helping to 
disseminate a new code word for the covert 
expression of racial hostility? More generally, 
what role do researchers play in the emer- 
gence of a new public stereotype, and how 
: can they prevent a social science generaliza- 
tion or an ideal type from being interpreted as 
a stereotype? 

To the extent that underclass studies are 
seen and used by social workers and other 
street-level bureaucrats as well as policy- 
makers, we have to ask whether these studies 
mainly help the people of the underclass or 
help government to control them? Once 
again, what sides are we on, intentionally and 
unintentionally, as we study this newest 
"hot" topic? Perhaps the biggest problem 
: stems from unintentional "putdowns" of poor 
people, because of either lack of researcher 
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reflexivity or the use of data from agencies 
that exist in part to be punitive toward the 
poor. 

I have the impression that sociologists 
doing research among underclass people are 
more likely to be on their side while the 
economists tend to treat them as a dangerous 
class. Even so, sociologists and economists 
play only a small causal part in the tragic 
relationship between the underclass and the 
rest of America. Indeed, the current research 
is itself an effect of public appetites for 
information, scapegoats, and, of course, 
solutions. These appetites have themselves 
emerged for such reasons as the increasing 
fear of crime—and of dark-skinned Ameri- 
cans— the rise of homelessness, the economic 
insecurity created in many parts of the 
population by the Reagan economy, and the 
relentless pressures by the Reagan administra- 
tion on people who cannot afford the values 
of mainstream cultures. 

My second example might serve as a model 
not only of what we have done well.as 
sociological researchers but also of the ways 
in which sociology can be useful, and 
relatively easily. I think here of the large set 
of findings which indicate on the one hand 
that informal groups and related social 
supports have both illness-preventing and 
healing functions, and on the other hand that 
isolation and loneliness as well as alienation 
produced by hostile or distant formal institu- 
tions can breed and worsen physical and 
mental illness. The basic idea goes back to 
19th-century sociology, but since World War 
II many researchers have shown how the 
presence or absence of kin, friends, neigh- 
bors, and other informal groups and net- 
works affect health (Litwak and Messeri 
1988). 

For the study of sociology in America, and 
for the making of health policy, we must 
examine whether and how such findings are, 
or could be, providing competition for purely 
medical models of health and illness. In 
addition, we need to know whether and how 
these findings are leading to changes in 
medical activity, from physician practice to 
national health policy. Conversely, we must 
also study why changes did not take place, so 
that we can try to understand how they could 
take place. Since informal groups should cost 
less than doctors and hospitals, social sup- 
ports would help reduce medical costs and 
might be welcomed for that reason alone— 
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unless hospitals and doctors decide to tum 
them into a medical specialty, and charge 
accordingly. 

Whether the study of Sociology in America 
involves basic, applied, or policy-oriented 
research, we will, in effect, be studying 
ourselves. I need not list the dangers of a 
disciplinary-wide self-study, and in a utopian 
world, another social science would study us 
while we study yet a third. However, in this 
world, we have to do the needed studies and 
we have to learn how to deal with the likely 
conflicts of interest. 

An essential ingredient for self-study is the 
right mixture of deliberate and systematic 
reflexivity and an equally deliberate and 
systematic distancing. Appeals for more 
reflexivity without structural underpinnings 
and instrumental incentives being the material 
of sermons, I am reluctant to go further 
except to hope with Alvin Gouldner that what 
I have in mind here does not become “just 
another topic for panels at professional 
conventions and not just another little stream 
of technical reports” (Gouldner 1970, p. 
489). 

Consequently, as relevant studies are under- 
taken, we have to begin.to think about what 
we will do with the results. Even before we 
know more about our contributions, roles, 
and effects, we must debate how to increase 
sociology’s positive effects and cut back the 
negative ones. We ought also to confront 
once more an old, recently forgotten question: 
what is a good society and how can sociology 
help bring it about? 

I have no illusions about how much we can 
agree on the nature of the good society or how 
much we can do to bring one about, but the 
discussion of these questions will have 
beneficial results for the discipline itself. The 
very innocence of the notion of the good 
Society may be a useful antidote for our too 
frequent tendency toward excessive abstrac- 
tion. Moreover, asking fundamental general 
questions, even the kind that cannot be 
answered easily or completely, forces us to 
address issues of widespread interest in 
America and is, in addition, a way of 
reaching out to the general public. 


SOCIOLOGY AND THE LAY PUBLIC 


The second part of my paper is about 
improving relations with the public and its 
institutions. I begin again on an empirical 
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note, because at least two further topics badly 
need study if we are going to act intelligently 
to improve our relations with both the large 
general and the smaller well-educated public. 

One study seeks to identify lay sociology, 
the generalizations about society and its parts 
that all people— we included — start learning 
as children, long before knowing of the 
existence of professional sociology. True, lay 
people do not label their knowledge about 
Society sociology, but nonetheless it consists 
of ideas and data in all of the fields we study. 
Much lay sociology is learned during the 
process of socialization, yet more is discov- 
ered through the applied participant- 
observation we all do constantly in everyday 
social life, and some comes from nonprofes- 
sional, or so-called pop, sociology: research 
done by nonprofessional sociologists who use 
some of our methods but few of our concepts 
and theories. 

For my purpose, the significant questions 
center around what happens when people's 
lay sociology comes into contact with our 
professional sociology. We have to discover 
what impacts we have on lay sociology, and 
whether and how we add to and change it. 
Perhaps even more to the point, we have to 
find out if and why we are ignored or 
rejected. When the generalizations of lay and 
professional sociology diverge, we generally 
seek to replace the lay kind, and our students 
may fail to learn because they are not 
persuaded that our sociology is more valid 
than theirs. I wonder, for example, what 
happens when working-class and poor stu- 
dents, whose lay sociologies are particularly 
rich in the fields of class and inequality, take 
a course in social stratification which sees 
society solely from a middle-class perspec- 
tive. Although we assume that professional 
sociology is always better than the lay 
version, that assumption also deserves some 
inquiry. 

The other study strikes at the heart of our 
relations with the educated public because we 
need to know in detail how our sociology is 
judged by that public. If, when, and where 
our standing is not as good as it should be, we 
have to identify the reasons and causes. In 
addition, we have to find out what the 
members of this public want from sociology, 
ours and theirs. There is clearly a great 
demand for applied organizational research, 
for the management literature is full of pop 
Sociology on this topic, much of it so poor 
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that every six months yet another new 
analysis becomes a brief best-seller. 
In their nonoccupational reading, however, 
many members of the educated public seem to 
specialize in literary and historical works, 
which is one reason why just about all of the 
important magazines and publishing houses 
catering to this public continue to be run by 
people from literary and historical back- 
grounds. Why the reading public is so fond of 
history and why it ignores—and perhaps 
dislikes—some or much sociology is a 
research topic of fundamental importance, for 
until we have a comprehensive answer our 
work will not get much attention from the 
journals of cultural and political opinion, the 
large circulation “class” magazines such as 
Time, The New Yorker, or The Atlantic, and 
“trade book” publishers who publish nontech- 
nical books in the social sciences. 
Despite the need for these researches, 
many suggestions can be made now for how 
to improve our relations with the lay public, 
but I will limit myself to five I consider 
particularly significant. 
First, I assume the lay public— general and 
educated— will pay more attention to profes- 
sional sociology if and when our research 
addresses salient subjects and issues. Many of 
these center around the family, the economy, 
and health— subjects about which we have 
something to say that can help people's 
understanding, if we can present our ideas 
and findings in plain English. Other lay 
concerns touch on or are set off by current 
events, and we should figure out how we can 
do more studies on significant topics of the 
moment. Many years ago Gladys and Kurt 
Lang proposed "firehouse research" for such 
',Studies, and their proposal is as timely as 
ever. We can also supply useful comments on 
topical issues, especially as debunkers and 
correctors when the early journalistic reports 
and nonprofessional sociology are wrong. In 

_ addition, we can report on trends underlying 
topical subjects and can often provide more 
systematic explanations of events and trends 
than do journalists and pop sociologists. 

An already existing lay interest in our 
sociology’ has to do with the diversity of 
American life. Because of that diversity, 
some members of the lay public want to know 
how other Americans cope with common 
problems such as familial and community 
ones, as well as how they interpret, or 
substitute for, the conventional rules and 
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norms of American life. It is no coincidence 
that the best-known sociological works of the 
last 75 years— Middletown, The Lonely 
Crowd, and Habits of the Heart—respond to 
one or another of these lay inquiries. 

These studies also exhibit what I consider 
one of sociology’s distinctive qualities: they 
are based on research among ordinary Amer- 
icans. While other social sciences concentrate 
on elite decision-makers, exotic subcultures, 
or laboratory subjects, sociology has always 
done much of its work with and among 
typical Americans. This is one reason why 
professional sociology, when properly pre- 
sented, appeals to the lay public. That appeal 
is widened when we use the research methods 
that seem most attractive to this public: the 
depth-interview, in which people have a 
chance to talk and to explain themselves fully; 
and fieldwork, in which sociologists are on 
the scene to hear them on a continuing basis, 
and inside the social structures in which they 
act and interact. 

The ideal study format may be the 
community study, not because I have done a 
few but because it is broad; it allows 
researchers the opportunity to report on a 
variety of people across a wide range of 
institutions and situations. If the communities 
and people studied are reasonably representa- 
tive or thoughtfully chosen deviant cases, the 
sampling is done properly, and the research is 
focused on significant theoretical and substan- 
tive questions, this is the best way to look at 
America, for both the discipline and the lay 
public (Keller 1988). 

Community studies are hard work; they can 
take a long time and, like many qualitative 
studies, do not fit the currently dominant 
definition of science. As a result, funding 
agencies have not been supportive—a serious 
mistake that helps to explain why sociology is 
not as much in the public eye as it should be. 

The second of my five suggestions is a 
corollary of the first that undergraduate 
sociology courses should concentrate, when- 
ever possible, on sociological analyses of 
American institutions and society rather than 
on sociological principles illustrated with 
samples from America. There is nothing like 
an overly concept-filled introductory course 
to turn many students against sociology 
forever. Courses that teach sociology through 
an analysis of American society also require 
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research on topical issues and current events.? 
Unfortunately, even reading a first-class 
newspaper or weekly news magazine with a 
sociological eye is not normally part of the 
graduate school training program. If we 
carried out more analyses of topical issues 
and current events, sociology could make 
more original contributions to understanding 
both: If any sociologists now prepare such 
analyses for their classes, we should find a 
place where the best of them can be published 
for the rest of us. 

My third proposal is that we must recruit 
and encourage talented sociologists who are 
able and eager to report their work so that it is 
salient to both their colleagues and the 
educated lay public. Borrowing Russell Ja- 
coby’s concept of public intellectuals (Jacoby 
1987), they might be called public sociolo- 
gists, and the public sociologist par excel- 
lence that comes at once to mind is David 
Riesman. Public sociologists are not popular- 
izers; they are empirical researchers, analysts, 
or theorists like the rest of us, although often 
their work is particularly thoughtful, imagina- 
tive or original in some respect.^ 

Public sociologists have three further dis- 
tinctive traits. One is their ability to discuss 
even sociological concepts and theories in the 
English of the college-educated reader, prob- 
ably because they enjoy writing as well as 
doing research and may even think of 
themselves as writers. Their second trait is the 
breadth of their sociological interests, which 
covers much of society even if their research 
is restricted to a few fields. That breadth also 
extends to their conception of sociology, 
which extends beyond research reporting to 
commentary and in many cases also to social 


criticism. To put it another way, their work is. 


intellectual as well as scientific.5 A third, not 





? Some time ago I received a blurb for an annual 
review of sociology text for undergraduates and 


discovered that the vast majority of contributions 


were not written by sociologists. 

4 I distinguish public sociologists from visible 
scientists. (Goodell 1977) because the visible 
scientists she describes earned their visibility not 
only as scientists but also as popularizers and as 
commentators on social issues far outside their 
scientific fields. 

5 Jacoby's hopes notwithstanding, public sociol- 
ogists also have to be academics or practitioners, 


there currently being no free-lance writing market ` 


to provide a living for even one sociologist. 
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unrelated, trait is the ability to avoid the 
pitfalls of undue professionalism described by 
earlier ASA presidents (for example, Hughes 
1963, p. 890; Lee 1976, pp. 927-29). 

I do not know how one recruits fledgling or 
mature public sociologists, but I fear that too 
many young people with an interest in society 
get Ph.D.'s in English, literature, or history. 
Consequently, sociology must encourage those 
it does attract, beginning in graduate school. 
It also has to assure them that they can be 
both sociologists and writers and will not be 
discriminated against for this combination of 
Skills. For example, they must be rewarded 
for being writers, and their major sociological 


.writing in nonscholarly publications must be 
‘treated as equivalent to scholarly writing in 


promotion and tenure decisions. We should 
also-find outlets for their writing inside 
sociology so we do not lose all their work to 
other publications. 

I have been around aang enough to 
remember when David Riesman was not 
considered a sociologist in many parts of the 
discipline, although even today some col- 
leagues who hold fervently to a natural 
science conception of sociology reject public 
sociologists. Worse yet, they may dismiss 
them as “journalists,” a term that we should 
never use as a pejorative for yet other reasons 
I will come to shortly. I am told that John 
Kenneth Galbraith, the dean of public econo- 
mists, has never been accepted as an 
economist by many of his colleagues, but 
then economics is a backward social science 
in other respects. 

The fourth suggestion for adding to our 
impact on the lay public requires revitalizing 
an old mode of public sociology: social 
criticism. I oversimplify only slightly to point 
out that American sociology began in part as 
social criticism, and while a handful of 
sociologists have continued this tradition, 
today’s American social criticism is almost 
entirely in the hands of journalists, essayists, 
literary critics, and philosophers. Europe is 
quite different in this respect, because many 
European sociologists and researchers double 
as newspaper or magazine columnists, writing 
regularly the kind of social commentary 
found here in journals of opinion and cultural 
criticism. 

We are not Europeans and we should not 
even imitate America’s current social critics. 
Our task is sociological social criticism. 
Journalistic and humanistic critics too often 


8 


view social ills by what makes them person- 
ally unhappy, and they may also misunder- 


stand the causes of these ills or offer solutions . 


that reflect the values of a single group—be it 
intellectual elite or working class. Partly as a 
result, conventional criticism is frequently 
nostalgic or apocalyptic, with good old days 
being mourned right and left and many 
institutions thought to be in permanent rapid 
decline — headed almost always by the family. 

The sociological social critic can do much 
better! The identification of social ills ought 
to be based both on empirical data about what 
the public or several parts of it feel to be 
wrong, and on the critic'S own concerns. 


Proposed solutions can likewise transcend the | 


perspective of the critic'S own immediate 
circle, and they should draw on systematic 
causal analyses of the problems to be solved. 

Social criticism is not for every sociologist, 
but it should become part of the discipline just 
as social policy research became a part of it in 
the last 20 years, once we were able to move 
beyond the primitive conceptions of value- 
free sociology on which the early disapproval 
of social policy research was based. Sociolog- 
ical social criticism will never grow as large 
as social policy research, however, because it 
cannot, and should not, become a government 
function. 

My fifth and last proposal is particularly 
focused on the general public. Since its major 
contact with professional sociology comes 
from the mass media, we should try to get 
more of the sociological perspective and our 
own studies into these media. Reaching the 
general public requires popularizers, sociolo- 
gists and others who can turn the ideas and 
findings reported in our journals and books 
that should be of general interest into 
everyday English.$ 

Concurrently, we should encourage the 
journalists who also popularize our work: the 
small number of free-lancers who do it from 
time to time, as well as the handful who have 
regular social or behavioral science beats. We 


i: Actually, a number of sociologists are already 
working with ASA's Public Information Commit- 
tee and ASA staff to write popular articles from 
papers in various sociological journals. Now we 
need to find ways to get their work into the media, 
which also requires learning what kinds and 
subjects of sociology will appeal to the general 
public, and the editors who supply their newspa- 
pers and magazines. 
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should assist journalistic popularizers as much 
as we can, for good popularization will 
increase public interest in sociology. At the 
same time, we may be able to head off some 
inaccurate or sensationalized popularization. 

In addition, we should help nonprofes- 
sional researchers who undertake pop sociol- 
ogy, which I described earlier as research 
based on the concepts and ideas of lay 
sociology. We can be particularly helpful 
with advice on methods. After all, the rules of 
sampling, question construction, field work, 
and statistical analysis apply equally to 
professional and pop sociology. True, nonpro- 
fessional sociologists often cannot apply these 
rules as rigorously as we do, for the lay public 
is not interested in professional subtleties and 
qualifications, whether in sociology or in 
physics. Still, our common interests in good 
methodology can make us useful as long as 
we understand and are tolerant about the 
differences between their sociology and ours. 

Good nonprofessional sociology is useful 
to us for the same reasons as good populariza- 
tion. We have a special interest in reducing 
bad pop sociology, however, because its low 
quality can reflect on us directly and quickly 
since the general public may not distinguish 
between professional and nonprofessional 
sociology. 

Professional sociologists should keep an 
eye on pop sociology, if only because it has a 
much larger audience than we do. They 
should also distinguish between good and bad 
pop sociology, but unfortunately too many of 
our colleagues look down on all of it, as they 
do on popularizers of our work. This stance 
can only hurt the discipline, for when some of 
us appear distant and superior, we may turn 
off members of the lay public otherwise ready 
to pay attention to our work. Worse yet, 
wholesale rejection of sociologies other than 
ours may end up by biting the public hand 
that feeds us. 

An ideal solution, allowing us to have our 
cake while eating it, is an ASA-run or 
supervised magazine of high-quality popular- 
ized and pop sociology, but that solution is 
unrealistic since the current lay constituency 
for sociology is too small to support such a 
magazine. Sociology may be inherently less 
newsworthy than, for example, psychology or 
economics, since both give advice about 
everyday life of a kind that we cannot 
supply—or anthropology and psychiatry, 
which can tell more dramatic stories than we. 
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Thus, a Sociology Today modeled on the 


monthly Psychology Today is not in the 
cards.” 


Today’s most significant disseminators of 
our sociology to the general public are 
magazine and newspaper journalists who 
incorporate our work in their stories, occasion- 
ally because they judge a sociological study to 
be newsworthy but increasingly often because 
they want sociological commentary on and in 
their stories. In these cases they may look for 
appropriate sociological findings, a sociolog- 
ical perspective to increase the quality of their 
story, or a quote to provide the story with 
some sociological legitimation. 

These journalists are a crucial resource for 
us, a veritable disciplinary treasure, and they 
should be given our full and immediate 
cooperation (Gans 1988). That we are being 
called more and more often by reporters, 
feature writers, and their equivalents in 
television can only be viewed as a compli- 
ment. I hope it is also a sign that the old days, 
when sociology was good only for a cranky 
feature exaggerating our shortcomings, are 
coming to an end.? Besides, the more we help 
journalists with their stories, the more inter- 
ested they may become in reporting our 
studies. 

We can be helpful further by eliminating 
the mindless attacks on journalism that are 
still heard in the discipline. At one level they 
reflect a disciplinary stereotype that all 
journalism is superficial, but at another level 
they may express unhappiness with the 
competition journalists provide us in the study 
of society. While journalism is often superfi- 
cial, sociology would be superficial equally 
often if it had to report to a diverse and often 
poorly educated lay audience; if it had a two- 
to six-hour deadline for data collection, 
analysis, and writing; and if the research 


? Psychology Today, which was founded as, and 
is once again, a commercially owned magazine, 
was for some years published by the American 
Psychological Association, which lost several 
million dollars in the process and proved that even 
a giant social science organization is not necessar- 
ily commercially adept. 

3 Such features, which criticize us for the use of 
jargon, too many numbers, irrelevance, academic 
restatements of the obvious, as well as for triviality 
and excessive seriousness, still appear from time to 
time, and we should make sure that we do not act 
according to this now-aging caricature. ` 
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report had to be condensed into a few hundred 
words. Journalism has other faults too, but we 


-must learn to distinguish between good and 


bad journalism. Indeed, we should not refrain 
from criticizing bad news stories about our 
work and ideas, as long as we make clear to 
the journalists involved how and why their 
work was inadequate. Conversely, we have 
every right to expect that journalists will learn 
to distinguish between good and bad sociol- 
ogy, to give up their stereotypes of us, and to 
stop thinking of the term sociological as a 
pejorative. 

I end this section of the paper with a 
modest proposal: that the abstracts of our 
journal articles and the summaries of our 


' academic books be written in nontechnical 


English.? Journalists may then become inter- 
ested in our work instead of becoming 
discouraged at the very outset, and while they 
wil probably still have troubles with the 
technical writing in the body of the text, they 
may be motivated to get in touch with the 
author for help in clarifying his or her work. 
They may also wean us away from writing so 
many of our article and book texts in 
*Sociologese." 


SOCIOLOGY FOR AMERICA 


The third of my three topics is the discipline 
itself and what we can do to help as well as 
improve ourselves. I again limit myself to the 
academic side, mainly at the research univer- 
sity level I know best. I will not systemiati- 
cally evaluate that side of the discipline, 
however, and I cannot even go into some 
specific problems that badly need discussing: 
for example, the ways we still often mistreat 
graduate students and part-time instructors, 
which is in part a reflection of long-standing 
inequalities within the discipline. These. 
inequalities are currently worsened by the 
ever-expanding star system and the treatment 
of some colleagues as celebrities. 

Here I want principally to outline what we 
need to do for and in the discipline in order to 
obtain a better. reception from the public, 
particularly the educated one. This goal. 
requires attention to the intellectual level of 


9 Moreover, article abstracts should not be 
repetitions of the first and sometimes the last 


. paragraphs of the article but should supply readers 


with a summary of the article's findings. 
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our work and the imperfections that intellec- 
tual observers and other members of the 
educated public see in that work. I will limit 
myself to two such imperfections. Both can 
also be found in the other social sciences, 
which means that their causes transcend our 
own discipline. However, the imperfections 
of the other social sciences do not excuse 
ours—and besides, we should be the first to 
overcome them, thus leading the way for the 
others. 

The initial imperfection is mindlessness, 
research that is poorly thought through. 
Mindlessness cuts across fields and methods. 
It is the use of proxies or indicators because 
tangentially appropriate quantitative data are 


accessible, even though these proxies have i 


only the most tenuous logical or empirical 
connection to the phenomena under study. 
Mindlessness is grounding the analysis of a 
complicated phenomenon on survey questions 
without any idea of how respondents under- 
stood the questions. Mindless fieldwork 
supplies thick descriptions of what is already 
common knowledge but fails to provide the 
thick analyses that are sorely needed. And 
whatever the research method, there are still 
occasional sociological analyses that, once 
translated into ordinary English, turn out to be 
examples of what we have often been accused 
of: restating the obvious. 

Another kind of mindlessness sacrifices 
substantive validity to a favored analytic 
technique of the moment. That kind of 
mindlessness is part and parcel of our passion 
for methodology, which is actually long- 
standing. Jules Poincaré, who was writing at 
the turn of the century, even then described 
sociology as "the science with the most 
methods and the fewest discoveries.” !? Otis 
Dudley Duncan, whose theme I am here 
repeating, has put it more pointedly: 


Writing on "methodology" cultivated for its 
own sake produces a bifurcation of scientific 
effort that is stultifying. You have on the one 
hand inept researchers who think they have no 
responsibility for the methods they use because 
they can cite the authority of some "methodol- 
ogist" and on the other hand “methodologists” 
whose advice is no good because they do not 


10 I am indebted to Otis Dudley Duncan for this 
quotation. Robert K. Merton has planned to 
include it, with its source, in a collection of social 
science quotation which he and David Sills are 
editing. 
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actually know how to do research (otherwise, 
we must suppose, they would have done some). 
(Duncan 1974, p. 2) 


The second imperfection, also of long 
standing, is what 1 think of as overqualifica- 
tion. I have no quarrel with statistical or 
mathematical analyses per se; they have 
advantages and disadvantages just as the 
various qualitative methods do. However, 
overquantification takes place when the re- 
search problem calls for qualitative analyses 
but quantitative ones are used instead, or 
when the use of such analyses changes the 
research problem. Overquantification occurs 
when elegant statistical analyses are per- 
formed on.sloppily collected data, or on data 
forever made unclean by the covert or overt 
agendas of the collectors. And it takes place 
when quantitative analysis is not preceded — 
or driven—by concept and theory formula- 
tion, when researchers are literally merely 
crunching numbers. Needless to say, equiva- 
lent sins happen on the qualitative side. There 
may be no phrase for qualitative data 
crunching, but it occurs, and fieldwork alone 
is inappropriate when the research problem 
calls mainly for frequency distributions. 

Some unfortunate effects of overquantifica- 
tion result from its ideological character. One 
is the inability of overquantifiers to tolerate 
disagreement, and their resulting stigmatiza- 
tion of and discrimination against qualitative 
research. Perhaps as a result, some advocates 
of qualitative method have also become 
ideologists. Consequently, a scientific disci- 
pline, in which research problems ought to 
determine the methods, and in which many 
problems are best solved by the use of both 
types of methods, is locked into an ideologi- 
cal dispute over a dubious typology — which 
is, moreover, actually about the nature of 
sociology. 

A related effect of overquantification is the 
time and energy academic departments, 
individuals, and the discipline as a whole 
waste in endless battling over the two types of 
methods. Robin Williams was recently quoted 
as calling this a sham battle, adding rightly 
that "energy should be better utilized in 
applying whatever techniques seem to pro- 
duce reliable knowledge" (Hirschman 1987, 
p. 5). However, bv now the crucial battle is 
less over ideas than over "scarce resources 
. . . Jobs, research funds, editorial policies of 
our journals, professional recognition and 
prestige," as Mirra Komarovsky has pointed 
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out (Komarovsky 1987, p. 562). Such battles 
are not sham, and were they to end now, the 
superior resource position of quantitative 
sociology would become permanent. Those of 
us who believe in the virtues of qualitative 
empirical and other sociological work have to 
continue the struggle for equality of re- 
sources. I consider it scandalous, for exam- 
ple, when funding agencies with public 
mandates or tax exemptions nonetheless base 
their grant policies on the power balance 
inside disciplines. 

A final effect of overquantification that 
needs mentioning is its tropism toward 
secondary analysis, which makes it possible 
for sociologists to study society for their 
entire lives without ever leaving their offices 
to talk or listen to the people they study.! 
The reliance on secondary analysis also 
makes us increasingly dependent on officially 
produced data. Worse yet, the resulting 
impersonalization of research is thought to 
make sociology more scientific, whereas in 
fact intensive interviewing and fieldwork are 
generally more scientific because the research- 
ers get to know closely the people and social 
structures they are studying. 

The problems I have described are familiar 
and have been discussed in previous presiden- 
tial addresses (e.g., Coser 1975). Thus, 
nothing is gained by further elaboration. 
What would be useful, however, is more 
sociological research into why sociology and 
the other social sciences have been develop- 
ing what I see as imperfections. If I were 
doing the study, I would want to look 
particularly at three sets of current academic 
arrangements. 

The first of these arrangements might be 
called scholarly insulation and a correlative 
lack of reality checks, which can disconnect 
our work from what is generally referred to as 
the real world. Unlike practitioners, our 
research does not need to be accountable to 
nonsociological kinds of validity, so that, for 
example, we are not open to and thus do not 
receive corrections from the people we have 
studied. We are accountable to funders to 
some extent, but many tend to base their 
judgments on peer reviews and, whatever 


!! David Riesman has pointed out that some 
survey researchers draft their interview questions, 
have others obtain the answers and then analyze 
the data and never leave their offices either 
(personal communication). 
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their other virtues, peer reviewers can be as 
insulated from the nonsociological world as 
other researchers. 

The absence of reality checks, which is 
also one cause of intellectual mindlessness, 
could actually be remedied somewhat by 
instituting such checks as part of our 
empirical procedure at the start and just 
before the end of our research. Basically such 
checks would involve informal . reconnais- 
sances, through the use of informants, 
informal interviewing, and fieldwork, among 
the people or institutions under study, as well 
as the application of independent statistical 
data, already available or newly collected 
from a small sample. Even theoretical papers 
and quantitative secondary analyses can be 
improved by reality checking. 

Another kind of reality checking would 
identify thoughtful nonsociologists to critique 
our work and identify errors of omission and 
commission. Where possible, these must also 
include the people we study. Reality checks 
seem to me to be at least as important as 
literature reviews, and we will be well served 
if we can make them intrinsic parts of our 
research procedures. 

The other two causes of imperfection are 
less easy to remedy. One is scientism, the 
modeling of sociological (and social science) 
research methods on a highly idealized 
version of the methods of the natural 
sciences. Although this modeling began even 
before sociology first became systematically 
empirical, it continues today when we know 
full well, in part from research in the 
sociology of science, that natural scientists do 
not operate according to the idealized concep- 
tion of their method. Indeed, the ideal is 
humanly unworkable; nevertheless we cannot 
let go of it. We also know that social 
structures are not molecules and cannot be 
studied like them, but we cannot seem to let 
go of that analogy either. Nor have we yet 
learned to appreciate Donald McCloskey’s 
lesson that “scientific work is rhetorical” and 
that it is so “even in its stylistic appeal to a 
rhetoric of not having a rhetoric” (McCloskey 
1985, p. 98). 

Idealized natural science is a kind of civil 
religion in modern America, and there may be 
a quasi-religious element both in the ideal and 
the consensus behind it. The ideal also 
continues to justify the search for sociological 
“laws” —the nomothetic approach to sociol- 
ogy—but that search may express the latent 
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hope for power—in an ideal society in which 
these laws—and their formulators—would 
play a central decision-making role. Such a 
society is as millenarian as those of the major 
religions: when salvation has been achieved, 
the Messiah has come, Mohammed has 
returned, or the State has withered away. 

The search for sociological laws is, further- 
more, sufficiently abstract to be "above" 
cultural or political conflicts of the moment. 
It is perhaps no accident that nomothetic 
sociology —like overquantified work—is usu- 
ally noncontroversial, and unlikely to produce 
criticism of economic, political, and cultural 
power holders who are behaving in undemo- 
cratic or unjust ways. 

The third and last cause of imperfection in 
sociology I will discuss concerns that strange 
institution in which academics work and in 
which all academic scholarship is therefore 
embedded. Although we are paid for the 
number of courses we teach, we are promoted 
by how much we publish, and only some- 
times by the quality of our publications as 
well. In effect, our strange institution oper- 
ates like a machine shop in which publica- 
tions are treated like piecework. And like 
employees in any other kind of machine shop 
basically concerned with amount of productiv- 
ity, we may overspecialize to study one part 
of the “social machine.” 

Moreover, again like workers in other 
machine shops, we are periodically greeted 
by new technology oriented to improving 
productivity, most recently of course the 
computer. The virtues of the computer for 
both quantitative and qualitative sociology far 
outnumber the vices, but there are some 
downsides too. Despite its potentials for 
high-quality research, the computer facilitates 
the speedier and thus greater production of 
piecework. It further encourages secondary 
analysis and the use of official, rather than 
self-generated, data. Although creative re- 
searchers can make creative use of the 
computer, the new technology even reduces 
the need to think and analyze once the right 
computer program has been found. Like 
many other industries, we too are becoming 
less labor-intensive. 

These patterns are also symptoms of the 
continuing bureaucratization of research and, 
as often happens, the new technology is 
merely handmaiden to the socioeconomic 
process. In fact the computer nicely fits the 
academic shop routine, for it enables academ- 
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‘ics to do their research during the interstices 


of a full teaching load, and to publish more 
work at a faster clip.1? The computer adds 
further to the impersonalization of research, 
and thus fits in with the worship of the 
idealized natural science method. What comes 
out of the computer is therefore automatically 
judged to be scientific, and insufficient 
attention is frequently paid to what human 
beings put into it. 

The imperfections I have described not 
only stand in the way of a better sociology but 
also damage our discipline and its reputation. 
The public, general and educated, cannot 
understand, or even see the justification for, 
much of what we produce, since in too many 
cases our work appears to have no benefit, 
direct or indirect, for people’s understanding 
of society or for their lives. The reactions of 
the lay public must not determine social 
science policy or shape our research, but they 
cannot be ignored either. Meanwhile, the 
leaders as well as the foot soldiers of today's 
dominant sociologies ought to remember that 
a good deal of the intellectual standing and 
good will our discipline has developed comes 
from the work of public sociologists. They — 
and books like The Lonely Crowd and Habits 
of the Heart—essentially persuade much of 
the lay public and its politicians that sociol- 
ogy ought to be cared about and funded!!? 


OUR SOCIOLOGICAL IDENTITY 


To conclude my highly selective analysis of 
the discipline and to end this essay, I want to 
raise the issue of our identity as sociologists 
in an era of ever-greater specialization of 
fields and subfields within the discipline.!4 


12 Perhaps the current crisis in university library 
finances, brought about in part by the ever- 
increasing number of journals which charge 
ever-increasing subscription rates, will eventually 
put : 8 damper on the publish-or-perish syndrome. 

? Their reasons for supporting sociology could 
Shrink if cultural anthropologists who can no 
longer do fieldwork overseas and who learn to cut 
back on their appetite for exotic U.S. subcultures 
replaces us even further in doing American 
community studies. 

4 Again, I must omit the practice side of 
sociology, but the discipline's most serious 
long-term identity problem is our continuing to 
conceive of identity in academic research terms, as 
I do here. Thus, we neglect the fact that many 
practitioners may have little reason to identify with 
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That identity should concern all of us, to 
further our own well-being and to help us 
make our case for the desirability of sociolog- 
ical knowledge to the lay public. 

Sociology is once again no different from 
the other social sciences, although the degree 
of specialization may be greater than it is 
among our peers because we are the residual 
social science. We are more diverse to begin 
with than economics (even though it is now 
branching out beyond the economy) and 
political science, which is basically still 
concerned with politics only in government. 

In any case, it is worth looking into the 
benefits and costs of further sociological 
specialization. Among the major benefits are 
the intellectual vitality usually found in new 
fields and subfields, as well as the intensive 
personal contact among researchers as long as 
these fields remain small. Indeed, because of 
the vitality that accompanies work in the new 
fields or at the frontiers of research, we ought 
also to be moving deliberately across the 
accepted or imagined boundaries of sociol- 
ogy, and in two ways.!5 

For one thing, we should look more closely 
at other social science disciplines to see what 
we can learn from them as well as to discover 
how we can improve on their work, jointly or 
by ourselves. To mention just a few: social 
history, the study of symbols and symbol 
systems which we share with anthropology, 
and empirical research in and of economic 
institutions can all gain from such a look.!6 
Disciplinary boundaries in the social sciences 
are arbitrary anyway, and they should be 
crossed freely, preferably for substantive, not 
imperialistic, reasons. We should act simi- 
larly toward boundaries beyond the social 
Sciences and take a greater interest in the 
humanities. Among other things, the study of 
the interrelations between culture and social 


an academic discipline, especially if and when they 
are pushed or pulled by industry/agency- and 
job-specific demands for their loyalty. 

15 Both of these boundary-crossing themes were 
considered by the 1988 Program Committee and 
translated into a number of Special Sessions at the 
Atlanta meeting. 

16 The intellectual vacuum created by the 
economists' emphasis on econometrics and model- 
building could and should be filled in part by more 
ethnographic and other institutional studies by 
sociologists of the giant, and the small but 
innovative, firms that currently play a significant 
role in the American economy. 
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structure can benefit from the concepts and 
ideas of literary scholars. These can put some 
of our concepts and ideas to use as well in 
their work on literature and society—for 
instance, what we have learned about the 
roles of audiences in the production of 
culture. i 

Increasing specialization inside sociology 
also carries costs, however. For one thing, the 
more sociologists specialize in particular 
fields, the more are some likely to limit 
themselves to really tiny specialties within the 
discipline as a whole. Moreover, when new 
fields and subfields develop, they quickly 
breed their own technical languages. 


The end result is that [the discipline] looks like a 
wheel. People sit on their own spokes and talk 
less and less to those on the other side. 
Eventually the wheel may become a doughnut, 
with a huge intellectual hole in the middle. 
(Winkler 1986, p. 7) 


The person I quote is geographer Sam 
Hilliard talking about his own discipline, but 
his comment is starting to apply to sociology 
as well, and the challenge is to prevent both 
the wheel and the intellectual hole. The hole 
cannot, however, be filled by pining for one 
approach or theory that will reintegrate 
sociology, for such reintegration is neither 
likely nor desirable in a pluralistic discipline. 

Instead we should ask ourselves what can 
or should bring us together as sociologists. 
One approach may be to identify intellectual 
cores that are common to many of us. These 
can be concepts, frames, theories, methods, 
or other intellectual forms and qualities that 
we continue to share. A related approach is to 
look empirically at some major old and new 
fields and subfields and determine what ideas, 
concepts, and theories are operationally 
similar in the significant research and theoriz- 
ing in them, even if the terminology is 
different.17 Such a project might even in- 
crease the sharing of terms and reduce the 
excessive number of terms in the discipline. 
The more we emphasize elements of sociol- 
ogy that we share in annual meeting sessions, 
other conferences, and various kinds of 
publications, the more we will discover to 


17 Harriet Zuckerman has suggested, in a 
personal communication, that some sociologists’ 
practice of changing fields and the migration of 
problems and approaches from one field to another 
may act as countertendencies to fragmentation. 
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what extent we can remain a single discipline. 
Even my previously mentioned question, 
“What is a good society?" can perform this 
function. Imagine a medical sociologist, an 

. ethnomethodologist, a specialist in gender 
and sex roles, a market researcher, and a 
mathematical sociologist, all with roughly the 
same values, being asked to come up with a 
single answer to this question! 

ldentity is social as well as intellectual; 
consequently, we should also look at social 
mechanisms that can contribute to being and 
feeling a part of a single discipline. ASA does 
what it can along these lines, but only a bare 
majority of all sociologists belong, and much 
too small a number of them are involved in 
ASA as other than receivers of its services. 
Also, the organization still relies excessively 
for its agenda and leadership on academics 
from the major research universities to be 
fully representative. The Annual Meetings 
bring about 3,000 of us together for a hectic 
few days, although the sessions themselves 
increasingly are vehicles for specialization. In 
1988, for example, 43 percent of the regular 
sessions were run by sections, and many other 
regular sessions were on subjects for which 
there are sections. 

Publications could bring us together as 
‘well, but I wonder if they do since our 
journals tend to appeal largely to specialists, 
whatever the editor's hopes. For example, 
ASR, being the flagship journal, is supposed 
to represent the best in sociology. However, 
for this reader and I imagine many others, it is 
also a journal of lengthy research reports on 
specialized topics, only some of which are of 
general interest. In addition, ASR is domi- 
nated by often elegant quantitative research. 
In fact, some have suggested that ASR is 
actually a methodological journal for quanti- 
tative sociologists not able or willing to work 
through the yet more technical articles in 
Sociological Methodology. There are excep- 
tions in ASR’s emphasis, to be sure, and 
recent editors have published more exceptions 
than past ones. On the whole, however, most 
major articles continue to be research reports 
of roughly the same format and from basically 
one kind of sociology. Sometimes one gets 
the impression that ASR is “run” by its 
contributors, the editors functioning primarily 
as quality controllers and traffic cops even if 
they might personally prefer to publish a 
different journal. Like the organizations we 
study, ASR has become institutionalized. 
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CS may be the most general of the journals, 
for it reviews a large proportion of all the 
books sociologists publish. While the reviews 
are classified by sets of specialties, CS 
readers can get a kind of overview of 
sociology by reading all of the reviews. 
Conversely, anecdotal evidence suggests that 
many of ASR’s readers scan the abstracts, 
read an article or two, and leave it at that. 
Over the years many have reported in the 
discipline’s grapevine that they have diffi- 
culty understanding or getting involved in 
many of the articles, and there are regular 
complaints, some published (Wilner 1985), 
that ASR almost never deals with any of the 
severe problems or controversial issues abun- 
dant in American society.!? 

None of these observations are intended as 
criticisms of present or past editors of ASR, 
for they work harder and longer at less 
celebrated tasks than any other active ASA 
members. Furthermore, I do not think ASR 
should be anything else than what it is now: a 
journal of research papers, although it should 
publish more reports of qualitative research 
and theoretical as well as historical papers.!? 

Instead of making basic changes in ASR, 
we need another sociological journal that 
publishes what ASR cannot: articles of general 
interest to sociologists. Although such a 
journal should be published for sociologists 
and not the lay public and should be of high 
intellectual quality, it must not be a technical 
journal. This should also add to its appeal and 
help make it profitable for an academic 
publisher. We would not even be pioneers in 
establishing such a journal, for in 1987 the 
American Economic Association began to 
publish The Journal of Economic Perspec- 
tives, which described itself in its first issue 
as “a scholarly economics journal for the 
general audience of economists” (Stiglitz, 
Shapiro, and Taylor 1987, p. 3). 

The editors of this new journal would have 


15 Despite the high reliability and validity of the 
sociological grapevine, my evidence is anecdotal, 
and we badly need sophisticated readership studies 
of the discipline’s major journals. 

19 In fact, the number of historical papers in 
ASR is now rising and one way to begin to assure 
the publication of qualitative research reports and 
theoretical papers is to submit them in large 
enough numbers and at such high levels of quality 
that ASR cannot want to do other than to publish 
them. 
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to use their intuition, experience, and values 
to decide what their sociologist-readers want 
and need, but I will describe some kinds of 
articles this reader would like to see: 

1. Analyses of general intellectual issues 


in sociology, including, for example, studies 


of the roles and effects of sociology in 
America, the relevance of sociology in 
postindustrial societies, and the relation 
between American sociology and the Ameri- 
can economy. 

2. Extended debates about, and critiques 
of, current theories or trends in theorizing, as 
well as fundamental or controversial issues in 
empirical research, teaching, and practice. 

3. Review articles of sociologically rele- 
vant work in other disciplines, such as 
institutional economics, literary criticism, and 
theories of knowledge. 

4. Nontechnical research reports and An- 
nual Review of Sociology-style articles about 
currently significant or controversial trends in 
American society: for example, downward 
mobility in the middle class, causes of drug 
use and abuse, convergencies of and relations 
between high culture and popular culture. 
This category could also include analyses 
(and corrections) of pop sociology, for 
example, of the decline of the nuclear family, 
the rise of greed and materialism in the 
1980s, and the cultural and economic power 
of “yuppies” and “baby boomers” in Ameri- 
can society. 

5. Sociological analyses of current events 
that have been or should have been in the 
headlines, domestic and foreign, economic, 
political, and cultural. 

6. Long reviews, of New York Review of 
Books quality, of important sociological 
books, well known and unfairly neglected, as 
well as of books of significance to sociology 
but written by nonsociologists. 

7. Articles of professional relevance not 
likely to appear in Footnotes, The American 
Sociologist, or the practice journals: for 
example, analyses of sources of conflicts in 
academic departments, reviews of graduate 
sociology programs from the student perspec- 
tive, and problems of sociological practice in 
profit-making organizations. These articles 
would frequently need to be anonymous. 

8. Sociological biographies of influential 
figures in sociology, not necessarily from the 
past. 

9. Provocative pieces that suggest unusual 
if untested (and even untestable) hypotheses, 
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or offer thoughtful analyses of the discipline 
by relevant outsiders. 

10. Shorter or lighter articles: for example, 
sociological reviews of art, literature, and 
films—highbrow, middlebrow, and lowbrow; 
studies of the depiction of sociologists in 
American novels, films, and television, and 
even cartoons of sociological significance or 


‘relevance. 


A lively journal that speaks to interests we 
share may help a little to bring us together as 
specialization moves us ever further apart. 
Nevertheless, perhaps the best way to add 
some unity to the diversity takes me back to 
the major theme of this essay: our being more 
useful to the public and to its various sectors. 
Being useful, as teachers, researchers, writ- 
ers, practitioners, and as experts, advisers, 
and critics, will make us feel more useful— 
and this will strengthen the commonality of 
purpose among us. Being useful should also 
add to our pride in the discipline, and pride is 
itself a potent social cement. But if we have 
further reasons to be proud of sociology, we 
will surely grow intellectually and in other 
ways in the years to come. 
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THE ELITE VARIABLE IN DEMOCRATIC TRANSITIONS 
AND BREAKDOWNS* 


MICHAEL G. BURTON 
Loyola College in Maryland 


JOHN HIGLEY 
University of Texas at Austin 


Stable democratic regimes depend heavily on the “consensual unity” of national 
elites. So long as elites remain disunified, political regimes are unstable, a 
condition which makes democratic transitions and democratic breakdowns merely 
temporary oscillations in the forms unstable regimes take. Disunity appears to be 
the generic condition of national elites, and disunity strongly tends to persist 
regardless of socioeconomic development and other changes in mass populations. 
The consensually unified elites that are necessary to stable democracies are 
created in only a few ways, two of the most important of which involve distinctive 
elite transformations. After elaborating this argument, we examine the relationship 
between elites and regimes in Western nation-states since they began to consolidate 
after 1500. We show that our approach makes good sense of the Western political 
record, that it does much to clarify prospects for stable democracies in developing 
Societies today, and that it makes the increasingly elite-centered analysis of 


democratic transitions and breakdowns more systematic. 


The unexpected wave of democratic transi- 
tions during the last decade, most notably in 
Latin America and Southern Europe, has 
attracted much scholarly attention (see, inter 
alia, O’Donnell, Schmitter, and Whitehead 
1986; Malloy and Seligson 1987; Baloyra 
1987; Needler 1987). Although this new body 
of work has considerable value, it offers no 
sound theoretical basis for judging the 
survival prospects of newly democratic re- 
gimes. Scholars have focused primarily on 
the antecedents and processes of democratic 
transitions and have avoided the task of 
prediction. Thus, comparative political soci- 
ology today is not much closer to a workable 
theory of stable democracy than it was in the 
1960s and 1970s when many putatively stable 
democracies fell to a wave of authoritarian 
regimes, a wave which was also unanticipated 
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theoretically (see Linz and Stepan 1978; 
Collier 1980). 

In thinking about the determinants of stable 
democracies, however, scholars have made a 
promising shift in causal focus away from 
Social structural and toward political determi- 
nants conceptualized in terms of the behavior 
of powerful actors or elites. This new 
emphasis has in turn introduced a large 
element of indeterminacy. Some scholars now 
suggest that democratic transitions and break- 
downs are ultimately the products of histori- 
cally contingent elite choices (e.g., O'Don- 
nell and Schmitter 1986; Lopez-Pintor 1987; 
Malloy 1987). Although this shift in causal 
focus is a step forward, it may lead to a dead 
end if it is not substantially elaborated. The 
elite concept is fraught with problems, and 
the contingent nature of elite choices may be 
a barrier to theoretical progress. 

We suggest a route out of these and related 
difficulties. Briefly, we argue that democratic 
transitions and breakdowns can best be 
understood by studying basic continuities and 
changes in the internal relations of national 
elites. A disunified national elite, which is the 
most common type, produces a series of 
unstable regimes that tend to oscillate be- 
tween authoritarian and democratic forms 
over varying intervals. A consensually unified 
national elite, which is historically much 
rarer, produces a stable regime that may 
evolve into a modern democracy, as in 
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Sweden, or Britain, or the United States, if 
economic and other facilitative conditions 
permit. Unless regime changes are preceded 
or accompanied by elite transformations — 
from disunity to consensual unity, in cases of 
democratic transitions, or from consensual 
unity to. disunity, in cases of democratic 
breakdowns— —they should be regarded as 
strictly temporary. However, such elite trans- 
formations rarely occur., Once created, each 
national elite type strongly tends to persist, 
with the disunified type being nearly ubiqui- 
tous, both historically in Europe and Latin 
America and today among Latin American 
and non-Western countries. Consequently, 
most regime changes that have been exam- 
ined as democratic transitions or breakdowns 
are more fruitfully viewed as underpinned by 
continuing elite disunity and associated re- 
gime instability. Failure to see this has led 
many scholars to exaggerate the longer-term 
significance of such transitions and break- 
downs and has left them unprepared to 
explain the reversals in regime form that 
typically. follow. 

_-The thrust of our argument is not new. 
Many scholars have shown that the unity of 
national elites is one of the most important 
déterminants of regime forms (e.g., Pareto 
1935; Mosca 1939; Aron 1950; Castles 1974; 
Putnam 1976; Huntington 1984). But this 
idea has not been developed systematically. 
Disunified and consensually unified national 
elites are not well defined, their origins and 
persisterice are largely unexamined, and the 
consequences of their internal organization 
for regime forms are poorly understood. 

The following section outlines a theoretical 
perspective that begins to correct these 
problems. Then our theoretical claims about 
the connections between national elite unity 
and regime stability are examined in a review 
of major Western political changes that have 
occurred since about 1500. We demonstrate 
that the connection between elite disunity and 
regime instability has been more widespread 
and persistent in historical and contemporary 
nation-states than is commonly recognized. 
We conclude by considering how our analysis 
informs current discussions about democratic 
transitions and breakdowns. 


CONCEPTUALIZING ELITE 
VARIATIONS AND REGIME 
CONSEQUENCES 


We first specify our frame of reference and 
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organizing concepts. The unit of analysis is 
the..independent, territorially consolidated 
nation-state, a political entity that is at least 
moderately demarcated territorially and admin- 
istratively centralized on basic matters like 
policing and taxing (see Giddens 1987, pp. 
116-21). Our principal interest is to explain 
how domestic elite interrelations affect re- 
gime stability. Although we recognize that 
elite-regime relationships are sometimes 
changed fundamentally by wars, we do not 
intend to explain such events. And while the 
political importance of location in the world 
economy is undeniable, we think that interna- 
tional economic forces do not normally 
determine elite-regime relationships (see 
Brenner 1977; Smith 1979; Linz and Stepan 
1978; O’Donnell and Schmitter 1986; Malloy 
1987). Finally, we recognize that subnational, 
regionally-based ethnic, religious, linguistic, 
and other cultural conflicts sometimes over- 
ride elite-regime relationships and require 
different but not necessarily contradictory 
concepts and models (e.g., Lijphart 1977). 
Within this frame of reference, we con- 
ceive of national elites as persons who are 
able, by virtue of their authoritative positions 
in powerful organizations and movements of 
whatever kind, to affect national political 
outcomes regularly and substantially (Burton 
and Higley 1987a). Scholars generally agree 
that national elites can be defined as top 
position-holders in the largest or most resource- 
rich political, governmental, economic, mili- 
tary, professional, communications, and cul- 
tural organizations and movements in a 
society (see Putnam 1976; Higley and Moore 
1981; McDonough 1981; Dye 1983; Hoff- 
mann-Lange 1987; Moyser and Wagstaffe 
1987). We think of regimes as basic patterns 
in the organization, exercise, and transfer of 
government decision-making power. Many 
distinctions among regime types can be drawn 
(e.g., democratic, authoritarian, and totalitar- 
ian), but we stress the value of distinguishing 
between stable and unstable regimes. 
Scholars who focus on the variability of 
national elites generally distinguish three 
basic types: (1) the “pluralistic” or “consen- 
sually unified” type that exists in most 
Western societies today and that existed in a 
few of them in earlier times; (2) the 
“totalitarian” or “ideologically unified” type 
in nation-states organized along communist, 
fascist, or theocratic lines; and (3) the 
“divided” or “disunified” elite of many past 
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and contemporary nation-states (Aron 1950; 
Dahrendorf 1967; Putnam 1976; Welsh 1979; 
Burton and Higley 1987a). We focus here 
only on consensually unified and disunified 
national elites because understanding their 
differences and their strong tendency to 
persist clarifies the analysis of democratic 
transitions and breakdowns. 

A national elite is consensually unified 
when its members (1) share a largely tacit 
consensus about rules and codes of political 
conduct amounting to a “restrained partisan- 
ship” (Prewitt and Stone 1973; Di Palma 
1973), and (2) participate in a more or less 
comprehensively integrated structure of inter- 
action that provides them with relatively 
reliable and effective access to each other and 
to the most central decision-makers (Ka- 
dushin 1979; Higley and Moore 1981). This 
combination of tacit consensus on rules of the 
game and comprehensive integration disposes 
elite members to view decisional outcomes as 
a positive-sum or “politics-as-bargaining” 
game, rather than a zero-sum or "politics-as- 
war" game (Sartori 1987, p. 224). With 
agreement on the rules of the political game 
and with decision-making access assured, the 
diverse and heterogeneous members accept 
various decisions they do not especially like 
because they expect to get their way on other 
issues they consider vital. Over time, most 
elites achieve their most basic aims and are 
therefore inclined to view the totality of 
decisional outcomes as positive-sum (Sartori 
1987, p. 229). 

This explains why elite persons and 
factions who regularly take opposing ideolog- 
ical and policy positions in public consistently 
refrain from pushing their differences to the 
point of violent conflict. Typical elite mem- 
bers therefore enjoy considerable personal 
security, in the sense that they do not expect 
to be killed, imprisoned, or otherwise se- 
verely penalized for ending up on the losing 
side of a policy dispute. It follows that, once 
this type of national elite is created, and so 
long as it persists, forcible seizures of 
government power by one or another discon- 
tented faction will not occur. Moreover, to 
accommodate and process the diverse, fre- 
quently opposing interests of the factions in 
such a national elite, political institutions will 
be structured along representative, at least 
proto-democratic lines, though the actual 
extent of representative democracy may 
depend on other, facilitative conditions. 
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By contrast, a national elite is disunified 
when its members (1) share few or no 
understandings about the proprieties of politi- 
cal conduct and (2) engage in only limited 
and sporadic interactions across factional or 
sectoral boundaries. The basic situation of 
persons composing this elite type is one of 
deep insecurity—the fear, usually rooted in 
experience, that all is lost if some other 
person or faction gets the upper hand. 
Accordingly, members of a disunified elite 
routinely take extreme measures to protect 
themselves and their interests: killing, impris- 
oning, or banishing opponents, fomenting 
rebellions against ascendant factions, expro- 
priating opponents' resources, and so on. In 
the context of elite disunity, these actions are 
often the most rational ones available. Recent 
experiences of having punitive measures 
taken against them or their close associates, 
and the strong belief that such' measures will 
be implemented in the future, solidify the 
fears and insecurities of disunified national 
elite members. So entrenched are these fears 
and insecurities that elite disunity can be 
transcended only in extraordinary circum- 
stances. 

We have few direct and comprehensive 
empirical studies of disunified national elites. 
Members seldom cooperate in such research 
because they fear it will be used against them. 
Researchers who persist in studying a disuni- 
fied elite do so at considerable personal peril. 
One direct and relatively extensive study is 
McDonough's (1981) 1972—73 survey of the 
Brazilian national elite, minus the governmen- 
tally dominant military elite, which refused to 
cooperate. McDonough's research portrays an 
elite divided into military-governmental, eco- 
nomic, church, and urban labor factions that 
were polarized over rules of the political 
game (e.g., freedom of political opposition 
and the extent of executive power) and 
isolated from each other. Brown's (1969, p. 
441) more limited survey of French elite 
groups in the mid-1960s— just a few years 
after their conflicts toppled the Fourth Repub- 
lic and the Algerian imbroglio spawned 
considerable intra-elite violence—concluded 
that the outstanding feature of the French 
national elite was “the lack of agreement 
concerning the basic political institutions of 
the nation” and an absence of extensive 
personal contacts among the main elite 
factions (p. 441). In related fashion, Schon- 
feld’s (1981) study of French elites in the 
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early 1970s found extensive personal ties 
among elite members in the main factions but 
lack of ties across factional lines. Impor- 
tantly, each faction was ignorant of and 
disregarded the other. In the same vein, in 
Czudnowski’s (1987) limited survey of the 
Taiwanese national elite, the fact that he had 
to devise complex research stratagems to 
circumvent the respondents’ mutual fears and 
hostilities points to basic elite disunity. 

The origin of national elite disunity appar- 
ently lies in the process of nation-state 
formation. Constructing nation-states out of 
previously disparate and partially autonomous 
territories is typically such a violent and 
conflict-ridden process, involving the repres- 
sion of some elite groups by others, that deep 
and unremitting elite disunity is almost 
inevitable (Coleman 1971, pp. 89-93). Ben- 
dix’s (1978) treatise on nation-state formation 
in the West and Japan amply demonstrates 
that nation-state consolidation everywhere 
resulted in disunified elites. The formation of 
nation-states in Latin America after emancipa- 
tion from Spanish rule in the early 19th 
_ century required repeated efforts to suppress 
local elites by force (Oszlak 1981). Johnson 
(1983) provides a graphic account of the 
altogether similar result of postcolonial nation- 
state consolidations in Black Africa and many 
other newly emerging nations in the 1960s 
and 1970s. 

Thus, the historical record strongly sug- 
gests that elite disunity originates in the 
formation of nation-states. Disunity is, in 
other words, the generic condition of national 
elites. But there are two kinds of exceptions. 
First, the experience of operating “home 
rule” regimes over longer periods under 
relatively benign colonial tutelage and/or of 
orchestrating large and politically complex 
national independence movements sometimes 
results in a consensually unified national elite 
from the date of postcolonial independence. 
The United States, Canada, New Zealand, 
Australia, India, and a few other offshoots of 
the British Empire are the principal examples. 
Second, defeat and occupation of societies 
after wars sometimes results in consensually 
unified elites from the date of postwar 
independence because the previously most 
antagonistic elite factions have been liqui- 
dated (e.g., Austria during and after World 
War ID. But with these exceptions, the 
historical record strongly supports the propo- 
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sition that nation-state formation almost 
always results in a disunified national elite. 
The main political consequence of elite 
disunity is regime instability. As a property of 
regimes, instability has several meanings. 
Sanders (1981) identifies three meanings: (1) 
a high incidence of political violence in the 
form of revolts, riots, strikes, mass demon- 
strations, and individual actions; (2) frequent 
changes in the makeup of governing coali- 
tions and cabinets; (3) the occurrence of 
coups d’etat or other government overthrows. 
In our view, the first two meanings are not 
sufficiently discriminating because nearly all 
regimes at various times would qualify as 
unstable in one or both respects. Only the 
third meaning, suitably elaborated, distin- 
guishes unstable regimes in a clear-cut and 
theoretically useful fashion. Thus, a political 
regime may be said to be unstable whenever 
government executive power is subject to 
irregular seizures, attempted seizures, or 
widely expected seizures by force. Concrete 
indicators of regime instability are revolu- 
tions, uprisings, or coups d’etat aimed at 
changing the control of government executive 
offices and not orchestrated primarily by 
another nation-state. A regime may be 
classified as unstable during periods when 
such seizures occur, are attempted, or are - 


regarded by informed persons as likely 


possibilities. So long as any of these 
indicators of instability obtain, a regime’s 
current mode of functioning, whether “demo- 
cratic” or “authoritarian” or something else, 
is likely to be temporary. 

Irregular, forcible power seizures are suffi- 
ciently frequent and visible, or the expecta- 
tion of them is so palpable, that observers 
usually have little difficulty recognizing a 
regime as unstable. Thus, Malloy (1985, p. 
367) calculated that some 186 irregular 
seizures of government occurred in Bolivia 
since independence in 1825. Veliz (1967, p. 
278) counted 80 successful military coups in 
18 Latin American countries between 1920 
and 1966. Grundy (1968) found 64 irregular 
seizures of power in Black African nations 
between 1963 and 1968, and Macridis (1986, 
p. 225) listed 26 coups in 16 countries of 
Tropical Africa between 1970—1984. Simi- 
larly, it is usually not hard to discern 
expectations of irregular power seizures 
among informed observers of a regime. For 
example, a journalist (Bonner 1988) who 
recently spent several months talking with 


ELITES AND DEMOCRACY 


leaders and opponents of the democratic 
regime led by Alan Garcia in Peru concluded: 
“The soldiers still exercise a de facto veto 
over the actions of civilians, and many 
observers in Peru, both Peruvian and foreign, 
think that the question is not whether the 
military will stage a coup but when. Between 
two and five years is the general estimate” (p. 
58). 

Occasionally, however, there are regimes 
in which no irregular power seizure or 
attempted seizure has recently occurred and in 
which the likely possibility of such seizures is 
debatable among informed observers. Uru- 
guay, after its civil war in 1904 and through 
the years immediately preceding the military 
coup in 1973, is an example. Chile, after the 
overthrow of the Ibanez dictatorship in 1932 
and down to the first year of the Allende 
government in 1970-71, is another example. 
France during the long-lasting Third Repub- 
lic, 1875-1940, might be another. In such 
ambiguous cases, many scholars and policy- 
makers have mistakenly regarded regimes as 
stable. . 

The primary reason for these inaccurate 
assessments has been a failure to comprehend 
the underlying condition of elite disunity. 
Although elites and regimes seem inextricably 
entwined, analytically distinguishing between 
the basic structure of a national elite and the 
characteristics of its political regime allows 
one to postulate a causal relationship between 
elites and regimes, in which elite structure is 
viewed as logically and factually prior to 
regime stability. This leads to the proposition 
that, for a lasting democratic transition to 
occur, the national elite must first be 
transformed from disunity to consensual 
unity. When analyzing regime changes, in 
short, researchers should concentrate on the 
underlying elite structure, seeking recent or 
historical evidence of an elite transformation. 
If no such evidence is found, the analyst 
should presume that the elite remains disuni- 
fied and the regime remains unstable. 

This conclusion presumes that we know 
both what elite transformations from disunity 
to consensual unity look like and the 
circumstances in which they may occur. 
Unfortunately, political sociologists have not 
explored these matters in depth. We (Burton 
and Higley 1987b) recently analyzed one kind 
of elite transformation: the "elite settlement" 
in which warring elite factions suddenly and 
deliberately reorganized their relations by 
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negotiating compromises on their most basic 
disagreements, thereby achieving consensual 
unity and laying the basis for a stable 
democratic regime. The elite settlement 
process is exemplified by political events in 
England in 1688-89, Sweden in 1808-9, and 
Colombia and Venezuela in the late 1950s. 
We suspect that a second kind of transforma- 
tion from disunity to consensual unity occurs 
in two distinct steps. In step one, some of the 
warring factions enter into sustained, peaceful 
collaboration in electoral politics in order to 
mobilize a reliable electoral majority, win 
elections repeatedly, and thereby protect their 
interests by dominating government executive 
power. In step two, the major hostile factions 
opposing this coalition eventually tire of 
losing elections and, seeing no other way to 
gain government power (for example, through 
a coup), gradually abandon their distinct 
ideological and policy stances and adopt 
essentially those of the winning coalition. 
With this development, a consensually uni- 
fied national elite is created and a stable 
democratic regime rapidly emerges. Exam- 
ples include France and Italy during the past 
quarter century: center-right elite coalitions 
formed, the French Gaullists and the Italian 
Christian Democrats plus smaller elite fac- 
tions in each country. These coalitions then 
dominated electoral politics in ways that 
eventually forced radical leftist factions, 
principally the French Socialists and the 
Italian Communists, to moderate their ideolo- 
gies and programs in order to compete 
effectively for executive office (Field and 
Higley 1978). 

So far as we can determine, elite settle- 
ments and these “two-step” elite transforma- 
tions are, to date in modern history, the only 
routes from a disunified to a consensually 
unified national elite. They thus constitute the 
only indigenous bases for changes from 
unstable to stable democratic (or proto- 
democratic) regimes. But, as noted, such 
transformations rarely occur. Conversely, 
elite transformations from consensual unity to ` 
disunity are even rarer. Down to the present 
period at least, consensually unified national 
elites, once formed, have everywhere perpet- 
uated themselves. 

The rarity of elite transformations from 
consensual unity to disunity has implications 
for the analysis of democratic breakdowns. 
The much-discussed breakdowns of interwar 
Europe (see Linz and Stepan 1978) all 
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occurred in nation-states whose elites had 
been patently disunified since the time of 
nation-state formation (e.g., German elites 
after 1871; see Baum 1981; Hamilton 1982). 
What broke down in interwar Europe were 
democratic interludes in wider patterns of 
elite disunity and regime instability. Possibly, 
an elite transformation from consensual unity 
to disunity, and thus a regime change from 
stability to instability, have recently occurred 
in the Philippines, a nation-state that gave 
many indications of having a consensually 
unified national elite and a stable democratic 
regime from the end of American colonial 
tutelage in 1946 until the spread of intraelite 
violence, culminating in several military 
rebellions, during the 1980s. 

Although elite transformations are funda- 
mentally “elite events,” mass variables obvi- 
ously are important. Elites always need mass 
support. Mass conditions and orientations 
thus establish fields of opportunity and 
constraint to which elites must respond (see 
Field and Higley 1980, pp. 18-47). For 
example, two-step elite transformations ap- 
pear possible only where a majority of voters 
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are disposed, by virtue of their socioeconomic 
and other conditions, to support elite electoral 
appeals that essentially defend the status quo. 
Yet, as we suggest below, no discernible 
mass configuration leads inexorably to elite 
transformations. Despite dramatic changes in 
mass conditions and orientations during the 
modern historical period, the. modal pattern 
of Western politics was one of persistent elite 
disunity and resulting regime instability. We 
now briefly describe this modal pattern. 


THE MODAL PATTERN OF 
WESTERN POLITICS 


For most of the period between 1500 and the 
Napoleonic Wars, only eight Western socie- 
ties approached the political independence 
and integration of modern nation-states: 
Denmark, England, France, Portugal, Russia, 
Scotland, Spain, and Sweden (see Table 1). 
During the first half of this period probably 
only England full met the nation-state 
standard in terms of territorial political 
consolidation. Denmark was embroiled in an 
effort to retain control of Sweden, France was 


Table 1. Types and Origins of National Elites in Selected Western Nation-States: A Schematic View 





Nation-State Formation Elite Type(s) Origin of CU Elite 
England (late medieval) DU to 1689 Elite settlement 
CU 1689-1988 1688-1689 
Denmark (late medieval) DU to 1901 2-3tep transformation 
CU 1935-88 1901-35 
Scotland (late medieval) DU to 1707 None; merged w/England 1707 
Portugal (late medieval) DU to 1980s None clearly indicated 
Spain (from 16th cent.) DU to 1977 Elite settlement 
CU 1979-88 1977-79 
Sweden (from 16th cent.) DU to 1809 Elite settlement 
CU 1809-1988 1808-9 
Russia (from 17th cent.) DU to 1917 Ncne; revolutionary transformation, 
TU 1921-88 1917-21 
France (late 17th cent.) DU to 1960 2-step transformation 
CU 1981-88 1960-81 
U.S.A. (from 1789) CU 1789-1988 Colonial “home rule“ and 
independence struggle 
Netherlands (from 1813) CU 1813-1988 Fusion of provincial elites 
Prussia (from 1815 or earlier) DU to 1871 None; merged w/Germany 1871 
Belgium (from 1830) DU to 1890s 2-step transformation 
CU 1961-1988 1900-61 
Switzerland (from 1848) CU 1848-1988 Fusion of cantonal elites 
Italy (from 1870) DU to 1948 2-step transformation 1948—80 
CU 1980-88 
Germany (from 1871) DU to 1933 Revolutionary transformation 1933 
TU 1933-45 2-step transformation 1948-66 
CU 1966-88 
Norway (from 1884) CU 1935-88 2-step transformation 1884-1935 
, Austria (from 1919 or earlier) DU to 1938 Elite settlement 1945-48 
` CU 1948-88 


Abbreviations: CU = Consensually Unified; DU = Disunified; IU = Ideologically Unified. 
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periodically broken up in civil wars over 
religious and social questions until late in the 
17th century, Portugal was subjugated by 
Spain between 1580 and 1640, Russia did not 
achieve national consolidation until the 17th 
century under the Romanovs, Scotland was 
not fully independent from England after 
1650 and it disappeared as a nation-state after 
merging with England in 1707, and Spain was 
initially several distinct countries joined by a 
dynastic union that only eventually led to 
territorial demarcation and administrative 
centralization. A ninth political entity, the 
United Provinces of the Netherlands, threw 
off Spanish domination in the 1580s, but the 
seven provinces, which separately appeared 
to have consensually unified elites from the 
time of independence (see Grever 1982), did 
not achieve nation-state consolidation until 
the Napoleonic Wars. Throughout the 18th 
century, Brandenburg-Prussia emerged as an 
important European power, but it consisted of 
various semiautonomous and noncontiguous 
territories. Finally, the United States did not 
emerge as a consolidated nation-state until the 
very end of the period. 


Disunity as the Generic Condition of 
National Elites 


By about 1600 ali eight original societies— 
cum-nation-states had reached levels of 
urbanization and commercialization suffi- 
ciently complex that many elite positions 
required more or less full-time attention by 
their incumbents. Although small leisured 
classes of influential aristocrats and gentry 
continued to exist, in showdown situations 
the decisive power-wielders tended to be the 
persons and factions commanding the most 
important bureaucratic organizations: mon- 
archs and high state officials, senior military 
officers, high-ranking ecclesiastics, mer- 
chants and entrepreneurs heading important 
commercial enterprises, and leaders of re- 
gional political bodies. 

As national aggregations, these elites were 
clearly disunified in the sense that wide and 
deep struggles for political ascendancy typi- 
fied their relationships. Elite factions associ- 
ated with monarchs and contenders for the 
throne tended to back royal absolutism, while 
factions benefiting from local autonomy 
resisted. The use of military force for 
purposes of aggrandizement and pacification 
bulked large (Finer 1975; Mann 1986), and 
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religious divisions made struggles between 
state-building and state-resisting elites more 
bitter and complex (Bendix 1978; see also 
Tilly 1975). 

Political regimes in all these nation-states 
were traditional monarchies in which govern- 
ment executive power was transferred among 
successive rulers by principles of inheritance. 
But since there seldom was any widespread 
agreement on the concrete application of 
these principles, transfers of executive power 
were subject to intrigues, challenges, and 
usurpations. In Denmark, for example, strug- 
gle over the royal succession exploded in civil 
war during the 1530s, and in England 
succession to the throne was precarious 
throughout the second half of the 16th 
century. Even where transfers of executive 
power among successive monarchs occurred 
peacefully —as during Spain's “golden” 16th 
century —the mutual fear and distrust of elite 
factions guaranteed incessant rivalries, plots, 
and other frequently violent maneuvers aimed 
at dominating the monarch. 

In reality, these traditional monarchies 
simply embodied elite disunity. The attempt 
to centralize political power in one person and 
clique—the monarch and his or her retinue — 
reflected the absence of mutually agreed 
power-sharing arrangements. In Bendix’s 
(1978, pp. 218—43) view, traditional monar- 
chies confronted insoluble dilemmas — irrec- 
oncilable needs to both concentrate and 
delegate royal authority, large and irreducible 
elements of arbitrariness in monarchical 
actions, eminently disputable processes of 
inheritance and succession, pervasive uncer- 
tainty about the extent and limits of royal 
authority—the only consequence of which 
could be regime instability. In these funda- 
mental respects, the persistence of traditional 
monarchies everywhere testified to the exis- 
tence of disunified national elites. 


The First Elite Settlements 


Down to the Napoleonic Wars, the relation- 
ship between elite disunity and regime 
instability was broken only in England and 
Sweden (see Table 1). In England's “Glo- 
rious Revolution" of 1688—89, and in substan- 
tively similar events surrounding Sweden's 
creation. of a constitutional monarchy in 
1808—9, elite settlements occurred in,.which 
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disunity to consensual unity. As we have 
argued (Burton and Higley 1987b), these 
settlements originated in the recent experience 
of costly but inconclusive elite conflict—the 
English civil wars and their Cromwellian 
aftermath plus 30 years of bitter infighting 
between Tory and Whig factions during the 
Restoration period, and in Sweden the 
struggle between Hat and Cap factions 
throughout the 18th century. Immediate 
precipitants were dramatic political crises: the 
birth of James IPs son in 1688, making 
probable a Catholic succession to the throne 
of a Protestant country; the grave defeat 
Sweden suffered at the hands of Russia in 
1808, coupled with economic disarray. The 
settlements depended for their success upon 
the unprecedented forbearance and secret 
collusions of a handful of skilled and 
experienced leaders, representing the major 
elite factions, who rapidly negotiated compro- 
mises on the most dangerous issues. And the 
predominantly unmobilized character of non- 
elite populations apparently facilitated the 
settlements by allowing elites sufficient auton- 
omy to negotiate compromises. 

These settlements established new arrange- 
ments for sharing power, thereby greatly 
reducing elite insecurities. Henceforth, most 
elite persons had reliable access to decision- 
making on issues salient to them, and defeats 
on policy questions no longer carried drastic 
penalties. Almost immediately after the En- 
glish and Swedish elite settlements, a prudent 
and restrained politics came into being, with 
contested elections taking place at short 
intervals and eventually becoming democra- 
tized through universal suffrage, and with 
executive power shifting from monarchs to 
cabinets responsible to elected bodies. There- 
after, even serious political challenges, such 
as England’s Chartist Movement during the 
1830s and radical labor movements in both 
countries at the end of the 19th century, did 
not prevent the peaceful transfer of executive 
power among different elite factions accord- 
ing to the outcomes of regular popular 
elections. 

We sketch the elite settlements in England 
and Sweden because they represent, within 
the frame of reference specified, the only 
route in preindustrial societies from elite 
disunity and regime instability to consensual 
elite unity and a stable, proto-democratic 
political regime. Without a settlement, the 
elite disunity that originates during nation- 
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state formation simply persists, guaranteeing 
that political regimes remain unstable. This 
contention is borne out by the politics of the 
other original Western nation-states down to 
the Napoleonic Wars: in none did anything 
resembling the English and Swedish elite 
settlements occur, and all of their political 
regimes evidenced instability throughout the 
period. 


Persistence of Elite Disunity Elsewhere 


In Denmark, straggles among aristocratic and 
bourgeois elite factions culminated in royal 
absolutism after 1665. A century of rule by a 
few hundred landowners in league with the 
monarchy followed but was upset in the 
1770s and 1780s by palace intrigues and 
power grabs, amounting to successive coups 
d'etat. In France during the 17th and 18th 
centuries, elites in the court, the church, the 
military, and the major towns became polar- 
ized over traditional and rationalist views of 
rank and privilege and over other opposing 
conceptions of the good society. The chasm 
between the elite camps persisted, and in 
1788—89 revolution broke out. In Portugal, 
independence from Spain in 1640 was soon 
followed by court intrigues, the overthrow of 
Alfonso VI in 1668, and the assertion of royal 
absolutism, which, as elsewhere, involved 
constant infighting among elite factions bent 
on dominating the monarchical power. 

Elite disunity was also evident in Russia 
and Spain. In Russia during the 17th century, 
absolutist rule was punctuated by sporadic 
uprisings which the Tsars ruthlessly sup- 
pressed. After 35 years of “enlightened 
despotism” under Peter The Great (1689— 
1725), court intrigues and military interven- 
tions perpetuated regime instability. As Ben- 
dix (1978, p. 597) notes, only two of the 
eight monarchs following Peter during the 
18th century attained the throne without 
military intervention, and both were mur- 
dered. Spanish politics in the 17th century 
were dominated first by revolts against 
Castilian hegemony and then by attempts by 
noble elite factions to dominate the incompe- 
tent Charles II (1665-1700). Early in the 18th 
century, Spain broke up in civil war and 
regional revolts. It was patched together in 
1716, but its politics then paralleled French 
politics —a protracted struggle pitting reaction- 
ary aristocrats and church officials against 
modernizing elite factions spearheaded by 
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state officials and leaders of the small 
bourgeoisie. At century’s end, under another 
incompetent king, Charles IV, court intrigues 
swept Spain into its disastrous French alli- 
ance, which produced an uprising against the 
king, his forced abdication in 1808, and 
Napoleon’s imposition of his brother Joseph 
on the Spanish throne. 

In short, the uniform political pattern in all 
of the earliest Western nation-states was elite 
disunity and regime instability, a pattern 
originating in the process of nation-state 
formation. Except in England and Sweden, 
where extraordinary circumstances facilitated 
sudden and deliberate settlements, elite dis- 
unity and regime instability persisted over 
several hundred years, despite the ebb and 
flow of national fortunes and despite fairly 
steady socioeconomic development. This pat- 
tern was not simply a consequence of 
imperfect nation-state consolidation, tradi- 
tional monarchies, or preindustrialization. It 
persisted after these nation-states fully consol- 
idated, after their monarchies were mostly 
replaced by republican governments, real or 
de facto, and after their economies were 
substantially industrialized. 


Elite Disunity In the 19th Century 


: Conquests by French revolutionary and Napo- 
leonic armies turned many small and precari- 
ously independent German and Italian princi- 
palities into large territorial units. In 1815, 
the Congress of Vienna ratified these changes, 
thus reorganizing the European political map 
(see Table 1). Prussia emerged as a major 
nation-state, while the loosely consolidated 
Habsburg Empire (after 1867 the Austro- 
Hungarian Empire) tried to control territories 
in Italy, Eastern Europe, and the Balkans. As 
a result of war in the 1860s, Prussia excluded 
the Habsburg Empire from German-speaking 
territories outside Austria proper, and in 1871 
Prussia established the German Reich. With 
French and Prussian help, the Kingdom of 
Sardinia-Piedmont expanded, also reducing 
the Habsburg Empire, and emerged in 1870 
as the Italian nation-state. Along with France, 
Spain, and Russia, these countries dominated 
continental Europe, though smaller nation- 
states also emerged—the Netherlands from 
1813, Belgium from 1830, Switzerland from 
1848, Norway from 1884. 

At various points during the 19th century, 
most of these countries, along with Britain, 
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Sweden, and Denmark, reached industrial 
levels of socioeconomic development, and 
their populations came to include large 
categories of manual industrial, bureaucratic, 
and service workers. This spread of an 
industrial occupational configuration created 
new elites commanding labor unions, mass 
political parties, new commercial enterprises, 
professional associations, mass media organi- 
zations, educational institutions, and public 
sector agencies and organizations stemming 
from state expansion. But industrialization 
was not accompanied by any basic change in 
elite structures or in the character of political 
regimes. Elites in the original European 
nation-states, other than England and Swe- 
den, remained disunified. If anything, elite 
disunity was exacerbated by the French 
Revolution’s aftershocks, by new elites enter- 
ing the political arena, and by conflicts 
between population segments resulting from 
industrialization. Moreover, the process of 
forging new nation-states during the 19th 
century nearly everywhere produced disuni- 
fied national elites. Thus, in Prussia, Bel- 
gium, Germany after 1871, and Italy after 
1870, elites disagreed fundamentally about 
the constitutions and institutions on which 
their new nations rested, and they defended or 
subverted governments according to their 
narrow interests. 

Two exceptions, Switzerland and Norway, 
require brief mention. Before entering the 
new Swiss federation in 1848, several, per- 
haps all, previously self-governing, geograph- 
ically isolated Swiss cantons had consensually 
unified elites as outgrowths of late medieval 
“citizen communities.” The Swiss constitu- 
tional settlement of 1848 formed these 
cantonal elites into a consensually unified 
national elite that has operated a stable 
representative regime ever since. Somewhat 
by contrast, Norway was controlled by 
Sweden between 1814 and 1884, with 
Norwegian elites operating a “home rule” 
regime along representative political lines. 
When Sweden relinquished control in 1884, 
Norway emerged with a national elite that 
was halfway through a two-step transforma- 
tion. Between them, conservative and liberal 
factions mobilized an electoral majority and 
consistently excluded ostensibly revolutionary 
socialist factions from government executive 
power until the latter abandoned radicalism 
during the 1930s, allowing full consensual 
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unification (see Higley, Field, and Groholt 
1976). 

But elsewhere in 19th century Europe, elite 
disunity persisted and produced frequent, 
often violent oscillations in regime forms. 
Thus, in France a traditional monarchy was 
restored in 1814-15, a “bourgeois monarchy” 
was installed in an upheaval in 1830, a 
, republican regime was instituted by revolt in 
1848, an autocratic regime was established by 
coup d’etat in 1851, and another republican 
regime was instituted following the suppres- 
sion of the Paris Commune uprising of 1870. 
Elite disunity and regime instability were 
equally conspicuous in Spain: restoration of 
the monarchy in 1814; enforced liberalization 
of the monarchy, involving a military coup, 
between 1820 and 1823; the military-backed 
“succession” of three-year-old Queen Isabel 
ID in 1833 and withdrawal of the former heir 
presumptive to lead a “Carlist” revolt in the 
north; military interventions that shifted 
power between liberal and conservative fac- 
tions in 1835, 1843, 1854, and 1856; Isabel’s 
dethronement in 1868 and an interregnum of 
military rule and civil war followed by her 
son’s acceptance of a constitutional monarchy 
with nominally representative institutions in 
1875. Portugal experienced a similar pattern 
of upheavals: displacement of the monarchy 
in 1820, a failed insurrection in 1824, civil 
war in 1832-34, coups in 1836 and 1851, and 
an attempted uprising in 1890. 

In most other European nation-states, 
however, there were no similar sequences of 
government overthrows. In Russia, Prussia, 
and Germany, conservative elite factions used 
repression to sustain monarchies that were 
reviled by elites leading bourgeois and 
working-class organizations and movements. 
In these countries, and in Denmark, Belgium, 
and Italy, elite disunity was suggested by 
deep ideological divisions between elites, and 
regime instability was reflected in intrigues 
surrounding the monarchical regimes and in 
numerous riots, strikes, and other confronta- 
tions engineered by dissident elites. 


Disunified Elites and Unstable Democracies 


When the 19th century ended, all Western 
nation-states except Russia had representative 
political institutions involving significant prac- 
tice of electoral politics. The material wealth 
produced by industrialization enabled domi- 
nant elites temporarily to placate dissident 
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elites and their followings, and the suffrage 
was still largely restricted to those with a 
stake in the existing socioeconomic order. 
Thus, elections did not immediately become 
merely another way for warring factions to 
undermine and destroy each other. For a 
while, a number of European nation-states 
appeared to have stable democratic regimes, 
even though no basic elite transformation had 
occurred. 

We must examine this situation further 
because it bears directly on our argument that 
democracy is stable only when elites are 
consensually unified. The long-lasting French 
Third Republic may seem to constitute a 
particularly clear exception. For 65 years it 
suffered no forcible seizure of government 
executive power, and it was regarded as one 
of the major democracies of the modern age. 
If space allowed, other seeming exceptions to 
our argument would bear examination: Italy's 
constitutional monarchy from 1870 until 
Mussolini’s takeover in 1922-25; and, further 
afield, the Argentine regime between 1912 
and 1930, the Uruguayan regime between 
1905 and the late 1960s, and the Chilean 
regime between 1933 and Allende's assump- 
tion of the presidency in 1970. In our view 
such governments were interludes, if rather 
long-lasting ones, in a continuing pattern of 
elite disunity and regime instability. Here we 
can only illustrate this interpretation -by 
looking at the French Third Republic. 

Despite certain conditions favoring an elite 
transformation, the creation of the Third 
Republic was not preceded, accompanied, or 
followed by unification of the disunified elites 
so evident in France's earlier history. Among 
the monarchist factions who held a majority 
of National Assembly seats in the early 
1870s, many people favored a British-style 
constitutional monarchy (Anderson 1977, p. 
6). Republicans within the Assembly were 
predominantly moderate, as exemplified by 
their leader, Adolphe Thiers. Monarchists and 
republicans were thus potentially capable of 
compromising. And radical factions in the 
Assembly were in disarray after the Paris 
Commune debacle. In these circumstances, 
an elite settlement was conceivable. A step in 
this direction was the "fusionist" effort by 
leaders of the two monarchist factions to have 
the Legitimist pretender, the childless Comte 
de Chambord, then in his fifties, take the 
throne and be succeeded by the much younger 
Orleanist pretender, the Comte de Paris 
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(Thomson 1969, pp. 80-83). The plan 
foundered on Chambord’s refusal of the 
limited monarchy being offered, a refusal he 
couched in a dispute over the national flag. 
Had Chambord accepted the fusionist plan, it 
might have become a key element in a 
comprehensive elite settlement. 

Thus, the Third Republic was born, not in 
an elite settlement but in an elite stalemate 
(Hoffmann 1963). Established by a one-vote 
margin in the National Assembly in 1875, it 
was never viewed by monarchist, Bonapart- 
ist, or left-wing elite factions as more than a 
temporary expedient (Thomson 1969). These 
several factions mobilized major segments of 
the business class, the church hierarchy, the 
military, the civil service, and the working 
class. Predictably, expectations of forcible 
seizures of government power became a 
regular feature of French politics. During the 
1880s there was considerable support for a 
coup by ex-General Boulanger, but he, balked 
at the last minute. The Ligue des Patriotes 
attempted a coup in 1889. In 1898 and 1899, 
during the divisive Dreyfus affair, two serious 
coup plots were uncovered. In 1933, Action 
Francaise and other protofascist groups con- 
spired to overthrow the Daladier government. 
The last years of the Republic, before, 
during, and after the Popular Front govern- 
ment of 1936-1938, involved severe elite 
' disunity, as evidenced by the alacrity with 
which a major portion of the elite seized the 
opportunity created by military defeat in 1940 
to dismantle the Republic and to establish the 
corporatist, semifascist Vichy regime (Paxton 
1972). 

Similar analyses of continuing elite dis- 
unity and its destabilizing effects on demo- 
cratic regimes apply to most other European 
and all Latin American nation-states during 


the latter part of the 19th century and much of . 


this century. Weimar Germany and the 
Austrian Republic during the interwar period 
before Hitler took over provide two especially 
clear-cut and consequential examples (see 
Hamilton 1982), but so do Italy, Spain, and 
Portugal in the same period. , 

Did this modal pattern of Western politics 
persist after World War II? Following our 
argument, one must examine each country for 
clear evidence of an elite transformation from 
disunity to consensual unity, through either a 
sudden and deliberate settlement or a two-step 
transformation. Apart from Russia and its 
East European satellites, and possibly Portu- 
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gal, all the unstable European countries 
discussed above appear to have experienced 
an elite transformation (see Table 1). In 
Austria, the proporz system and other power- 
Sharing arrangements that elites secretly 
negotiated in the late 1940s probably consti- 
tuted an elite settlément, thus laying the basis 
for the first stable democratic regime in 
Austria's history (see Steiner 1972; Stiefbold 
1974). Similarly, the intensive elite negotia- 
tions carried on in Spain after Franco's death 
in 1975 produced a settlement terminating 
more than four centuries of regime instability 
(see Gunther et al. 1986; Share 1987). 
Elsewhere—in West Germany and Belgium 
during the 1950s and early 1960s, and in 
France and Italy during the 1960s and 
1970s—two-step elite transformations oc- 
curred as the electoral dominance of center- 
right elite coalitions gradually forced left- 
wing factions to abandon socialist orthodoxies 
and join their "bourgeois" opponents in 
defending and óperating liberal democratic 
regimes presiding over capitalist economies. 


Summary 


Table 1 summarizes the patterns underlying 
our contention that, barring special colonial 
legacies such as those in the United States or 
the Netherlands, and barring exceptional 
historical and geographical circumstances 
such as those of Switzerland, elite disunity 
and regime instability have been so wide- 
spread and persistent as to constitute the . 
modal pattern of Western politics. Locating 
the origins of elite disunity in the formations 
of Western nation-states, the table illustrates 
our contention that the relationship between 
elite disunity and regime instability simply 
persists, regardless of changes in regime 
forms or in the configurations of mass 
publics, until one of three elite transforma- 
tions occurs: (1) a sudden and deliberate elite 
settlement, (2) a more gradual two-step elite 
transformation, or (3) a “revolutionary elite 
transformation” to the condition of ideologi- 
cal unity (which because of our focus we have . 
not examined). Finally, Table 1 suggests that 
only after an elite settlement or a two-step 
transformation do previously unstable re- 
gimes become stable along representative 
lines that are conducive to the rapid or 
gradual spread of democratic politics. In 
short, within the frame of reference specified, 
special colonial legacies, elite settlements, 
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and two-step elite transformations have con-. 


stituted the only routes to stable democratic 
political regimes in modern Western history. 
We now suggest some implications of this 
analysis for the current discussion of demo- 
cratic transitions and democratic breakdowns. 


DEMOCRATIC TRANSITIONS AND 
BREAKDOWNS RECONSIDERED 


The distinction between democratic, totalitar- 
ian, and authoritarian regimes has been a 
mainstay of comparative political sociology, 
but success in developing a general theory of 
political continuity and change centering on 
this distinction has been slight. A serious 
impediment is the fuzziness in distinguishing 
between democratic and authoritarian re- 
gimes. While Linz (1964, 1975) and others 
have offered sound distinctions between 
totalitarian and authoritarian regimes, so long 
as the former retained something like their 
pristine Stalinist and Hitlerite forms, the 
analytical boundary between authoritarian and 
democratic regimes has been much less 
convincingly drawn. This is reflected by the 
numerous concepts that seemingly combine 
authoritarian and democratic elements: 
authoritarian-populist, single-party, revolution- 
ary, mobilizational, tutelary, and even post- 
authoritarian regimes. Moreover, political 
changes have made the analytical dichotomy 
into a sieve, as numerous countries have 
oscillated between various authoritarian and 
democratic regime forms. 

Reacting to these regime oscillations, com- 
parative political sociology has developed an 
oscillatory pattern. Thus, during the 1950s and 
, early 1960s, when many newly independent de- 
. veloping countries seemed to be consolidating 

democratic regimes, and when most Latin Amer- 
ican countries seemed to be moving toward de- 
mocracy, comparativists were preoccupied with 
theories about the socioeconomic requisites of 
democratic regimes (e.g., Lipset 1959; Cut- 
right 1963), the political cultures in which they 
are ostensibly rooted (Lipset 1963; Almond and 
Verba 1963), and other aspects of political de- 
velopment or evolution toward democracy (see 
Huntington 1968; Huntington and Dominguez 
1975). Beginning in the mid-1960s, however, 
the fledgling democratic regimes of many de- 
veloping countries were overthrown and re- 
placed by authoritarian regimes. During the 
1970s, comparativists thus became preoccu- 
pied with “democratic breakdowns” (Linz and 
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Stepan 1978), the role of the military in politics 
(Nordlinger 1977; Perlmutter 1977), and the 
nature of “bureaucratic-authoritarian” as well 
as other nondemocratic but also nontotalitarian 
regimes (O’Donnell 1973; Collier 1980). Be- 
ginning in the late 1970s, a number of author- 
itarian regimes, particularly in Latin America 
and Southern Europe, again gave way to more 
democratic ones. Accordingly, scholars are now 
preoccupied with “democratic transitions” or 
“transitions from authoritarian rule” (O'Donnell 
et al. 1986; Malloy and Seligson 1987; Needler 
1987; Baloyra 1987). 

We have outlined an approach to the study 
of regime creation, persistence, and change 
that avoids this oscillatory, reactive pattern. 
Thus, any tendency to accord watershed 
importance to the latest regime changes in 
countries with long records of instability 
should be resisted unless compelling evidence 
demonstrates that the necessary elite transfor- 
mations have occurred. Studies of “demo- 
cratic breakdowns,” such as those collected 
by Linz and Stepan (1978), provide much 
evidence of long-standing regime instability 
rooted in continuing elite disunity. But such 
studies fail to observe that, without some 
fundamental transformation to consensual 
unity, any outcome other than breakdown was 
and is unlikely. Similarly, discussions of 
recent transitions from authoritarian to demo- 
cratic regimes in Latin America and Southern 
Europe generally fail to ask whether the 
regime changes were preceded or accompa- 
nied by shifts from elite disunity to unity. 
Again, without such elite transformations, 
these democratic transitions are likely to be 
short-lived swings in regime instability. 

An important step toward recognizing the sig- 
nificance of elite transformations is the recent 
attention paid to “elite pacts.” O’Donnell and 
Schmitter (1986, p. 37) define an elite pact as 
“an explicit, but not always publicly explicated 
or justified, agreement among a select set of 
actors which seeks to define (or better, to re- 
define) rules governing the exercise of power 
on the basis of mutual guarantees for the ‘vital 
interests’ of those entering into it.” Such an 
elite pact roughly approximates our concept of 
an elite settlement, though the latter is more 
comprehensive and multifaceted. The impor- 
tance of elite pacts has been noted in the dem- 
ocratic transitions of Colombia in 1957-58 
(Wilde 1978; Hartlyn 1984), Venezuela in 1958 
(Karl 1986), and Spain in 1977—79 (Gunther et 
al. 1986). O'Donnell and Schmitter (1986, pp. 
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37-39) doubt that such pacts are necessary pre- 
conditions for transition to stable democracy, 
but they think pacts enhance its probability. 
They observe, for example, that two of the three 
countries that escaped the post-1964 wave of 
democratic breakdowns in Latin America—Co- 
lombia and Venezuela—were “pacted democ- 
racies.” In our view, the third country, Costa 
Rica, escaped breakdown because in 1948 its 
elites entered into a settlement, though not a 
formal pact (see Peeler 1985). Moreover, 
O’Donnell and Schmitter notice that even the 
long-lasting, but “unpacted,” democratic re- 
gimes in Chile after 1933 and Uruguay after 
1904 oscillated to authoritarian forms in 1973. 
As we have suggested, those regimes—like the 
French Third Republic and several other long- 
lasting, more or less democratic Western re- 
gimes— were unstable because they did not orig- 
inate in fundamental elite transformations from 
disunity to consensual unity. They rested in- 
stead on fragile coalitions and stalemates among 
basically disunified elites, and their compar- 
atively long durations probably depended on a 
good measure of luck. Sooner or later they were 
likely to break down in the face of a political 
crisis. 
Although the focus on elite pacts and other 
recent elite-centered analyses are steps for- 
"ward, they introduce a potentially trouble- 
some amount of indeterminacy. Thus, O’Don- 
nell and Schmitter (1986, p. 5) stress “the 
high degree of indeterminacy embedded in 
situations where unexpected events (fortuna), 
insufficient information, hurried and auda- 
cious choices, confusion about motives and 
interests, plasticity, and even indefinition of 
political identities, as well as the talents of 
specific individuals (virtu), are frequently 
decisive in determining the outcomes.” Like 
Hamilton’s (1982) penetrating study of the 
breakdown of democracy in Weimar Ger- 
many, O’Donnell and Schmitter (1986, p. 19) 
adopt a position of “structural indeterminacy” 
when analyzing recent transitions from author- 
itarian rule. Similarly, Malloy’s (1987, p. 
237) analysis of recent regime changes in 
Latin America notes that “there is a key 
voluntary dimension to the process that . . . 
precludes neat deterministic theories based on 
general laws." 
Our effort to specify the basic forms of 
national elites, their origins, and their conse- 
quences provides a way around what may 


otherwise be a theoretical dead end. Although. 


more systematic and explicit in its concepts 
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and claims than much current work, our 
approach nevertheless respects the messiness 
of politics and political history. Our concepts 
steer a middle course between grand theory 
and a retreat to local history. We urge those 
who study democratic breakdowns and demo- 
cratic transitions to look first at elites and to 
investigate basic patterns and transformations 
of elite relationships. This approach is 
decidedly not monocausal, however. We 
recognize that religion, class, ethnicity, 
technology, demography, geography, eco- 
nomic development, and the "demonstration 
effects" of other countries’ politics (Bendix 
1978) máy affect elite relationships and the 
forms of political regimes. But we deny that 
such forces lead inexorably to democratic 
transitions or breakdowns. Instead, we see 
national elites as filtering these forces, with 
each type of elite giving a broadly predictable 
thrust to the functioning of political regimes. 

Finally, our approach implies much caution 
about the prospects for stable democracy in 
contemporary developing nations. The strong 
tendency for disunified elites to persist, 
evident in the West almost up to the present, 
and evident today in most Third-World 
countries, calls for a shift in thinking about 
the mechanisms through which stable democ- 
racies are established. Stable democracies do 
not emerge simply by writing constitutions, 
holding elections, expanding human rights, 
accelerating economic growth, or exterminat- 
ing leftist insurgencies. The vital step is the 
consensual unification of previously disuni- 
fied elites. Except in a few instances growing 
out of international warfare, such elite 
transformations have resulted primarily from 
internal situations and contingencies. This 
strongly suggests that Western countries can 
do little to promote stable democracies where 
they do not now exist. Indeed, it may be that 
Western policies—in particular those of the 
United States—have done more harm than 
good, often exacerbating elite disunity and 
thus actually weakening prospects for the elite 
transformations that alone appear to provide 
the basis for stable democracy. 
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This paper redirects debates over the religion-suicide link away from specific 
empirical quarrels to a consideration of Durkheim’s general proposition regarding 
religion’s protective power. We argue that his proposition must be tailored to 
social and historical contexts and that research must specify the underlying social 
mechanism at work. A consideration of historical trends leads to a more detailed 
specification of religions in analyses of contemporary cases, and more importantly, 
to an inductive elaboration of Durkheim’s theoretical underpinnings. Analysis of 
religion’s effects on United States county group suicide rates in 1970 reveals that 
religion continues to affect suicide rates, with Catholicism and Evangelical 
Protestantism tending to lower rates, and Institutional Protestantism tending to 
increase them. The presence of Jewish adherents produces a small but inconsistent 
protective effect. We attempt to account for these results first by examining a 
variety of standard religious typologies and second by examining evidence on 
whether religious affiliation reflects the operation of network ties. Finding this 
evidence suggestive, we move toward a network reinterpretation to clarify and 


elaborate Durkheim’s theory. 


INTRODUCTION 


Almost a century ago, Durkheim provided 
sociologists with a formidable conceptualiza- 
tion of the link between religion and suicide. 
This idea has been subjected to rigorous 
empirical examination and, occasionally, to 
theoretical scrutiny. The empirical debate 
continues, with findings consistently favoring 
neither proponents nor skeptics of Durk- 
heim’s hypotheses, and with little questioning 
of the basic theoretical scheme. Even in 
theoretical efforts, the social mechanism 
underlying the relationship has not been 
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addressed systematically. Both theoretical 
and empirical studies have focused on Durk- 
heim’s specific ideas without seriously consid- 
ering how historical changes might have 
affected his general propositions and how 
these propositions might be translated to 
consider new social contexts. We shift our 
attention back to central theoretical questions 
of how and why religions protect adherents 
from self-destructive impulses. To do this, we 
draw on recent insights from the sociology of 
religion, particularly analyses of culture, 
cultural change, and community and unite 
these ideas with an overarching network 
perspective. We first review Durkheim’s 
ideas and modify the research questions posed 
in empirical studies. Using the United States 
in 1970, we then examine the influence of 
religion on suicide rates. Finally, we investi- 
gate the ability of a network perspective, as 
well as competing alternatives, to explain the 
complex pattern of religion's effects. 


DURKHEIM, RELIGION, AND 
CULTURAL CHANGE 


Durkheim (1951 [1897]) accepted the finding 
that more Protestants commit suicide than 
others, dismissing the influence of dogma and 
the greater morality of minority religions in 
favor of an explanation that contrasted 
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Protestant free inquiry with Catholic emphasis 
on unquestioning acceptance of beliefs and 
rituals.! He located the key to this difference 
in dramatic societal changes in the. late 
19th-century society. Protestantism developed 
as a religion that responded to “modern” 
society by loosening its hold on members’ 
collective lives, thus forfeiting its ability to 
restrain self-destructive impulses. Durkheim’s 
general proposition conceptualized ex- 
tremes—very weak integration (egoism) or 
overly strong integration (altruism) produces 
suicide (see Smelser and Warner 1976, 
Chapter 10). In Suicide, Durkheim saw 
religion only as integrative but in later works 
(e.g., 1961 [1915]) described religion as 
having regulative aspects. Confusion over the 
relative and independent roles of integration 
and regulation has led some scholars to argue 
that there is no difference between the two 
(e.g., Johnson 1965). To anticipate our later 
discussion, religious or other network ties 
have both regulative and integrative aspects. 
A network perspective allows a dual concep- 
tualization of separate functions of social ties. 

Durkheim’s analysis was located in the 
particular context of rapidly industrializing 
societies and often newly established nation- 
states (e.g., Italy by 1870; Germany in 1871). 
In Germany under the chancellorship of 
Bismarck, in Italy under Cavour before 
World War I, and in France during the early 
years of the Third Republic, state officials 
attempted to reduce the prerogatives of the 
Catholic Church. On its side, the Catholic 
Church adopted a stand against “modernity” 
in all its forms. In the context of great social 
change, the solace and guidance provided by 
Catholicism’s antimodern stance was protec- 
tive. 

Religion no longer enjoys the cultural 


! Early work did show a marked difference in 
suicide rates of Catholic and. Protestant countries in 
Western Europe (e.g., Masaryk 1970 [1881]) as 
did later work on the United States (e.g., 
Porterfeld 1952). During the fifties and early 
sixties, the claim that Catholics commit fewer 
suicides than Protestants was touted as the closest 
sociological approximation to scientific law (e.g., 
Merton 1967). This was called into question on 
theoretical and methodological grounds in the late 
sixties and early seventies (e.g., Pope 1976; Selvin 
1965) and was dismissed empirically in the 
seventies and eighties (e.g., Pope and Danigelis 
1981; Stack 1980, 1981; see Breault 1986; Maris 
1981; Stark et al. 1983 as exceptions). 
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hegemony once forged between Roman Ca- 
tholicism and the medieval state or between 
the Anglican church and the Georgian British 
state (see Blackstone 1962, vol. 4). New 
social circumstances bring to the fore dilem- 
mas of "contextualization" —conscious or 
unconscious alignment of biblical teaching to 
cultural norms. All religions contextualize to 
be relevant —relating invariant "truths" to the 
changing realities of people's lives (Wells 
1984). The rise of new religions or the 
shifting appeal of established ones reflects 
this process (Hunter 1987; Roof and McKin- 
ney 1987). Fundamental shifts have occurred 
in the relationship between religion, culture, 
and the state, particularly in America. Accord- 
ing to Marsden (1980), 1870 America saw 
itself as a Christian (i.e., Protestant) nation. 
By the 1950s, with the defeat of Fundamen- ` 
talists in the 1920 over issues of "creation" 
and "evolution," a broadened Protestant- 
Catholic-Jewish accommodation emerged (Her- 
berg 1960; Hunter 1987). The erosion of 
Protestant cultural hegemony culminated in a 
"seismic shift" (Marty 1979), a "collapse of 
the middle," a “collapse of the bridge 
between private and public faith" (Roof and 
McKinney 1987, pp. 9, 29). Roof (1978) 
contends that the root of this crisis in the 
1960s and 1970s was cultural, not theologi- 
cal, but some religions necessarily suffered 
consequences of their contextualization 
choices. Liberal Protestantism had misinter- 
preted the evangelical defeat in the 1920s and 
became more “culture-affirming.” The evan- 
gelical surge, beginning after World War II, 
represented a response to this religiocultural 
realignment, to the breakdown of established 
patterns and a climate of moral ambiguity, 
and to failures in political, economic, and 
cultural integration (e.g., Douglas 1983; 
Kelly 1972; Wuthnow 1982). 

Catholicism and Judaism, historically out- 
side American cultural hegemony, also re- 
sponded to these cultural shifts. Although 
Herberg (1960) interpreted Catholicism as 
suffering from the results of secularization, 
Greeley (1977) noted the rise of a new, 
self-styled Catholicism with adherents loyal 
to tradition, its immigrant heritage, and its 
ethnic communalism. Yet these “communal” 
Catholics appear to be more like questioning 
Protestants than the blind acceptors of faith 
Durkheim described in Suicide. Contempo- 
rary American Judaism also departs from 
Durkheim's description of its all-embracing 
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way of life. "Secular" Jewish associations 
(e.g., local federations for welfare, education 
and cultural activities) as well as the congru- 
ity of ethnic and religious ties bolster 
Judaism's strength. Yet, as Steinberg (1973) 
suggests, its protective power may not extend 
to third-generation Jews who lack traditional 
ties based on religious heritage. 

In sum, three sociohistorical trends (secu- 
larization, ecumenicalism, and evángelical re- 
vival) have altered the relationship between 
religion and society. Suicide studies failing to 
find religion's protective influence have relied 
on the notion of secularization (e.g., Pope and 
Danigelis 1981; Stack 1980). Secularization 
theorists predicted the inevitable demise of re- 
ligion because of its fundamental incompati- 
bility with modern society and its resulting 
inability to address individual concerns.? An- 
alysts of the ecumenical trend concluded that 
no meaningful differences exist between reli- 
gions. They interpreted the ecumenical move- 
ment of the sixties as "homogenizing" main- 
line Western religions and the cultural outlook 
of Americans (Alba 1981; Herberg 1960). Yet 
Collins (1982, p. 31) noted that the “expec- 
. tation of religion crumbling under the advance 
of rationality has not panned out." Not only 
have mainline religions endured (some even 
showing a resurgence, but “counterculture” 
religious cults have appeared, many showing 
surprising longevity. Moreover, fundamental 
and evangelical Protestant denominations that 
compromised least with the spirit of ecumeni- 
calism have experienced large growth in num- 
bers (Kelly 1979, pp. 338-40; Marsden 1980, 
pp. 202-12). 

These trends suggest that important differ- 
ences have emerged among United States 
Protestant groups and that less diversity exists 
between some Protestant denominations and 
Catholicism or Judaism. If this is the case, 
comparing “Protestants” to Catholics (or 
areas with Protestants and Catholics) holds 
little meaning in the context of the Durk- 
heimian theory. There is little reason to 
suspect that Catholics should differ from 
Protestants in certain denominations. Such a 
finding would not represent a flaw in 
Durkheimian theory; rather it would challenge 





? Sociologists of religion have acknowledged for 
at least a decade that the secularization hypothesis 
was in error; yet the argument continues to be used 
in sociological studies of suicide (Hout and 
Greeley 1987; Hunter 1987; Marsden 1980). 
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the utility of applying theoretical ideas to 
explain religion's impact in one sociohisto- 
rical context to a different era. It would also 
require a reexamination of the reasons 
underlying differences in religions' protective 
or aggravating effects on suicide. l 

With this background, we examine the 
relationship between religious affiliation and 
suicide in this paper. Some deviations from 
traditional findings appear. Catholicism and 
some Protestant denominations exert a protec- 
tive effect, while other Protestant groups 
increase the suicide rate. We attempt to 
account for these departures in terms of 
standard theoretical differences in religions 
(theological conservatism, ecumenicalism, 
church governance, cultural tension). The 
data are reanalyzed accordingly, but standard 
typologies fail to provide adequate empirical 
explanations. In search of a more adequate 
explanation, we focus on the religious 
“community,” specifically the degree to 
which religions vary in terms of adherents’ 
participation and network contacts. Finding 
this direction promising, we recast Durk- 
heim’s theory in network terms. We argue 
that Durkheim’s propositions can be ampli- 
fied and clarified with current structural as 
well as sociopsychological ideas from net- 
work theory. 


DATA, MEASURES, AND METHODS 


The empirical examination that follows from 
these concerns represents a two-step process. 
First, we analyze areal data to demonstrate 
the continued relevance of religion to under- 
standing suicide in the United States and to 
see whether sociocultural changes produce 
different effects for various Protestant denom- 
inations. Second, we use the results of this 
analysis to explore further the theoretical 
meaning of denominational effects. The 
“first-stage” results on religion-suicide net 
effects are used as “data” for a denominational- 
level analysis. We discuss the details of the 
first analysis only, deferring discussion of the 
particulars of this second set of analyses. 

In order to match information from various 
sources and to minimize errors in the data set 
as a whole, we use the county-group as the 
unit of aggregation (N= 404; D.C., Alaska, 
and Hawaii excluded).? Data within a two- 





>The county group is a Census Bureau 
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year time period for all sources were available 
-only for 1970. Suicide rates come from the 
National Center for Health Statistics (1972). 
We limit the analysis to white adults because 
of the unreliability of suicide rates and 
religious affiliation for blacks (Breault 1986). 
We recorded the suicide rate (per 100,000; 
X = 10.19, s.d. =3.37) and matched informa- 
tion from the National Council of Churches 
study using Census Bureau translation codes 
(Johnson, Pican, and Quin 1974). These NCC 
data are considered to be among the most 
comprehensive and reliable religious affilia- 
tion data on the United States. Information on 
Jewish affiliation comes from the American 
Jewish Yearbook (1977). National Jewish 
census figures are recorded by city (for cities 
with more than 500 Jews) and by state. In the 
few cases in which no information was 
available, we used the statewide average as an 
estimate. We were not able to locate 
nationwide disaggregated information on 
“cult” groups, some community religions, or 
atheist groups (e.g., the Unification Church, 
the Amish). The collection of data reflects the 
institutionalization of religious groups, and 
our analysis is so restricted.4 

We retained as much detail in Protestant 
denominations as possible. Because of the 
very small size of some of the 52 denomina- 
tions for which information was available and 
because of coefficient instability in early 
analyses, we combined religious groups with 
fewer than 50,000 members nationwide into 
theologically and theoretically similar denom- 
inations (e.g., the Northern and Southern 





aggregation of adjacent counties into groups of 
250,000 or more population. This choice reduces 
the instability of rates in phenomena of low 
occurrence such as suicide. The complex issue of 
aggregation unit involves an inevitable trade-off. 
As the level of aggregation becomes more remote 
from the individual, the homogeneity assumption 
within aggregation units must be evoked. How- 
ever, as the aggregation unit increases, data are 
thought to be more accurate (i.e., measured with 
less error, Hanushek and Jackson 1977; see 
Pescosolido and Mendelsohn 1986, p. 85, for a 
more detailed discussion). 

* The issue of the “electronic church” (basi- 
cally, t.v. evangelicalism) became important after 
our analytic time frame. Not having cult informa- 
tion would be critical in those few county groups 
(e.g., the county group in Oregon where Wasco 
county and the Rajneeshpuram religious commu- 
nity were located). 
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Provinces of the Moravian Church were 
combined but not all Baptist denominations) 
using a combination of guides from literature 
in the sociology of religion (e.g., Melton 
1978) and informed judgment. The three 
Catholic denominations that represent re- 
gional variants were collapsed, and Jews were 
included as a separate group (no breakdown 
of Reformed, Conservative, and Orthodox 
membership is available).5 This classification 
resulted in 27 religious groups (25 of which 
are Protestant; detail available on coding). 
We operationalized the religion variables 
using the county group population as a base 
of ratio variable (information on alternative 
operationalizations used on request). Follow- 
ing Firebaugh and Gibbs (1985), we use a 1/Z 
(where Z is the county group population) to 
control for possible biases in the use of ratio 
variables. 

In the course of the data analysis, we 
considered whether the geographic and socio- 
economic clustering of some religious groups 
masked or exaggerated religion's influence on 
suicide. Previous suicide research suggests 
that religion-suicide effects may be con- 
founded by factors such as divorce, migration 
or socioeconomic status (Stack 1985). Hoge 
(1979) argues that any effect of types of 
religions simply masks social status differ- 
ences. Based on these considerations, data 
availability, and preliminary analyses, we 
included the following county-level variables 
as controls in the "religious effects" equa- 
tions: average income and education to 
contro] for social status; migration and 
divorce rates to control for confounding 
factors cited above; and the percentage female 
and average age to control for age-gender 
differences in suicide (Gibbs 1961). Social 
constructionists suggest that religion may play 
an important role in the recording of suicide 
rates which, in turn, hinders any attempt to 
study religion’s protective influence. We used 
one strategy suggested by Pescosolido and 
Mendelsohn (1986) to control for possible 
biases induced by recording procedures and 


$ Theoretically, the unavailability of detailed 
Jewish data represents a concern. We would expect 
that Orthodox Judaism (like evangelical Protestant- 
ism) would exert an effect on network ties and 
suicide rates different from the effect of Reform or 
Conservative groups. However, since orthodox 
groups represent a minority, this data limitation is 
unlikely to significantly affect our overall findings. 
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personnel. We focused specifically on organi- 
zational characteristics — the proportion of the 
county group population covered by medical 
examiner systems, holding inquests, having 
toxicology and pathology services available, 
and being served by an elected (as opposed to 
appointed) official. Finally, we considered the 
impact of regional (Northeast, South, Mid- 
west, and Southwest; West omitted) and 
urban/rural density) differences (see e.g., 
Fischer 1982, Maris 1981 on urbanization; 
Marsden 1980; Roof and McKinney 1987 on 
region). For controls, we drew information 
from two sources, the 1970 Public Use 
` Sample (1972) and the Kornblum and Fisher 
(1972) report on medicolegal systems (see 
detail in Appendix Table A). 

To calculate the effect of religion on the 
suicide rate, we used ordinary least squares 
regression of the general form indicated in 
Equation 1: 


S; = ag + b, Religion; + b; Religion... . 
cb Religionz; + c, Control 
+ c2 Control; . . .+ c 12 Control;2 


+ 2 d; Region 
+ fi Statistical Control + ej 


All observations are weighted by the square 
root of the county group population to correct 
for heteroscedasticity (Pescosolido and Men- 
delsohn 1986). Since all available religions as 
a proportion of the county group population 
are included, the proportion of the population 
affiliated with other religions or no religion 
forms the omitted category. Because we 
included all county groups, the calculated 
coefficients represent population parameters 
(not estimates). Tests of statistical signifi- 
cance do not hold their usual import. All 
effects are “real”; significance simply pro- 
vides a conventional way of highlighting 
large or consistent effects across county 
groups. A number of regression models were 
estimated in order to examine coefficient 
sensitivity to different specifications and to 
see if “trimmed” specifications were appro- 
priate. The actual values of the coefficients in 
the trimmed models reported changed little 
from the fuller models. 

Great differences in the presence of reli- 
gions across county groups present a problem. 
The theoretical importance of preserving de- 
tail in Protestant denominations is likely to 
inflate the coefficient values of minority reli- 
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gions. We caution against the metric interpre- 
tation of unadjusted coefficients. Since our re- 
sults using rate or proportion variables as 
independent and dependent variables represent 
point elasticities of Y (suicide rate) with respect 
to X (religious affiliation), meaningful coeffi- 
cient interpretation requires an adjustment (Stol- 
zenberg 1979). We opt to transform coeffi- 
cients in line with their variances, deflating them 
by the religious groups’ proportions in the United 
States population. Adjusted coefficients repre- 
sent the average contribution of the group to the 
suicide rate, that is, religion’s influence if it 
were present in typical share. 


RESULTS 


Table 1 presents the analyses using Catholic, 
Jewish, and detailed Protestant denomina- 
tions. The results include regression analyses 
of models with the religion variables alone and 
with social controls. We report the unstandar- 
dized regression coefficients, their standard er- 
rors (in parentheses), and, recalling these co- 
efficients represent population parameters and 
not estimates, significance tests for conven- 
tion and for locating particularly large or con- 
sistent effects. Table 1 also presents adjusted 
coefficients. The impact of sociodemographic, 
social construction, and region/density con- 
trols is not presented since they are not di- 
rectly relevant to theoretical tasks at hand. In 
all reported results, the inclusion of sociode- 
mographic controls generally did not change. 
the influence of religion coefficients, but they 
did exert important and sizable influences on 
the suicide rate. The effects of these controls 
are in line with the substantive findings pre- 
sented in an earlier analysis (Pescosolido and 
Mendelsohn 1986). Only a few religious co- 
efficients changed dramatically in both sign 
and magnitude with the addition of the region 
control, suggesting that the influence of reli- 
gion operates differently in geographical areas 
of the United States.9 l 





$In sum, the effects of increased divorce, 
income, education, migration raise the suicide rate. 
The density control for urbanization reveals that 
dense county groups have lower, not higher, rates 
of suicide, as Fischer (1982) but not others (Roof 
1978) would predict. The statistical control (1/Z) 
has little effect but the percent female has a 
positive effect, emphasizing ecological data prob- 
lems since individual-level studies consistently 
find that women have lower rates than men. 
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Table 1. Regression Analysis of Religious Variables and Controls on the Suicide Rate for U.S. County Groups 








(N= 404), 1970* 
Without controls With controls* 
Independent Variables b (0 b (0 Adjusted b 
Religious Affiliation = 
Institutional 
Presbyterian 6.63 (.57) 7.78 (.84) .187 
Evangelical 
Presbyterian 90.66 (1.43) 52.8) (.98) .005 
Institutional 
Baptist —12.14 (—.75) —2.15 (—.16) —.019 
Evangelical 
Baptist —6.99* (—2.67) —4.95* (—1.93) —.416 
Institutional 
Lutheran 2.88 (.59) 1.65 (.40) .049 
Evangelical 
Lutheran —2.55 (— .32) 53 (.07) .008 
Institutional 
Methodist — 14.92* (—2.38) 5.95 (1.11) 381 
Evangelical 
Methodist 71.71 (.23) —251.55 (— 1.03) — .075 
Congregational 30.03 (30) 46.93 (.63) .047 
Episcopalian 10.99 (.58) 38.78* (2.36) 582 
Unitarian 51.14 (.37) —25.76 (—.22) — .026 
United Church of Christ —3.98 (—.30) 15.63 (1.49) .187 
Church of God —79.44* (- 1.81) — 24.63 (- .67) — .098 
Reformed Churches — 28.89* (—2.43) —17.21* (— 1.88) —.052 
Brethren Churches 18.69 (.49) —.39 (—-.01) — 001 
Mennonite —74.67 (—1.43) —48.59 (—1.23) —.044 
Moravian 56.87 (.90) 12.44 (.26) .005 
Disciples of Christ 17.49 (.68) —2.90 (-.14) —.017 
Church of Christ 22.23 (.88) 16.46 (.83) .099 
. Nazarene 13.91 (.29) — 98.00" (—2.54) — .490 
Evangelical Covenant 32.18 (.17) — 191.03 (— 1,34) — .076 
Quaker —92.87 (—.86) 59.62 (.72) .060 
Pentecostal —39.97 (—.34) 65.99 (.69) .033 
Salvation Army 37.87 (.38) 72.13 (.95) .144 
S-D Adventists 194.84* (3.44) — 30.28 (— .64) — .090 
Catholic —9.73* (—5.64) —3.33* (—1.96) — .680 
Jewish —3.98 (— 1.00) — 1.90 (-.57 — .036 
Constant 13.01 — 11.76 
R? .21 58 
S.E.E. 3.11 2.32 
Overall F 3.71 11.29 
(a3.01) (a3.01) 


* Coefficients significant at a<s.05, two-tailed test. 
* Coefficients significant at aS.10, two-tailed test. 
* Full list of controls in Appendix, Table A. 


Overali, the results reveal that religion 
continues to influence the suicide rate. Even 
with controls, some religion effects are large 
and retain the same sign. A few religions 
consistently exert a relatively large or consis- 
tent influence. County groups with higher 
percentages of Catholics, Evangelical Bap- 
tists, and Nazarenes report lower suicide 
rates. The protective influence of the Re- 
formed Churches appears to be consistent but 
is relatively small. For the adjusted coeffi- 
cients, Catholicism, on average, exerts the 





greatest protection. Judaism continues to be 
protective but is neither large nor consistent 
across county groups. Equally important 
theoretically, not all religions exert a protec- 
tive influence. The presence of Episcopalians, 
for example, has a large aggravating effect. 
The most important patterning of Protestant 
results comes from rank ordering the adjusted 
b’s. The Protestant denominations whose 
presence increased the county group’s sui- 
cides are mostly “mainline,” “mainstream,” 
or “old-line” (Episcopalian, Institutional Pres- 
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byterian, Institutional Methodist, United 
Church of Christ). In fact, three of these form 
the “heart” of liberal Protestantism (ex- 
cluding Methodists; Roof and McKinney 
1987). Conversely, Protestant denominations 
whose presence exert the most protective 
influences are mostly evangelical: Nazarene, 
Evangelical Baptist, Seventh Day Adventist, 
Church of God. Of the four denominations 
that have split along evangelical/institutional 
lines, the evangelical branch always has a 
more protective (or at least a less harmful) 
influence. Specifically, for Methodists, areas 
with higher percentages of institutional 
branches report higher rates while areas with 
a greater share of evangelical counterparts 
report a lower rate. Any branch of Presbyte- 
rianism or Lutheranism increases the suicide 
rate, but the presence of institutional groups 
has a great aggravating effect. The situation is 
similar for Baptists—areas with higher per- 
centages of either type report lower suicide 
rates, but the coefficient for Evangelical 
Baptists is much larger (i.e., more protective) 
than for Institutional Baptists. In sum, these 
results hold both theoretical and methodolog- 
ical import for studies of suicide and religion. 
They show that religious affiliation is associ- 
ated with suicide rates in contemporary 


American society. And they indicate that the: 


effects are more complex than Durkheim’s 
theory or empirical research derived from 
Durkheimian ideas would suggest. Specifi- 
cally, although Catholicism continues to exert 
a protective influence over suicide rates, 
some Protestant denominations, predomi- 
nantly of the evangelical type, do so as well, 
while many of the Institutional Protestant 
denominations increase suicide rates. Durk- 
heim’s specific hypotheses on the protective 
influence of particular religions are, at best, 
only partially supported. Yet his fundamental 
propositions can provide insight for the 
pattern of results. 


THEORETICAL EXPLANATIONS 


Our work was motivated by the apparent 
connection between Durkheim’s ideas in 
Suicide, current ideas in the sociology of 
religion, and network theory. As Marty 
(1976) claims, what distinguishes religions in 
the United States today is social behavior, 
what people do, not just what they believe. 
Churches are “natural communities” depen- 
dent upon factors such as member participa- 


39 
tion and socialization of initiates by members 
(Gustafson 1961). We propose that network 


theory provides a clue to synthesizing these 
ideas with the dimension of “integration” 


found in Durkheim’s original formulation. 


Durkheim’s notion of the centrality of social 

integration in understanding suicide corre- 

sponds to the primary starting point of 

network theory: the nature of social relations 

(or the social structure) influences individu- 

als’ attitudes, beliefs, and behavior (see . 
White and Boorman 1976, p. 1442). Reli- 

gious solace results from social interaction 
atid depends on the strength of community of 

network ties, the “belonging” aspect of 

religion (Fischer 1982; Roof and McKinney 

1987; Tipton 1982). In Suicide, Durkheim 

places religion only on the integration dimen- 

sion, deemphasizing its role in regulating 

social behavior. We maintain this theoretical 

distinction through a network perspective by 

conceptualizing integration and regulation as 

two aspects of social networks. 

Before we proceed to an explication of this 
network approach, we need to consider 
alternative approaches to understanding the 
differences among Protestant denominations 
and their link to the broader network 
perspective. Sociologists of religion have 
attempted to capture the essence of Protestant 
differences through typologies classifying 
religions along critical theoretical dimei- 
sions. Table 2 presents a reanalysis in which 
each of the original 52 Protestant denomina- 
tions was assigned to a liberal or conservative 
category (and, where possible, to a more 
nuanced conservative, moderate, liberal split; 
coding details on request; see Panels 1 and 
2). As we saw in Table 1, Catholicism and 
Judaism exert a protective influence although 
the influence of Judaism is smaller.” These 
categories appear to capture broadly the 
patterns of the detailed analysis. Areas with 
higher percentages of liberal (and to a lesser 
extent, moderate) Protestant groups report 


higher suicide rates. Those with more conser- 


vative adherents report lower rates. 
If we had concluded the analysis at this 


e hes ambiens de interpreted relative to the 
omitted category. With the exception of Panel 1, 
the makeup of this omitted category changes 
depending on the number of religions that can be 
classified in the religious typology. We would 
expect, then, the effeet of Catholicism and Judaism 
to shift accordingly. i 
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Table 2. Regression Analysis of Religious Categoriza- 
tions on the Suicide Rate for U.S. Country 


Groups, 1970* 
Adjusted 
Religious Categories* b (0 b 
PANEL 1 
Protestant 
Liberal 6.53* (2.87) 1.08 
Conservative ~1.37 (—.78) —.18 
Catholic —.96  (-.71 —.19 
Jewish —2.4  (-.T7T) —.04 
Constant —7.21 
R? .54 
Overall F 21.72 
Significance a<.0l 
PANEL 2 
Protestant 
Liberal 7.77* (1.68) 61 
Moderate 3.70 (.94) 20 
Conservative —1.69 (—.99) —.20 
Catholic —.94 (—.66) —.19 
Jewish —2.8  (-.87) —.05 
Constant — 7.80 
R? 54 
Overall F 20.23 
Significance a«.01 
PANEL 3 
Protestant 
Ecumenical 7.95* (2.99) 1.04 
Nonecumenical ~ 1,53 (—.90) 17 
Catholic —.64  (-.46) 13 
Jewish 2.21 (— .66) .04 
Constant ` —9.88 
R? 54 
Overall F 21.80 
Significance a<.01 
PANEL 4 
Protestant 
Low Tension 9.54* (2.41) .94 
Medium Tension 5.93* (2.00) .38 
Medium High Tension —.89 — (—.42) —.09 
High Tension —.57  (-.23) —.01 
Catholic 31 (—.22) 06 
Jewish ~2.34  (-.67) — .04 
Constant — 7.08 
R? 55 
Overall F 19.87 
Significance a<.0l 
PANEL 5 
Protestant 
Episcopal Policy 8.06 (1.78) 64 
Presbyterian Polity 5.22* (2.09) 36 
Congregation Polity —1.22  (-.54) —.13 
Catholic —.86  (-.60) -.17 
Jewish —3.14  (-.95) — .06 
Constant -7.25 
R? .54 
Overall F 20.51 
Significance a«.01 


* Coefficients significant at a.05, two-tailed test. 








+ Coefficients significant at a.10, two-tailed test. 
* All controls included: see Table A, Appendix. 
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point, we might have been tempted to suggest 
that our findings could be understood with 
reference to differing theological positions 
toward suicide, with “conservative” religions 
being more strict on their prohibitions. Two 
factors have convinced us otherwise. First, 
the positions of "conservative" and "liberal" 
religions are less differentiated than sup- 
posed. We contacted representatives of the 27 
religious bodies in the detailed analysis and 
inquired about official "position" towards 
suicide. Only two groups (Catholics and 
Reformed) claimed to have an official policy 
(the sixth commandment and a recent theolog- 
ical paper on suicide, respectively). All 
others, from a Unitarian minister to a 
Pentecostal pastor, claimed no "official 
statement" other than the same command- 
ment. Some conservative representatives called 
suicide a "sin" (as did the Orthodox rabbi), 
but all called it "forgivable" and suggested 
that dogma matters less than interpretations of 
particular cases (details on request). In short, 
if "conservative" Protestant churches are 
more protective against suicide, the reason is 
not one of dogma. The network approach we 
consider here presents a plausible resolution 
to the arguments about the role of dogma. 
The nature of beliefs and practices associated 
with suicide have changed. What separates 
religious groups is the degree to which 
practices conform to original dogmatic prohi- 
bitions. This depends fundamentally on the 
nature of the religious community and on ties 
that bind individuals to it. Moral rules must 
be backed by a cohesive community to 
endure; without it, beliefs, loyalties, and 
identities erode. We do not dismiss the 
importance of beliefs nor claim a causal 
ordering between beliefs and networks. As 
Roof (1978, pp. 147-50) points out, to 
suggest that beliefs and values are deeply 
rooted in people's social networks is not to 
argue that the former are “epiphenomena,” 
but to stress that social ties play a substantial 
role in their adoption, retention, and modifi- 
cation (see also Pescosolido 1986; Wood 
1983). 
Second, we continued to explore alterna- 
tive, conventional explanations offered by 
sociologists of religion: the ecumenical/non- 
ecumenical split (Kelly 1979); hierarchy in 
church governance— presbyterian, episcopal, 
and congregational (not to be confused with 
denominations of the same name; Quinley 
1974); and tension with the surrounding 
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sociocultural environment—low, medium, 
medium-high, high (Johnson 1963). These 
criteria suggest historical, formal organiza- 
tional or cultural influences as differentiating 
Protestant denominations and suggest alterna- 
tive reasons why suicide rates would vary 
among them. For example, if cultural tension 
picks up the pattern of detailed effects, the 
importance of the greater morality of minority 
religions receives support, Durkheim’s (1951) 
original protestations to the contrary. In Table 
2 (Panels 3, 4, and 5), we find that each 
typology more or less faithfully captures the 
pattern of results. If a religious denomination 
is liberal (or to some extent, moderate) or 
ecumenical or low or medium tension or has 
presbyterian or episcopal governing style, it 
fails to protect. Areas with a greater share of 
their conservative, nonecumenical, high- 
tension counterparts report lower suicide 
rates. 

We seem to be presented with an embar- 
rassment of possible explanations. Each 
typology highlights a different aspect of 
religion as the critical.dimension and provides 
an equally likely candidate for building a 
theoretical explanation. These typologies work 
equally well in capturing the detailed reli- 
gious patterns, in part because they are 
correlated —liberal religions tend to be ecu- 
menical, in agreement with prevailing cultural 
ideas, and organized in more formal style. 


Network theory, we argue, provides a solu- 


tion to these correlated explanations. Each 
conventional typology taps an underlying 
dimension critical to contemporary American 
religion: the nature and strength of social 
bonds. Sociologists of religion have found 
that conservative Protestants are more likely 
to participate in religious activities and to 
name fellow congregation members as best 
friends (Stark and Glock 1973). Mainline, 
liberal, or ecumenical Protestants are often 
referred to as "dormant" because studies find 
that they do not attend church as frequently 
and often do not know one another (e.g., 
Kelly 1972). The hierarchical structure of 
liberal denominations translates into a passive 
role for members (Quinley 1974). These 
scholars contend that organizational structure 
replaces the close community ties of smaller 
religions, drawing less member participation. 
There has been a weakening of socioreligious 
group bonds in liberal churches while conser- 
vative churches remain "primary groups with 
a great deal of face-to-face association" 
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forming "some of the most cohesive non- 
ethnic communities in the United States" 
(Roof and McKinney 1987, p. 94; Tipton 
1982). : 

We can examine suggestive evidence of 
whether religious affiliation reflects the oper- 
ation of network ties. Given suicide's low 
probability and the inability to locate victims 
beforehand, we have little opportunity to 
proceed in a direct fashion.5 Using data from 
a number of social surveys, the adjusted 
coefficients presented above in Table 1 can be 
used in tandem with descriptive information 
on religious participation and networks.? In 
all following analyses, denomination (not 
geographical area) becomes the unit of 
analysis. In analyses not reported here, we 
find that across religious typologies, partici- 
pation systematically varies in religious com- 
munities in a fashion consistent with network 
expectations. These results tend to be consis- 
tent across the variety of data sets available 
and to hold for a range of informal network 
ties. For example, adherents to Judaism or to 
liberal, ecumenical, low-tension Protestant 
religions with episcopal polity style are less 
likely to attend services on a weekly basis 
than are adherents to Catholicism or conser- 
vative, nonecumenical, medium-high or high- 
tension denominations with congregational 
polity style. A more direct indicator, the 
percentage of respondents in each religion 
who report no friends of the same religion, 


* This is a less desirable approach precisely 
because the historical transition described in 
classical theory makes categorical boundaries 
poorer approximations of individual or group 
networks (see White et al., 1976). However, 
suicide removes relevant actors from networks and 
leaves only the possibility of ad hoc reconstruc- 
tions. Further, the network data would need to be 
collected not only about the suicide's ties but from 
the appropriate comparison group—all individuals 
in the population. 

? Our criteria for selecting surveys required a 
survey to sample a general population, record 
Protestant denominational affiliation in 'some 
detail, and include some participation or network 
questions. Five data sets, all available through 
ICPSR, fit these requirements: Laumann's (1966) 
and Reynolds and Schuman's (1976) studies of 
Detroit; Davis and Smith's (1987) pooled sample 
of the General Social Survey; Campbell and 
Converse's (1978) national study of the quality of 
American life; Fischer’s (1977) study of Northern 
California. 
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2% attending weekly 
€ % of same religion 


96 
' Figure 1. Adjusted suicide-religion effects by the percentage of respondents in each religion attending church weekly 
(r = —.75*, —.88* excluding Jews) and having spouse of the same religion (r = —.77*; —.90* 
excluding Jews). 


appears to lend support to this. While only 10 
percent of Catholics report a lack of religiously- 
similar friendship ties, almost half (43.9 to 46 
percent) of adherents to liberal, ecumenical, 
low-tension, or episcopal polity religions do. 
Conservative counterparts are much less 
likely to report no such friends across 
religious typologies (e.g., 29.5 to 32 percent; 
details on request). 

These results help.build an empirical bridge 
between typologies and the network explana- 
tion. À more fine-grained examination of the 
relationship between denominational net- 
works and suicide plots the relationship 
between the religion-suicide coefficient and 
the behavior of individuals in corresponding 
Protestant, Catholic, and Jewish religions.!9 
Across available surveys, religions with the 
greatest attendance by members at services 
have the greatest protective influence on 
suicide. Figure 1 is provided as an illustration 
of general trends. It shows that the protective 


19 We examined a large number of plots, coding 
the religious information in a variety of ways. 
Basically, individuals’ religious affiliation was 
cross-tabulated with participation or network ques- 
tions. For example, the percentage of Episcopa- 
lians, Jews, and Catholics who attend church 
weekly was coded from the cross-tabulation of 
religious affiliation and frequency of church 
attendance. The dependent variable is always the 
adjusted net effect coded from Table I. Across 
surveys we were able to construct over 30 such 
variables. Detail on plots and correlations available 
on request. Because the outlier point most often 
represents Judaism, we ran the correlations with 
and without this group. 


effect, defined as the net adjusted effect of the 
religious denomination on suicide from Table 
1, is greater as more individuals in the 
denomination participate weekly (from Rey- 
nolds and Schumann data). This result is 
consistent across surveys with attendance data 
but ranged in magnitude (particularly in the 
GSS where the smallest religions tended to 
cluster around the zero point of the Y-axis). 
Figure 1 also indicates that denominations 
with greater religious homogamy are most 
likely to show protective influence (from the 
GSS data; in the Laumann data not shown, 
r= —.37, —.52). Similarly, in plots not 
presented here, we find denominations whose 
adherents report a greater average number of 
friends and relatives of the same religion have 
the greatest protective effect (r= —.67+, 
— .89* excluding Jews; from Laumann data). 
Where religion-specific information was pro- 
vided, its importance for forming network ties 
appears to be supported. 

Only informal ties and those specifically 
religiously-based display these effects. Indica- 
tors of more general network ties (e.g., 
density measures, visiting patterns) are rarely 
large or even moderate or consistent in sign. 
Formal network ties, unlike those reported 
above, tend to be correlated with aggravating 
effects. Denominations whose members have 
a higher average number of organizational 
memberships are most likely to report aggra- 
vating effects (r= .52, .70* excluding Jews 
in the Fischer data set; also found in GSS and 
Laumann data). Our findings, then, do not 
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appear to be by-products of general ties.!! 
The distinction between the “churched” and 
the “unchurched” becomes the critical dimen- 
sion. The key issue is not whether individuals 
formally identify themselves as having a 
religious affiliation but whether they actually 
become part of the church or temple commu- 
nity. This participation in networks varies in 
ways consistent with the protective or aggra- 
vating effect of religions on suicide. These 
attachments, rather than church membership 
per se, represent a more subtle but significant 
influence than is often realized (Roof 1978). 


RECASTING THE THEORY: TOWARD A 
NETWORK CONCEPTUALIZATION 


We now return to Durkheim's ideas on reli- 
gion. Religion protects because it is a “so- 
ciety"; Protestantism provides less protection 
because it is a less integrated society (1951, 
pp. 169-70). Durkheim refers not to one so- 
ciety but to many — domestic and political so- 
ciety, for example. But this reliance on “so- 
ciety,” Tilly (1984, pp. 27-28) argues, 
weakens the power of sociological explana- 
tions. He rejects the focus on such “fictitious 
entities" in favor of social relationships, social 
ties and social networks established through 
the sharing of social characteristics. Three the- 
oretical suggestions drawn from the network 
tradition provide greater clarity to Durkheim's 
explanation and to discussions of the utility of 
the reformulated Durkheimian theory in con- 
temporary society. 

First, if we replace “society” with “net- 
works” in the above summary of Durkheim's 
ideas, the notion of the multiplicity of social 
arenas becomes clearer. To borrow from 
Simmel's (1955) network imagery, an individ- 
ual in contemporary society belongs to a 
number of social circles or networks. Abstract 
parts of society— "religion," the "family," 
the *economy" — Fischer (1982) suggests, are 
really the operation of personal networks. The 
critical aspects in these networks centers on 
interaction among members, that is, their 
social ties (1951, p. 210). The potential 
strength of an individual's ties depends, in 


11 These graphs have the added benefit of 
suggesting that we are not making incorrect 
ecological inferences. Rather, the religion-suicide 


effects across county groups actually appears to. 


have something to do with the behavior of 
individuals in those religions. 
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part, on the “knittedness” of the network 
itself: The “hidden payoff” of religion that 
accounts for its continuous appeal, according 
to Collins (1982, p. 41), is the ability of 
religious networks to provide a source of 
collective energy on which individuals can 
draw during difficult times. And strong ties 
provide emotional support and access to 
intangible resources (Wellman 1983). 
Second, with network ideas we can begin 
to resolve the conceptual confusion surround- 
ing Durkheim's two explanatory dimensions 
of "integration" and "regulation." Durkheim 
did not consider religion directly under 
regulation. In later discussion in Suicide 
(e.g., of mixed types, 1951, pp. 325-26), 
however, he did acknowledge their relation- 
ship, and in later works (e.g., 1961 [1915]) 
he dealt explicitly with religion's power to 
"jnhibit" actions and cause "social pres- 
sures." Religion becomes both a source of 
authority and source of solace (Lukes 1972). 
This dual and separate conceptualization 
continues to puzzle researchers. The existence 
of ties has been equated mistakenly with the 
possible provision of functions, such as 
support and guidance. Social support theorists 
(e.g., Barrera 1986) only recently have begun 
to distinguish between having ties and 


“receiving aid from them, and social network 


analysts have begun to delineate the variety of 
functions associated with network ties (Fischer 
1982; Wellman 1983; Wellman, Mosher, 
Rottenberg, and Espinosa 1987). Network 
theory permits differentiating analytically 
between the structure of ties and their 
functions. One potential function of social 
networks is integration or the ability to 
provide social and emotional support. An- 
other is regulation, guiding action through 
advice and behavior monitoring (Umberson 
1987). While integrative and regulative func- 
tions may occur together, they do not always 
do so. The strength of the tie affects the 
ability of the network to carry out either 
function, not simply integration. 

This conceptualization requires an interac- 
tive view. When networks are moderately or 
even strongly integrated and regulated, mem- 
bers are protected from self-destructive im- 
pulses. However, as networks become out of 
balance on the amounts of integration and 
regulation they provide, the potential for an 
individual to act on suicidal impulses in the 
case of crisis becomes greater. For example, 
when individuals are subjected to strong 
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religious regulative authority, but are pre- 
vented from developing strong integrated ties 
to others outside the network or even to 
fellow network members, the probability of 
fatalistic suicide ‘increases. The Jonestown 
mass suicide presents a persuasive current 
example in which ties outside the People’s 
Temple and even bonds of marital intimacy 
within the cult were disrupted by Jones. 
Because of the low level of within-group 
integration, these suicides result from overre- 
gulation and are truly fatalistic (see Coser and 
Coser 1979; Fitzgerald 1986). 

Our third theoretical point is that network 
structures and functions are necessary but not 
sufficient explanations of when suicides 
occur. Networks operate in a larger cultural 
context that can facilitate or inhibit accep- 
tance of general cultural norms and beliefs 
(Pescosolido 1986; White, Boorman, and 
Breiger 1976). Contexts, in turn, constrain 
the formation of religious networks and 
influence their ability to offer a unique source 
of solace to their members. For example, as 
long as the larger society is not in crisis, 
culture-affirming religions provide support. 
However, with cultural crisis, these religions 
are in crisis as well. Currently, some warn 
that evangelical religions may be in danger of 
aligning too closely with larger cultural norms 
(Hunter 1987; Wells 1984). This insight calls 
for a strong cultural analysis to accompany 
and frame a network conceptualization of and 
future work on suicide. 





Low 
Figure 2. Proposed theoretical relationship between potential suicide and network density. 
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Toward a Network Theory of Suicide in the | 
Modern Context 

If we view Durkheim's theory in the light of 
Erikson's (1976) idea of "axes of variation," 
religious denominations generaly can be 
thought of as existing on a continuum of 
network density. Ideal types defining the 
poles are (a) membership in strong collectivity- 
oriented religions and (b) a rejection of all 
affiliations. Figure 2 presents a graphic 
depiction, a heuristic device, of expectations 
under a network perspective. 

Figure 2 shows potentially high suicide 
among cults that require rejection of all 
outside network ties and among atheists. 
Socially or geographically isolated cults 
create a social structure in which decisions are 
societal rather than individual. Cults may 
provide a high degree of integration to 
members facing individual crises, all other 
things being equal. This does not negate their 
power in compelling suicides in the presence 
of community crises. Atheists facing crises 
are offered neither religious support for 
troubles nor strong advice (i.e., regulation) 
against suicide. Associations such as Madalyn 
Murray O’Hair’s Society of Separationists, 
while rejecting religion, nonetheless provide a 
tie centering on religious interests. Such ties 
might provide a source of informal support or 
regulation. 

Durkheimian theory is clear about the 
limits of social protection. Of course, the 
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Durkheimian notion of “optimal” cannot be 
determined a priori, even if one is willing to 


risk normative judgments. However, as net- ' 


work density either increases or decreases 
from extremes, suicide potential should de- 
crease. In individualistic religions, each 
person becomes his or her own religion 
inhibiting the creation of "community" 
(Bellah, Madsen, Sullivan, Swidler, and 
Tipton 1985, p. 235). ' 

Institutional religions provide stronger ties 
but do not provide the protection of evangel- 
ical religions. Judaism is protective but minor 
because historical evidence suggests that 
religious ties have become less critical. 
Catholicism with its ethnic and immigrant 
heritage in the United, States provides a 
stronger network community. Other groups 
that form religious as well as geographic 
communities (e.g., the Amish) provide even 
greater protection from individual crisés. In 
the face of crisis within the religious 
community, however, dense structure could 
be aggravating. 

We have indicated only approximate areas 
on the graph for two reasons. First, each 
religious group occupies an overlapping 
range. Broad groupings mask important 
within-category differences. While the Catho- 
lic church in the United States today appears 
to have relatively dense network structure, 
Charismatic groups or parishes may be more 
so (as might be the case with Orthodox 
Jewish groups). Second, if religious groups 
mark a range of network density on the 
parabola’s inner side, individuals more or less 
involved in religious networks line up on the 
other. For example, some “born-again” Jews 
or orthodox adherents form ties more like 
their evangelical brethren. 


CONCLUSION 


According to Durkheim, what accounted for 
rising suicide rates at the end of the 19th 
century was deterioration of traditional forms 
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of social organization. He failed to explain 
how social organization changed and how it 
influenced religion’s role in people’s lives. 
Our use of network theory helps fill the gap. 
We can recast Durkheim’s position as fol- 
lows: Fundamental changes in social organi- 
zation altered the basis of network formation. 
In medieval society, networks were concen- 
tric circles, with ties based on “geographical 
and psychological factors” (Simmel 1955, p. 
128). People were afforded few choices in 
closed and preordained networks. Religion 
played a central role in organizing networks 
since it was transocietal in organization and 
linked with the polity. In late 19th-century 
Western society, urbanization, industrializa- 
tion, and migration shifted the basis of 
network formation to “individual interests,” 
forming intersecting, open circles. According 
to Simmel, these networks were more ra- 
tional, giving individuals greater freedom but 
more “psychological tensions" and less 
emotional support (1955, pp. 141, 163). 
Faced with disintegrating network ties, indi- 
viduals who were denied both integrative and 
regulative supports committed suicide more 
often. In that era, Catholicism's antimodernist 
stance as well'as its organizational network of 
schools, hospitals, social clubs, and so on 
provided a base of strong and continuing 
network affiliation and, as a result, continued 
as a source of solace and guidance. 

Our recasting of Durkheimian theory within 
the network tradition takes into account three 
critical aspects of networks —their structures, 
function, and operation in larger cultural 
context. Religion's role in suicide, operating 
through a network mechanism, is more 
complex than Dürkheimian theory or contem- 
porary typologies of religion would suggest. 
Network structures create the potential to 
provide members with integrative and regula- 
tive benefits. These functions can exist 
together or out of balance, affecting the 
ability of individuals to face personal, reli- 
gious community, or larger cultural crises. 


46 


AMERICAN SOCIOLOGICAL REVIEW 


Table A. Full Description of Control Variables 





Socio-Demographic Controls 


Age Average age of county group population 

Income Index with respect to poverty level 

Education Average years of education completed for county group population 
Migration rate Percentage growth rate 1960—1970 

Divorce Percentage currently divorced 

Female Percentage of the county group that is female 

Density Persons per square mile 


Social Construction Controls 


. Toxicology facilities 


Medical examiner system 


Percentage of the county group population serviced by medicolegal office with 
state toxicologist facilities available 
Percentage of the county group population covered by a medical examiner 


system 
Percentage of the county group population where official can call an inquest 
Percentage of the county group population with an elected official responsible 


Percentage of the county group populatioa serviced by medicolegal office with 


County groups in New England and Mid-Atlantic 


Inquest used 
Official elected 
for recording cause of death 
Pathologist available 
pathologist available 
Region (Omitted category: West) 
North East 
South County groups in South East and Deep South 
Midwest County groups in Midwest and Plains 
South West County groups in South West 
Statistical Control 
UCounty group population Following Firebaugh and Gibbs (ASR 1985) 
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Theorists and researchers have explicitly or implicitly made use of primary groups. 
to explain a wide range of social behaviors, such as work productivity, mass media 
communication, combat morale, job search, services to elderly, and mortality 
rates. Typically, they have not systematically distinguished primary group from 
formal organization effects. Consequently, it is not known what primary groups 
uniquely contribute to social behavior. An expanded organizational contingency 
theory of group structure is advanced which fills in this gap and shows that 
primary.group theory and organizational contingency theory share a common 
framework. To demonstrate the power of this formulation, national data on 
mortality are analyzed to predict which causes of death can and which cannot be - 


reduced by primary groups. 


Durkheim’s (1951) classic study on suicide 
provided early evidence that social support 
enhances physical and mental well-being. In 
recent decades findings from many studies 
(summarized in Berkman 1985; House and 
Kahn 1985) convincingly demonstrate the 
effect of social support in reducing mortality 
from a broad range of causes of death. The 
Alameda county studies (Berkman and Syme 
1979; Berman and Breslow 1983) showed 
that residents of this county with fewest social 
ties had twice the mortality rate of those with 
the most ties. However, major reviewers in 
this field (House and Kahn 1985; Berkman 
1985) agree there is neither a common 
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theoretical definition of social support nor a 
satisfactory explanation of how it operates. 
Furthermore, there is a notable absence of 
reference to the substantial literature on the 
more general role of primary groups (the key 
social support group) in the achievement of 
non-health-related goals. 

A major aim of this paper is to present a 
theoretical synthesis by showing that organi- 
zational contingency theory can be expanded 
to incorporate such diverse lines of inquiry as 
Parsons' view on ascriptive groups (Mayhew 
1968), Durkheim's work on social bonds and 
suicide (see also Gove 1973), community 
studies on social support and mortality 
(House and Kahn 1985), the role of primary 
groups in work productivity and mass media 
communication (Katz and Lazarsfeld 1955), 
as well as the literature on organizational 
design (Perrow 1967; Drazin and Van der 
Ven 1985). We will first document that prior 
formulations do not provide an explanation 
for the unique role of informal social supports 
as contrasted with formal organizations. 
Next, organizational contingency theory will: 
be reviewed to show how an extension of this 
theory does provide a common framework for 
the analysis of both informal and formal 
structures in health and non-health-related 
areas. This new formulation will be applied to 
the analysis of the 1980 national United States 
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mortality data to show it is possible to 
systematically indicate which causes of death 
are most likely to be reduced by informal 
social supports. 


REVIEW AND REFORMULATION OF 
LITERATURE ON SOCIAL SUPPORTS 
AND MORTALITY 


Despite the fact that contemporary researchers 
have not arrived at a common theoretical 
definition of what constitutes a social support 
group (House and Kahn 1985), virtually all 
empirical studies measure social support as 
ties to primary groups (that is, family, 
friends, neighbors; House and Kahn 1985; 
Berkman 1985). Sometimes investigators also 
include weak tie groups, such as voluntary 
associations. On the rare occasions when 
` contact with staff of formal organizations are 
included, it is their primary group aspects 
(such as providing emotional support) and not 
their official duties that are measured. Conse- 
quently, investigators seeking to explain the 
workings of social supports are implicitly 
contrasting them to formal organizations such 
as doctors and nurses in their professional 
settings, and not necessarily to the absence of 
all support. The concept “social supports” 
should more properly be labeled “informal 
social supports.” 

Investigators rarely take systematic account 
of the role of formal social supports. As a 
result, they have not delineated the unique 
functions of informal social supports. To 
make this point clear, consider the four ways 
social supports are thought to reduce mortal- 
ity, according to House and Kahn (1985) and 
Berkman (1985): (1) by providing instrumen- 
tal help, (2) by providing information, (3) by 
providing advice, and (4) by providing 
emotional bonding that buffers stress and/or 
directly affects physiological functions such 
as blood pressure and the immune system 
(Berkman 1985). Obviously, formal organiza- 
tions provide the first three services. Doctors 
who give medication provide instrumental 
help, their diagnosis provides information, 
and their medical regimen includes advice. 
As a consequence these three functions do not 
uniquely differentiate informal from formal 
supports. 

Emotional sustenance would seem to be 
one form of support that informal groups are 
uniquely able to provide. Even here, explana- 
tions have not explicitly delineated the 
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conditions under which emotional bonds of 
informal groups are better able than formal 
organizations to enhance health. For instance, 
the adverse health outcomes resulting from 
stress of being unemployed can be reduced by 
formal organizations that provide jobs. For- 
mal support from psychiatric professionals 
can reduce stress based on interpersonal 
conflicts, and physicians can prescribe medi- 
cation to control blood pressure and boost 
immune system functioning. Even when it 
comes to emotional support, social support 
theorists do not indicate the type of health 
problems for which emotional support of the 
primary groups may be equal or superior to 
that of formal interventions. 

Gove's (1973) explanation that marital 
units provide a feeling of well-being in 
addition to instrumental help has the same 
problem. Formal organizations can also 
provide states of well-being (e.g., through 
organizations that provide entertainment, dis- 
pense tranquilizers, and offer psychiatric 
care) as well as instrumental help. Conse- 
quently, Gove does not specify the unique 
role that primary groups play in reducing 
mortality. Umberson's (1987) theory of social 
control and health behaviors suffers from 
similar logical shortcomings. Thus, she pro- 
poses that the family utilizes social control 
mechanisms of socialization to internalized 
health norms or sanctions to maintain them. 
However, formal organizations can also 
socialize (e.g., schools, mass media) and 
sanction (e.g., police). 


AN EXPANSION OF ORGANIZATIONAL 
CONTINGENCY THEORY 


To know what the informal group uniquely 
provides, it is necessary to know what the 
formal ones can do. Litwak and Figueira 
(1968) and Litwak (1985) show that an 
extension of organizational contingency the- 
ory (Litwak 1961; Perrow 1967; Drazin and 
Van der Ven 1985) furnishes these answers. 
The basic premise of this formulation is that 
both tasks and group structures vary and 
groups can best manage those tasks that 
match their structure. A brief review of 
organizational theory highlights this point. 
The early organizational theorists, such as 
Weber (1947), implicitly assumed that techni- 
cal knowledge would in most cases enable 
people to solve problems better than knowl- 


. edge obtained through everyday socialization. 
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Weber’s monocratic bureaucracy was a group 
structure that was designed to exploit the 
potential of technical knowledge and task 
simplification more fully than the primary 
group. does. For instance, members of the 
monocratic bureaucracy are chosen on the 
basis of technical ‘training or special job 
experience, while members of the primary 
group are determined by birth and love. 
Members of the monocratic bureaucracy have 
only a limited commitment based on their 
technical competence. They can be removed 
if they become technologically outmoded or 
do not meet minimum standards of perfor- 
mance. By contrast, the primary group 
members have long-term, even lifelong, 
commitments, which means they can not be 
easily removed even if new technology comes 
along that makes their activities obsolete. The 
monocratic organization has detailed division 
of labor which permits individuals to train on 
the job through specialization or alternatively 
permits task simplification (i.e., taking com- 
plicated tasks and breaking them into simpler 
components so they can be carried out faster 
and with fewer errors). By contrast, the 
primary group is small and so does not 
permit a detailed division of labor; instead, 
primary group members assume various roles. 

To protect technical knowledge from the 
inroads of favoritism, the monocratic bureau- 
cracy maintains an impersonal atmosphere 
and motivates its members by impersonal 
economic rewards. In contrast, an internal 
commitment of duty and/or affection and a 
bartering of services motivate primary group 
members. These by definition encourage 
highly personal ties. When Weber (1947) 


“claimed that the monocratic bureaucracy was 


the most rational or most efficient for the 
needs of a modern industrial society, he 
implicitly matched the technical knowledge 
requirement of tasks with the dimensions of 
an organizational structure that promoted 
technical knowledge (e.g., recruitment by 
merit, detailed division of labor). 

The next generation of organizational 
theorists demonstrated, contrary to Weber’s 
expectations, that primarylike groups played 
significant roles in either increasing or 
decreasing work productivity in complex 
organizations (Roethlisberger and Dickson 
1939). One explanation for the persistence of 
primary group influence offered by Blau 
(1955) was that intrinsic to all tasks was 

uncertainty as well as technical knowledge. 


EE 


Dealing with uncertainty required a modicum 
of trust. The organization most consistent 
with trust was that which emphasized internal- 
ized motivations, long-term commitment, and | 
peer rather than hierarchical relations. Thus, 
Blau's reasoning, like Weber's, involved 
implicit matching of dimensions of group 
with his revised definition of tasks. Blau 
(1955) introduced dimensions of structure that 
moved the formal organization towards the 
primary-group-like structure. He did not, 
however, see the need to look systematically 
for variations in task and group structures 
because he made the assumption that uncer- 
tainty would be inherent in all tasks and 
therefore there would be only one type of 
Structure. 3 i 

It was not until the third generation of 
organizational theorists emerged that more 
systematic attention was focused on the need 
to simultaneously classify tasks and organiza- - 
tional structures. The early organizational 
contingency theorists (Litwak 1961; Perrow 
1967; Thompson 1967) recognized that tasks 
can vary in the amount of technical knowl- 
edge and uncertainty. If a task required 
technical knowledge and was highly predict- 
able, then the monocratic model would be 
optimal. If technical knowledge was required 
and predictability was low, then the human 
relations model would be best. Litwak and 
Figueira (1968) extended this formulation to 
its logical conclusion and argued that if little 
technical knowledge was needed and_ there 
was considerable uncertainty, then primary 
groups would be best. It is this extension that 
is central to our analysis.! 

To apply the lessons of the above analysis 
to the literature on social support, it is 
necessary to make explicit the circumstances 
under which technical knowledge is not 


‘useful and then ask why the informal support 


T A task-specific theory (Litwak 1985) augments 
contingency theory further by showing that tasks 
and organizations can be matched on any or all 
dimensions of organization, such as size, length of 
commitment, and communality of life-styles, and 
not just technical resources and uncertainty (Lit- 
wak 1985). This formulation permits a more 
detailed prediction of which group manages which 
task (e.g., which primary group—friends, neigh- 
bors, spouses, or kin—can provide which ser- 
vices). However, for this presentation, it is 
necessary to differentiate only technical resources 
and uncertainty. i 
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system will be more effective than the formal 
organization in such circumstances. There are 
at least two situations in which technical 
knowledge cannot be used effectively (Litwak 
and Figueira 1968). First, there are some 
activities that simply do not require any 
' technical knowledge. For example, a family 
member with skills gained through everyday 
socialization is just as well equipped to 
remove a lit cigarette from the fingers of a 
Sleeping person as is a doctor or fireman. 
Second, technical knowledge is of little use 
. when the situation is so unpredictable, or 
when there are so many contingencies, that an 
expert cannot be brought to bear in time to 
make a difference, but a person with everyday 
knowledge can play some role and is 
available. For instance, if a tornado destroys a 
house, starts a raging fire, and leaves one of 
, the inhabitants senseless with a broken leg, a 
` doctor usually cannot be brought to the scene 
quickly enough to help the victim. A family 
member at the scene, however, can save the 
victim's life by pulling him or her to safety. 
Of course, had a doctor been available, the 
. likelihood of aggravating the leg injury would 
have been reduced. But between the alterna- 
tive of a nonavailable expert and an available 
nonexpert who can provide some help, the 
. latter is more effective. 

Litwak (1985) provides the theoretical 
analysis of group structure that shows why in 
principle primary groups are more effective 
than formal organizations when managing 
nontechnical tasks. The primary group are 
less expensive than formal organizations 
when nontechnical tasks are the mode, 
because they do not have the costs necessary 
for training and recruiting experts (who are no 
. longer needed). In addition, primary groups 
are likely to be faster in performing tasks not 
requiring technical knowledge. The small size 
and diffused ties mean that primary groups 
are not slowed down, as are formal organiza- 
tions, by more extended lines of communica- 
tions needed to coordinate large organizations 
with a detailed division of labor. Finally, 
persons motivated by internalized commit- 
ments of primary groups rather than the 
economic incentives of formal organizations 
.are likely to be more conscientious in 
performing tasks involving minimal technical 
knowledge, particularly when these tasks 
. cannot. be continuously monitored (e.g., 
caring for the bedridden patient). The conse- 
quences of favoritism, to which primary 
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group members are prone, are not likely to 
impair performance of such nontechnical 
tasks. The very structure of the primary 
group, which makes it less effective for 
managing technical tasks, makes it faster, less 
expensive, and a better motivator for tasks 
that require everyday socialization. 


THEORETICAL CONVERGENCE 


If it is recognized that common usage of 
social supports refers to primary group 
structures, then this expansion of organiza- 
tional contingency theory provides the under- 
lying rationale as to why informal social 
supports reduce mortality and morbidity. 
Thus, informal social supports can provide 
instrumental aid, information, and advice 
more effectively than formal organizations for 
all tasks that do not benefit from technical 
resources and occur in unpredictable situa- 
tions. Unlike prior definitions, this uniquely 
differentiates optimal roles of informal and 
formal groups. 

The theory also predicts the circumstances 
in which emotional support from informal 
groups is better able than interventions by 
formal organizations to enhance the body's 
resistance to disease. For example, high 
blood pressure can be produced by social 
stress. Chronic high blood pressure is a 
predictable and measurable form of stress on 
the body that can be successfully controlled 
through medication prescribed by a physician. 
On the other hand, daily hassles (Kanner, 
Coyne, Schaefer, and Lazarus 1981) repre- 
sent idiosyncratic and unpredictable types of 


stressors. The primary group members at _. 


work and at home are far more likely than a ' 
physician or counselor to be available and 
adept in providing emotional succor to buffer 
the stress of everyday hassles as they arise. 
Umberson's (1987) formulation that marital 
units reduce morbidity by exercising social 
control through socialization and sanctions 
would provide a unique definition if she 
limited it to those causes of morbidity to 
which technical resources cannot be applied. 

Durkheim's (1951) discussion of egoistic 
suicide suggests that the key to reduction in 
suicide rates is stronger social bonds. More- 
over, like social support theorists, he opera- 
tionalizes one aspect of social bonds in terms 
of primary groups and weak tie groups (e.g., 
marital.status and religious affiliation). The 
expanded organizational contingency theory 
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applied to Durkheim’s egoistic suicide fills in 
gaps by specifying how in nontechnical areas 
social bonds reduce stress. Gove’s (1973) 
view that marital ties provide states of 
well-being and instrumental activities that 
reduce causes of death would uniquely define 
the role of primary groups if he made explicit 
his assumption that primary groups provided 
such services for tasks requiring little techni- 
cal resources and or having much uncertainty 
attached to them. Gove goes much further 
than Durkheim and other researchers in that 
he sees the need to differentiate tasks (i.e., 
causes of death), but he groups together those 
requiring technical resources with those not 
amenable to any social sources. 

This same contingency theory also explains 
why primary groups are effective in other 
areas of life. For instance, Katz and Lazars- 
feld (1955) attempt to explain the role of 
primary groups in mass media communication 
and work productivity by suggesting that they 
provide instrumental help and “reality” in 
uncertain situations. If their concept of 
“uncertain situations” can be translated to 
state that technical resources cannot be 
readily brought to bear in time to make a 
difference, or that there is no technical 
knowledge, then the expanded contingency 
theory would incorporate their formulation as 
well. Mayhew's (1968) expansion of Parsons' 
theory of ascriptive groups argues that 
primary groups are effective because they are 
cheaper. This is subject to the objection that 
there are clearly times when costs are a minor 
consideration and when formal organizations 
are cheaper. Those objections would disap- 
pear and his framework would be consistent 
with task specific theory if he simply said 
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they were cheaper when dealing with nontech- 
nical tasks and uncertain situations. 

This same contingency theory explains the 
kinds of task formal organizations can best 
manage (Drazin and Van de Ven 1985). As’ 
such it has the virtue of providing a unified 
framework for bringing together many seem- ` 
ingly diverse sociological inquiries. In addi- | 
tion, it suggests new research approaches to 
old problems. An application to monau i 
data will illustrate this virtue. 


REFORMULATION OF HYPOTHESIS 
ON MORTALITY 


The expanded organizational contingency ` 
theory introduces two important conceptual 

distinctions that previous explanations of: 
social support and mortality have either’, 
overlooked or left ill defined. The first: 
distinguishes between social support from  . 
primary groups, social support from formal : 

organizations, and the lack of any social: 

resources. The second differentiates the 
causes of death based upon the types of; 
services groups can provide to reduce risk of, 
mortality from specific causes. . 

In principle, then, persons can be located 
along two continuous dimensions with respect 
to their level of formal and primary group 
resources. In a parallel manner, causes of 
death can be arrayed based upon risk’ 
reduction through group services requiring 
technical knowledge or those gained through 
everyday socialization. For the purposes of 
specifying hypotheses, populations and causes . 
of death are treated as dichotomies to form a. 
matrix as depicted in Table 1. The rows of the 
table represent populations with the four 


Table 1. Hypothesized Rank of Mortality Rates by Formal and Informal Group Resources and Causes of Mortality ` 


(Classified by Services Needed to Reduce Them) 


Group Courses Causes of Death (classified by technical knowledge and everyday socialization) 
High Socializ.* High Socializ. Low Socializ. Low Socializ. 

Primary Pormal cow Technical. Hh Technical- , ae eee 

Group Org. Col. 1 Col. 2 Col. 3 Col. 4 

row 1 high low 1.00° 2.00 3.00 3.00 

row 2 high high 1.00 1.00 1.00 3.00 

row3 low high 3.00 2.00 1.00 3.00 

row 4 low low 3.00 3.00 3.00 3.00, 





= High: goctäliz:/low technical causes are -those an; wineh tie HAE Ot dedib di reduced Dyiservicen gained through.” 


y socialization and not requiring technical services. 


b The. numbers in the cells indicate the extent, if any, mortality rates will be reduced for row populations having 
designated group resources. A 1.00 corresponds to the population with the lowest hypothesized rate of mortality and 


a 3.00 means the highest mortality rate in each column. 


54 


possible combinations of high and low 
primary group and formal organizational 
resources. The columns distinguish causes of 
death according to whether group services, to 
reduce the risk of mortality, require either 
technical knowledge or everyday socialization 
or both. At one extreme are causes for which 
prevention or delay of death depends predom- 
inantly upon everyday socialization (Col. 1, 
Table 1) Deaths caused by falling asleep in 
bed with a cigarette or driving when drunk 
can be prevented by a spouse, who can 
remove the cigarette or do the driving. 
Avoiding these hazards requires no technical 
_training but does require internalized long- 
term commitment and face-to-face contact. 

In contrast, Hodgkin’s Disease (a lym- 
phatic cancer) illustrates a second category of 
causes which can be reduced exclusively by 
technical knowledge (Col. 3, Table 1). It can 
be diagnosed only through biopsy and treated 
only by chemotherapy or radiation. Preven- 
tion of death from a third category of causes 
entails a combination of services involving 
everyday socialization and those requiring 
technical knowledge (Col. 2, Table 1). It is 
illustrated by cardiovascular diseases whose 
major risk factors can be reduced by proper 
diet. Adherence to a low-fat diet generally 
benefits from long-term internalized commit- 
ment and everyday knowledge provided by a 
primary group member. At the same time, 
medication, and even surgery, may be 
necessary to prevent cardiovascular disease 
for those with the most elevated cholesterol 
levels. A fourth category includes causes of 
death for which group services of any type are 
of little value (Col. 4), such as cancer of the 
pancreas. 

With this classification of deaths, the 
expanded organizational contingency theory 
suggests that social support reduces mortality 
whenever available group resources match the 
type of services required to reduce a given 
cause of death. The hypothesized relation- 
ship betweén social supports and mortality 
under this new formulation is depicted in 
Table 1. The numbers in the cells of the table 
represent the hypothesized rankings of cause- 
specific mortality for the four population 
groups (the rows) within each of the four 
causes-of-death categories (the columns) de- 
scribed above. Within any cause-of-death 
category (a column), the theory distinguishes 
at most three levels of mortality. The rank of 
“3” is used to designate, within each 
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cause-of-death category, the group with the 
highest expected mortality. For instance, 
persons high in primary group support or 
resources but not formal organizational ones 
(row 1) will have a reduced risk of dying from 
causes prevented by services requiring only 
everyday socialization (e.g., pulling a ciga- 
rette -out of the hands of a sleeping person) 
which are classified in column 1. Those in 
row 2, who are high on both formal and 
informal resources, will also have the primary 
group resources to reduce this cause of death. 
Therefore, the theory predicts that these two 
groups will have the lowest mortality rates, 
that is, a rank of 1 in rows 1 and 2 of column 
1. Those in the lower two rows are low in 
primary group resources so they would have a 
higher mortality rate for this cause of death, 
that is, a rank of 3 in column 1. 

By contrast, column 2 represents those 
causes of death for which risk reduction 
requires both technical knowledge and every- 
day socialization (e.g., a heart attack at 
home). Since the row 2 population is high in 
contact with both formal organizations and 
primary groups, it is ranked 1. Row 1 and 3 
populations are high in only one type of group 
resource so they are ranked 2, while those in 
row 4 are low in both formal and primary 
group resources and so are ranked 3. Column 
3 represents those causes for which the risk of 
death can be reduced only through group 
services requiring technical knowledge (e.g., 
Hodgkin's disease). In this case those people 
in rows 2 and 3 are most likely to have formal 
organizational resources necessary to avoid 
death from this cause so they would be ranked 
1, while those in row 1 and 4 would be 
ranked 3. For those causes which cannot be 
reduced by any form of group services, 
represented in column 4, no type of social 
support will make a difference, and mortality 
is expected to be about the same rate for all 
populations. Hence, all rows are given the 
same rating of 3. - 

Table 1 can also be used to deduce the 
relationship between social support and mor- 
tality when, as in the case of the data 
presented below, there are no measures of 
formal organizational resources. This situa- 
tion is represented in Table 1 by combining 
rows | and 2 and contrasting them with the 
combination of rows 3 and 4. Table 2 shows 
the average rankings of these combinations 
assuming the rows have equal populations. It 
suggests a monotonic relationship. Persons 
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Table 2. Average Hypothesized Rank of Mortality Rates i Causes of Mortality (Classified by Type of Service 
Necessary to Reduce Them) when Only Informal Group Resources are Identified 


Causes of Death (classified by technical knowledge and everyday socialization) 








High Socializ.* High Socializ. Low Socializ. Low Socializ. 

Low Technical High Technical High Terchnic. Low Technical 
Primary Group _ Col. 1 Col. 2 Col. 3 Col. 4 
Row 1 (1+2)° high 1.00 1.50 2.00 3.00 
Row 2 (3+4) low 3.00 2.50 2.00 3.00 


* High socializ. means this cause of death can be reduced through everyday socialization; low technical means that 
this cause of death can not be reduced through technical knowledge. 
> The figures in row 1 is an average of the numbers in rows 1 and 2 in Table 1; figures in row 2 are an average of 


rows 3 and 4 in Tablé 1. 


high in primary group resources will show 
greatest reduction in mortality relative to 
persons low in primary group resources for 
causes of death that are predominantly 
affected by everyday socialization services 
(Column 1). The ratio is expected to be 
smaller but still greater than 1 where 
coordination between formal organizations 
and primary groups is requiréd to prevent 
death (column 2), and the ratio is expected to 
approach 1 for deaths in which primary 
groups are largely ineffectual (Columns 3 and 
4). 

Table 2 also makes clear how the expanded 
version of contingency theory leads to 
empirical expectations that differ from previ- 
ous explanations. For instance, most other 
investigators assume that there is no system- 
atic variation on the effect of informal social 
support on cause-specific mortality rates 
(House, Robbins, and Metzner 1982; Berk- 
man and Breslow 1983). Conceptually, that 
means all the columns in Table 2 would have 
the same rankings. Alternatively, it is neces- 


. sary to have only one column. The monotonic . 


pattern in Table 2 also differs from Gove’s 
dichotomous distinction between causes of 
death that are and those that are not affected 
by marital status. His formulation would lead 
to combining columns 1 and 2 in one ranking 
and combining columns 3 and 4 in another 
ranking. None of the alternative formulations 
would formulate the hypothesis suggested in 
Table 1. 


METHODS 


An empirical test of the contingency theory 
entails a procedure for classifying causes of 
death. We relied upon expert raters to develop 
an initial assignment of causes of death. The 
theoretical group-service categories in Tables 


1 and 2 were expanded to create a six-point 
operational scale. The two additional catego- 
ries represent intermediate points in the level . 
of each type of group service involved in 
preventing death from a specific cause. In 
order of increasing primary group influence, 
the six group-service categories and their 
numerical ratings are: neither primary group 
or formal organization services (1); overwhelm- 
ingly formal (2); majority formal/minority 
primary group (3); equally formal and pri- 
mary group (4); minority formal/majority 
primary group (5); and overwhelmingly 
primary group (6). This scale represents a 
simultaneous judgment on two independent 
dimensions. One represents the amount of 
formal influence; the other represents the 
amount of primary group influence. The 
categories of the resulting scale have been 
ordered with respect to the increasing impor- 
tance of primary group ties.? 


? The raters could have been asked to make two 
judgments, one on the amount of everyday 
socialization necessary to reduce the given cause of 
death in one on the amount of technical knowledge 
necessary to reduce the cause of death. Because 
resources based on everyday socialization and 
technical knowledge can refer to independent areas 
within a given cause of death, they must both be 
rated separately. Consequently, there are two 
independent scales.. The raters were asked to make 
both judgments simultaneously instead of rating 
each scale separately and joining them statistically. 
The scale represents all possible combinations of 
two continua but for the purpose of this study has 
been scored to represent a single continuum that 
orders deaths based upon the importance of 
primary group affiliations. If we wanted to 
examine only the effects of formal organizations 
on mortality, then category 1 (neither formal nor 
primary group) would have to be scored like 
category 6 (overwhelming primary group) and 
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Two health professionals independently 
assigned 176 of the largest three-digit causes 
of death categories defined in the ninth 
edition of the International Classification of 
Diseases (ICD.9.cm. DHHS, 1980) to the six 
group-service categories.? This list constitutes 
close to 90 percent of all adult deaths in the 
United States in 1980. When the two raters 
disagreed on a rating, the cause of death was 
given the average value of the two assigned 
categories. This resulted in some of the 
causes of death receiving a fractional value 
(ie., 1.5, 2.5,... , 5.5). Ratings for the 75 
largest ICD categories are presented in 
Appendix 1. 

We recognize that there are considerable 
difficulties in attempting to rate causes of 
death by influence of social factors. For 
instance, some causes of death have multiple 
stages with formal and informal groups 


playing different roles at each stage. Some ` 


primary groups (e.g., the low-income .ones) 
may have less knowledge of good health 


practices than others (Berkman and Breslow ` 


1983). Before being overwhelmed by the 
complexities in rating, one should keep in 
mind that they may relate to only a few of the 
causes of death. In fact, there was an 
acceptable level of agreement between the 
two raters. They assigned 79 percent of the 
176 causes to the same or adjacent group- 
services categories. The correlation between 
the two ratings was .55 and their average 
scores (which were those actually used) had a 
Spearman-Brown reliability of .71 (Rosenthal 
1984). This level of agreement implies that 
experts can arrive at a meaningful rating. The 
consequence of rater unreliability is likely to 
bias downward estimates of correlation and 
regressing coefficients (Duncan 1975). There- 
fore, future improvements in reliability will 
likely strengthen the positive associations 
reported below.* 





reverse weights assigned to all of the categories 
(e.g., old category 2 would now become category 
6). 


? The two expert raters were Sheila Gorman, a 
nurse with a Ph.D. in sociomedical sciences, and 
Samuel Wolfe, a physician with a doctorate of 
public health. 

* To examine the sensitivity of our findings to 
unreliability in rating causes of death, we reestima- 
ted the regression equation described below and 
reported in Table 5, excluding observations for the 
35 cause-of-death categories in which raters 
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TEST OF HYPOTHESES WITH 
NATIONAL MORTALITY STATISTICS 


To investigate the validity of both the 
group-service classification scheme and the 
predictions of contingency theory, we have 
updated and expanded Gove’s (1973) use of 
national death data to analyze marital status’s 
influence on mortality. We employed the 
1980 annual National Center For Health 
statistics (1983) public use data tapes contain- 
ing information abstracted from all death 
certificates (close to two million) issued in the 
United States. For this study, we have 
gathered information on age, gender, race, 
and marital status of decedents 25 years or 
older at time of death who died from one of 
the 176 rated causes in 1980. 

Martial status (married/single at time of 
death) is used as the indicator of informal 
social support. Prior work suggests that other 
forms of social support may operate differ- 
ently and, in particular, marriage benefits 
men more than it does women (Gove 1973 
and 1984; Berkman and Syme 1979; Helsing, 
Szklo, and Comstock 1981; House et al. 
1982; Umberson 1987). Separate analysis by 
gender permits some control of this issue. 
Other próblems with these data are the lack of 
a measure of health status and limited 
socioeconomic measures. Thus our data may 
reflect a spurious effect of an unmeasured 
antecedent cause such as poor health. Poor 
health might lessen chances of marriage and 
increase mortality at the same time (Gove 


: 1973; Vaupal, Manton, and Stallard 1979). 


Fortunately, there is strong empirical evi- 
dence indicating a causal component to the 


association between social support and mor- 


tality. At least four prospective studies have 
shown that the effect of social supports on 
reducing mortality persists even when mea- 
sures of health and socioeconomic status are 
introduced (Berkman and Breslow 1983; 
House et al. 1982; Blazer 1982; Schoenbach, 
Kaplan, Fredman, and Kleinbaum 1983). 
Effects of social supports were estimated 
from ratios of single to married mortality rates 





disagreed by two or more rankings. Comparison of 
the two regression equations revealed only minimal 
differences in size of regression coefficients. These 
results lend credence to the use of average ratings 
as a way of resolving disagreements between 
coders as well as the usefulness of the rating 
scheme. 
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for each rated cause of death broken down by 
age-gender-race groupings of decedents. De- 
cedents were divided into seven age groups: 
six ten-year groups between 25 and 84 and an 
open-ended category for all deaths to persons 
85 and older. Race was dichotomized be- 
tween white and black. Decedents of other 
racial groups were excluded from this analy- 
sis. Denominators for the mortality rates were 
based on 1980 United States census counts for 
single and married persons in each of the 
corresponding age-by-gender-by-race sub- 
groups. Mortality ratios greater than 1 
indicate that marriage conferred protection. 
Ratios about 1 indicate that marriage confers 
little benefit, while ratios less than one 
indicate that married individuals were more 
likely to die from a specific cause. 

We excluded from the analysis the least 


reliable estimates of age-gender-race specific . 


mortality ratios, those in which either the 
married or single mortality rate was based on 
five or fewer decedents.5 The elimination of 
these “low-frequency” mortality ratios re- 
duced the number of mortality ratios for 
analysis from 4,400 to 2,940, a reduction of 
one-third. 

The correspondence between the theoreti- 
cal pattern based on the group-service ratings 
and the empirical distribution of the mortality 
ratios was inspected in several ways. We first 
examined average values of the mortality 





5 Preliminary analysis of the data with the 
low-frequency mortality ratios included tended to 
attenuate correlations, consistent with our expecta- 
tions that these mortality ratios introduced greater 
random variations than estimated mortality ratios 
computed from high-frequency ratios. Although 
the exclusion of mortality ratios with five or fewer 
single or married decedents removed a consider- 
able source of random error, sampling error could 
still vary between mortality ratios because there 
were still large differences in the number of deaths 
used in calculating them. The number of decedents 
used in computing the mortality retios varied from 
as few as 12 to over 1,000. We investigated the 
extent of this problem by examining standard 
deviations for mortality ratios grouped by number 
of deaths and residuals from the regression 
equations reported below. In both cases, variability 
in the mortality ratios were not related to the 
number of deaths used in computing the mortality 

os. Mortality ratios based on as few as 12 cases 
had no more random scatter than those based upon 
as many as 1,000 cases. Heteroscedasticity, then, 
did not appear to be a problem requiring further 
action. 
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ratios of the causes assigned to each of the 
group-service categories for each age-gender- 
race groups. To further simplify presentation 
of findings, age-standardized, cause-specific 
mortality ratios were computed for each of the 
four gender-by-race subgroups. The age- 
standardized ratios were computed from 
unweighted averages of the age-specific 
mortality ratios. Because of the exclusion of 
the low-frequency mortality ratios, some of 
the age-standardized ratios were based on 
truncated sets of age-specific ratios. The 
association between the theoretical and empir- 
ical rankings of the mortality ratios was 
examined first through tabular presentation of 
bivariate relationships and then multiple 
regression analysis which allowed for simul- 
taneous tests of interactions between the 
theoretical ranking and the demographic 
variables. We used the log of the mortality 
ratio in computing correlation coefficients and 
the regression equations to eliminate much of 
the skewness in the untransformed variable. 


FINDINGS 


Table 3 shows the average values of cause- 
specific mortality ratios organized by gender, 
race, and age groups; and Table 4 shows the 
average values for age-standardized mortal- 
ity. Consistent with the central hypothesis of 
contingency theory, there is a .64 correlation 
between the theoretical classification of each 
rated ICD cause of death and its logged 
age-standardized mortality ratios for white 
men. For black men it was .532, for white 
women it was .527, and for black women it 
was .372. (All correlations significant at p < 
.001.) These correlations are all the more 
impressive because they undoubtedly underes- 
timate the true level of association because of 
the random classification error due to rater 
unreliability. Correlations computed for mean 
values of the agé- standardized mortality 
ratios for causes of death in each of the 10 
group-service categories would reduce this 
type of error. As would be expected, they are 
substantially increased over the values ob- 
tained for the disaggregated data: .883 for 
white men, .808 for black men, .743 for 


© Only for the oldest blacks do we find any 
vacant group-service categories, and these are for 
the extreme ends of the group-service ratings (see 
Table 3). 
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Table 3. Mortality Ratios by Group-Service Rating, Gender, Race, and Age 





Age Category - 
Group White Males 
Service 
Rating* 25-34 35-44 45-54 55-64 65-74 75-84 85+ 
i 1.263 1.360 1.050 1.057 0.907 0.830 1.026 
1.5 2.027 2.981 2.784 2.067 1.475 1.194 0.917 
2 2.085 1.980 1.988 1.691 1.380 1.241 1.142 
2.5 2.098 2.615 2.785 2.459 1.929 1.587 1.409 
3 1.379 1.772 1.849 1.677 1.455 1.276 1.251 
3.5 2.498 2.338 2.396 2.097 1.805 1.548 1.452 
4 2.876 3.063 2.604 2.153 1.751 1.408 1.327 
4.5 2.219 2.566 2.875 2.715 2.5032 2.040 1.644 
5 3.132 4.909 4.918 3.702 2.350 1.631 1.474 

3.506 4,058 3.878 3.753 3.102 2.404 1.670 
Black Males 
1,163 1.420 0.933 0.377 1.075 1.349 — 

1.5 1.768 2.399 2.096 1.886 1.171 1.251 1.107 
2 1.706 2.362 2.078 1.657 1.419 1.331 1.258 
2.5 2.074 2.677 2.277 2.149 1.644 1.381 1.334 
3 1.569 1.824 1.767 1.620 1.345 1.245 1.347 
3.5 1.828 2.437 2.322 2.007 1.614 1.486 1.456 
4 2.281 2.703 2.386 2.082 1.606 1.387 1.578 
4.5 1.314 1.831 2.063 2.302 1.700 1.337 2.146 
5 2.905 3.647 2.865 2.71 1.783 1.345 1.326 
5.5 1.980 2.236 2.739 2.570 2.370 3.116 1.550 

White Females 
1 1.533 1.129 1.136 0.940 0.899 0.976 0.640 
1.5 1.604 1.806 1.587 1.272 1.218 1.071 1.283 
2 1.501 1.778 1.316 1.311 1.249 1.220 1.172 
2.5 1.891 2.093 2.118 1.747 1.476 1.335 1.446 
3 1.282 1.445 1.501 1.424 1.299 1.244 1.389 
3:5 2.532 1.866 1.614 1.566 1.511 1.459 1.614 
4 2.210 1.900 1.706 1.672 1.496 1.357 1.559 
4.5 1.973 1.439 1.309 1.173 1.106 1.083 1.265 
5.. 3.325 2.998 2.240 1.735 1.456 1.392 1.669 
5.5 2.472 2.837 2.268 1.844 1.605 1.598 1.390 

Black Females 
1 1.147 1.274 1.305 1.426 1.236 — — 
1.5 1.460 1.713 1.378 1.593 1.200 1.287 2.349 
2 1.735 1.416 1.645 1.567 1.489 1.269 1.168 
2.5 1.868 1.561 1.368 1.827 1.474 1.536 2.479 
3 1.608 1.515 1.488 1.403 1.373 1.369 1.189 
3.5 1.683 1.668 1.564 1.690 1.580 1.496 2.055 
4 1.578 1.525 1.841 1.864 1.572 1.522 1.789 
4.5 1.260 1.278 1.346 1.221 ` 1.317 1.223 1.589 
5 1.691 2.125 1.766 1.648 1.523 1.473 1.311 
3.5 1.768 1.828 1.843 1.684 2.106 1.522 — 





* {=neither formal nor informal influence; 2— overwhelmingly formal; 3— majority formal, minority informal; 
4= equally formal and informal; 5 — majority informal, minority formal; 6 = overwhelmingly informal. 


white women, and .424 for black women 
(Table 4). 

When the size of the mortality ratios are 
compared between gender and race, men 
typically have higher mortality ratios than 
women, and white men have higher ratios 
than black men, but white women have no 
higher ratios than black women. This can be 


seen by counting the number of age- and 
group-services categories in which the aver- 
age mortality ratios in Table 3 for one group 
exceed those of a second. Thus, white men 
have higher mortality ratios than white 
women in 91 percent of 70 possible compari- 
sons in Table 3, and black men have higher 
ratios than black women in 76 percent of 67 
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Table 4. Age Standardized Mortality Ratios by Group- 
Service Ratings for Each Race and Gender 


Group 
Age Standardized Ratios 

Group- White Black White Black 

Service Males Males Females Females 
i 1.070 1.053 1.036 1.278 
1.5 1.921 1.668 1.406 1.569 
2 1.644 1.687 1.364 1.470 
2.5 2.135 1.934 1.729 1.730 
3 1.523 1.531 1.369 1.421 
3.5 2.019 1.879 1.737 1.677 
4 2.169 2.003 1.700 1.670 
4.5 2.375 1.813 1.335 1.319 
5 3.159 2.292 2.116 1.648: 
5.5 3.196 2.366 2.002 1.792 

Correlation 

with log ; 

of ratios — R-.883***  .808*** — 743** .425 

** —p«.05. 

*** = p^ 005. 


possible comparisons. White men have mor- 
tality ratios that exceed those of black men in 
74 percent of comparisons, but white women 
and black women are virtually equal. 

. The relationship between the mortality 
ratios and age is more complicated. Age does 
not appear to be an important factor condition- 
ing the relationship between the mortality 
ratio and the group-service rating. Measures 
of association between mortality ratios and 
group-service ratings do not vary greatly by 
age groups. For instance, the rank order 
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correlations for white males in the 25-34 age 
group is .91 and for those in the 85+ itis 
.915. For black men these figures are .515 
and .667, for white women they are .73 and 
.64, while for black females they are .29 and 
21. 

However, age does appear to have a strong 
main effect on the size of the mortality ratios. 
The age-related trends for all gender and 
racial groups are evident in Figures 1 and 2. 
For both men and women there is a 
curvilinear relation, with the youngest and 
older age groups having lower ratios than 
those for middle age groups. There is also a 
modest upswing among the very oldest 
women. 

Regression equations summarizing the in- 
dependent and interactive effects of race, 
gender, age, and the group-service ratings on 
the logged mortality ratios are presented in 
Table 5. Equation 1 includes the main and 
interaction terms needed for corroborating the 
interrelationship of group-service rating ef- 
fects, race, and gender on the^ mortality ratio 
as inferred from the foregoing tabular presen- 
tation of the data. Quadratic and cubic age 
terms are included to model the nonlinearity 
between age and the mortality ratios just 
described (see Figures 1 and 2). To assist in 
the interpretation of equation 1, Table 6 
shows estimates, derived from this equation, 
of the group-service coefficient for each of 


Table 5. Regression Equations Predicting Mortality Ratios for Total Sample and Each Race & Gender Combination 


Total White 
Variables Sample Males 
Intercept 0.462*** 0.869*** 
Age —0.185*** —0.242*** 
Age squared —0.023*** —0.043*** 
Age cubed 0.013*** 0.020*** 
G-S rating 0.029* 0.192***. 
Race 0.058* 
Gender 0.193*** 
G-S x Race 0.090*** 
G-S x Gender 0.071*** 
Race x Gender 0.077* 
N 2940 951 
R-Squared 0.319 0.350 


Black White Black 
Males Females Females 
0.676*** 0.458*** 0.398 
—0.178*** —0:192*** — 0.097 
—0.029*** —0.004 — ,0003 
0.013*** 0.014*** 0.011 
0.093*** 0.113*** 0.050 
607 891 491 
0.238 0.285 0.065 


Note: Age categories are coded consecutively from — 3 to 3. The 55—64 age group was assigned a value of "0". G-S 
rating = group service rating. We subtracted 3 from each group service rating so that the zero point on this scale is the 
third group service category (majority formal/minority informal). Gender, O= female, 1 — male; race, 0=black, 


1 = white. 


* Because the equation includes interaction terms, the coefficient for the group service variable measures the slope 
for the excluded race and gender group which is the black female. This is not significant as shown in the separate 
equations for black females. The group service variable is significant for the other race and gender groups. 


* p<.05. 
** p«.0l. 
*** p«.005. 














Mortality Ratios 





Age Category 


Fig. 1. Mortality Ratios by Age Category for Black and 
White Males Note: Data points are the geometric 
means of the mortality ratios for rated ICD 
causes of death within each age category. 


the race-by-gender groups as well as their 
mortality ratios adjusted for services in age 
and group-service ratings. The direction and 
size of equation regression coefficients and 
the pattern of effects derived in Table 6 are 
largely consistent with the level of associa- 
tions discerned in Tables 3 and 4. That all 
interaction terms are significant and pgsitive 
supports the findings in Tables 3 and 4 that 
the effect of the changes in group-service 
ratings on mortality ratio is stronger for men 
than women and for whites than blacks. The 
main effects of race and gender indicate that 
the mortality ratios tend to be larger for males 
than females and for whites than blacks. The 
regression analysis, contrary to comparisons 
based on the tabular data, indicates that racial 
differences obtain for both genders, although 
the positive and statistically significant inter- 




















65-74 BS+ 
75—84 
Age Category 


. Mortality Ratios by Age Category for Black and 
White Females Note: Data points are the 
geometric means of the mortality ratios for rated 
ICD causes of death within each age category. 
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Table 6. Adjusted Mortality Ratios and Effect of Group 
Service Rating by Gender and Race 





White Black 
Males Females Males Females 
Adjusted Mortality 


Ratios 2.20 1.69 1.92 1.59 


Rating Effect 49. 12 340 03 


Notes: Values are derived from coefficients of 
Equation 1 in Table 5. Adjusted mortality ratio based on 
combination of constant term (.46) main effects of RACE 
(.06) and GENDER (.19) and interaction between 
GENDER and RACE (.08). The 55-64 age group and the 
3rd Group-Service Category are used as the reference age 
and cause of death groups in calculating the adjusted 
mortality ratios. These terms are summed and the 
exponentiated (e*) to yield the expected mortality ratios 
shown in the first row. Group Service Rating Effects are 
based on the coefficient for the G-S RATING main effect 
(.03) and its interaction with RACE (.09) and GENDER 
(.07). These terms can be interpreted as the percentage 
change in the mortality ratio for a unit increase in the 
group-service rating. 


action term (p<.05) between race and gender 
suggests that the racial disparity is larger 
among males. The effect of age on the 
mortality ratio is best examined for regression 
equations estimated separately for each race- 
by-gender group. Significant linear and higher 
order interaction terms in all four equations 
confirm the presence of nonlinearity in the 
age trend. Differences between males and 
females are also evident by the presence of a 
significant quadratic effect for males, but not 
for females. When the age curves of esti- 
mated mortality ratios based on these equa- 
tions are examined (not shown here), they 
capture the depressed levels of the mortality 
ratio of the youngest age group, and the small 
upturn in the mortality ratios among the oldest 
group of women. 


DISCUSSION 


The above findings offer provisional support 
for the claim that the impact of informal 
social supports on reducing mortality from 
different causes varies in a way that is 
generally consistent with the classification 
scheme derived from our application of 
contingency theory. But the theory's explan- 
atory power also rests on the claim that it 
provides a comprehensive framework for 
understanding gender, race, and age differ- 
ences reported in this and previous studies. 
Some of the general principles that may be at 


SOCIAL SUPPORT AND MORTALITY 


work in producing these 


differences are 
discussed below. : 


Gender Effects 


The finding that marriage provides more 
: protection for males than females is consistent 
with Gove's (1973) earlier findings using 
NCHS mortality data from 1969-71, Umber- 
son's (1987) work on health behaviors, as 
well as four prior prospective mortality 
studies (Berkman and Syme 1979; House et 
al. 1982; Helsing et al. 1981; and Schoenbach 
et al. 1986). The expanded contingency 
theory offers a cogent explanation of the 
gender pattern by suggesting that women are 
better able to manage nontechnical aspects of 
health in a family context. It is hypothesized 
that women become more knowledgeable 
about the nontechnical aspects of health 
behavior because of a nonmachismo norm 
which makes it easier to acknowledge physi- 
cal ailments and their role as primary 
socializers for infants and very young chil- 
dren. These early years require mothers to be 
in continual contact with the medical system: 
Consistent with these hypotheses are Umber- 
son's (1987) data and observation that studies 
show women are less likely to engage in 
risk-taking behavior (Waldron 1982), more 
likely to be responsive to health concerns 
(Nathanson 1977), and more likely to engage 
in care-giving related to health (Belle 1982; 
Gove 1984). 

A further factor explaining the weaker 
effect of marital ties for women is that they, 
much more than men, get equal if not greater 
protection from social supports based on kin 
and friends (Berkman and Syme 1983). 
Unmarried women, then, may be less isolated 
than unmarried men. 


Race Effects 


The weaker protective effect of marriage for 
blacks may in part be a consequence of 
black-white differences in socioeconomic 
position in United States society. To begin 
with, blacks are likely to receive inferior 





7 The lower correlation for blacks may also be 
caused by including in the married category 
persons who are separated at time of death. This 
classification error is likely to be larger for married 
blacks than whites because the former are much 
more likely to be separated. 
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formal medical services because of their 
lower socioeconomic standing. The expanded 
contingency theory suggests that ties to 
primary groups without access to good formal 
resources may reduce primary group influ- 
ence on those causes of death that require 
services performed by both formal and 
informal groups. Specifically, this refers to 
deaths assigned to group-service categories 
3.5 to 4.5. If this reasoning is correct, the 
most substantial differences in the mortality 
ratios between blacks and whites should occur 
in these categories. However, an inspection 
of the totals in Table 4 shows that the greatest 
differences occur in the extreme categories, 
that is, causes of death in which primary 
groups play the major or overwhelming role 
(i.e., 5 and 5.5 in Table 4). This pattern is 
more consistent with a second line of 
reasoning emphasizing how the impoverished 
circumstances of many blacks tend to weaken 
the marital bond (Liebow 1973) so its 
members can not manage typical nontechnical 
tasks. Even blacks with viable marital ties 
may benefit less because the poorer and less 
educated groups in our society are generally 
less well informed about recent health promo- 
tion information (Berkman and Breslow 
1983). A strong marital tie may have less 
effect on mortality if marriage partners are 
unaware of and consequently discourage 
healthy behaviors and life-styles. The task- 
specific theory, by highlighting the relation 
between group structure and tasks, illustrates 
how “race” may be conceived as an indicator 
for structural variation in informal social 
supports rather than a mere control variable of 
little theoretical interest. 


Age Effects 


Two possible explanations may be advanced 
to explain the curvilinear relationship. First, 
in extremely old age, biological-physical 
factors may become so overwhelming that 
social supports may play a very diminished 
role. This will be referred to as the “ceiling 
effect.” “Selective mortality” may also 
contribute to the dimension of marriage in old 
age (Vaupal et al. 1979). According to this 
hypothesis, only the very hardiest persons 
survive to very old age without a support 
group. By contrast, the very strength of the 
support group makes it possible to keep 
individuals alive with chronic ailments who 
might otherwise have died. When both of 
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these groups reach very old age, the fact that 
those with support groups systematically have 
a population with poorer health than the 
single people, will lead to elevated death rates 
among the married couples. 

To explain the low ratios among younger 
people, the argument is advanced that it takes 
time for a marital tie to develop. In addition, 
younger people-are more likely to exhibit the 
“male machismo” disregard for health and to 
engage in unhealthy or hazardous behaviors, 
such as reckless demonstration of physical 
prowess, drinking and driving, and driving 
too fast (Langlie, 1977). Therefore, wives in 
the first years of marriage may be unwilling 
or unable to provide the life-sustaining 
services for the sorts of risky activities their 
young husbands might engage in. Marital ties 
become more salient sources of social sup- 
ports as husbands gain maturity, and the 
advent of children makes both spouses more 
committed to preserving each other's health 
as well as more knowledgeable about health 
issues (Gove 1973; Durkheim 1951; Umber- 
son 1987). This delayed commitment to 
enhancing health among the youth, coupled 
with the ceiling effects for the elderly, 
explains the curvilinear relationship. 

There still remains to be explained the rise 
in mortality ratios for women in the extremely 
old group (Schoenbach et al. 1986). Why 
should women suddenly benefit from mar- 
riage in this oldest group? One explanation 
might be that the oldest group of women, 
because of physical frailties, for the first time 
in their marriages may require help to manage 
the everyday health-related household activi- 
ties. This frailty may prompt a request for 
help from their children—unthinkable at a 
younger age. Insofar as children provide help, 
the marital status is spuriously related to 
reduction in mortality. It may also be the case 
that the husband, upon retirement, undergoes 
a form of adult socialization in which sharing 
health-promoting household tasks become 
increasingly legitimated (e.g., becoming 
knowledgeable about cooking nutritious meals, 
learning basic first aid, keeping up with the 
news on health). What is being argued is that 
the marital unit, once a limited social support 
group for women, becomes a more legitimate 
social support group among those of very 
advanced age. 
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General Conceptual Issues-Buffering and 
Social Networks 


The expanded contingency theory also has 
implications for more general conceptual 
problems dealing with social support. One 
question frequently asked is, “Do social 
supports provide a buffering effect or a direct 
one?” (Berkman 1985). The buffering hypoth- 
esis argues that social supports help individu- 
als cope with stressful situations as they arise. 
Otherwise, social supports have little effect. 
The direct effect hypothesis states that social 
support works during noncrisis times as well. 
Indeed they may prevent the development of 
crises. Expanded contingency theory states 
that primary groups can motivate their 
members better and be less costly and faster 
than formal organizations in managing non- 
technical tasks. Insofar as managing nontech- 
nical tasks can prevent crises from happening 
(e.g., providing nutritious meals so people do 
not get sick, providing emotional support in 
daily hassles so they do not build up to a 
crisis situation), from a theoretical perspec- 
tive informal social supports have direct 
effects. On the other hand, in crisis situations 
in which unique forms of nontechnical help 
are necessary, the primary group would also 
be the most efficient group for delivering 
help, so they would have buffering effects as 
well. In short, expanded contingency theory 
provides a unified conceptual framework for 
Berkman’s (1985) important observation that 
social supports in reducing mortality may 
have both buffering and direct effects. 

To make these points clear, consider the 
stressful situation of the death of a spouse, 
which also represents a rupture of one of the 
most important sources of social support for 
most married individuals (Thoits 1982; Berk- 
man 1985). When this event occurs, other 
primary group members may act to reduce the 
resulting stress in one of two ways. First, they 
may provide services that were supplied by 
the departed spouse. For example, a recent 
widower may experience stress because he 
cannot manage everyday problems— such as 
shopping, cooking, and laundry —previously 
managed by his wife. The effect of children's 
social support for their widowed father is 
likely to look like a buffering effect. This is 
the case because children's attempt to supply 
this kind of help while the spouse is alive and 
well would generally be superfluous or even 
create friction. However, this particular form 
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of the buffering effect always implies a prior 
direct effect. 

A second kind of buffering effect occurs 
when the primary group provides a unique 
form of service never supplied by the spouse. 
For instance, the widower might have an 
enormous sense of grief when the spouse dies 
and the children provide him with succor 
against this grief. The spouse could never 
have supplied this service. This kind of 
buffering effect does not assume a prior direct 
effect. 

What is important to note is that the 
concepts of buffering and direct effects are 
low-level theoretical observations. They pro- 
vide little theoretical guidance for stating 
which groups can substitute for which activi- 
ties of the shattered primary group or which 
can provide stress reduction services to an 
ongoing primary group. An elaborated contin- 
gency theory does both of these things 
(Litwak 1985). 

Yet another distinction made by social 
support researchers (Berkman 1985; Thoits 
1982) is that between networks and social 
supports. It is recognized that strong primary 
group structures may sometimes lead to 
greater mortality (Horwitz 1977). This has led 
past investigators to distinguish between the 
structure of the group, which they call 
networks, and the group’s reduction of 
mortality, which they call social supports. 
But no one has yet presented a theoretical 
analysis to suggest the interrelationship be- 
tween the structure of primary groups and the 
provision of social support. For instance, 
explanations emphasizing that social supports 
operate mostly through some form of behav- 
ioral immunology must treat the nonsuppor- 
tive primary group as exceptional cases 
outside of the theoretical framework. These 
immunology theories argue that primary 
groups provide emotional support, and emo- 
tional support directly strengthens the im- 
mune system through its effects on enzymes 
in the blood (Cassel 1976; Berkman 1985). 
Implied is ‘that all primary groups should 
reduce mortality because by definition they 
provide emotional support. 

In contrast, contingency theory permits an 
explanation as to why strong primary groups 
may either decrease or increase mortality. Its 
basic premise is that primary groups will have 
substantial impact on those deaths that can be 
reduced through activities based on everyday 
socialization. A primary group will increase 
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mortality if it seeks to manage causes of death 
that require technical knowledge or the 
resources of formal organizations. For in- 
stance, if primary groups, out of love and 
affection, feel that they can best manage the 
health problems of a member, thereby 
delaying bringing the sick member to a 
hospital, and if the problem is an infected 
appendix that requires surgery, the primary 
group would actually increase the member’s 
risk of dying (Horwitz 1977; McKinlay 
1973). Primary groups will also increase 
mortality if members, out of ignorance or 
group norms, utilize their everyday resources 
to engage in unhealthy risky behavior. For 
instance, 1930s mothers may have had an 
adverse influence on the health of their 
children in at least one respect. Following the 
commonly accepted nutrition guidelines of 
the day, she would have prepared meals 
having ample servings of eggs, whole milk, 
butter, and red meat, all high in cholesterol. 

The reason that past investigators have 
been able to show strong relations between 
social supports and reduction of mortality 
without considering the extent to which 
primary groups inappropriately seek to man- 
age technical matters or utilize everyday 
socialization injurious to its members health 
is that most primary groups have norms which 
protect the health of their members. As a 
result, they generally try to keep current on 
medical advances, and they try to link up with 
health professionals for technical matters. 
This expanded contingency theory makes 
clear what factors one must take into account 
to understand when primary groups reduce 
mortality—that is, provide social support— 
and when they may increase mortality or have 
no effects. 

In conclusion, a wide range of theories 
makes implicit or explicit use of primary 
groups (i.e., social support and mortality, 
Mayhew and Parsons’ views on ascriptive 
groups in social evolution, Durkheim’s views 
on social bonds and egoistic suicides, Katz 
and Lazarsfeld’s “opinion leaders” in mass 
media, and many other studies on the role of 
primary groups in combat, work productivity, 
job search, education, and services to older 
people). None of them systematically contrast 
primary groups with formal organizations to 
show the unique services each contributes. 
The expanded contingency theory provides a 
common framework for seeing the underlying 
unity of these varied endeavors as well as 
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relating these alternative formulations to 
organizational theory. When applied to the 
study of mortality, this contingency theory 
leads to a new classification of causes of 
death and provides a theoretical rationale for 
predicting which causes of death will most 
likely be reduced by informal social supports. 
Finally, this formulation suggests new lines 
of inquiry such as the investigation of the 
joint effects of informal and formal supports 
on mortality. 


APPENDIX 1 
Group-Service Ratings on 75 of the 
Largest ICD’s 


1=Neither formal nor informal, 2= Overwhelm- 
ingly formal, 3 = Majority formal, minority infor- 
mal, 4= Equally formal and informal, 5 = Majority 
informal, minority formal, 6=Overwhelmingly 
informal. Where the raters disagree, an average 
rank was used, usually giving a midpoint score. 


Causes of Death Ranking 
Diseases of heart: 
394 Diseases of mitral valve 3:5 
396 Diseases of mitral and aortic valves 3.5 
402 Hypertensive heart disease 5.0 
404 Hypertensive heart and renal disease 5.0 
410 Acute myocardial infarction 3.0 
411 Other acute and subacute forms of 

ischemic heart disease 3.0 
414 Chronic ischemic heart disease 3.5 
415 Acute pulmonary heart disease 3.0 
425 Cardiomyopathy 1.5 
428 Heart failure 3.0 
Malignant neoplasms: 
141 Malignant neoplasm of tongue 4.5 
146 Malignant neoplasm of oropharynx 4.0 
150 Malignant neoplasm of esophagus 4.0 
151 Malignant neoplasm of stomach 3.0 
153 Malignant neoplasm of colon 3.5 
154 Malignant neoplasm of rectum, 

rectosigmoid junction and anus 3.0 
155 Malignant neoplasm of liver and 

intrahepatic bile ducts 3.0 
156 Malignant neoplasm of gallbladder 2.0 
157 Malignant neoplasm of pancreas 1.0 
161 Malignant neoplasm of larynx 3.0 
162 Malignant neoplasm of trachea, 

bronchus and lung 4.0 
170 Malignant neoplasm of bone and 

articular cartilage 3.0 
171 Malignant neoplasm of connective 

and other soft tissue 3.0 
174 Malignant neoplasm of female breast — 4.5 
175 Malignant neoplasm of male breast 4.0 
179 Malignant neoplasm of uterus, ' 

part unspecified — — 3.0 
180 Malignant neoplasm of cervix uteri 3.0 
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Causes of Death Ranking 
183 Malignant neoplasm of ovary and 
other uterine adnexa 3.0 
185 Malignant neoplasm of prostate 3.0 
188 Malignant neoplasm of bladder 3.0 
189 Malignant neoplasm of kidney and 
other unspecified urinary organs 3.0 
191 Malignant neoplasm of brain 1.0 
192 Malignant neoplasm of other and 
unspecified parts of nervous system 1.0 
193 Malignant neoplasm of thyroid gland — 3.0 
197 Malignant neoplasm of digestive 
systems 3.0 
198 Secondary malignant neoplasm of other 
specified sites 1.0 
200 Lymphosarcoma and reticulosarcoma 3.0 
201 Hodgkins’s disease 3.0 
202 Other malignant neoplasm of lymphoid 
and histiocytic tissue 3.0 
203 Multiple myeloma and immuno- 
proliferative neoplasms 3.0 
204 Lymphoid leukemia 3.0 
205 Myeloid leukemia 3.0 
Cerebrovascular diseases: 
430 Subarachnoid hemorrhage 2.0 
431 Intracerebral hemorrhage 1.5 
433 Occlusion and stenosis of precerebral 
arteries 4.0 
436 Acute but ill-defined cerebrovascular 
disease 4.0 
437 Other and ill-defined cerebrovascular 
disease 4.0 
438 Late effects of cerebrovascular disease 5.0 
Accidents and adverse effects: 
E812 Other motor vehicle traffic accident 
involving collision with motor 
vehicle 4.5 
E814 Motor vehicle traffic accident 
involving collision with pedestrian 4.5 
E815 Other motor vehicle traffic accident 
involving collision on the highway 4.5 
E816 Motor vehicle traffic accident due to 
loss of control, without collision 4.5 
E819 Motor vehicle traffic accident of 
unspecified nature 4.5 
E841 Accident to powered aircraft 5.0 
E880 Fall on or from stairs or steps 2.5 
E884 Fall from one level to another 5.5 
E890 Conflagration in private dwelling 5.5 
E910 Accidental drowning and submersion 5.5 
E911 Inhalation and ingestion of food 
causing obstruction of respiratory 
tract or suffocation 5.0 
E912 Inhalation and ingestion of other 
object causing obstruction of 
respiratory tract or suffocation 5.5 
E916 Struck accidentally by falling object 5.5 
E922 Accident caused by firearm missile — 5.5 
Chronic obstructive pulmonary disease: 
49] Chronic bronchitis 4.0 
492 Emphysema 4.0 
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Causes of Death Ranking 
493 Asthma 5.0 
Pneumonia and influenza: : 
480 Viral pneumonia 3.5 
481 Pneumococcal pneumonia 3.0 
482 Other bacterial pneumonia 3.0 
485 Bronchopneumonia, organism 

unspecified 3.5 
486 Pneumonia, organism unspecified 3.5 
487 Influenza 3.5 
250 Diabetes Mellitus 5.0 
571 Chronic liver disease and cirrhosis 5.0 
440 Atherosclerosis 5.0 
E950- 
E958 Suicide 5.5 


Inter rater reliability R=.71. 
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INTERRACIAL FRIENDSHIP CHOICES IN 


SECONDARY SCHOOLS* 
MAUREEN T. HALLINAN RICHARD A. WILLIAMS | 


University of Notre Dame University of Notre Dame : 

We examine processes that govern the selection of cross-race and same-race 
friends among high school students. Relying on social psychological theories of 
interpersonal attraction, we predict that individual- and school-level factors that 
affect the bases of attraction. will influence the friendship choices. We test 
hypotheses using data from the sophomore and senior cohorts of the High School 
and Beyond Survey. Its large sample, national representativeness, and detailed 
information permit a deeper examination of the determinants of adolescent 
friendship formation than has been previously possible. Students are only one-sixth 
as likely to choose a cross-race than a same-race peer as a friend, and controls for 
school and individual variables account for only a third of this differential. While 
personal characteristics of individual students and pairs of students have the 
strongest effects on friendship choices, organizational characteristics of the 
school, such as tracking, are also important. 


One goal of the massive effort to desegregate 
the American public school system was to 
counter racism by promoting positive atti- 
tudes and behaviors between black and white 
students. A considerable amount of social 
Science research has been conducted aiming 
to understand this process and identify the 
factors that influence student interracial atti- 
tudes and behaviors (See Carithers 1970; 
Cohen 1975; St. John 1975; Amir 1975; 
Stephan 1978; and Schofield 1978, 1982). 
The research on interracial attitudes is 
inconclusive, partly because of methodologi- 
cal flaws in many studies and because of 
noncomparable research designs and analy- 
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ses. Research results on intergroup behavior 
are more consistent, showing that opportuni- 
ties for cross-race interaction influence inter- 
racial sociability and friendship (Schofield 
1978; Patchen 1982; Hallinan and Williams 
1987; Hallinan and Teixeira 1987). Neverthe- 
less, considerable variance appears in the 
frequency of interracial interaction and social 
ties across schools and across classrooms 
within schools. These apparently result from 
student choices and organizational con- 
straints, suggesting that factors other than 
opportunity for interaction also influence 
these social processes. 

This paper examines the interracial friend- 
ship patterns of students in a large national 
sample of secondary schools. The incidence 
and determinants of friendship choices will be 
compared. Characteristics of student dyads 
and of school environments will be related to 
friendship choices. The results will provide 
information on the factors that promote 
cross-race friendships and on personal and 
school characteristics that affect interracial 
sociability. l 


DETERMINANTS OF 
INTERRACIAL ATTRACTION 


Both the theoretical and empirical literature in 
social psychology have identified several 
bases of interpersonal attraction, including 
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propinquity, similarity, complementarity, sta- 
tus, and reciprocity. Among these, propin- 
quity and similarity are believed to have the 
greatest impact on friendship formation. 
Individuals are more likely to establish 
friendship ties with those with whom they 
have opportunities to interact. Given propin- 
quity, individuals are more likely to become 
friends with those who are similar to 
themselves in attitudes, values, and behaviors 
than those who differ. Hypotheses about the 
effects of these two factors, propinquity and 
similarity, on interracial friendship will be 
derived here. 

Propinquity is determined both by chance 
and by choice. The probability of chance 
encounters is influenced by organizational 
factors. Among these are the size and 
composition of the formal groups to which 
students belong. Pupils generally are assigned 
to a nested set of groups within a school for 
instruction. These are grades, tracks, classes, 
and, often, groups within classes. Those who 
are assigned to the same organizational unit 
have more opportunities to interact than those 
in different units. 

The size of the group to which a student is 
assigned determines the number of peers with 
whom within-group chance interactions can 
occur. The larger the group, the greater the 
number of students available for contact. 
Students in large schools, grades, classes, or 
ability groups have a wider range of peer 
contacts than those in smaller organizational 
units. This is true regardless of the race of the 
students. 

While the size of a group determines the 
number of peers available for interaction, the 
composition of the group defines the charac- 
teristics of those peers with whom a student 
has chance encounters. Within-organizational 
unit interracial contact can occur only when 
students are assigned to racially mixed units. 
By its direct effect on the likelihood of 
cross-race interaction, the assignment process 
becomes a mechanism which either promotes 
desegregation or creates resegregation within 
a school. 

Propinquity also is determined by student 
choice, within the constraints of organiza- 
tional decisions. To some degree, students 
can choose to take the same classes as 
selected peers, to participate in the same 
extracurricular and cocurricular activities, or 
to play truant together. But assignment 
decisions generally supersede students' pref- 
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erences about the classes or groups to which 
they would like to be assigned. Moreover, 
academic schedules limit opportunities to take 
cocurricular and extracurricular activities, 
again affecting the probability that a student's 
preferences about class or group selection will 
be taken into account. 

Since interaction, whether by chance or by 
choice, generally leads to positive sentiment 
(Homans 1950), students who are assigned or 
choose to belong to the same instructional 
groups or participate in joint cocurricular and 
extracurricular activities are more likely to 
become friends than those who are in 
different groups. Empirical support for this 
proposition may be found in the literature on 
student social relations. Schofield (1979) 
found an increase in the number of cross-race 
friendship choices of junior high school 
students over a school year when they were 
assigned to racially mixed ability groups in an 
atmosphere that supported positive race rela- 
tions. Hallinan and Sorensen (1985) observed 
the same result for the same-race and 
cross-race friendship choices of elementary 
School children assigned to the same ability 
groups for instruction. 

We are concerned here with the effect of 
organizational characteristics of a secondary 
school on the interracial friendship choices of 
students. Since the organization of instruction 
within a school influences the chance and 
choice contacts of students, we expect it to 
affect the probability of friendship choice. 
Since school activities also affect contact, we 
expect black and white students who partici- 
pate in the same school activities to be more 


' likely to become friends than those who do 


not. 

Once students are in proximity, other 
characteristics begin to influence friendship 
choices. When students have the opportunity 
to observe each other, they can identify 
similarities and differences in personal char- 
acteristics, attitudes, values, and behaviors 
among peers. Observed similarities and/or 
status differences in salient characteristics are 
expected to become a basis of friendship 
formation. Gender differences are a barrier to 
friendship for elementary school students. 
While boys and girls obviously become 
romantically attracted to each other in secon- 
dary school, gender differences can remain a 
barrier to friendship. Moreover, the pervasive- 
ness of the negative effect of gender on 
youth's friendships makes it likely that it 
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transcends race and other background charac- 
teristics. Consequently, we expect to find a 
negative effect of difference in sex on 
friendship formation regardless of the race of 
the chooser or the student being chosen. 

Another student characteristic readily ob- 
servable to students, and generally promoted 
by schools as important, is a student's 
academic standing in the class. Academic 
achievement can be a basis of status in a 
class. If teachers stress the importance of 
good grades, students may assign status to 
their peers based on scholastic performance. 
Consequently, we hypothesize that, regard- 
less of race, a student will be more likely to 
choose a friend whose grades are higher than 
the student's than one whose grades are 
lower. This prediction, of course, rests on the 
assumption that students agree with teachers 
about the importance of grades. 

When students belong to the same instruc- 
tional or activity groups, not only can they 
readily observe existing salient similarities, 
but they also develop new similarities. 
Students establish similar attitudes and values 
through engagement in common activities. 
Students assigned to the same track in school, 
for example, tend to develop similar educa- 
tional aspirations (Alwin and Otto 1977). 
These new similarities promote interpersonal 
attraction. This argument lends further sup- 
port to our prediction that students assigned to 
the same instructional unit are more likely to 
become friends. 

Another factor affecting the attractiveness 
of a peer as a potential friend is whether 
positive sentiment is mutual. À norm of 
reciprocity (Gouldner 1960) suggests that a 
student is more likely to select a peer as a 
friend if the peer regards the student as a 
friend. The positive effect of reciprocity is 
likely to. occur regardless of the personal 
characteristics of the members of a dyad. 
Consequently, we expect a student to be more 
likely to make a cross-race friendship choice 
if the person chosen regards the student as a 
friend. The tendency to reciprocate friendship 
is believed to be so strong that we include it 
as a control variable in our analyses of the 
determinants of friendship choice. 

In this paper, we will test the several 
hypotheses stated above about the effects of 
organizational characteristics of schools and 
characteristics of student dyads on interracial 
friendship choice. These hypotheses have 
been tested frequently on samples of same- 
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race students but rarely on racially mixed 
samples, particularly at the secondary school 
level. Those studies that are available are 
small surveys (e.g., Patchen's [1982] study of 
interracial attitudes and behaviors of students 
in twelve high schools) or case studies (e.g., 
Schofield's [1979] study of race relations in 
an innovative junior high school) Some 
evidence of proximity and similarity effects 
on the interracial friendship choices of 
younger students is reported in a survey of 10 
desegregated elementary schools (Hallinan 
and Sorensen 1985; Hallinan and Teixeira 
1987). What has been missing in the literature 
and what will be provided by this study is an 
analysis of the determinants of interracial 
friendship choice in a large survey of high 
school students. The data for our analysis are 
obtained from the 1980 wave of the High 
School and Beyond survey. Our study is 
believed to be the first multivariate analysis of 
the friend file of this large data bank. 


PROCEDURES 


High School and Beyond is a national, 
longitudinal survey of over 58,000 sopho- 
mores and seniors from over 1,000 public and 
private schools in the United States. The 
sample was drawn according to a two-stage 
probability design with schools within a 
stratum selected with a probability propor- 
tional to their size. In the second stage, up to 
36 sophomores and 36 seniors were randomly 
selected from each school. A detailed descrip- 
tion of the 1980 wave of the HSB survey, 
including a discussion of weighting proce- 
dures and missing data, is found in Coleman, 
Hoffer, and Kilgore (1982) 

In this study we include only the regular 
public and Catholic schools, eliminating the 
various special samples of HSB. We examine 
only those schools that contain at least 10 
percent of each sex. Moreover, we include 
onl]y black and white students, eliminating 
those with other ethnic backgrounds. Cases 
with missing data are removed listwise. 

Information on organizational characteris- 
tics of the schools in the sample (class size, 
tracking system, racial composition) was 
obtained from school personnel. Data on 
student characteristics, (sex, race, grades, 
track, school activities) were obtained from 
student questionnaires. In addition, in the 
student questionnaire, the respondents were 
asked to name three friends who were in their 
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grade (i.e., sophomores or seniors). If a 
student named a friend who was also in the 
HSB sample, the friend’s ID was recorded. In 
over 20,000 instances, a student in our 
sample could be matched with one or more of 
his or her friends. 

Organizing data on student friendships is 
not methodologically straightforward, and a 
number of strategies have been employed by 
different researchers. One common approach 
(e.g., Patchen 1982) is to use the individual 
as the unit of analysis, with the dependent 
variable being whether or not the individual 
has any cross-race friends. Such studies 
typically have sociometric data from all the 
students in the school, making near-perfect 
measurement of the dependent variable possi- 
ble. Unfortunately, individual-level analysis 
is problematic with the HSB data. Only the 
races of friends who were also members of 
the sample are known. Hence, the fact that a 
student was paired only with same race 
friends does not necessarily mean that he or 
she does not have any cross-race friends. It 
just means that if there are any such friends, 
they were not included in the sample. 

Further, individual-level analysis which 
simply looks at whether any friendships are 
interracial does not necessarily give a clear 
picture of how race affects the probability of 
friendship formation. For example, Patchen 
(1982) reports that 34 percent of all black 
students and 31 percent of all white students 
in the eleven high schools in his sample either 
named another-race student as one of their 
best friends or said that they belonged to an 
informal group which was biracial. While this 
information is valuable it is also limited, since 
it does not indicate how much more likely 
individuals are to become friends with 
members of their own race tban with 
members of other races. For example, 
suppose a school were evenly divided be- 
tween black and white students; it would 
make a difference whether half the friend- 
ships of the typical student were interracial or 
whether only one of many friendships was 
interracial. If a study looked only at whether 
students had any interracial friendships, the 
clear differences between these two situations 
could be missed. 

Another common practice in friendship 
studies (see, for example, Hallinan and 
Teixeira 1987) is to use the dyad as the unit of 
analysis and to have the dependent variable be 
whether or not a friendship exists between 
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two peers. This approach makes it possible to 
determine how much (if at all) the race of a 
peer affects the likelihood that he or she will 
be chosen as a friend. Besides the substantive 
advantages of such a strategy, this approach is 
better suited for the HSB data. Even though it 
is not possible, with HSB, to determine with 
certainty whether an individual has any 
cross-race friends, it is possible to determine 
whether a friendship exists between any given 
pair of students, and how much more likely 
friendships are to exist when both members of 
the dyad are of the same race than when they 
are different races. 

In order to examine determinants of 
cross-race friendship choices, it is not suffi- 
cient to look simply at those pairs of students 
in which one individual has named the other 
as a friend. It is also necessary to examine 
dyadic relationships in which friendships do 
not exist. To do this, it was necessary to 
create a dyadic level file that contained 
information on every pair of potential friends 
in the sample, matching each respondent with 
every other respondent in that school and 
grade. After eliminating missing data, we had 
about 450,000 sophomore dyads and 433,000 
senior dyads in the sample. Since this number 
of dyads is too large to analyze efficiently, we 
selected a subsample through the following 
procedure. First, we automatically selected 
those dyads in which one student named the 
other as a friend. This included over 18,000 
dyads in which P named O as a friend 
(regardless of whether O reciprocated the 
choice), and an additional 7,000 dyads in 
which O named P as a friend but P did not 
reciprocate. We then took a 5 percent 
subsample of the remaining dyads, yielding 
43,000 dyads. This led to a total sample size 
of 34,000 sophomores and 34,000 seniors. 
Note that each dyad actually appears twice, 
with the roles of P (“chooser”) and O (the 
potential "choosee") reversed; that is, for 
every dyad in which P is paired with O, there 
is another dyad in which O is paired with P. 
This is because friendship and nonfriendship 
choices need not be reciprocated. Table 1 
presents the total number of dyads sampled 
and the number of friendship dyads, broken 
down by year in school and the racial 
composition of the dyad. 

Determining appropriate weights for the 
selected dyads was not a straightforward 
matter. The original multistage HSB sampling 
scheme gave individuals differing probabili- 


INTERRACIAL FRIENDSHIP CHOICES 
Table 1. Dyads, by Race* 


Sophomores Seniors 
Friend- Friend- 
Dyads ships Dyads  — ships 
Type N) (N) (N) N) 
White-White 29,291 7,912 29,160 8,267 
Black-Black 2,231 748 2,483 891 
White—Black 1,410 88 1,293 78 
Black-White 1,387 86 1,269 94 
Same-Race 31,522 8,660 31,643. 9,158 
Cross-Race 2,797 174 2,562 172 
Total 34,319 8,834 34,2205 9,330 


* Unweighted N's are reported. All other analyses use 
the weighting procedures described in the text. 


ties of selection. The designers of the study 
computed weights to reflect these differences, 
but these weights alone are not adequate in a 
dyadic-level analysis. The main problem is 
that the number of dyads in a school increases 
geometrically rather than arithmetically with 
increases in school size. For example, in a 
School with 100 students, there are 9,900 
dyadic relationships. In a school with 300 
students, there are 89,700 dyadic relation- 
ships— more than nine times as many. Hence, 
weights assigned must take into account the 
probability of an individual's being included 
in the HSB sample, the number of dyads in 
the school and in the sample, and the 
probability that a dyad from the full sample 
would be included in our subsample. We 
therefore assigned weights as follows: 

1. The nonfriendship dyads (which had a 5 
percent probability of inclusion in the subsam- 
ple) were weighted 20 times as heavily as 
friendship dyads (which were selected with 
certainty). 

2. The number of dyads sampled from the 
school was inflated to equal the number of 
dyads in the school. For example, if 1,000 
dyads from a school were included in the 
sample, and there were 10,000 dyads in the 
school, each sampled dyad was given a 
weight of 10. 

3. The HSB data include a school weight, 
which indicates how many comparably sized 
schools in the united states each school in the 
sample represented. Each dyad was weighted 
by this factor. 

4. Weights for all the cases were then 
divided by a constant factor, so that the 
weighted number of dyads equalled the true 
number of dyads in the sample. This keeps 
the standard errors in the latter parts of the 
analysis from being inflated. 


"A 


5. Finally, since each dyad appears twice, 
an additional weighting factor of .5 was used 
for all cases. 

This led to a final weighted N of 17,022 for 
the sophomores and 17,030 for the seniors.! 


Variables 


The dependent variable in the analysis is P's 
choice of O, coded as unity if P chooses O as 
a friend and zero otherwise, The independent 
variables include both organizational charac- 
teristics of the school and characteristics of 
the student dyads. The school characteristics 
include class size (CLASS SIZE). A transfor- 
mation of the class size variable (LCLAS- 
SIZE) is also sometimes used because it is 
more mathematically appropriate for parts of 
the analysis. It is measured as the logarithm 
of the inverse of class size adjusted for the 
number of friends named. 

Several dyadic level independent variables 
are included in the analysis. The sex of the 
dyad members (DIFF SEX) is coded as unity 
if the two members are of different sex and 
zero if they are the same sex. The racial 
composition of the dyad (CROSS-RACE) is 
coded as unity if one dyad member is black 
and the other white and zero if they are -both 
black or both white. On the student question- 
naires, the students were asked whether they 
were in the academic, vocational, or general 
track in school. The dyadic variable (DIFF 
TRACK) is coded as unity if the two dyad 
members are in different tracks and zero if 
they are in the same track. 

The students were also asked to report 
whether they received mostly A's (coded 7), 
about half A's and half B's, mostly B's, 


about half B's and half C's, mostly C's, about 


! Several other weighting schemes were also tried. 
One approach ignored school size and the dispro- 
portionate sampling of the original HSB data col- 
lection. Other approaches incorporated various de- 
sign weights that are calculated by the collectors of 
the HSB data. Different weights produced modest 
differences in the descriptive statistics; in particu- 
lar, the friendship rate varied from 1.3 percent to 2 
percent, and there were also minor differences in 
the racial compositions of the weighted samples. 
These differences primarily reflect the fact that dy- 
ads from larger schools constitute a larger part of 
the sample when school size is uscd,as-a weighting 
factor. However, except for the-i ercept Ji, pa- 
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half C’s and half D’s, mostly D’s, or mostly 
below D. These responses were coded 7 
through 0 respectively. Two variables were 
constructed to measure difference in grades. 
The first, P GRADES > O, is coded as the 
numeric difference between P’s and O's 
grades if the difference is positive (i.e., P’s 
grades are higher than O's), and zero 
otherwise. The second, O GRADES > P, is 
coded as the difference between P’s and O’s 
grades if the difference is negative and zero 
otherwise. Measuring difference in grades in 
this way has the advantage of showing 
whether the relatively higher academic status 
of a peer has a different effect on a student’s 
friendship choice than a relatively lower 
status. The number of school activities 
(sports, cocurricular activities, etc.) to which 
both members of a dyad belong (# SAME 
ACTIVITIES) is coded as a continuous 
variable. Finally, reciprocity (O CHOOSE P) 
is a dichotomous variable, coded as unity if O 
chooses P as friend and zero otherwise 
(regardless of whether P named O). Other 
variables, including differences in SES, were 
also examined but were shown to have no 
effect on friendship choice and consequently 
were eliminated from the final analyses.? 


Methods of Analysis 


We will report both descriptive and inferential 
analyses of the data. The descriptive statistics 
will reveal characteristics of this important 
data set that have not yet been reported. They 
will show the incidence of interracial friend- 
ship choice and determine whether patterns of 
cross-race friendships are similar across 
grade. In the inferential analysis, we will 
estimate a multivariate logistic model of the 
determinants of cross-race and same-race 
friendship choice. A logistic model is pre- 
ferred because the dependent variable is 
dichotomous. The model will be estimated for 
both the total sophomore and the total senior 
dyadic samples, for the cross-race and 
same-race subsamples, and finally for the 
black-white, white-black, black-black, and 
white-white subsamples of dyads. 

Before we proceed, some implications and 
limitations of the HSB data and our strategy 
for analyzing them should be made clear. 





? Additional details on how the dyadic-level data 
set was constructed are available from the authors. 
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First, the HSB friendship measures have a 
number of weaknesses. Students are asked to 
name three, and only three, friends. Almost 
all do so. Hallinan (1974) and Holland and 
Leinhardt (1975) have identified problems 
with such measures. An individual may want 
to make more than three choices but be unable 
to do so. Conversely, he or she may want to 
name fewer but name three anyway. Hence, 
some individuals may name borderline ac- 
quaintances who do not reciprocate the 
choice, while others may want to name a 
friend but be prevented from doing so because 
they have named three friends already. The 
inclusion of unimportant relationships or the 
mislabeling of important ones may affect the 
validity of the results. In addition, the 
three-friend limit means that friendship rates 
will be low: while students may have 
hundreds of classmates to choose from, they 
can name only three as their friends. 

Second, even though there are several 
thousand dyads in the sample, only a few 
hundred of these involve interracial friend- 
ships (see Table 1). Clearly, compared to 
same-race friendships, interracial friendships 
are somewhat rare in American high schools. 
These small numbers make it difficult to 
identify significant determinants of interracial 
friendships, and they also increase the 
likelihood that apparent differences between 
same-race and cross-race friendships could be 
due to sampling variability. Analyses will 
therefore be presented for several different 
subgroups to provide the clearest possible 
picture of interracial friendship formation. 

Third, it is important to keep the unit of 
analysis in mind when interpreting these 
results and comparing them with findings 
from other studies. For example, Patchen’s 
(1982) finding that many individuals have at 
least one cross-race friend is not inconsistent 
with our finding that cross-race friendships 
are much less common than same-race 
friendships.? 


FINDINGS 


In Table 2, results are presented separately for 
sophomores and seniors and for those dyads 





? Another factor contributing to Patchen's high 
interracial friendship rates is that respondents in his 
study were allowed to name five best friends, 
while HSB respondents were allowed to name only 
three. 
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Table 2. Means and Standard Deviations of Grade-Level and Dyadic-Level Variables for Sophomore and Senior 
Samples, by Friendship Status 


Sophomores Seniors 

(N 17,022) (N 17,030) 
Variable Mean ` S.D. Mean S.D. 
CLASSIZE (in 100s) 

All dyads 5.70 2.86 4.53 2.12 
Nonfriends 5.72 2.86 4.54 2.12 
Friends 4.76 3.09 3.66 2.20 

LCLASSIZE 

All dyads ~ 514 62 —4,92 60 
Nonfriends —5.15 62 —4.93 59 
Friends —4.84 79 —4.58 79 

O CHOOSE P (*100) 

All dyads 1.24 11.08 1.34 11.50 
Nonfriends .48 6.94 54 7.35 
Friends 60.78 48.94 59.23 49.25 

CROSS-RACE (*100) 

All dyads 14.80 35.51 11.92 32.40 
Nonfriends 14.96 35.66 12.05 32.55 
Friends 2.83 16.61 2.38 15.28 

DIFF SEX (*100) 

All dyads 49.60 50.00 48.29 49.97 
Nonfriends 50.14 50.00 48.78 49.99 
Friends 7.73 26.77 12.35 32.98 

DIFF TRACK (*100) 

All dyads 61.87 48.57 64.84 47.75 
Nonfriends 62.09 48.52 65.06 47.68 
Friends 44.39 49.80 48.82 50.09 

P GRADES > O 

All dyads 87 1.30 ` TI 1.15 
Nonfriends . 88 1.31 7] 1.15 
Friends 63 1.03 51 .93 

O GRADES > P . 

AII dyads —.85 1.29 ~.75 1.13 
Nonfriends — 85 1.29 -.15 1.13 
Friends —.69 1.06 — .64 1.00 

# SAME ACTIVITIES 

All dyads 66 .88 .83 1.13 
Nonfriends .66 .88 .82 1.12 
Friends 1.06 1.10 1.54 1.56 





in which P did and did not name O as a 
friend. The average sophomore in the sample 
has 570 classmates; the average senior has 
453. The average class size is smaller for the 
friend dyads than for the nonfriend dyads. 
This does not mean that students from smaller 
schools are friendlier. Rather, because all 
students were limited to three friendship 
choices, the larger the school, the less likely 
it is that any particular pair of individuals will 
be friends. For example, if a class has 10 
people and an individual names three of them 
as friends, then he or she is friends with 30 
percent of the class. If, on the other hand, the 
class has 100 people, then he or she is friends 
with only 3 percent of the class.* 


^ Differences in school size account for a small 


The mean for the reciprocation variable 
reveals that, in about 1.3 percent of the 


part of the other differences that are observed 
between the friend and non-friend dyads. For 
example, separate analyses (not shown) reveal that 
there is a very small correlation between the racial 
composition of a school and its size. Schools that 
are highly segregated are somewhat smaller, on the 
average, than schools that have a more even racial 
balance (and hence have more cross-race dyads). 
Even if friendship choices were totally unaffected 
by race, these size differences would cause the 
same-race and cross-race friendship rates to differ, 
since the larger the school, the less likely it is that 
any pair of students ‘will be friends. However, as 
the multivariate analyses reveal, differences in 
school size alone do not account for the large racial 
differences in friendship rates. 
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dyads, O names P as a friend. (Since each 
dyad appears twice with the roles of P and O 
reversed, this also means that P names O 1.3 
percent of the time.) Again, this low number is 
not surprising, since students can name only 
three friends even though many have several 
hundred choices. However, when P names O 
as a friend, the choice is reciprocated about 
60 percent of the time. We will examine this 
finding in greater detail shortly.5 

About 15 percent of the sophomore dyads 
and 12 percent of the senior dyads are 
composed of students of different races. 
However, less than 3 percent of the friendship 
dyads are interracial. That is, the friendship 
dyads have only one-fifth as many cross-race 
members as are found in the general student 
population. Results are similar for the gender 
composition of the dyad. As would be 
expected, half the dyads involve boy-girl or 
girl-boy combinations. However, among 
friends, only 8 percent of the sophomore 
dyads and 12 percent of the senior dyads are 
composed of members of a different sex. 

Over 60 percent of the dyad members are 
in different academic tracks. Among friends, 


5 The friendship rates are actually higher than 
they should be. Because students could name only 
three friends, and because the average sophomore 
class had 570 students and the average senior class 
had 457, the friendship rates should be 3/570 = 
.53 percent for sophomores and 3/457 = .66 
percent for seniors. The fact that the observed rates 
are greater than this implies that students were 
more likely to name as friends classmates who 
were also in the sample than classmates who were 
not included. We suspect this occurred because the 
questionnaire usually was administered to all 
respondents at the same time and in the same 
room; when asked to name three friends, students 
gave preference to those who happened to be 
within eyesight. We do not believe this is 
particularly problematic, however. Students can 
have more than three friends, so the friendship 
choices made can still represent true friendships. 
Students were selected randomly, so any biasing 
effect should be the same across racial categories; 
hence, comparisons will not be distorted. Most 
importantly, even though this bias should cause the 
magnitude of the intercept term in the logistic 
regression to be too large, effect of other variables 
should be unaffected. That is, since respondents 
were chosen randomly, the tendency to name other 
sample members as friends should be reflected in 
inflated intercept terms (because the overall 
friendship rate is higher), not in distortions of other 
parameters. 
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however, the figure is about one-fourth 
lower. The means for the grade difference 
variables reveal that friends tend to be 
somewhat closer in academic ability than are 
randomly paired classmates. (Note that, for 
both the positive and negative differences, the 
means for friends are closer to zero than are 
the means for the nonfriends.) Overall, 
dyadic members have fewer than one extra- 
curricular activity in common, while friends 
have more shared activities. 

Table 3 presents the percentage of dyads in 
which P chooses O as a friend for the 
Sophomore and senior samples. Table 3 
shows that cross-race friendship choices occur 
far less frequently than same-race choices in 
both the sophomore and senior samples. 
Friendships are eight times as common in 
sophomore black-black dyads as they are in 
sophomore black-white dyads. The ratio is - 
slightly higher for black seniors. Whites are 
almost six times as likely to be friends with 
any randomly chosen white as they are with 
any randomly chosen black. 

Data in table 3 also show that the 
friendship reciprocation rates are quite high, 
given the low percentage of friendship 
choices overall. For sophomores, there are no 
differences in reciprocation rates for cross- 
race and same-race friendship, and only 
modest differences exist among seniors. 
Given the much lower incidence of cross-race 
compared to same-race friendships, this lack 
of major differences in reciprocation rates 
across racial categories is somewhat surpris- 
ing. This provides strong support for a norm 
of reciprocity governing both cross-race and 
same-race friendships. The effect of recipro- 
cation on the likelihood of cross-race and 
same-race friendship choices will be reexam- 
ined in the multivariate analysis where 
appropriate controls can be included. 


Table 3. Friendship Choices and Friendship Choices 
Reciprocated, by Racial Category 


Sophomores Seniors 
Friend-  Recipro- Friend- Recipro- 

ships cated ships cated 

Dyad Type (%) (96) (96) (96) 
White-White — 1.36 61.5 1.45 59.4 
Black-Black 2.11 57.1 2.41 60.1 
White-Black 0.23 62.1 0.27 50.2 
Black-White 0.25 59.4 0.27 50.7 
Same-Race 1.44 60.8 1.51 59.4 
Cross-Race 0.24 60.7 0.27 50.4 
Total 1.26 60.8 1.36 $9.2 
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The low incidence of interracial friendships 
appears to dramatically illustrate racial cleav- 
age among high school students. However, 
we have suggested several factors that 
influence friendship formation and some of 
these may account for at least some of the 
differences between the races. We now 
examine the likelihood of a cross-race friend- 
ship choice in a multivariate analysis in which 
all relevant variables are controlled simulta- 
neously. 

In tables 4 and 5 logistic regression models 
for both sophomores and seniors are esti- 
mated for the total sample and for each racial 
category of dyad separately. Several interest- 
ing and important results emerge. Even after 
controlling for several variables simulta- 
neously, we find that students are much less 
likely to name cross-race than same-race 
students as their friends. This is illustrated by 
the significant coefficient for CROSS RACE 
(about — 1.1) in the model for the total 
sample. When CROSS RACE alone is 
included in the model (not shown), its 
coefficient is about — 1.8 for both sopho- 
mores and seniors. Hence, the different 
variables examined account for only about à 
third of the difference in the likelihood of 
cross-race and same-race friendship choices. 

O's naming P as a friend (O CHOOSE P) 
strongly affects the likelihood that P will 
choose O as a friend in all racial categories. 
The effect of reciprocation is strongest for 
interracial friendships. Since interracial friend- 
ships are not common, students may be 


75 


especially unwilling to make such choices 
unless the feelings of friendship are likely to 
become mutual. i 

In all groups, P is much less likely to name 
O as a friend when O is of the opposite sex. 
The effects of gender differences are gener- 
ally smaller for seniors than sophomores, 
however, suggesting that gender differences 
may become less important with age or that 
romantic attachments overcome the gender 
barrier to friendship. 

Being in a different track tends to reduce 
the probability that a friendship choice will be 
made. Note that here (and for several other 
variables) effects are not statistically signifi- 
cant across all racial categories. However, the 
signs and magnitudes of effects are similar 
across groups. This suggests that differing 
levels of statistical significance reflect differ- 
ences in sample size rather than differences in 
the importance of tracking. 

Interestingly, participation in the same 
activities has a small effect on same-race 
friendships: but does not affect interracial 
friendship choice. The absence of a predicted 
effect here may be due to the small number of 
activities that students participate in, as seen 
in Table 2. It could be that part-time 
employment and bus schedules limit the 
number of school activities to which students 
belong. 

O's having higher grades than P does not 
significantly affect the likelihood that P will 
name O as a friend in any racial category, nor 
is there any consistent pattern of effects. 


Table 4. Logistic Regression Models for Sophomores, by Racial Category (Standard Errors in Parentheses) 


Variable TOTAL . WH-WH BL-BL WH-BL BL-WH SAME CROSS 
INTERCEPT -2.62* -231* -3.2 —3.28 —4.03 —2.64* —3.69 
(.68) (74) (1.95) (6.32) (6.04) (.68) (4.40) 
LCLASSIZE 35" 42* . 59 31 34* 43 
(.13) (14) (.35) (1.17) (1.11) C13) (.81) 
O CHOOSE P 5.12* 5.14* 4.49* 6.81* 6.84* 5.07* 6.80* 
(.19) (.21) (.47) (1.62) (1.63) (.19) (1.11) 
DIFF SEX -172* —1.76* —1.47* —1.62 —2.24 -171* — 1.86 
(27 (31) (.64) (2.29) (2.62) (.28) (1.70) 
DIFF TRACK —.36* —.34 —.54 —.55 — 44 —.35* —.53 
(18) (.20) (.45) (1.52) (1.51) (.18) (1.06) 
P GRADES > O -47 —.8 —.14 —.02 —15 —.18* —.09 
(.09) (.10) (.23) (.70) (.81) (.09) (.52) 
O GRADES > P .09 .08 : -.31 20 09 ~.04 
(.09) (.09) (.23) (.65) (.68) (.09) (46) 
SAME ACTIV .17* .16 28 —.10 dg .19* -.11 
(.09) (.10) (.20) (.88) (.81) (.09) (.60) 
CROSS RACE —1.16* 
(.45) 


* significant at the .05 level. 
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Variable TOTAL WH-WH  BL-BL 
INTERCEPT -2.37* -233*. | —231 
(.63) (.67) (2.19) 
LCLASSIZE 43" 45* 32 
(.12) (13) (.43) 
O CHOOSE P 5.02* 5.00* 4.68* 
(.18) (.19) (.57) 
DIFF SEX — 1.30* —1.30* —1.18 
(.22) (.24) (.70) 
DIFF TRACK -.39* -.33 -27 
(.17) (.18) (.56) 
P GRADES > O —.20* —.20* —.24 
(.09) (.10) (32) 
O GRADES » P .07 .07 .18 
(.09) (.09) (.31) 
SAME ACTIV .19* .21* Bel 
(.06) (.06) (.18) 
CROSS RACE —1.08* 
(46) 


* = significant at the .05 level. 


However, P is generally less likely to name O 
as a friend when P's grades are higher. The 
effect is smaller and not statistically signifi- 
cant for interracial friendships. It has fre- 
quently been noted that individuals tend to 
choose as friends those who have higher 
abilities than themselves. Our findings pro- 
vide modest evidence that for same-race 
friendships this may occur not so much 
because individuals respect their higher- 
achieving peers but because they look down 
on those with lower achievement. Students 
may care when others are less capable than 
themselves but be less aware or concerned 
when they themselves are less capable than 
others. The absence of a status effect for 
cross-race friendship choices is consistent 
with Patchen's findings that status differences 
had little effect on friendly interracial 
relations.® 


$ We also examined the effect of school racial 
composition on friendship choices. We looked at 
proportion black, the proportion of cross-race 
dyads in the school, and various nonlinear effects 
of school racial composition. No significant effects 
could be found. This might seem to suggest that 
racial integration has absolutely no effect on 
interracial friendships, perhaps because racial 
barriers to friendship are too strong, or perhaps 
because tracking policies resegregate desegregated 
schools. We believe, however, that effects of 
racial integration are not totally captured by the 
present analytical approach. Recall that we are 
using the dyad, and not the individual, as our unit 
of analysis. Factors that affect the likelihood that 
any two individuals will be friends are examined. 


WH-BL . BL-WH SAME CROSS 
—4.90 —1.86 —2,.35* —3.30 
(6.28) (5.72) (.64) (4.19) 
15 93 ,44* 54 
(1.19) (1.14) (.12) (.81) 
6.34* 6.48* 4.99* 6.27* 
(1.65) (1.73) (.18) (1.14) 
—2.33 —.98 —1.29* —1.59 
(2.46) (1.81) (.02) (1.42) 
—.81 - 23 -32 —.54 
(1.46) (1.50) CIN (1.03) 
-.07 —.10 —.21* - 07 
(.63) (.96) (.10) (.52) 
-42 ~ 23 .08 — 28 
C70) (.54) (.09) (.43) 
.00 .10 20* 06 
(.62) (.59) (.06) (.42) 
DISCUSSION 


The analyses in this paper illustrate how much 
less common interracial friendships are than 
same-race friendships in American high 
schools. Even though 59,000 students were 
interviewed in the High School and Beyond 
Survey, and almost a million dyads were 
available for the present analysis, only a few 
hundred interracial friendships could be 
identified. Controls for a number of dyadic 
and school-level variables could account for 
only about a third of the differential in the 
likelihood of same-race versus cross-race 
friendships. 

The analyses also suggest a number of 
mechanisms through which dyadic and school 
characteristics affect the social relationships 
of black and white students. The social 
psychological processes that govern interra- 
cial friendships appear to be the same as those 
influencing same-race friendships. Our analy- 
ses of the HSB data indicate that the 
well-known bases of interpersonal attrac- 


Hence, the analysis tells us that racial composition 
has no effect on the likelihood that any particular 
white will be friends with any particular black. 
However, this does not mean that school racial 
composition has no effect on the likelihood that a 
white will have any black friends at all, or vice 
versa. For example, whites who attend heavily 
black schools may be more likely to have at least 
one black friend than whites who attend schools 
that have fewer blacks. To address this possibility, 
we plan to conduct future analyses employing 
other units of analysis. 
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tion— proximity, similarity, status, and reci- 
procity —are important determinants of both 
cross-race and same-race friendship choices. 
The race and sex cleavages that frequently 
have been documented for elementary school 
students continue through secondary school. 
This is not to suggest that strong cross-sex 
and possibly cross-race friendship ties do not 
occur between high school students, but 
rather that students typically form friendships 
within their own race and gender categories. 

The small but consistent effects of tracking 
suggest that school organizational characteris- 
tics can also affect interracial friendship 
formation. The distribution of students across 
organizational units can effectively integrate a 
school population by ensuring a reasonable 
mix of black and white students within 
instructional groups. When this occurs, inter- 
racial interactions take place by chance as 
well as by choice. Students have the opportu- 
nity to get to know cross-race peers better, to 
observe existing similarities, and to develop 
new similarities that may transcend the racial 
barrier to positive effect. 

On the other band, the assignment of 
students to tracks can effectively resegregate 
a desegregated school. Because the average 
academic performance of whites generally has 
been superior to that of blacks, a tracking 
system that is based solely on the distribution 
of achievement across a grade usually results 
in a disproportionate number of whites in the 
academic track and of blacks in the general or 
vocational track. When this occurs, opportu- 
nities for interracial interaction within track 
are seriously limited. Black and white stu- 
dents are denied the chance to recognize 
existing similarities between them and to 
develop new ones that would foster positive 
interracial sentiment. 

Contact theory suggests another reason 
why tracking policies may be important. 
Contact theory claims that contact alone is 
insufficient to improve race relations: contract 
in a context of status equality (Allport 1954) 
or status reversal (Cohen 1975) is required. 
The assignment of students to tracks locates 
them on a status hierarchy in the school 
population, with students placed in the same 
track generally having equal academic status. 
When tracks are racially mixed, black and 
white students in the same track interact in a 
situation of status equality, at least with 
respect to academic performance. 

Finally, the most powerful determinant of 
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the friendships of both black and white 
students is reciprocity. Therefore, it is crucial 
that students recognize when a classmate 
regards them as friends. The more opportuni- 
ties students have to interact, the more likely 
this is to occur. Again, the organization of 
students in school emerges as a factor in 
affecting interracial friendships through its 
influence on opportunities for interaction. 

The more fundamental question, of course, 
is how to encourage cross-race friendship 
choices in the first place, since once a choice 
has been made and recognized, it is likely to 
be reciprocated. The analyses reported here 
indicate that this is no simple task. Interracial 
friendships are far less common than same- 
race friendships, and the dyadic and school- 
level variables examined can account for only 
about a third of this differential. Further, 
student characteristics beyond the control of 
school administrators have the strongest 
effects on friendship choices. Nevertheless, 
schools are not totally powerless to act in this 
area. Our analysis suggests that the assign- 
ment of students to tracks or other instruc- 
tional units is an organizational factor that 
directly affects cross-race friendship choices. 
The effects of achievement differences on 
friendship choices suggest that classroom 
policies that decrease status differences and 
give all students a chance to win the esteem of 
their peers may enhance the likelihood that 
interracial friendships will form. Since these 
factors can be modified by school personnel, 
they represent possible means for influencing 
the social relations between black and white 
students and for promoting interracial friend- 
ship. 
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MALE AND FEMALE TASK GROUPS* 


CECILIA RIDGEWAY DaviD DEKEMA 
l University of lowa 


To clarify the impact of dominance behavior on status in task groups, the formation 
of status hierarchies must be viewed as the collective product of joint interaction 
among the entire network of group members rather than as an aggregate of 
independent dyadic encounters. A network-collective analysis indicates that the 
impact of a dyadic dominance contest on status depends critically on the reaction 
of group members who are bystanders to it. Structural conditions induced by the 
group’s task encourage bystanders to intervene against members who claim status 
by dominance behavior, limiting its role in status allocation. In an experiment with 
four-person same-sex decision groups, two bystanders intervened to attack the 
dominance behavior of another confederate. Moreover, the dominant confederate 
was disliked and was no more influential in the group than a neutral confederate. 
These results held for both male and female groups, although levels of dominance 


DOMINANCE AND COLLECTIVE HIERARCHY FORMATION IN 


behavior were higher in male groups. 


Several writers have suggested that domi- 
nance behavior may play an important role in 
the formation of status hierarchies in face-to- 
face groups (Keating 1985; Mazur 1985; Lee 
and Ofshe 1981; Ellyson and Dovidio 1985). 
-Yet neither the precise nature nor the 
significance of this role has been ascertained, 
especially in task-oriented groups. We argue 
that this issue has been difficult to resolve 
because most analysts take a dyadic approach 
to the status formation process. Clarifying the 
role of dominance behavior in task groups 
forces us to consider basic questions about the 
nature of status hierarchy formation. 

Early researchers such as Bales (1950) and 
Homans (1950) viewed a face-to-face status 
order as an emergent collective property of 
the conjoint behavior of all group members. 
However, subsequent theory and research has 
tended to focus on outcomes of dyadic 
encounters (e.g., Strongman and Champness 
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1968; Rosa and Mazur 1979; Mazur, Rosa, 
Faupel, Heller, Leen, and Thurman 1980; 
Berger, Conner, and Fisek 1974). This has 
been particularly true of dominance-oriented 
approaches to status (e.g., Jones 1983; Lamb 
1986). These researchers apparently assume 
that hierarchies in groups of three or more 
simply reflect the aggregate results of inde- 
pendent dyadic encounters. 

Following Chase (1974, 1980) and Fararo 
and Skvoretz (1986), we argue against this 
assumption and posit that if status emergence 
is analyzed in terms of the entire network of 
group members, the role of dominance 
behavior in task groups can be clarified. In 
this collective approach, the impact of a given 
dominance contest on status formation de- 
pends critically on the reaction of group 
members who are bystanders to the pairwise 
dominance encounters. In task groups, it 
suggests that bystanders will intervene to 
block efforts to claim status by dominance 
behavior alone. This limits the role of 
dominance behavior in the formation of status 
hierarchies in task groups of three or more, 
whatever its role in dyads or nontask groups. 
This paper reports an experimental test of this 
argument. 


DYADS, BYSTANDERS, AND 
HIERARCHIES 


Evidence suggests that hierarchy formation in 
groups of three or more does, in fact, depend 
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on the larger network of interactants within 
which dyadic encounters are embedded. In 
independent dyadic encounters, individuals 
have only their own personal attributes 
through which to establish rank over another 
since no involved third parties can lend 
support to either contestant or respond to the 
outcome of the contest.! Consequently, there 
should be a high correlation between an 
individual’s personal status attributes and his 
or her rank in status hierarchies if the 
dyadic-aggregate approach is an accurate 
description of hierarchy formation (Chase 
1974). Chase (1974, 1980), however, has 
shown that correlations between individual 
differences in such attributes and rank in 
status hierarchies are not high enough among 
people or animals to allow this dyadic- 
aggregate approach to account for the near 
linear or transitive nature of most status 
hierarchies. 

Chase (1980) further investigated the struc- 
tural dynamics of hierarchy formation by 
examining animal groups in which rank is 
achieved by the dominance attack of one 
individual on another. He demonstrates that 
for a nearly transitive hierarchy to emerge 
from a series of such dyadic dominance 
contests, bystanders to a given pairwise 
encounter must behave in specified ways. 
Most importantly, they must not intervene in 
a given dominance encounter by showing 
support for or coming to the aid of the 
attacked member. When defeated in a contest 
themselves, they must react by attacking 
another or withdrawing rather than by joining 
with others in a coalition against the domina- 
tor. Some animals do generally react in this 
way when in a bystander role, resulting in a 
transitive hierarchy in which rank is deter- 
mined by the outcome of interdependent sets 
of dyadic dominance contests. 

Both Chase (1980) and Mazur (1985), 
however, noted that people sometimes inter- 
vene in one member’s dominance attack on 
another and often form coalitions which affect 
the outcome of dominance contests.? If a 





! These personal attributes can include learning 
based on prior personal experience as well as other 
characteristics that the individual possesses. 

? Since it involves effective coalitional behav- 
ior, rather than mere contingency in response by 
dyadic contests, bystander intervention differs 
from the bystander behavior analyzed by Chase 
(1980) and Fararo and Skvoretz (1986) based on 
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coalition in a small group refuses to acknowl- 
edge the status claim of a lone member, then 
that claim generally fails. When coalitions 
occur, then, they are likely to substantially 
determine the status ranking of group mem- 
bers. If bystanders intervene in a dominance 
contest, they in effect form a coalition to 
support either the dominator or the attacked 
member against the dominator’s claims. . 
When the effective coalition is with the 
attacked member, bystanders do not necessar- 
ily grant higher status to that member but 
rather oppose the dominator’s efforts to gain 
rank over him or her. When intervention 
occurs, the rank achieved by the dominator 
will be more strongly determined by by- 
stander support or resistance than by the 
dominator’s ability to extract submissive 
signs from the challenged member. With such 
intervention, then, the process of status 
formation becomes collective, based on 
coalitional support rather than purely individ- 
ual competition. 


A NETWORK-COLLECTIVE APPROACH 
TO TASK GROUPS 


To understand why bystander interventions 
might occur, we must examine status forma- 
tion in terms of the entire network of group 
members. We argue that the nature of the 
interdependence among members induced by 
a cooperative task makes such interventions 
especially likely in task groups. This analysis 
assumes three or more members since with 
two there are no bystanders to pairwise 
encounters, and the dyadic-aggregate and 
network-collective approach to status forma- 
tion are conflated. 

In human task groups, high-status members 
exercise greater control over task activities 
than low-status members and, consequently, 
have a disproportionate impact on the group’s 
task accomplishment (Mazur 1973). As a 


Chase’s data for chickens. Bystander intervention 
and coalitional behavior are common among higher 
primates as well as humans (Chase 1980; Mitchell 
and Maple 1985), causing some ethologists to 
question the adequacy of dyadic dominance 
approaches to explain their hierarchies as well. 
Current evidence indicates that an individual 
animal’s rank in such primate hierarchies seems to 
be determined by social skills such as the ability to 
form alliances as well as the ability to manipulate 
others (Mitchell and Maple 1985; Goodall 1987). 
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result, when one group member defers to. 


another, thereby raising the other's status, the 
first member has effectively exchanged defer- 
ence for additional task contributions from the 
second member. When members come to- 
gether with a mutual interest in accomplishing 
a given cooperative task and a knowledge that 
high-status members will disproportionately 
affect task accomplishment (as they do in task 
groups), this effective exchange exists as part 
of the structure of their group situation. In a 
group of three or more, it gives rise to a 
positively connected exchange network among 
the members in which deference granted by 
one member (Mj) to a second member (M2) 
results in task services performed by the 
second member which yield benefits (or 
losses) not only to the first member but to all 
other members (M4—M,). Thus, the ex- 
change of benefits between Mz and M3 is 
dependent on the exchange of deference 
between M, and M2, or vice versa, creating a 
positively connected network (Emerson 1981; 
Cook and Emerson 1978).? 

This positively connected exchange net- 
work has two related effects. First, it creates a 
Structure of potential coalitions that makes 
members’ inferred task abilities, not their 
degree of dominance behavior, the normative 
basis for status allocation in task groups. 
Second, it creates a structural likelihood. of 
bystander intervention against efforts to attain 
status by dominance behavior alone. 

Status theorists have often noted that, given 
the cooperative interdependence induced by a 
shared task, it is in the rational self-interest of 
each member to defer to others on the basis of 
relative expectations for task performance 
(Bales 1950; Homans 1961; Berger et al. 
1974). On the other hand, it can also be 
argued that a member's interest in personal 
dominance over others may sometimes out- 
weigh shared interest in task accomplishment 
and limit willingness to defer to others with 
greater inferred task capacity. Given these 


? Emerson (1962, 1981) Distinguishes between 
networks and groups, arguing that networks are 
transformed into groups when norms develop. 
Although we are discussing a group in its 
formative stages before its normative structure has 
developed, we assume that, even after this occurs, 
the exchange network remains as a structural 
potential that sets certain bounds on normative 
relations. This assumption differs somewhat from 
Emerson's view. 


81 
conflicting possibilities, it is instructive to 
consider the impact of the positively con- 
nected network on members' interests. Such a 
network makes it in the interests of each 
member that all other members grant defer- 
ence on the basis of inferred task capacity 
rather than another criterion such as domi- 
nance. That is to say, it is in each member's 
interest that others' status be given on the 
basis of inferred task capacity, whatever they 
each want for themselves. This means that, if 
group members have higher expectations for 
M,’s task performance than Mo's, they are 
likely to pressure Mo to defer to M;, in effect 
forming a coalition with M; against Mb, 
regardless of M2’s personal desire to defer. 
As a consequence, even if each member seeks 
personal dominance, each will be faced with 
an implicit coalition of others willing to grant 
deference only in proportion to relative 
expectations for task contributions. This - 
creates a power structure that enforces 
inferred task capacity as the appropriate basis 
for status allocation. 

Emerson (1962, 1972) has noted that group 
norms are properly viewed as the collective 
“voice” of a coalition that would stand 
against a member who violates them. Since 
the positively connected network creates the 
structural likelihood of coalitions forming in 
support of deference to signs of task capacity, 
these potential coalitions in effect create 
norms defining expectations for task contribu- 
tions as the legitimate basis for status in the 
group. By effectively creating these legitimat- 
ing norms, this positively connected network 
also creates a structural likelihood that 
bystanders will enforce these norms by 
intervening against a member's efforts to 
claim status over another on a basis other than 
task capacity. Legitimacy allows members to 
act with group support in censuring such 
norm violations. Consequently, it empowers 
members to legitimately direct dominance 
behavior toward illegitimate challenges to the 
status system as a means of controlling and 
censuring them. Such social control is likely 
to be the major use of dominance behavior in 
task groups.* 


` 4 It is not necessary that individual bystanders 
consciousty calculate their intervention on the basis 
suggested. Because of prior learning and experi- 
ences in collectively oriented task groups, general 
expectations about legitimate and illegitimate 
behaviors in task groups may be immediately 


82 


DOMINANCE AND BYSTANDER 
INTERVENTION 


Scholars do not completely agree on a 
definition of human dominance behavior 
(Strayer 1980; Harper 1985). Strayer (1981) 
and Jones (1983) focus on dyadic conflicts 
that involve physical aggression, the threat of 
aggression, or struggles for objects or physi- 
cal position. Chase (1980) views dominance 
behavior in terms of attacks without specify- 
ing the nature of these attacks. Mazur (1985) 
defines dominance acts as stress-eliciting 
behaviors, noting that they have this effect 
because they are perceived as threatening by 
their recipient. 

Drawing on what is common in these 
definitions, we use dominance here to refer to 
behaviors directed towards the control of 
another through threat. The nature and 
credibility of the threat implied by a domi- 
nance act vary with dominator and social 
context. Some characteristics of the domina- 
tor, such as size, may suggest the physical 
ability to coerce a challenged member. 
However, in most task groups in which the 
use of physical force is proscribed, the 
dominator's ability to credibly threaten an- 
other depends largely on social power. This in 
turn depends heavily on the extent to which 
the dominator has the backing of bystander 
members (Ridgeway 1984). Consequently, 
dominance behavior used legitimately to 
enforce collective norms is likely to be more 
effective than dominance behavior used to 
illegitimately claim status. Common exam- 
ples of dominance behaviors are shouting at, 
interrupting, commanding, or angrily address- 
ing another. 

Dyadic dominance contests occur when one 
member directs dominance behavior toward 
another to gain deference and rank. In task 
groups such contests involve the interests of 
all members since if the dominator wins, he 
or she will have a greater impact on the 
collective task effort. If dominance behavior 
gives the impression of greater task ability, 
then bystanders should support the domina- 
tor's claims to status. But if it does not, all 
other things being equal, bystanders are likely 
to censure such claims by threatening or 
attacking the dominator because he or she has 


activated. The cognitive processes involved may 
be outside of conscious awareness and may occur 
quite rapidly. 


AMERICAN SOCIOLOGICAL REVIEW 


violated the legitimating norms maintained by 
the positively connected network. 

Since dominance behavior claims status on 
the basis of a capacity to tbreaten rather 
than make a case for the dominator's task 
contributions, it alone should do little to 
enhance bystanders' assessments of the dom- 
inator's task capacity (Ridgeway 1984). 
Evidence from a recent experiment supports 
this. A videotaped confederate in female 
three-person decision-making groups engaged 
in nonverbal dominance while presenting a 
task argument (Ridgeway 1987). Although 
both observers and fellow group members 
saw the confederate in this condition as highly 
dominant and threatening, they rated her as 
no more competent at the task than they did in 
another condition when she did not display 
dominance behaviors. These results and 
others (Mohr 1986) suggest that dominance 
behavior is unlikely to enhance the domina- 
tor’s appearance of task competence and win 
bystander support on that basis. 

This discussion of bystander reactions to 
dominance contests assumes that other factors 
that would increase the dominator’s appear- 
ance of task ability are held constant. External 
status characteristics (Berger et al. 1977) and 
preestablished positions in the group affect 
the dominator’s appearance of task compe- 
tence and, consequently, likelihood of by- 
stander support, independent of his or her 
dominance behavior. If the dominator has an 
external status or positional advantage over 
the attacked member, bystanders need not 
intervene since the dominator has a legitimate 
competence-based right to deference 
(Ridgeway 1984). If the dominator has an 
external status or positional disadvantage, 
bystanders are likely to intervene on grounds 
of apparent lower task ability and violation of 
legitimate norms. In both these situations, 
however, dominance behavior and status- 
competence differences are conflated, making 
it more difficult to clarify the independent 
effects of dominance on bystander reactions 
and status formation. As others have noted 
(Rosa and Mazur 1979; Mazur 1985), the 
independent impact of dominance behavior 
can be seen most clearly when the contestants 
are external status and positional equals. 

The following hypotheses are suggested by 
this network-collective analysis of dominance 
and status formation. In a task group in which 
one member directs dominance behavior 
toward another in an effort to claim higher 
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status, all other things being equal, members 
who are bystanders to the dominance chal- 
lenge will intervene to resist and control the 
dominator by directing dominance behavior 
back toward the dominator and, possibly, by 
imposing other negative sanctions such as 
expressions of hostility or rejection. Further, 
they will not grant the dominator enhanced 
influence over group decisions.5 

The question remains whether these rela- 
tionships will hold equally in male and female 
groups. Nothing about the structural argument 
presented suggests that the predicted pro- 
cesses should interact with gender. On the 
other hand, dominance behavior is often 
considered more stereotypically appropriate 
for males than females, although the implica- 
tions of this for bystander behavior are not 
clear. Perhaps bystanders will be more likely 
to intervene against (and use dominance to 
contro a dominator in female groups if 
dominance claims to status are particularly 
unacceptable there. However, it is also 
possible that the assumed greater inappropri- 
ateness for women to exercise dominance will 
inhibit female bystanders from intervening 
and threatening a dominator. Given the 
conflicting implications of gender stereotypes 
and our argument that suggests no inherent 
basis for an interaction with gender, we 


5 Some may want to argue that bystander 
intervention to dominance attacks is due to the 
violation of more general societal norms about 
appropriate behavior in task groups (i.e., one 
should not shout, interrupt, insult, etc.) rather than 
the violation of specific task group norms about 
status allocation. Our argument suggests that to the 
extent such general norms exist, they do so 
because of structural pressures in collectively 
oriented task groups that create norms regarding 


status allocation. If these general norms were. 


independent of specific norms of status allocation 
in task groups, one would expect reactions of 
resistance to dominance behavior to be similar 
regardless of the external status, positional advan- 
tage, or established competence of the dominator. 
Our theory, in contrast, predicts differences in 
reaction and resistance depending on the relative 
status of the dominator. 

6 Ridgeway's (1987) results do not clarify the 
matter since, while she found that bystanders 
would not grant enhanced influence to a video- 
taped dominator, she studied only female groups. 
Furthermore, the use of videotapes in that study 
did not allow bystanders the opportunity to rm 
intervene against the dominator. 
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hypothesize that the predicted relationship 
will hold equally in male and female groups. 


METHODS 
Design and Procedures 


An experiment was designed to assess the 
likelihood of bystander intervention in a 
group decision-making context in which a 
confederate did or did not direct dominance 
behavior toward a second member while 
arguing for a particular position. The experi- 
ment consisted of four conditions crossing sex 
of the group (all male vs. all female) with a 
dominant or neutral confederate. Because, as 
we noted, the independent effects of domi- 
nance are clearest when members are external 
status equals, subjects and confederates in a 
given group were peers: all were undergradu- 
ates of the same sex and of approximately the 
same age (18-25). Each group consisted of 
four members: two confederates and two 
naive subjects. The main confederate directed 
an initial dominant or neutral argument 
toward the second confederate and continued 
to argue his or her position in a dominant or 
neutral manner throughout the group discus- 
sion. To avoid artificially encouraging by- 
stander intervention, the second confederate 
reacted neutrally to the initial and any 
subsequent dominance encounters with the 
main confederate.” The two naive subjects 
were bystanders to this initial and. any 
subsequent dominance attempts. Two bystand- 
ers were included in each group to provide 
multiple indicators of bystander reaction in 
every group. 

The decision-making task, adapted from 
Nemeth and Wachtler (1974) and Lee and 
Ofshe (1981), required group members to 
decide on a proper financial award level in a 
jury case involving an insurance settlement. 
Subjects read the case individually and made 
an initial award decision using a scale from 
$0—$25,000 with $1,000 intervals. They also 
indicated the highest and lowest award they 
would agree to for the sake of unanimity in 
the group. They then were escorted into 
another room to discuss the case as a group. 
The four members sat in a semicircle around a 
coffee table with a confederate on each end 


7 All confederate roles were played by multiple 
confederates in all conditions. 
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and the naive, subjects in the middle. The 
group was instructed to listen carefully to 
each member’s arguments and to take these 
into account in order to make the best 
possible final decision. Their final judgments 
would be combined into a group award 
decision and evaluated against superior, 
average, and poor ranges of award suppos- 
edly preestablished by a group of legal 
experts. An extra 50 cents would be earned 
for awards falling in the superior range. These 
instructions were designed to encourage 
subjects to take the task seriously and to 
foster a collective orientation. Following the 
group discussion, subjects indicated individu- 
ally their final award judgments as well as 
their perceptions of each member’s behavior 
in the group. 

Throughout the group discussion and in all 
conditions, the main confederate argued for a 
distinctively low award ($0), while the 
second confederate argued for a moderately 
high ($15,000) award (based on the research 
by Nemeth and Wachtler 1974). The main 
confederate made a point of speaking first in 
the group discussion, suggesting that the 
group begin with each member announcing 
his or her initial award choice, starting with 
the second confederate. In all conditions the 
initial statement of the main confederate was 
the same and subsequent arguments were 
similar. 

In neutral conditions the initial opening 
statement was phrased as a suggestion. The 
second confederate and bystanders were each 
allowed to state their award decisions without 
interruption, followed by the main confeder- 
ate. The main confederate then participated 
normally but actively in the discussion, 
presenting his or her arguments for an award 
of $0 in a confident, firm voice and manner 
but avoiding aggressive, hostile, or challeng- 
ing behaviors. 

In dominant conditions the opening state- 
ment was presented as a command that 
concluded with the main confederate pointing 
aggressively at the second confederate. When 
the second confederate responded, the main 
confederate interrupted him or her with an 
interrogating "Why?" Why give him $___?” 
Bystander members were then allowed to 
state their awards, followed by the dominant 
confederate. After this, the dominant confed- 
erate presented arguments for a $0 award and 
attacked the second confederate’s arguments. 
The main confederate never initiated attacks 
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on the bystanders but did respond dominantly 
to challenges received from them. Through- 
out the interaction, the dominant confederate 
maintained an aggressive, threatening man- 
ner, discussing the second confederate’s 
arguments in a dismissive or contemptuous 
way, interrupting, maintaining a tense, for- 
ward leaning posture, pointing, speaking with 
a raised or commanding voice, and asking 
interrogating questions. These dominance 
behaviors were selected on the basis of 
previous research indicating that they success- 
fully convey an intent to control through 
threat (Ridgeway 1984, 1987; Mazur 1985; 
Leffler, Gillespie, and Conaty 1982). 

The groups worked together on the task for 
12 minutes and all interactions were video- 
taped. Twelve groups were run in each 
condition for a total of 48 groups and 96 naive 
subjects. Four groups were excluded from our 
final analysis because of suspicion of the 
main confederate, and two groups were 
excluded for insufficient dominance cues in 
the main confederate’s performance. This left 
42 groups and 84 naive subjects in our final 
analysis: 10 male groups with a dominant 
main confederate; 11 male groups with a 
neutral main confederate; 11 female groups 
with a dominant main confederate; and 10 
female groups with a neutral main confeder- 
ate. 


Measures 


The primary dependent variable, bystander 
resistance to the main confederate’s domi- 
nance attempts, was measured by scoring 
videotapes of bystander behavior. Two raters 
watched each videotape and jointly deter- 
mined the beginning and end of each 
bystander speech act as well as the group 
member to whom it was directed. They then 
independently rated each of these speech acts 
for instances of dominance and resistance 
behaviors: Cues coded as dominance behav- 
iors included dismissal of another’s argu- 
ments, addressing another in a dismissive or 
contemptuous tone of voice, raised or angry 
voice, interruptions, pointing or other aggres- 
sive gestures, hostile posturing such as 
leaning forward in an intrusive manner. Acts 
coded as resistance included counterarguing 
against the main confederate, negative affect 
toward the main confederate, physical with- 
drawal or disassociation from the main 
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confederate.2 A single speech act could 
contain none of these behaviors or multiple 
instances of such behavior. 

The correlation between coders for the total 
instances of bystander dominance behaviors 
was r=.98 and for total instances of 
resistance behavior was r=.97. The lowest 
percentage of agreement between coders on a 
category of dominance behavior was 89.7 
percent for dismissive arguments. The lowest 
percentage of agreement among behaviors 
coded as resistance to the main confederate 
was 85.5 percent on counterarguing behav- 
iors. 

In addition to raw totals of dominance and 
resistance behaviors, we constructed a mea- 
sure of dominant acts, defined as bystander 
speech acts containing at least one instance of 
dominance behavior. Dominant acts were 
further categorized by whether they were 
addressed to the main confederate or to 
others. This provided our primary measure of 
bystander intervention against the confeder- 
ate. : 
To ensure that the main confederate 
displayed substantially more dominance be- 
haviors in dominant than in neutral condi- 
tions, videotapes of all confederate perfor- 
mances were scored for total instances of 
dominance by raters blind to the conditions 
being scored. The behaviors coded as domi- 
nance were the same as those coded as 
dominance in the bystanders' behavior. Since 
the number of speech acts was recorded for 
the main confederate, a measure of total 
number of dominance cues per speech act was 
also available. Total time the confederate 
talked during the group discussion was also 
scored. 

Two coders scored 10 randomly chosen 
tapes to ensure that independent coders rated 
confederate behaviors similarly. For total 





5 Counterarguing was coded as any argument 
made by a bystander that explicitly contradicted or 
refuted a point made by the dominant confederate. 
Negative affect was coded as any verbal tone or 
behavioral gesture indicating dismissiveness, con- 
temptuousness, or anger. Physical withdrawal or 
disassociation was coded as behaviors on the part 
of the bystanders that indicated attempts to 
disassociate themselves from the dominant confed- 
erate, such as turning away, pushing one's chair 
away from the confederate, nodding one's head in 
disagreement with the confederate's statements, 
and rolling one's eyes. ; 
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instances of dominance, the correlation be- 
tween coders was r= .99. The lowest percent- 
age of agreement between coders was 91.1 
percent on cues coded as dismissive argu- 
ments. 

The primary measure of the main confeder- 
ate's status was the influence he or she 
attained in the group. Since the main 
confederate argued for a distinctively low: 
($0) award while the second confederate 
argued for a moderately high ($15,000) 
award, the main confederate's influence 
should be indicated by the extent to which 
subjects lowered their awards after the group 
discussion (initial award—final award). Equiv- 
alent differences on highest and lowest 
acceptable awards provided secondary influ- 
ence measures. If dominance is an ineffective 
means of status attainment in task-oriented 
groups, we should see no differences in 
influence between the neutral and dominant 
main confederate. 

The main confederate’s apparent task 
capacity, dominance, likability, and group- 
vs. self-motivation were measured by a series 
of seven- and nine-point Likert and semantic 
differential items on the postexperimental 
questionnaire. Subjects rated the main confed- 
erate’s task capacity with items measuring 
how persuasively he or she argued, how 
knowledgeable he or she was about the case, 
how much he or she contributed to the group 
decision, and the usefulness and quality of his 
or her ideas. Dominance-related items as- 
sessed the main confederate as dominant or 
submissive, aggression or unaggressive, and 
intimidating or fearful. Socioemotional items 
rated the main confederate’s pleasantness and 
likability. Group- vs. self-motivation items 
asked subjects to rate the main confederate’s 
cooperativeness and helpfulness. Finally, 
subjects rated the main confederate as tense or 
relaxed and on perceived degree of influence. 


RESULTS: 
Confederate Behavior 


Our first task was to ascertain that the main 
confederates in the dominant conditions did in 
fact engage in substantially more dominance 
behavior than in the neutral conditions. As 
Table 1 indicates, analysis of variance 
revealed dramatic differences in both total 
dominance cues and dominance cues per 
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Condition 
Male Male Female Female Anova Results 
dominance Neutral Dominance Neutral Gender 
(n=9) (n 11) (nz 11) (n9) Gender Type x Type 
Total X-2145.22 16.64 111.59 2.11 F= 11.53 281.80 1.82 
Dominance min= 87.0 3.0 74.0 0.0 p= .002 .000 186 
Behaviors max = 218.0 37.0 157.0 5.0 
Dominance X= 2.28 0.47 1.771 0.10 F=18.91 296.735 .413 
per act min= 1.771 0.145 1.121 .000 p= .000 .000 .52 
max= 2.725 1.194 2.553 263 
Total seconds X2350.3 254.59 267.20 295.56 F= 1.01 2.59 8.78 
speaking pe .321 .116 .005 
Total Speech X= 63.44 35.82 63.73 24.00 F= 4.00 136.37 4.40 
Acts ; p= .053 .000 .043 
Seconds per X= 5.56 7.37 4.29 13.21 F=11.07 61.01 26.75 
speech act p= .002 .000 .000 
speech act.? For both males and females, the Bystander Behavior 


lowest score for a dominant confederate on 
both measures was substantially above the 
highest score for a neutral confederate. In 
addition, male confederates were more domi- 
nant in both dominant and neutral conditions 
than were female confederates, an effect we 
will discuss in greater detail later. However, 
since gender did not interact with group type 
(i.e., dominant or neutral), these differences 
do not reduce the validity of our dominance 
manipulation. Because participation is often 
related to influence, it is also important to 
note that there were no main effects for group 
type or gender on the total time the 
confederate talked. However, an interaction 
effect indicates that dominant male confeder- 
ates did talk more than neutral male confed- 
erates (p<.01) while there were no signifi- 
cant differences between female dominant 
and neutral confederates (p.40). Although 
the differences in participation between male 
dominant and neutral confederates are less 
than ideal, they work against rather than for 
our hypothesis that the dominant confederate 
will be no more influential than the neutral 
confederate. 


° Technical problems with the confederate 
videotapes in one male dominant group and one 
female neutral group prevented a full coding of 
confederate dominance behavior, so these two 
groups are not represented in Table 1. However, a 
review of audio recordings of the confederates’ 
behavior in these two groups did indicate that both 
confederates’ levels of dominance fell within the 
range appropriate for their respective conditions. 


To test our principal hypothesis that bystand- 
ers will be more likely to intervene against the 
main confederate in the dominant compared 
to neutral condition, we examined the bystand- 
ers’ dominance and resistance behavior (Table 
2). Since the group is our primary unit of 
analysis, we analyzed these and other by- 
stander data by treating the two bystanders as 
repeated measures of bystander reaction in a 
given group, employing a repeated measures 
analysis of variance. 

Results indicated substantially more raw 
instances of bystander dominance and other 
resistance behaviors when the confederate 
was dominant rather than neutral. However, 
the hypothesis requires that bystanders direct 
dominance behavior toward the dominant 
confederate rather than other group members. 
Thus the measure of dominant acts (i.e., 
speech acts containing at least one dominance 
behavior) directed toward the main confeder- 
ate provides the key test of the hypothesis 
(Table 2). Because it is calibrated in speech 
acts, this measure differs from the raw total of 
dominance behaviors since the latter may 
count multiple dominance behaviors in a 
given speech act. 

As Table 2 shows, 100 percent of bystand- 
ers’ dominant acts were directed toward the 
main confederate in all conditions except 
female dominance in which the percentage 
was 96.7. In other words, almost no domi- 
nance behavior was directed toward anyone 
except the main confederate in any condition, 
including the dominant ones. Further, the 
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Table 2. Bystander Dominance and Resistance Behavior 
Condition Means 
Male Male Female Female Anova Results 
Dominance Neutral Dominance Neutral . Gender 
i (nz 10) (nz11) (nz 11) (n= 10) Gender Type x Type 
Total Dominance 27.28 2.25 14.07 .58 F=3,82 25.58 2.29 
Behaviors* : p- .058 .000 .138 
Total Resistance 19.03 6.16 12.23 3.25 F=9.59 48.57 1.54 
Behaviors p= .004 .000 222 
Dominant Acts? 
Directed at 12.60 1.59 7.36 .45 F=4.43 34.97 1.83 
Confederate = .042 .000 .185 
Dominant Acts" 
Directed at .00 .00 .182 .00 = ,905 .905 .905 
Others p= .348 348 348 
Total Speech 27.15 22.77 23.41 14.45 Fx6.52 7.97 941 
Acts p= .015 .008 .338 
Total Seconds 178.55 185.46 148.96 139.95 F=5.58 004 .250 
Speaking p= .023 .948 .620 
Seconds per 6.79 8.41 6.41 9.885 Fs 382 8.64 1.13 
Speech Act p= .540 006 294 
Dominant acts 
toward the 
Confederate as 
a percent of 41.25 6.30 31.20 3.40 F=2.11 49.13 .64 
total acts p= .154 .000 429 


* There may be multiple dominance behaviors in a given speech act. 
> Dominant acts are speech acts containing at least one dominance behavior. 


number of dominant acts directed toward the 
main confederate was substantial in dominant 
conditions, but quite low in neutral condi- 
tions, producing a highly significant effect for 
group type. In fact, in dominant conditions, 
41 percent of male and 31.5 percent of female 
bystanders’ speech acts were dominance- 
containing acts directed at the main confeder- 
ate. In contrast, only 6.5 percent of male and 
3 percent of female bystanders’ speech acts in 
neutral conditions contained dominance di- 
rected at the confederate. 

These results strongly support the principal 
hypothesis. They clearly show that, in both 
male and female groups, when the main 
confederate engaged in dominance attacks on 
the secondary confederate, bystanders inter- 
vened to oppose him or her with dominance 
and other resistance behaviors. They did this 
despite the fact that the confederate never 
initiated a dominance attack on the bystanders 
themselves. The results further indicate that 
the confederate’s dominance behavior did not 
merely ignite a dominance war of all against 
all since bystanders almost never attacked 
each other or the secondary confederate, 
confining their attacks instead to the dominant 
confederate. Furthermore, in neutral condi- 


tions in which the confederate was not 
dominant, bystanders displayed almost no 
dominance behavior. Taken together, these 
results suggest that bystanders engaged in 
dominance attacks in an effort to censure the 
dominant confederate but did not rely on 
dominance behavior to gain status them- 
selves, supporting our argument about the 
role of dominance behavior in task groups. 

As predicted by our secondary hypothesis, 
bystander reactions to confederate dominance 
were proportionately the same in male and 
female groups. This is clear from the failure 
of gender to interact with group type on any 
of the measures of bystander dominance or 
resistance to the confederate (Table 2). These 
results suggest that the nature of the role 
played by dominance behavior in task groups 
is essentially the same whether these groups 
are composed of males or females. 

There were, however, significant main 
effects for gender on total dominance behav- 
iors and for dominance acts directed at the 
confederate, although there were no gender 
differences in the latter as a percent of a 
bystander’s total speech acts. To interpret 
these differences, note that while bystander 
speech acts were similar in length for males 
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and females (averaging 7.60 and 8.13 sec- 
onds respectively, n.s.), male bystanders 
talked more and so had more total speech acts 
than the female bystanders. This is interesting 
in itself since, because male and female 
confederates did not differ in talking time, it 
suggest that there were either fewer silences 
or more members talking at the same time in 
male groups. However, in this context, these 
results make it clear that males had more 
dominance attacks on the confederate because 
males had more speech acts, not because 
there were any differences between males and 
females in the proportion of their speech acts 
that constituted such attacks on the confeder- 
ate. 

A comparison of the frequencies with 
which male and female bystanders used 
different types of dominance (e.g., interrupt- 
ing vs. a dismissive argument) showed no 
differences whatsoever (X?— 1.99, df=10, 
p>.995). This means that the “styles” or 
patterns of dominance behavior used by male 
and female bystanders were indistinguishable. 
From the borderline differences in raw totals 
of dominance, however, it appears that there 
was a tendency for males to incorporate more 
types of dominance behaviors (e.g., both a 
contemptuous tone and pointing) into a given 
dominant act directed at the confederate than 
did females. Dominance behavior was used in 
a similar way in male and female groups, 
then, but the overall level of dominance 
behavior was slightly higher among males. 
This may have been a response to the fact that 
bystanders in male groups were confronted 
with confederates displaying a higher level of 
dominance behavior than were bystanders in 
female groups. 

Interaction in dominant condition groups 
differed from that in neutral groups in other 
ways as well. While the bystanders’ total 
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participation time did not differ in dominant 
and neutral conditions, the way they distrib- 
uted that time over speech acts did. In 
dominant conditions, bystanders engaged in 
significantly more, significantly shorter speech 
acts (Table 2). Comparing this with Table 1, 
we see that confederates in dominant condi- 
tions showed the same pattern as bystanders, 
engaging in more and shorter speech acts, 
although for confederates, but not bystanders, 
this pattern of difference was more extreme 
for females. These results indicate that the 
aggressive quality of dominance behavior led 
to more rapid-fire exchanges in dominant 
groups than in neutral groups. 


Confederate Influence 


We predicted not only that bystanders would 
intervene and attack the dominant confeder- 
ate, but also that dominance behavior would 
not enhance the confederate’s influence over 
bystanders’ decisions and, thus, the confeder- 
ate’s status in the group. If this is the case, we 
should find no difference in the main 
confederate’s influence in dominant and 
neutral conditions because confederates in all 
conditions presented the same arguments in 
support of the same award level. Our primary 
measure of influence was the extent to which 
bystanders changed their final awards in the 
direction advocated by the confederate. Since 
the confederate argued for a distinctively low 
award, a negative award change indicates 
confederate influence. 

As Table 3 shows, bystanders lowered their 
final awards in all conditions, perhaps in re- 
sponse to the confederates’ well-developed ar- 
guments. However, confirming our hypothe- 
sis, bystanders were no more influenced in 
this way by dominant confederates than by 
neutral confederates, and this was true in both 


Condition Anova Results 
Male Male Female Female Gender 
Dominance Neutral Dominance Neutral Gender Type — x Type 
Mean Change: 
Dollar — $3,389 — $4,143 — $2,227 — $1,833 F=2.232 .000 .134 
Award (n= 18) (n221) (n=22) (n= 18) = 144 .987 .716 
Highest —$1470.59 — $3666.67 — $2363.64 —$2611.11 = 009 1.107 .712 
Acceptable (n 17) (n= 18) (n=22) (n= 18) = .925 301 405 
Lowest — $472.22 ~ $2194.44 — $1204.55 — $500.00 = .172 197 1.38 
Acceptable (n= 18) (n= 18) (n=22) (n= 18) p= .681 .660 .249 
Mean Perceived 4.47 5.30 4.36 6.00 = .320 3.206 .795 
Influence (nx 17) (n= 20) (1 =22) (n= 18) p= .576 .083 .379 
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male and female groups. There was a similar 
pattern for our secondary measures of influ- 
ence. There, too, the extent to which bystand- 
ers lowered the highest and lowest awards they 
would accept was no different for dominant 
than neutral confederates. There were also no 
main or interaction effects of gender on will- 
ingness to accept influence from the dominant 
confederate. It is significant that this failure of 
the dominant confederate to attain greater in- 
fluence occurred despite the fact that the male 
dominant confederate actually talked more than 
the male neutral confederate. 

In addition to these measures of actual 
influence, we also had a measure of perceived 
influence. On the postexperimental question- 
naire bystanders indicated on a nine-point 
scale where each member ranked in influence 
over the group decision. As Table 3 shows, 
dominant confederates were actually per- 
ceived as less influential than neutral confed- 
erates although the differences were only of 
borderline significance. Since the confeder- 
ates were similarly competent in that’ they 
gave identical arguments, this difference in 
perceived influence may indicate some resent- 
ment or “backlash” against the dominant 
confederate. Once again, there were no 
gender effects in perceived influence. 


Perceptions of the Confederate 


To further explore the bystanders’ perceptions 
of the dominant and neutral confederates, we 
constructed scales measuring dominance- 
submission, perceived task capacity, group ori- 
entation, and likability from bystanders’ rat- 
ings of the confederates on postexperimental 
questionnaire items. To control for measure- 
ment error in constructing these scales, we 
employed confirmatory factor analysis, using 
the LISREL program (Joreskog and Sorbom 
1986).1° Table 4 indicates the standardized 
loadings yielded by the measurement model. 
Bystanders’ raw scores on scale indicators were 
standardized, weighted by these loadings, and 
summed within each scale to produce final 
scale scores (Table 5). 


10 Our data evidenced nine such correlated 
errors which this technique allowed us to incorpo- 
rate into the measurement model. Although 
likability and group orientation were measured by 
only two indicators each, our measurement model 
treated them as separate but correlated constructs, 
avoiding the problem of underidentification. 
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Table 4. Scale Loadings for LISREL Confirmatory 
Factor Analysis 
Standardized 
Loadi 
Factor 1: Task Capacity Items 
high quality ideas 9224 
knowledgeable .7943 
competent .7369 
persuasive .7303 - 
useful ideas .6893 
contributed to group decision .6362 
Factor 2: Dominance Items 
dominance 1.0 
aggressive .5216 
intimidating .3437 
Factor 3: Likability Items* 
likable . .9656 
pleasant 93 
Factor 4: Group Orientation* . 
helpful .8555 
cooperative 79 


* Factors 3 and 4 were treated as separate but 
correlated constructs, avoiding underidentification. 


Repeated measures analysis of variance on 
scale scores showed, as expected, that the 
dominant confederate was perceived as signif- 
icantly more dominant than the neutral 
confederate.!! Dominant confederates’ were 
also seen as tenser than neutral confederates, 
supporting Ridgeway's (1987) view that 
threat gives dominance-submission behaviors 
a tense, emotional quality, distinguishing 
them from more emotionally neutral: task- 
oriented behaviors. Interestingly, there were 
no gender differences in how dominant the 
confederates were seen to be, despite the fact 
that the male confederates were behaviorally 
more dominant than the female ones. This 
might indicate that it did not require as high a 
level of dominance behavior to be perceived 
as highly dominant among females as it did 
among males.!? 


!! One bystander in one group did not complete 
the dominance items and one bystander in each of 
the four groups did not complete all the task 
capacity items. In these cases, rather than delete 
the entire group from the analysis, we used the 
dominance and task scale scores of the bystander in 
the group who did complete the items to estimate 
the scores of the bystander who did not complete 
them. Doing so did not change the direction or 
significance of any effects. 

12 Bystanders might use'the dominance scale 
items relatively, defining the end points in terms of 
the degrees of dominance actually seen in the 
group. However, since there is no evidence that 
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Condition Means Anova Results 

Male Male Female Female Gender 

Dominance Neutral Dominance Neutral Gender Type X Type 
Task — 1.203 .430 —.991 2.895 F= 2.015 8.551 1.425 
Capacity p- .164 .006 240 
Dominance .481 —.90 .669 — .285 F= 1.099 9.264 .312 
p= .301 .004 .580 
Likability — 1.240 .270 —.692 1.937 F= 11.967 41.84 3.067 
p= .001 000 088 
Group — .237 275 —.667 .923 F= .100 9.208 2.426 
Orientation ^ p= .753 .004 .128 
Tense 4.94 3.41 3.46 1.89 F= 13.181 13.989 .001 
p= .001 .001 971 


Since we argued that dominance behaviors 
in themselves do not enhance an actor's 
appearance of task ability, we expected 
dominant confederates to be perceived as no 
more competent at the task than neutral 
confederates. In fact, dominant confederates 
were seen as significantly less competent than 
the neutral confederates. This is striking 
because they both presented the same task 
arguments and were similar in being peers to 
the bystanders (and so, lacked any external 
status advantage). These results are compara- 
ble to those of Ridgeway (1987) whose 
subjects also rated a dominant confederate as 
less skilled at the task than a nondominant 
confederate despite their identical task sugges- 
tions. As with perceived influence, these 
results suggest a possible backlash against 
dominant confederates that made bystanders 
unwilling to credit them with as much task 
skill as the neutral confederates. 

The impression that bystanders reacted 
against or rejected the dominant confederates 
is supported by the fact that these confeder- 
ates were perceived as significantly less 
likable and less group-oriented (i.e., cooper- 
ative and helpful) than neutral confederates 
(Table 5), another finding replicating Ridge- 
way's (1987). Such results support our 
argument that the use of dominance behavior 
to gain status violates the legitimating norms 
of status allocation in task groups and, as a 
consequence, elicits disapproval and sanc- 
tions from bystander members. 

There were no gender differences in the 
ratings of confederates' group orientation, but 
women rated both dominant and neutral 


they used other scale items in such a relative 
manner, this seems less likely. 


confederates as more likable than men did. 
Since female confederates were perceived to 
be just as dominant as male confederates by 
their respective bystander members, these 
differences may reflect gender norms about 
reporting liking for others. The fact that there 
were no interactions between these variables 
and group type, however, indicates that male 


~and female groups reacted similarly to the 


dominant confederate compared to the neutral 
confederate, again supporting our argument 
that the role played by dominance in task 
groups is essentially the same in male and 
female groups. 


CONCLUSION 


The data clearly supported the hypotheses, 
and this result in turn supports the network- 
collective analysis from which the hypotheses 
were derived. When a confederate engaged in 
a dominance attack on another, bystanders 
consistently intervened against that confeder- 
ate with dominance and other resistance 
behaviors. Just as importantly, dominant 
confederates achieved no more influence over 
bystanders' task decisions than did neutral 
confederates and were perceived by the 
bystanders to have been less influential than 
were neutral confederates. As expected, 
dominance behavior did not enhance the 
confederates' appearance of task skill and 
apparently produced some resistance since, 
despite similar task contributions, dominant 
confederates were rated as lower in task 
capacity than neutral confederates. As further 
evidence of resistance to and censuring of the 
dominant confederates, they were perceived 
as less likable and less group-oriented than 
neutral confederates. Finally, as predicted, 
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these relationships held in both male and 
female groups, indicating that the nature of 
the role played by dominance in task groups 
is the same for botli sexes. 

These results demonstrate the importance 
of taking the entire group network into 
account rather than aggregating dyads to 
study status formation in task groups. A 
dyadic approach cannot explain why bystand- 
ers consistently intervened in the confeder- 
ates’ dominance contest. The network ap- 
proach is similarly much more successful in 
explaining why the dominant confederate 
failed to achieve enhanced influence. As this 
analysis predicts, bystanders did indeed act as 
if the confederate’s efforts to gain status 
through dominance were not only not in their 
interests but normatively inappropriate in a 
task group. 

A collective approach clarifies the role of 
dominance behavior in the formation of status 
hierarchies in task groups. Our results and 
those of an earlier study (Ridgeway 1987) 
indicate that that role is limited, whatever it 
may be in other groups. Dominance seems to 
be more effective as a means of enforcing 
performance-based norms in such groups 
(e.g., the bystanders’ attacks on the dominant 
confederate) than as a means to achieve 
status. l 

A more traditional, dyadic competition 
approach to dominance dynamics might argue 
that bystanders merely enjoined dominant 
confederates in dominance contests of their 
own that the confederates did not win, 
accounting for the confederates’ failure to 
gain influence. (Why the confederates consis- 
tently failed to win would remain unclear, 
however.) This ‘approach might further argue 
that bystanders did this to win status for 
themselves through dominance, not to cen- 
sure the confederates. 

For this analysis to be true, however, the 
two bystanders in each group, both of whom 
attacked the dominant confederate, must 
eventually turn to compete for dominance 
with one another as well. This must occur for 
a transitive hierarchy to be established 
through pairwise dominance competitions. 
Yet dominance attacks by one bystander on 
another literally never happened in male 
groups, and in female groups only a single 
dominant act was ever directed at a fellow 
bystander. Furthermore, if bystanders in 
dominant conditions were themselves seeking 
status through dominance, it is not clear why 
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they did not also try to gain status in this way 
in neutral conditions. Thus, our results are not 
very compatible with a traditional, pairwise 
competition approach to dominance dynamics 
in task groups.?? 

The failure to find interactions between 
bystander reactions and gender supports our 
assumption that a network-collective analysis 
of dominance in task groups applies equally 
to male and female groups. However, female 
groups did generally display lower levels of 
dominance behavior in both dominant and 
neutral conditions than did male groups. 
There is also some suggestion in the data that 
lower levels of confederate dominance in 
female groups produced bystander reactions 
as negative as that produced by higher levels 
of dominance in male groups. It is also not 
clear why female confederates, who received 
the same training as male confederates, 
consistently found it difficult in the context of 
group interaction to be as dominant as male 
confederates were. Taken together, these 
points could indicate that gender norms 
operated to make high levels of dominance 
behavior particularly unacceptable in female . 
groups (Martin 1985; Ridgeway 1988). How- 
ever, because we do not know whether males 
would have reacted as strongly to lower levels 
of dominance as females did, had they been 
exposed to such levels by male confederates, 
we must be cautious about accepting such a 
conclusion without further research. 

The network-collective approach espoused 
here returns to an older tradition in sociology 
(e.g., Bales 1950; Homans 1950) that viewed 
status hierarchies as the emergent product of 
the conjoint behavior of all group members, 
not as statistical aggregates of dyadic encoun- 
ters. Taking such an approach to task groups 
not only clarifies the role of dominance 
behavior, but- also suggests a deeper explana- 
tion for something that dyadic status analyses 
must assume: that perceived contributions to 
the collective task are the basis of such 
groups' status hierarchies. The network anal- 
ysis reveals that a shared task induces 
effective coalitions that enforce this collective, 
interest over individual desires for promi- 
nence. An important future task will be to 


13 It is not clear that our results contradict more 
complex approaches such as Mazur's (1985) that 
recognize that status allocation is usually coopera- : 
tive and that dominance may not win status in 
some contexts. 
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discover whether this collective, coalitional 
form of status formation is unique to task 
groups or, as is more likely, characteristic of 
all face-to-face groups. 
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SPECIFYING SPECIFIC DETERRENCE: 
THE INFLUENCE OF ARREST ON 
FUTURE CRIMINAL ACTIVITY* 
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Using data on police contacts for a cohort of males born in 1949 and followed until 
age 25, this paper examines the influence of arrest on future criminal activity. Two 
questions are examined. Does arrest amplify or deter the future criminal activity of 
those arrested? Does the influence of being arrested on future offending vary by 
where arrest occurs in the sequence of police contacts: for example, does being 
arrested have more influence on the future criminal activity of novice or 
experienced offenders? Drawing on the “criminal career” perspective, we 
partition future offending into distinct components, such as rate of future 
offending, duration of a criminally active period, and desistence from future 
offending. Evidence emerges that is more consistent with a specific deterrence 


Position than a labeling perspective. 


INTRODUCTION 


Research over the last two decades reveals 
that a substantial percentage of police con- 
tacts with suspected offenders do not result in 
arrest (Black and Reiss 1970; Lundman, 
Sykes, and Clark 1978; Smith and Visher 
1981). While considerable energy has been 
directed toward understanding why police 
arrest some suspected offenders and not 
others, very little is known about the 
consequences of these decisions. One poten- 
tially important consequence is the influence 
of police decisions to arrest suspected offend- 
ers on their future criminal activity. This 
paper examines this issue with data on police 
` contacts and dispositions for a cohort of males 
born in 1949 and followed until age 25. 

Two concerns motivate the present study. 
The first is to examine whether arrest, as a 
form of punishment, reduces or increases 
subsequent criminal activity. From a specific 
deterrence perspective, being arrested should 
increase the perceived risk of future criminal- 


* Financial support for this project was provided 
by the National Institute of Justice under award 
(OJP-87-M-274). The views and opinions ex- 
pressed in this paper are those of the authors and 
do not necessarily reflect the views of the 
sponsoring agency. We thank Professor Lyle 
Shannon for supplemental data on sanctions and 
for answers to our queries about the data. Finally, 
we appreciate the helpful criticisms of Christy 
Visher, Charles Wellford, and Charles Tittle on an 
earlier version of the paper. 


ity for those who are arrested and thus reduce 
their future criminal activity (see Bridges and 
Stone 1986; Zimring and Hawkins 1973). 
Conversely, a labeling position argues that 
legal punishment may increase future offend- 
ing among those who are sanctioned (Becker 
1963; Lemert 1972). These opposing behav- 
ioral predictions of the consequences of 
punishment will be examined in this paper. 

Second, these data are relevant to under- 
standing the influence of arrest on different 
dimensions of future offending. Blumstein, 
Cohen, Roth, and Visher (1986, p. 12) have 
recently articulated a "criminal career" per- 
spective for the study of crime, in which a 
criminal career is defined as a "longitudinal 
sequence of crimes committed by an individ- 
ual offender." Thus, criminal careers have 
distinct components. For example, individu- | 
als may terminate their careers with the first 
offense. Those who continue to commit 
offenses— often labeled persisters—may do 
so at different rates or remain criminally 
active for variable lengths of time. 

One potential contribution of the criminal 
career perspective is that distinguishing differ- 
ent dimensions of offending (rate of offend- 
ing, duration, and termination) "provides a 
finer resolution in the search for factors 
associated with crime than do other ap- 
proaches" (Blumstein et al. 1986, p. 13).! 


! The criminal career perspective advanced by 
Blumstein and his colleagues is concerned with the 
parameters of an active offending period. Their 
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The research reported in this paper offers a 
preliminary assessment of whether being 
arrested influences different dimensions of 
future offending, and whether these relation- 
ships vary by where arrest occurs in a 
developmental sequence of offending. For 
example, does the influence of arrest on 
future offending differ for those just begin- 
ning to commit crimes and those who have a 
history of prior criminal activity? 


A BRIEF REVIEW OF THE ISSUES 


One of the most enduring debates in criminol- 
ogy concerns the effects of punishment on 
future criminal conduct. A primary issue is 
whether legal sanctions reduce or amplify 
future criminal behavior. Early utilitarianists 
argued that punishment deterred repeated 
criminality because people responded to a 
subjective pleasure/pain calculus (Bentham 
[1843] 1962). Over time, however, the utility 
of punishment was increasingly questioned. 
In commenting on 19th-century utilitarian 
justifications for punishment, Glueck (1928) 
emphasized the distinction between the deter- 
rent and preventive value of punishment. To 
Glueck, the preventive value of punishment 
was the influence of punishment on the future 
conduct of ‘those punished—subsequently 
referred to as specific or special deterrence. 
He argued that, while it could not be seriously 
denied that punishment influenced future 
behavior, punishment did not always prevent 
future crime. For some, punishment was the 
“worst possible method of treatment, result- 
ing as it so often does in . . . future 
misconduct" (1928, p. 460). A decade later, 
Tannenbaum (1938) further questioned the 
preventive utility of punishment by arguing 
that punishment contributed to the crystalliza- 
tion of a delinquent self-image which in- 
creased the future criminal activity of those 
punished. 

The belief that punishment, and especially 
incarceration, made those who were punished 
more criminal prevailed until the 1970s. 
Correctional efforts, especially with regard to 








work does not consider desistence to be a 
parameter of a criminal career. In the research 
reported in this paper, our focus is primarily on 
how arrest influences dimensions of future criminal 
activity. Within this framework, termination from 
future offending is a potentially important conse- 
quence of being arrested. 
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juveniles, emphasized rehabilitation, diver- 
sion, and treatment of offenders rather than 
punishment. However, in the early 1970s the 
effectiveness of these approaches was seri- 
ously challenged by Lipton and his associates 
(1975). Their review of research from 1945 to 
1967 on the effectiveness of correctional 


' treatment, was cited to show that, with few 


exceptions, rehabilitative efforts had gener- 
ally failed to reduce future criminality. 

Attention to the effects of punishment on 
future offending increased with the decline of 
the rehabilitative ideal. Several studies com- 
pared measures of future offending among 
groups of individuals who had been subjected 
to different degrees of punishment (Empey ` 
and Erickson 1972; Murray and Cox 1979). 
These studies and a host of others (see 
Farrington 1983 for a review of experimental 
evidence on specific deterrence) produced 
inconsistent results. Some claimed that more 
severe punishment significantly reduced rates 
of future offending, while numerous others 
reported that more severe punishment either 
had no effect or actually increased subsequent 
criminal activity. 

This contradictory evidence on the specific 
deterrent value of more severe punishment 
offers little insight into the deterrent efficacy 
of punishment per se. Packer (1968, p. 46) 
has pointed out that many studies of specific 
deterrence confuse the severity of punishment 
with punishment itself and suggests that “it 
would be hard to imagine that offenders who 
escape arrest or detection would be less likely 
to repeat an offense than those who are 
processed through the legal system." Tittle 
and Logan (1973) agree that many studies of 
specific deterrence assess the influence of. 
punishment only among those who have 
received some type of punishment, and 
question the appropriateness of these studies 
to assess the specific deterrence hypothesis. 
Both they and Gibbs (1975, p. 236) argue that 
valid tests of specific deterrence must com- 
pare the future offending of those punished 
with the future criminal activity of those who 
are offending but not punished. 

Despite these concerns, few researchers 
have examined the future offending of those 
punished relative to those who evade punish- 
ment, and the few studies that do address this 
question provide contradictory findings. For - 
example, Gold and Williams (1972) com-' 
pared the delinquent involvement of 35 
matched pairs of juveniles who self-reported ` 
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delinquent activity. For each pair, one 
member had been apprehended by police 
while the other had not. In 57 percent of these 
pairs the apprehended youths reported more 
subsequent delinquency than their counter- 
parts who had not been caught. Gold and 
Williams argued that these results indicate 
that apprehension by police increases future 
delinquency. Sherman and Berk (1984) exam- 
ined the specific deterrent effect of arrest in 
domestic assault cases. Their research in- 
volved random assignment of three police 
dispositions (arrest, mediation, and separa- 
tion) to cases of misdemeanor spouse assault. 
Thus, among the sample of offenders one 
group was punished (those arrested) and two 
groups were not.? In comparing the rates of 
repeated assault between these three groups, 
Sherman and Berk reported that punishment 
(arrest) reduced subsequent domestic violence 
significantly more than did other police 
responses. 

Our research will assess whether arrest 
reduces subsequent offending for a wide 
range of offenses. Additionally, we are 
interested in specifying the conditions under 
which arrest influences future offending. In 
addressing this issue, we will examine 
„whether the specific deterrent efficacy of 
arrest varies with the extent of past involve- 
ment in criminal activity among those who 
are sanctioned. Some previous research sug- 
gests that it might. 

In an early study of shoplifting, Cameron 
(1964) noted that apprehension of the novice 
shoplifter usually terminated the offender’s 
shoplifting career. This observation suggests 
that arrest may have a greater deterrent effect 
on individuals who are just beginning crimi- 
nal activity. This point has also been raised 
by Thorsel and Klemke (1972) and Packer 
(1968) who argued that sanctions may be less 
effective in reducing the criminal activity of 
experienced offenders. 

Klein’s (1974) study based on aggregate 


? It could be argued that the mere presence of 
police is a form of punishment. We do not take 
exception to this. However, an arrest subjects the 
offender to possible legal sanctions, while a 
decision not to arrest the offender essentially 
eliminates the possibility of any legal sanction for 
the current offense. Thus, the distinction between 
being arrested and not being arrested is essentially 
a distinction between being punished and not being 
punished. 


AMERICAN SOCIOLOGICAL REVIEW 


rates of recidivism across police agencies 
suggests that rates of offending among 
experienced offenders may be influenced by 
police decisions to refer juvenile offenders for 
further processing. His research classified 13 
police agencies in California as either high- or 
low-diversion departments, based on the 
percent of cases they released. Recidivism 
data were collected on 100 juvenile offenders 
processed by each police agency in 1969, and 
two-year recidivism rates were calculated for 
these offenders. When all offenders were 
considered simultaneously, little association 
appeared between diversion practices and 
recidivism. However, when the analysis was 
repeated for first and multiple offenders 
separately, those who had been arrested 
previously had significantly fewer subsequent 
arrests in departments that referred more 
cases for further processing compared to 
departments that diverted a larger percentage 
of cases. These findings suggest that arrest 
and further processing may decrease the rate 
of future offending among more experienced 
offenders. 

In light of these findings, several hypothe- 
ses are plausible. First, an arrest may have a 
greater effect on the future criminal activity of 
novice offenders than on that of experienced 
offenders. To some extent, punishment sets 
the cost of crime. Once police make contact 
with a suspected offender, any action less 
than arrest might reduce the perceived risk 
associated with future criminal activity. 
Conversely, being arrested may increase 
individuals’ estimated cost of future criminal 
activity. Moreover, reactions to early offenses 
might factor more heavily into individuals' 
perceptions of the cost of committing another 
offense. This hypothesis is consistent with 
recent evidence that "punishment is associ- 
ated with higher levels of perceived threat 
among naive offenders but has no apparent 
relationship with perceived threat among 
experienced offenders" (Bridges and Stone 
1986, p. 228). However, other research on 
this issue has failed to discover evidence that 
the relationship between punishment and 
perceived risk of future offending differs for 
novice and experienced offenders (Thomas 
and Bishop 1984). This suggests that the 
effect of punishment on future offending will 
be similar for offenders with different crimi- 
nal histories. 

But punishment may increase the future 
criminality of novice offenders more than that 
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of experienced offenders. Jensen (1969) 
found that punishment early in an offending 
career was more likely to produce a deviant 
identity than sanctions applied to individuals 
who had already established a pattern of 
repeated delinquent behavior. However, sub- 
sequent research on this issue by Hepburn 
(1977) found no evidence that punishment 
had a greater impact on the delinquent 
self-identification of novice offenders than on 
that of more experienced offenders. Thus, 
perhaps the only conclusion we can draw at 
this point is that the relationship between 
punishment and future offending among both 
novice and experienced offenders is problem- 
atic. 

A second focus of the present research is to 
determine whether police arrest decisions 
influence parameters of criminal careers. In 
this sense arrest is a potential modifier of 
future criminal activity which may influence 
different dimensions of subsequent offending. 
For example, offenders who commit a k’th 
offense may be more or less likely to commit 
another offense depending on whether they 
were arrested for the k’th offense. However, 


looking at whether those arrested on their k’th : 


offense are more or less likely to commit 
another offense than those who are not 
arrested is only one possible measure of the 
influence of arrest on future offending. 
Moreover, since the probability of commit- 
ting another crime increases with the number 
of crimes already committed, cessation from 
future. criminal activity may be a less viable 
measure of the impact of arrest as offenders 
continue their criminal careers (Barnett and 
Lofaso 1985; Barnett, Blumstein, and Farring- 
ton 1987; Blumstein, Farrington, and Moitra 
1985; Shannon, 1980; Wolfgang, Figlio, and 
Sellin 1972). 

Within the criminal careers framework, 
cessation of future criminal activity is only 
one dimension of a criminal career. À sample 
of persons who commit an offense can be 
divided into two subsamples — desisters (those 
who refrain from future offending) and 
persisters (those who commit at least one 
additional offense). Farrington (1987, p. 57) 
has recently noted the importance of this 
distinction by observing that the "commission 
of one recorded offense during a follow-up 
period does not necessarily show the failure 
of rehabilitative or deterrent measures be- 
cause the measure may have caused an 
offender to commit fewer offenses." More- 
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over, persisters may differ in the rate at which 
they commit future crimes. Thus, research 
reported in this paper will also assess whether 
arrest influences the future rate of offending 
among those who remain criminally active. 

Finally, the influence of arrest on two other 
dimensions of criminal careers will be 
examined. First, does being arrested extend 
or reduce the subsequent length of time that 
an individual remains criminally active? 
Second, some research indicates that the time 
intervals between successive offenses de- 
crease as offenders progress further into 
criminal careers (Barnett et al. 1987). In this 
paper we examine whether being arrested is 
related to the time until the next offense for 
those who continue to commit crimes. 

To summarize, this research will examine 
the influence of arrest on four different 
dimensions of subsequent criminal activity: 


` termination from future criminal activity, 


future offending rate, duration of the crimi- 
nally active period, and time until the next 
offense. Separate consideration of these 
components recognizes the possibility that 
being arrested may have different effects on 
the various dimensions of criminal careers. 
Moreover, the following analysis is con- 
cerned with whether the influence of arrest on 
future criminal activity varies by where arrest 
occurs in the criminal career. 


DATA AND VARIABLES 


To address these issues, we use data on the 
police contact histories of all males born in 
Racine, Wisconsin, in 1949 who maintained 
continuous residence in Racine until age 25.? 
Our original sample consists of all males with 
at least one recorded police contact for a 
misdemeanor or felony. Contacts for status 
offenses, traffic violations, and suspicion are 
not included in the sample. Given these 
criteria, 325 males had at least one police . 
contact. 

For each person's initial police contact, the 
following variables were created. First, the 
disposition of each police contact was coded 
as referred or not referred. Referral covers 
instances in which police arrested the of- 





? These data are available from the Inter- 
University Consortium for Political] and Social 
Research at the University of Michigan under 
study number 8163. 
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Table 1. Sample Statistics for Police Contacts 1 through 6 


Propor- Propor- Prior Future 
. tion tion Contact Contact Future 
Contact (N) Age Serious Arrested Desist* Rate? Rate Duration? Contacts" 
1 325 15.3 .28 22 .33 — .81 52.1? 4.4 
2 219 16.8 27 .28 31 — 1.15 65.3 5.3 
3 151 17.4 24 .30 30 1.15 1.63 66.2 6.8 
4 105 17.4 25 31 19 1.28 1.51 66.5 7.5 
5 85 17.7 .20 43 .20 1.12 1.81 68.5 8.3 
6 68 17.9 34 46 .10 1.23 1.95 63.6 8.2 


* Based on all cases with a k'th contact. 
b 5 percent trimmed mean based on all cases with a kth contact. 


* 5 percent trimmed mean based on cases who continue 
d [n months. 


fenders and referred them to eitber juvenile 
probation or the state's attorney.^ All other 
cases in which police did not arrest but 
instead counseled, warned, or released the 
"offender were coded as not referred. In these 
data, 22 percent of offenders were arrested on 
their first police contact. Second, the serious- 
ness of the contact was measured by whether 
the offense was an index offense or not; 28 
percent of offenders' first police contacts 
were for index offenses. Third, offenders' age 
was measured as the number of days from 
birth until their first police contact divided by 
365. The average age of persons on their first _ 
police contact is 15.3 years. ‘Fe 
Individual’s future police contacts were 
used to operationalize different dimensions of 
criminal careers. If an individual did not have 
another police contact before age 25, he was 
classified as desisting after his first contact. In 
these data, one-third of offenders did not have 
subsequent police contacts after their first 
contact. For those who continued to have 
police contacts, the duration of their active 
offending period was measured by the time 
from their first to their last police contact.5 
Future police contacts were also used to 
calculate an annual rate of police contacts. 


* In this paper the terms arrest and referred are 
used synonymously. 

_ > We thank professor Lyle Shannon for provid- 
ing us with additional data on the dispositions of 
those arrested which allowed us to make a crude 
adjustment for time incarcerated. Information was 
available on the length of sentence received and we 
estimated time served as one-third of the sentence 
received. These estimates of time incarcerated 
were subtracted from exposure time since individ- 
uals were not at risk of having a police contact 
during the period of incarceration. In the sample of 
325 males used in this research, 36 were 
incarcerated for some length of time. ] 


to have police contacts after the k’th contact. 


Following Blumstein and Cohen (1979), we 
computed future police contact rates on the 
subsample of individuals who had at least one 
additional police contact. Thus, of the 325 
persons with an initial police contact, 219 had 
at least one additional contact. For these 219 
a future contact rate was computed by 
dividing the number of subsequent police 
contacts by the duration of their active 
offending period. These rates were then 
annualized. Because we are interested in 
estimating the effect of being arrested on 
future offending at different stages of criminal 
careers, we also computed the variables 
defined above for the second through sixth 
police contacts. Thus, the following analysis 
will be conducted on six separate though 
related samples. Analysis based on the second 
through sixth police contacts will include two 
additional independent variables: the number 
of arrests prior to the current contact and the 
number of prior contacts for index offenses. 
Finally, analyses conducted on samples with 
at least three police contacts will include the 
offender's prior frequency rate of police 
contacts. 


FINDINGS 


Data in Table 1 reveal that, while 325 
individuals had at least one police contact, 
only 68 persons had six or more police 
contacts. Consistent with other cohort data, 
the proportion of serious contacts did not vary 
systematically with the number of contacts 
(e.g., Barnett and Lafaso 1985). However, 
the proportion of persons arrested did increase 
with the number of contacts. While police 


$ Following Cohen (1986) the first police 
contact is excluded from the numerator of prior 
frequency rate. 


SPECIFIC DETERRENCE 


arrested 22 percent of persons on their first 
contact, they arrested 46 percent of those on 
their sixth contact. These data are consistent 
with recent research (e.g., Smith 1987) 
indicating that prior contact with suspected 
offenders increases the probability that police 
will make an arrest. 

These data also indicate that the proportion 
of offenders who desist declines in a step 
function with the number of contacts. On the 
first through the third contacts, approximately 
30 percent of offenders have no further 
contacts. For those with a fourth or fifth 
contact, however, 20 percent are desisters, 
and only 10 percent of those with a sixth 
contact have no further contacts. A similar 
pattern of career termination was noted by 
Blumstein and Moitra (1980) in the Philadel- 
phia cohort data. 

Table 1 also provides information on the 
parameters of criminal careers for those who 
continue to have police contacts subsequent to 
a k'th contact. For example, the average 
future contact rate for the 219 individuals who 
continue to have police contacts after their 
initial contact is .81.7 By the sixth contact, 
the future contact rate among  persisters 
increases to almost two per year. In general, 
these data indicate that an offender's future 
contact rate increases with the number of 
prior contacts. This trend also appears in 
other cohort data on offending (Barnett and 
Lafaso 1985). Thus, as offenders progress 
further into criminal careers their average rate 
of offending increases. Additionally, data in 
Table 1 indicate that average residual career 
length, or the duration of a criminally active 
period, remains fairly stable after the second 
police contact. Stable residual career length 
coupled with an increasing rate of police 
contacts results in an increase in the average 
number of future police contacts. Thus, 
offenders who remain active beyond their first , 
police contact have an average of 4.4 


7 Future contact rates are estimated by the 
number of future police contacts divided by the 
time at risk from the current contact until the last 
police contact. These rates are then annualized. 
The means reported in Tables 1 and 2 for prior 
contact rate, future contact rate, duration, and 
number of future contacts are 5 percent trimmed 
means (Hoaglin et al. 1983). These means are 
more robust but less efficient than conventional 
averages and minimize the influence of a few 
outlying cases on the estimated sample means. 
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additional contacts until age 25, compared 
with 8.2 additional contacts for offenders who 
remain active after their sixth police contact. 
_ While patterns of desistence, seriousness, 
and future contact rates for the first through 
the sixth police contacts are consistent with 
other cohort studies, our interest is in 
determining how these dimensions of criminal 
careers ‘vary for those who are arrested 
relative to those who are not arrested for a 
specific contact. Preliminary data to assess 
this issue appear in Table 2 which presents 
data comparable to Table 1 for those arrested 
and those released. 

Important differences emerge in the subse- 
quent criminal activity of those arrested 
relative to those who are simply released by 
the police. For example, half of the 72 
offenders arrested on their first contact did not 
have any subsequent police contacts through 
age 25, compared to only 28 percent of those 
who were released by police at their first 
contact. At the second contact a similar 
differential in the probability of desistence is 
evident between those arrested and those 
released by police. However, the influence of 
arrest on desistence is diminished at the third 
and fourth contacts and appears to actually 
decrease the likelihood of ending criminal 
activity by the fifth contact. Thus, the 
influence of arrest on terminating future 
offending appears to vary with where the 
arrest occurs in the development of a criminal 
career. 

Among those who continue to have police 
contacts, the influence of arrest on the rate of 
future contacts also appears contingent on 
when the arrest occurs. At the first and 
second police contacts, future contact rates do 
not differ markedly for those arrested and 
those released. However, at the third and later 
contacts, those arrested have considerably 
lower future contact rates than those released. 
For example, in the sample of persons who 
continue to have police contacts after the 
fourth contact, those arrested on the fourth 
police contact have an average annual future 
contact rate of 1.15 compared to 1.98 for- 
those not arrested by police. 

The duration of criminal careers and the 
total number of future police contacts also 


appear to vary with arrest. Those arrested on 


their first through third police contacts 
continue to offend for shorter periods of time 
than those who are not arrested. Additionally, 
across the first six police contacts, those 
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Table 2. Sample Statistics for Police Contacts 1 through 6 by Police Disposition 











Propor- Propor- Prior Future 
tion tion Contact Contact Future 
Contact Disposition (N) Age Serious Desist* Rate? Rate? Duration? Contacts? 

1 Released 253 14.8 .26 .28 — .87 80.42 4.6 
Arrested 72 17.2 33 50 — 1.03 55.2 3.6 

2 Released 155. 16.1 23 .26 — 1.17 68.4 5.9 
Arrested 63 18.1 38 .44 — 1.08 50.4 3.8 

3 Released 105 17.1 .16 31 1.05 1.76 72.7 7.9 
Arrested 46 18.3 41 .30 1.27 1.27 52.3 43 

4 Released 73 17.2 15 AS 1.37 1,98 65.6 8.4 
Arrested 32 17.9 47 .28 1.14 1.15 68.4 5.0 

5 Released 48 17.8 23 33 1.18 2.20 79.9 11.0 
Arrested 37 17.7 .16 .16 1.07 1.43 55.6 5.6 

6 Released 39 18.0 .26 A5 1.19 2.30 67.1 9.3 
Arrested 29 17.7 31 .03 1.29 1.60 59.8 7.4 





* Based on all cases with a k'th contact. 


> 5 percent trimmed mean based on all cases with a k'th contact. 
* 5 percent trimmed mean based on cases who continue to have police contacts after the k’th contact. 


d In months. 


arrested who have future police contacts, 
have fewer future contacts than those not 
arrested. 

These data are suggestive of the possible 
influence of arrest on different dimensions of 
future offending. However, a more compel- 
ling analysis would control for the effects of 
other relevant variables such as age and 
seriousness of offense. For example, persons 
arrested on their first police contact are 
considerably older (17.2 years) than persons 
released (14.8 years). Thus, the average time 
at risk after the first contact differs for these 
two groups because the police contact records 
were available only up to age 25. This may 
affect the relationship between arrest and 
time-dependent variables such as the duration 
of a criminally active period subsequent to the 
first arrest. Specifically, persons released by 
police on their initial contact, who had at least 
one additional contact, were criminally active 
about 2.1 years longer than those arrested on 
their first contact. However, those released 


were also 2.4 years younger, on average, than 
those arrested. Thus, when differences in 
time at risk are taken into account, arrest may 
have little or no influence on the subsequent 
duration of criminal careers. 


Multivariate Models of 
Subsequent Offending 


To address this concern, a series of multivar- 
iate models that examine the influence of 
arrest on different dimensions of subsequent 
criminal activity appear in Table 3. The 
coefficients under the column labeled desis- 
tence are estimated probit coefficients of the 
effect of being arrested on the probability of 
having no future police contacts, after we 
have controlled for whether the current 
contact resulted from an index offense, and 
for time at risk subsequent to the current 
contact. For the second through the sixth 
contact, the number of prior police arrests and 
the number of past index contacts are also 


Table 3. Effects of Arrest on Desistence, Future Police Contact Rates, Duration of Active Offending Period, and 
Time to Next Contact for Police Contacts 1 through 6 








Future Contact Time to 
Contact N) Desistence* Rate^ Duration? Next Contact? 
1 325 .32 (1.77) — .06 (—.51) .01 (.02) 17 (.54) 
2 219 37 (1.65) —.17  (—-1.38) .19 (1.08) .22 (.65) 
3 151 .09 (.30) —.38 (—2.53) .12 (.53) .63 (1.55) 
4 105 51 (1.32) — 35 (—2.07) 46 (2,10) .88 (1.90) 
5 85 31 (.70) —.45  (-—2.80) .15 (.67) .76 (1.70) 
6 68 —.53  (-.78) —.9  (—124) 17 (73) —.43  (—.86) 


* Maximum likelihood probit coefficient and t-ratio. 

> OLS estimate and t-ratio. Dependent variable is log of annual future contact rate among persisters. 

* OLS estimate and t-ratio. Dependent variable is log of months of active offending period among persisters. 
d OLS estimate and t-ratio. Dependent variable is log of months until next police contact among persisters. 


SPECIFIC DETERRENCE 


included in the equation as controls. Finally, 
for the third through sixth contact, the 
offender’s annualized prior police contact rate 
is also included. The complete equations for 
all six contacts are given in Appendix A. 

When other potentially confounding factors 
are taken into account, being arrested is 
marginally associated with an increased 
probability of desistence among novice offend- 
ers (those with one or two police contacts). 
However, the magnitude of this relationship 
is not as strong as the bivariate relationship in 
Table 2. Consistent with earlier results, as 
offenders acquire more police contacts the 
influence of arrest on whether they continue 
to have police contacts diminishes. Thus, the 
hypothesis that being arrested terminates an 
offender’s criminal career only appears appli- 
cable to individuals at early stages of 
involvement in criminal activity. 

Results reported in Table 3 under the 
column future contact rate address the 
question of whether being arrested influences 
the future rate of police contacts once other 
potentially confounding factors are taken into 
account. Reported coefficients are least squares 
estimates of the effect of being arrested on the 
log of the subsequent contact rate (complete 
results of these six equations are provided in 
Appendix B). This analysis indicates that 
arrest is consistently related to reduced rates 
of future police contacts and that this 
relationship becomes more pronounced as 
offenders progress further into criminal ca- 
reers. These results provide support for the 
hypothesis that, among  persisters, being 
arrested reduces the rate for future criminal 
activity. 
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The final results in Table 3 concern the 
effects of arrest on the duration of a criminal 
career and the fime between the k’th contact 
and the next police contact. Reported coeffi- 
cients are least squares estimates and both 
dependent variables are natural logarithms of 
the number of months from the current 
contact to the next and last contacts respec- 
tively (complete results of these models are 
given in Appendices C and D). Both sets of 
results reveal generally consistent though not 
statistically significant relationships. For the 
sample of offenders who remain criminally 
active after a k'th contact, arrest appears to 
extend the duration of their criminal career; 
with one exception (k = 4) this relationship is 
not significant. Additionally, being arrested 
generally increases the time until the next 
police contact among those who continue to 
have police contacts. 

Our final concern is to assess the influence 
of arrest on the number of future police 
contacts. The number of police contacts 
subsequent to a given contact reflects the 
combined effects of arrest on desistence, 
frequency, and duration. For example, Table 
3 showed that offenders arrested for their 
second police contact were somewhat more 
likely to terminate their criminal career than 
offenders who were not arrested. Among 
persisters, those arrested on their second 
contact had lower rates of future offending 
but remained criminally active for longer 
periods of time. Thus, it is unclear how arrest 
may influence the total number of future 
police contacts. This issue is examined next. 

Table 4 reports results of a series of models 
of the effects of arrest on future number of 


Table 4. Effects of Arrest on Future Number of Police Contacts 1 through 6 


Independent Variable I 2 
Arrest —2.55* —3.50 
(—2.05) (-2.28) 
Index contact 1.98 3.31 
(1.87) (2.31) 
Time at risk? 8.05 8.00 
(6.80) (5.51) 
Prior arrests —.26 
(—.15) 
Prior index contacts 2.24 
(1.67) 
Prior contact rate 
Constant — 16.53 — 14.83 
Sigma .. 7.98 8.45 


3 4 5 6 
—1.59 —3.69 —4.96 —1.03 
(—.86) (—1.66) (—2.38) (—.46) 

— 28 -] —1.26 -.41 
(7.14) (—.30) (—.50) (~.17) 
7.98 9.01 11.00 9.38 
(4.69) (4.32) (4.41) (3.28) 
~1.21 —1.56 -1L17 — 1.68 
(—.82) (—1.29) (—1.06) (— 1.40) 
1.53 1.18 .26 54 
(1.22) (1.09) (.25) (.59) 
2.49 —.18 —.07 —2.56 
(2.33) (-.12) (—.03) (—1.22) 
~ 13.40 — 10.76 — 11.80 -6.7 
8.79 8.73 8.70 8.49 





* Coefficients are maximum likelihood Tobit estimates (t-values in parentheses). 
b Log of time remaining, in years, from the current contact until age 25. 
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police contacts. These models are estimated 
using a maximum likelihood tobit estimator 
(see Amemiya 1985, p. 360—408) which is 
appropriate for the censored distribution of 
future contacts in the current data. 

The data reveal that, when other relevant 
variables are taken into account, arrest has a 
consistently negative effect on the number of 
future police contacts, which is independent 
of the seriousness of the current offense, the 
number of prior arrests and index contacts, 
the prior rate of offending, and the remaining 
time at risk.? Thus, a randomly selected 
offender who was arrested on his k’th contact 
would, on average, have fewer subsequent 
police contacts than a randomly selected 
offender who was similar in other respects but 
was not arrested on his k’th contact. This 
difference is most pronounced at the first, 
second, fourth, and fifth police contacts. 
Thus, while being arrested affects specific 
parameters of criminal careers differently, the 
combined effect is a reduction in the future 
criminal activity of those offenders whom 
police arrest relative to those who are 
contacted but not arrested. 


CONCLUSIONS 
This paper has provided a preliminary empir- 


* For all persons who have a k’th contact some 
will have future police contacts while others will 
not. Thus, the distribution of future contacts 
contains two types of observations: those cases 
with no future contacts and those who have some 
positive number of future contacts. Among the 
latter group, there may be wide variation in the 
number of future contacts. The fact that observa- 
tions on the dependent variable have a lower bound 
of zero makes the use of estimators based on 
continuous densities, such as Ordinary Least 
Square, inappropriate. Additionally, the fact that 
the dependent variable is bounded at zero and that 
several observations on the dependent variable are 
zero makes the linearity assumption of Ordinary 
Least Squares problematic. Moreover, Amemiya 

-has shown that OLS estimates of regression 
parameters using all observations (both the zero 
and nonzero values of the dependent variable) are 
biased and inconsistent (1985, pp. 366-68). 
Because of these concerns, we use a tobit estimator 
which is more consistent with the distribution of 
subsequent police contacts in these data. 

? Time at risk is the log of (age 25—age at the 
current contact — estimated time incarcerated sub- 
sequent to the current contact). 
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ical assessment of the influence of arrest on 
different dimensions of subsequent criminal 
activity. Some evidence emerges that sup- 
ports a specific deterrence hypothesis. How- 
ever, the specific deterrent effect of arrest 
operates differently for novice and experi- 
enced offenders. Among novice offenders, 
arrest is somewhat more likely to terminate 
their criminal careers. Among more experi- 
enced offenders, arrest significantly reduces - 
future rates of offending. 

However, several issues remain unre- 
solved. Central to the conclusions noted 
above is our ability to make reasonable 
inferences about offending based on an 
observed sequence of police contacts. This 
concern is not new or unique to this study. 
Almost all research that has examined the 
specific deterrent effect of sanctions has 
measured future offending by some official 
measure, such as the number of subsequent 
arrests or convictions. The inherent problem 
is that not all future offenses result in 
detection. But among the domain of official 
measures, police contacts provide a closer 
approximation to the true level offending than 
arrests or convictions. 

Nevertheless, police contacts are only a 
sample of all crimes committed by offenders, 
and subsequent police contact rates underesti- 
mate true rates of offending. The important 
issue for our purpose is whether the probabil- 
ity of a subsequent police contact, given the 
commission of another offense, varies by 
whether the offender was arrested or not for 
the current contact. If the probability of a 
future police contact, given the commission 
of another offense, did not vary significantly 
by whether the person was arrested for the 
k’th contact, then differences in the subse- 
quent contact rates of those arrested and those 
released would reflect differences in the 
underlying offending rates between these two 
groups. Given this assumption, an inference 
from a sequence of police contacts to an 
underlying rate of offending appears reason- 
able. 

Of course the two other possibilities are 
that being arrested either increases or de- 
creases the probability of a police contact for 
a future offense. The scenario that would be 
most damaging to our finding that arrest 
reduces the rate of subsequent offending 
would be if being arrested for a K'th contact 
reduced the probability of a police contact 
given the commission of another offense. 
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This seems to be the least plausible possibil- 
ity, but it is clearly an open empirical 
question. i 

Several other questions also remain open. 
First, a larger sample would have permitted 
us to disaggregate police contacts by offense 
type and estimate models separately for 
different types of crimes or for felonies 
relative to misdemeanors. Second, not all 
arrests result in the same degree of punish- 
ment. Indeed, a large percentage of arrests 
that are referred to the state’s attorney or 
juvenile probation are probably dismissed 
with no further criminal sanction. Distinguish- 
ing between referrals that were subsequently 
dismissed and those that resulted in some 
sanction is important since we found no 
evidence that arrest increases the future 
criminality of those arrested. This lack of a 
labeling effect is not inconsistent with the 
extensive literature showing criminogenic 
effects of incarceration relative to milder 
punishment such as probation or community 


103 


placement. The important distinction is that 
the current study compares the future criminal 
activity of those who are offending and 
punished (arrested) with that of those who are 
offending but not punished (released by 
police without arrest). In contrast, the typical 
study of specific deterrence compares the 
future criminal activity of two or more groups 
that differ, not in terms of whether they are 
punished or not, but in terms of how severely 
they are punished. It is possible that punish- 
ment will reduce future offending relative to 
no punishment at all, but that among those 
who are punished, more severe punishment 
may lead to increased future criminal activity. 

Perhaps this hypothesis is best illustrated in 
previous research by Lyle Shannon (1980, 
1985), using these same data, which has shown 
that severe sanctions, such as incarceration, 
lead to increased future criminality. Thus, 
within the same data, evidence indicates that 
arrest has some specific deterrent value even 
when more severe sanctions do not. 


APPENDIX A. Probit Models of Desistence for Police Contacts 1 through 6 


Independent Variable 1 2 
Arrest .32* .37 
(1.77) (1.65) 
Index contact —.14 —.24 
(—.79) (—1.01) 
Age 15 14 
(6.33) (4.43) 
Prior arrests — .08 
(-.33) 
Prior index contacts —.18 
(— .84) 

Prior contact rate 

Constant —2.79 — 2.83 


3 4 5 6 
.09 51 3i -53 
(.30) (1.32) (.70) (—.78) 
—.01 41 16 53 
(—.03) (.84) (.30) (65) 
18 29 .26 23 
(4.07) (3.82) (2.48) (1.70) 
— 26 .00 —.13 -.25 
(-1.11) (.01) (—.55) (—.97) 
02 —.08 38 .07 
(.12) (—.40) (1.47) (27) 
FM .58 22.13 —1.83 
(—2.45) (1.79) (—2.18) (—1.38) 
—3.54 —6.74 —5.84 —4.95 


* Maximum likelihood probit coefficients (t-ratios in parentheses). 
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Independent Variable 1 2 
Arrest — .06* —-.17 
: (—.51) (— 1.38) 
Index contact .14 .20 
(1.35) (1.80) 
Age .02 01 
(1.40) (.66) 
Prior arrests 23 
(1.62) 
Prior index contacts .02 
(.23) 
Active period —.48 -.32 
(— 7.83) (4.50) 
Prior contact rate 
Constant 1.35 1.02 


* OLS coefficients (r-ratios in parentheses). 
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APPENDIX B. OLS Models of Subsequent Police Contacts Rates for Police Contacts 1 through 6 





APPENDIX C. OLS Models of Duration of Contact Period for Police Contacts 1 through 6 


Independent Variable 1 2 

Arrest .01* .19 
(.02) (1.08) 

Index contact —.01 01 
(-.02) (.07) 

Age —.17 —.18 
(—8.57) (—8.31) 

Prior arrests —.16 
(-.86) 

Prior index contacts .07 
(.52) 


Prior contact rate 


Constant 6.45 6.67 
* OLS coefficients (f-ratios in parentheses). 





APPENDIX D. OLS Models of Time to the Next Police Contact among Persisters 





Independent Variable 1 2 

Arrest .17* 222 
(.54) (.54) 

Index contact —.39 -.37 
(—1.58) (~ 1.17) 

Age —.12 —.01 
(—3.57) (—.24) 

Prior arrests -.19 
(—2.14) 

Prior index contacts ~. 
(— 1.78) 


Prior contact rate 


Constant 4.37 2.73 


3 4 5 6 
—.38 —.35 — 45 —.19 
(—2.50) (—2.07) (—2.80) (- 1.23) 
—.02 02 ~ 01 03 
(~.16) (11) (.05) (.19) 
—.02 — 01 —.08 —.06 
(7.60) (—.40) (-.67) (—1.64) 
—.04 —.07 —.16 —.16 
(—.36) (—.68) (—1.67) (— 1.88) 
01 —.03 — 02 03 
(.13) (—.32) (—.19) (AD) 
—,59 — 46 ~ 65 — 64 
(—5.90) (—3.75) (—4.35) (—4.86) 
.06 .05 -.M — 26 
(.72) (41) (—.81) (-1.61) 
1.96 1.75 2.69 3.21 
3 4 5 6 
12 46 A5 7 
(.53) (2.10) (.67) (.54) 
~ 21 12 ~.17 ~.01 
(-.83) (.51) (—.60) (—.01) 
— 24 —.19 -.23 —.15 
(~6.60) (—5.78) (—5.17) (-3.26) 
01 ~.01 18 —.05 
(.03) (—.04) (1.41) (—.37) 
AS —.03 08 08 
(.92) (—.29) (.69) (.82) 
—.09 24 24 26 
(~.71) (1.59) (1.04) (1.06) 
7. 6.89 7.37 6.25 
3 4 5 6 
63 88 76 — 43 
(1.55) (1.90) (1.70) (—.86) 
-.32 —.10 20 -.61 
(-.78) (—.22) (.36) (— 1.08) 
-08 -10 — ,08 —.08 
(—1.33) (— 1.33) (—1.02) (—.81) 
13 — .04 47 13 
(-.41) (~.15) (1.86) (.46) 
-.13 — 08 —.01 25 
(—.50) (—.35) (.02) (1.20) 
—.18 9 - 45 -.31 
(—.80) (.58) (—.99) (-.57) 
2.91 2.91 2.43 2.64 





* OLS coefficients (f-ratios in parentheses). 
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This paper examines the relationship between employment conditions and 
property-crime arrest rates of male juveniles and young adults, using age-specific 
state-level data from 1977-1980, compiled from raw arrest data of the FBI’s 
Uniform Crime Reports and from the Census Bureau’s annual March Current 
Population Survey. The analysis is disaggregated by age and utilizes dimensions of 
underemployment to provide more sensitive indicators for labor market conditions, 
including measures of job availability (such as unemployment) and job quality 
(such as low hours and low wages). Controls are included for criminal opportunity 
and other variables related to crime and the labor market. Labor market effects on 
arrest rates differ for juveniles and young adults. Availability of employment 
produces strong effects on juvenile arrest rates—full-time employment is associated 
with low arrest rates, unemployment with high arrest rates. Low quality of 
employment (e.g., inadequate pay and hours) is associated with high arrest rates 
for young adults. We discuss theoretical and policy implications of our findings. 


INTRODUCTION 


This paper examines the relationship between 
employment conditions and property crime 
arrest rates of juveniles and young adults. The 
cross-sectional research reported here em- 
ploys a number of strategies to investigate the 
employment/crime relationship in greater depth 
than previous studies. First, both dependent 
and independent variables are age-stratified — 
a crucial feature, given the strong age- 
dependency of both employment and crime. 
Second, the independent variable (underem- 
ployment) includes not only unemployment— 
a traditional measure of the availability of 
employment—but also aspects of job quality 
that may have structural effects on the level of 
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crime. Third, controls are included for other 
factors thought to affect the labor market/ 
crime relationship, including criminal oppor- 
tunity, percent minority population, mobility, 
and deterrent effects of the justice system. 
The state-level data utilized in the study are 
compiled from the Federal Bureau of Investi- 
gation's Uniform Crime Reports (UCR) arrest 
data and from the Census Bureau's annual 
March Current Population Survey (CPS). 
Covering the years 1977-1980, the data are 
age-specific for males aged 13 to 17 and 18 to 
24—the two age-groups responsible for the 
majority of index offense arrests. Results are 
analyzed separately for the four property 
crimes included in the UCR Crime Index: 
robbery, burglary, larceny, and auto theft. 
The importance of understanding the crime/ 
labor market relationship among young peo- 
ple is highlighted by recent debate concerning 
the strong relationship of age to crime. 
Whether one concludes that the nature of that 
relationship is invariant (Hirschi and Gottfred- 
son 1983), or whether one believes that ' 
patterns differ significantly across time and 
offense type (Steffensmeier, Allan, Harer, 
and Streifel 1989; Greenberg 1977), there is 
no question that rates of offending for index 
crimes are highest for juveniles and young 
adults—age groups for whom rates of unem- 
ployment and other labor market problems are 
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also high. The fact that both labor market and 
crime indicators are strongly related to age 
could lead to spurious results in aggregate 
analyses based on totals for all ages. 


BACKGROUND 


The relationship between employment condi- 
tions and crime has long interested social 
scientists. The employment/crime link is 
predicted by traditional theories of strain, as 
well as by a variety of theoretical perspectives 
on crime, including neoclassical models of 
economic choice, neo-Marxist models of 
economic constraint, and reduced controls. 

At the level of individual behavior, the 
common view is that poor labor market 
conditions (e.g., high unemployment) (a) 
create a stressful state that renders individuals 
susceptible to criminal behavior in order to 
overcome their economic problems (Bonger 
1916; Brenner 1977), and/or (b) make 
illegitimate pursuits an attractive alternative 
to scarce or poorly paid work (Block and 
Heineke 1975). At the aggregate level, 
concern is not so much with whether 
unemployed individuals are motivated by 
economic need to commit crime, but with 
whether deleterious labor force conditions 
have “milieu” effects that are associated with 
high crime rates (Chiricos 1987; Logan and 
Messner 1987). One common structural 
approach suggests that poor employment 
prospects may have a demoralizing impact 
that creates an anomic climate with crimino- 
genic consequences for those both in and out 
of the labor force (Merton 1956). Although 
only direct labor market effects are explored 
here, there may also be important indirect 
effects through attenuation of informal and 
formal community controls under depressed 
employment conditions (Sampson 1987; Mc- 
Gahey 1986). 

Our focus in this report is on the structural 
relationship between employment conditions 
and levels of crime for different ecological 
units. In this regard, few studies have focused 
specifically on the impact of labor market 
conditions on crime at the aggregate level, 
although a substantial number of studies using 
either cross-sectional or time-series data have 
included unemployment as an exogenous 
variable. The results of these studies are 
mixed. Indeed, several recent reviews of 
aggregate research on the issue have called 
into question the strength (Orsagh 1980), the 
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significance (Long and Witte 1981), the 
direction of the unemployment/crime relation- 
ship (Cantor and Land 1985), and even its 
very existence (Fox 1978). 

Nevertheless, in the most thorough review 
to date of the aggregate research, Chiricos 
(1987) strongly disagrees with the above 
claims, which he suggests have come to 
constitute a "consensus of doubt" about the 
unemployment/crime relationship. After re- 
viewing the findings from 63 studies that 
report some measure of the unemployment/ 
crime relationship, he argues that the "con- 
sensus of doubt" is premature, concluding 
instead that the preponderance of evidence 
from both cross-sectional and time-series 
studies "does favor the existence of a 
positive, frequently significant U-C relation- 
ship" (p. 203). 

For our purposes, the significance of the 
reviews by Chiricos and other writers is that 
they underscore how little we really know 
about the issue and how weak the research to 
date has been. First, contrary to the argument 
of some that this is a "now well-plowed 
terrain" (Wilson and Herrnstein 1985, p. 
313), there are in fact very few studies that 
have focused specifically on the impact of 
employment conditions on crime. Instead, 
most of the published findings are derived 
from research that includes unemployment 
only as an exogenous variable of interest. 

Second, even those studies designed explic- 
itly to test the employment/crime relationship 
suffer from statistical or measurement flaws: 
for example, collinearity among independent 
variables (Fleisher 1966); inappropriate parti- 
tioning of data on a variable, such as poverty, 
that is clearly related to the variable of 
interest, such as unemployment (Fleisher 
1966; Singell 1976; see Blalock 1982 for a 
discussion of inappropriate partitioning of 
data); and/or failure to include controls for 
socioeconomic or law enforcement variables 
that may suppress or enhance the job 
involvement/crime relationship (Glaser and 
Rice 1959; Danser and Laub 1981; Thorn- 
berry and Christenson 1984). Also, the use of 
national-level data by some studies may not 
allow for meaningful variation between units 
essential for assessing the employment/crime 
relationship (Glaser and Rice 1959; Fleisher 
1966; Danser and Laub 1981; Thornberry and 
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Christenson 1984; Phillips, Votey, and Max- 
well 1972). 

Third, most researchers have not heeded 
Radzinowicz’s (1939) early cautions that the 
relationship between employment conditions 
and crime will probably differ across types of 
crime and social groups. For example, as 
reported in the Chiricos review, the relative 
frequency of positive and significant findings 
is highest for property crimes (particularly 
burglary and larceny) and lowest for violent 
crimes. Even more significant is the manner 
in which the employment-crime relationship 
may differ among age-groups, since age is 
strongly related to both crime and the labor 
market. Both rates of offending and unemploy- 
ment are highest for juveniles and young 
adults, so that for the total population the U-C 
relationship may be artifactually suppressed 
when the youth cohort is small relative to 
total population, and artifactually enhanced 
when the youth cohort is large. 

Some age-specific research exists (see 
Allan 1985 for a review of age-specific 
research), although for the most part juveniles 
are the only age-group represented. Although 
these studies generally report a positive 
' association between depressed labor market 
conditions (unemployment is usually the sole 
measure) and crime, the findings are not 
conclusive, and some weak or negative 
associations are reported for particular crimes 
or for particular ages. These studies fail to 
provide a conclusive test of the employment 
crime relationship for specific age groups 
because they generally do not provide age- 
specific measures of the independent variable, 
or because they do not include controls for 
other important factors related to both crime 
and the labor market. 

Fourth, the most serious shortcoming of 
prior research is the inadequate conceptualiza- 
tion and operationalization of labor market 
conditions. The most common indicator has 
been unemployment. However, recent studies 
suggest that there may be other dimensions of 
the labor market—i.e., other than official 
measures of unemployment— which are 
equally critical (or perhaps even more critical) 
in shaping socioeconomic climates that may 
be conformogenic or criminogenic. The 
traditional assumption has been that unemploy- 
ment is an indicator of economic need, which 
some may satisfy by resorting to property 
crime. But the modern welfare state buffers 
many of the unemployed against economic 
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need through unemployment compensation, 
welfare, and other programs. Furthermore, 
the official measure of. unemployment does 
not actualy tap joblessness per se, but 
job-seeking; and job-seeking may reflect 
commitment to work rather than to illegiti- 
mate pursuits. 

Using officially measured unemployment 
as a sole indicator for economic need also 
neglects other important aspects of the labor 
market that may be criminogenic at the 
ecological level. Dimensions of employment 
itself may encourage (or fail to inhibit) the 
growth of illegitimate alternatives. Where 
jobs are insecure, with low pay, few benefits, 
and minimal opportunities for advancement, 
work may provide fewer incentives for young 
people to form lasting commitments to 
conventional lifestyles, and such conditions 
may help to create and sustain deviant 
subcultures at the community level while 
eroding norms and social controls. 

Recent studies have suggested approaches 
to the labor market that provide more 


.sensitive views of the dimensions of eco- 


nomic risk, including underemployment (T. 
Sullivan 1978; Clogg 1979), the segmented 
labor market approach (e.g., Baron and 
Bielby 1984), and working class stratification 
(Form 1985). Each of these approaches 
illuminates aspects of economic marginality 
that may enhance the attractiveness of illegit- 
imate alternatives to the labor market; yet 
none has been exploited in attempts to explore 
the relationship between the labor market and 
crime. 

Thus, despite the longstanding interest of 
social scientists in the problem, theoretical 
and empirical analyses of the employment/ 
crime relationship are weak, especially on the 
extent to which the relationship differs by age 
and by type of labor market dimension. The 
present study addresses the limitations of 
previous ‘research by conducting separate 
analyses of age-specific data for juveniles 
aged 13 to 17 and for young adults aged 18 to 
24, and by analyzing the independent effects 
not just of unemployment but also of three 
other dimensions of underemployment: low 
wages, low hours, and discouraged workers. 
Other independent variables identified in 
previous research as major correlates of crime 
are included in the analysis as controls. 

On theoretical grounds, it is reasonable to 
assume that factors salient to criminal involve- 
ment will vary by age and that the effects of 
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employment opportunities on crime will also 
vary by age. The two age-groups most at risk 
for both crime and labor market problems are 
juveniles and young adults. In many respects 
juveniles and young adults live in two 
different worlds, both at the individual and at 
the ecological level. While they are not 
subject to the same pressures as adults to 
succeed in the work world (Glaser and Rice 
1959; Gibbs 1966), juveniles are subject to 
pressures to consume, to seek expression of 
their masculinity in the absence of adequate 
access to the labor market, and to make the 
transition. from adolescence to adulthood 
(Greenberg 1977). These pressures are aggra- 
vated by the lack of culturally sanctioned rites 
of passage that exist in some societies. Work 
may be one outlet for these pressures, 
delinquency another. 

The types of delinquency engaged in by 
juveniles are typically exploratory and low- 
yield, unlikely to provide more than pocket 
money— much like the minimum wage ser- 
" vice jobs available to teenagers (Steffens- 
meier et al., forthcoming). If suitable work is 
plentiful, low-yield high-risk property crime 
:may be unappealing as a source of spending 
money or consumer objects. For juveniles, 
therefore, the availability of work (as mea- 
sured by the traditional employment/unemploy- 
ment distinction) may be a structural determi- 
nant of the attractiveness of delinquent 
alternatives. Furthermore, the availability of 
opportunities to participate in work may 
inhibit the power and development of peer 
subcultures, foster normative expectations of 
working rather than stealing to obtain pocket 
money, and mitigate the otherwise difficult 
transition into the adult world. Ecologically, 
the greater the availability of work, the 
smaller the proportion of the juvenile popula- 
: tion that will choose delinquent alternatives. 

For young adults the choice of legitimate 
over illegitimate pursuits may be influenced 
more by the quality of employment than by 
mere availability. Where rewards are low in 
comparison to those that may be obtained 
from illegitimate pursuits, rejection of the 
latter may be delayed, particularly if adoles- 
cent opportunities for work have been too 
limited to develop strong commitment. Low- 
wage and/or part-time employment that may 
have seemed acceptable or even status- 
enhancing to juvenile first-job holders still 
living at home becomes constraining and 
ünsatisfactory to young adults seeking greater 
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independence. At the same time, young 
adults may not yet have developed the types 
of family commitments or other stakes in 
conformity that serve to inhibit the choice of 
illegitimate alternatives among older age- 
groups. Furthermore, lack of suitable employ- 
ment may contribute to a climate of moral 
cynicism and alienation that attenuates the 
effectiveness of social controls. Thus, the 
more an area is dominated by marginal 
employment, the greater the proportion of the 
young adult population at risk for crime. 
However, if the labor market is characterized 
by a good supply of jobs with adequate hours 
and pay, crime is likely to be less attractive, 
the payoff of low-yield exploratory crimes 
diminishes, and the costs of criminal justice 
sanctions increase appreciably. 

So. far our discussion has focused on 
motivational types of mechanisms that can 
affect individual choices in such a way as to 
add up to effects at the ecological level. In 
this respect our model resembles the rational 
choice model used in econometric studies of 
crime. It is difficult to discuss the issue of the 
labor market and crime without discussing 
factors that are relevant to individual choices. 
However, it does not follow that an ecological 
relationship between underemployment (for 
example) and crime implies that underem- 
ployed individuals are necessarily the ones 
committing crimes or being arrested. Rather, 
we see the labor market "climate" of a region 
as establishing conditions that may be condu- 
cive to the growth of conformist or criminal 
behavior in the population at large. For 
example, domination of a regional labor 
market by very marginal sorts of employment 
may operate to motivate people who are not 
in the labor force (and therefore not underem- 
ployed) to remain out of the labor market and 
to seek illegitimate alternatives; or persons 
who are either unemployed or employed in 
marginal jobs may be motivated to take 
illegitimate risks because there is little hope 
of anything better. In regions with healthy 
labor markets, those in and out of the labor 
force may be motivated to avoid risks because 
there are clear opportunities that one would 
not want to jeopardize. Thus, like Blau and 
Blau (1982, p. 115) the focus of this study is 
“not what kind of individuals tend to commit 
[crimes], but what social conditions make it 
likely that many people commit them." 
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SOURCES OF DATA AND 
MEASUREMENT OF VARIABLES 


The unit of analysis in this study is state-level 
data for male juveniles aged 14—17 and young 
adults aged 18-24, for the years 1977—1980. 
The data (aggregated by state, year, and 
age-group) were drawn from data tapes of the 
FBI (1977-1980) and the Census Bureau 
(1977—1980). The data on arrests for robbery, 
burglary, larceny, and auto. theft were aggre- 
gated from raw data made available by the 
FBI in the form of arrest counts by crime type 
and age subgroup for each police jurisdiction 
of the 50 states for the years 1977-1980. The 
measures for labor market variables and 
control variables were obtained from the 
Census Bureau’s March Current Population 
Surveys for 1977-1980 and from publications 
of the U.S. Department of Justice 
(1977-1981). The various FBI, CPS, and 
other files were aggregated by state and by 
age-group (where appropriate) and merged.! 
For each variable of interest, there are data 
from 50 states for each of the four years in the 
period 1977-1980. For each age-group, N = 
200 (50 states X 4 years).? 


! States were the smallest practicable unit of 
aggregation for this study. While metropolitan 
areas would be a preferable unit of analysis of 
labor market conditions, the use of SMSAs was 
problematic: underemployment variables are avail- 
able only in the CPS, and CPS data can be 
aggregated only for the 35 or 40 largest SMSAs, 


making for a smaller N; SMSA codes used by the ` 


FBI are different from those used by any other 
agency, raising problems for merging the different 
data sets. The use of the state as a unit of analysis 
has been supported by Linsky and Straus (1986) in 
a comparison of social stress indicators across 
states, metropolitan areas, and nonmetropolitan 
areas. Furthermore, because both legislation and 
corrections policies, as well as many labor market 
(e.g., "right to work laws") and law enforcement 
policies, are developed and implemented at the 
state level, the state may in fact constitute an 
appropriate level of analysis. Nevertheless, the 
authors are reexamining the feasibility of aggregat- 
ing and analyzing data for SMSAs. 

The use of the rectangular data set (states X 
years) yields a gain in efficiency through pooling 
(Hannan and Young 1977). Pooling the data 
assumes invariance of hypothesized relationships 
over the years included in the analysis. This was 
confirmed by preliminary analysis of each year and 
period (Allan 1985). 


Measurement of Arrest Rates 


The dependent variable i is the xn loi df 
the arrest rates per 100,000 for the crime 
index property offenses (robbery, burglary, 
larceny, and auto theft), corrected for actual 
urban-rural population distribution. Age- 


Cre 


? The logarithmic transformation reduces nonad- 
ditivity and increases the homogeneity ofi, the 
variances. It also minimizes reliability problems 


. stemming from use of population estimates rather 


than census counts in calculating. the rates. We 
think of the rate as r; = (x/nj), where rj =the 
"true" rate, x; = arrest volume, and. n, 31.5 true" 
population. However, n, is estimated, rather, than 
known. The estimated rates must be calculated as 
r* = — (x*ini*). Comparison „of ..1980... CPS 
estimates with 1980 census data Suggests that the 
ERS estimates tend to vary proportionally from. the 
Census data. The observed nj* may be thought, of 
as consisting of the “true” n;, disturbed by random 
ee error, e^ “i: nt. =, ne. “i. Thus, 77% 
= (x/ne "i and, ri* = (ue"ilnj. (Taking is 

logarithm of the r;* yields: log r;* log r; + 
which shows that the true log-arrest-rate i is pra 
the observed log-arrest-rate plus a measurement 
error (uj). If we consider linear models with log r; 
as the dependent variable, and predictors Zli Z2b 

. . , %q the foregoing arguments suggest the 
following specification: 


log TQ; a Bo + Bizid;i so 
+ Bau + (e — u) 
= Pu + u* 


Note that the disturbance in this equation (u;* = e; 
— uj) includes two parts: measurement error (uj) 
and left-out causes (captured by ej. We are 
grateful to Clifford Clogg for suggesting this line 
of reasoning. 

^ Two problems arise from the aggregation of 
arrest data at the state level that prior research has 
failed to consider. First, the UCR population 
coverage for a particular state may vary from year 
to year, particularly when a major city fails to 
report its crime data for a certain year, as happens 
occasionally even for some of the largest cities in 
the country. Therefore, in this study we include 
only agencies reporting arrest data for all four 
years, 1977-80. 

Second, the population coverage of agencies 
participating in the UCR program may not 
resemble the population distribution of the state as 
a whole. Since arrest rates in cities tend to be much 
higher than in the rural areas, statewide rates may 
be drastically affected if the UCR coverage differs 
significantly from the actual rural-urban popula- 


fion. In states where center city populations are 


overrepresented in the UCR, statewide rates will 
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specific population estimates used in calculat- 
ing the rates for inter-censal years were 
obtained from the CPS.5 All four index 
property offenses are included, since a 
measure of total property index offenses 
would be heavily weighted by larceny arrests. 
By analyzing all four, we provide a useful 
cross-check on whether effects (if any) are 
consistent across offense categories; alterna- 
tively, interesting substantive differences may 
be revealed. 

Official statistics such as the UCR have 
been severely criticized; but, despite its 
shortcomings, criminologists generally agree 
that the UCR furnishes fairly valid estimates 
of comparative frequencies of serious crimes, 
though not of their absolute frequencies 
(Gove, Hughes, and Geerken 1985). With 
respect to the use of official arrest data for the 
estimation of age-specific offending rates, the 
body of research comparing the UCR index 
arrest rates with age-specific offending rates 
estimated from surveys and self-reports has 
found close agreement (Steffensmeier et al., 
forthcoming). The use of arrest data is also 
supported by the large body of research on 
police-citizen encounters, which has found 
that the seriousness of the crime is the 
strongest predictor of arrests (Gove et al. 


be exaggerated, and the reverse will be true for 
states where rural populations are overrepresented. 

For these reasons, the crime rates have been 
adjusted to reflect more accurately the actual 
population composition of each, using the follow- 
ing method. First, rates are calculated separately 
for central cities, for the metropolitan balance, and 
for nonmetropolitan areas, using arrest volumes 
and population estimates aggregated separately for 
each of those geographic segments of the state. 
Second, region-specific CPS population estimates 
are multiplied by the region-specific arrest rates to 
obtain estimates of the "actual" volume of arrests 
for that geographic segment. Third, these esti- 
mated arrest volumes for the three segments are 
summed to obtain a corrected volume of arrests for 
the state as a whole, given the CPS estimates of the 
actual rural-urban distribution of the population. 
Finally, corrected total rates are calculated for each 
state by substituting the corrected total arrest 
volume into the crime rate equation. 

5 The weights assigned to each individual and 
family record permit aggregations to higher levels 
(e.g., state levels) which provide reasonably 
accurate estimates for the total population at those 
levels. The accuracy of these estimates was 
verified by regression of the CPS estimates for 
1980 on actual census counts for 1980. 
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1985); arrest data for index crimes may 
therefore be representative of serious property 
offenses. 


Measurement of Labor Market Conditions 


In previous studies unemployment has often 
been used as the sole indicator for labor 
market disruption. But more diverse indica- 
tors are needed for a full understanding of the 
employment/crime relationship. The underem- 
ployment approach of Sullivan (1978) and 
Clogg (1979) identifies an underclass that 
bears much of the economic risk in society, 
and ease of measurement provides an impor- 
tant additional advantage. Therefore, four 
dimensions of underemployment are used 
here as the principal measures of labor market 
conditions. Unemployment (UNEMP), the 
percent of the labor force without work but 
looking for work, is one of the dimensions of 
underemployment. The other three dimen- 
sions are: subunemployment (SUBUNEMP), 
a proxy for discouraged workers (people who 
have been without work for a long time and 
have given up looking for work); low hours 
(LOWHOUR), the percent of the labor force 
employed part-time for economic reasons 
(unable to find full-time work); and low 
wages (LOWW AGB), the percent of the labor 
force with sub-poverty-level wages.$ 

Each of the dimensions of underemploy- 
ment represents a different aspect of eco- 
nomic risk. Both the unemployed (UNEMP) 
and the subunemployed (SUBUNEMP) are 
without jobs but wanting work. Those 
underemployed by reason of low hours 
(LOWHOUR) or low wages (LOWWAGE) 
represent classes of workers who are em- 
ployed but in very marginal sorts of jobs. 
Among young people, the proportion of the 
population in the labor market varies consid- 
erably from state to state, and an earlier phase 
of the research (see Allan 1985) established 
that population-based measures provide more 
accurate estimates of the groups at risk for 


$ The operationalization of the LOWWAGE 
measure differs slightly from Clogg (1979) in that 
it excludes the voluntary part-time workers of the 
previous year, and prorates the wage calculation to 
take part-time work (of those working part-time 
involuntarily) into account. Early stages of analysis 
revealed that the LOWWAGE category included a 
substantial number of workers whose wages fell 
below the poverty line because they worked only 
part-time. 
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report, therefore, each indicator is operation- 
alized as the percent of the population for the 
respective age-group falling into each under- 
employment category. Except for the use of 
the total age-specific population as the base 
(rather than the ‘labor force), we have 
followed procedures established by Clogg 
(1979), Among juveniles, effects for UNEMP 
and SUBUNEMP are expected to be positive 
and strong; indicators for job quality, such as 
LOWHOUR and LOWWAGE, are expected 
to be weak and/or negative. Among young 
adults, positive effects are expected for all 
four variables, with effects for job quality 
(LOWHOUR and LOWWAGE) expected to 
be at least as strong as the effects for job 
availability. 

For purposes of comparison with previous 
research, our analysis also looks at the effects 
of both age-specific and total unemployment 
alone, excluding the other underemployment 
variables (Ehrlich 1973; Brenner 1977), and 
at the effects of lagged and contemporaneous 
unemployment (Cantor and Land 1985; Parker 
and Horwitz 1986). 


Measurement of Control Variables 


The control variables used in this study are 
household activities index, minority popula- 
tion, residential mobility, geographic region, 
new prison commitments per 100 index crime 
arrests, and police per 1000 population. 
Since underemployment varies inversely 
with economic conditions while criminal 
opportunity varies directly with economic 
conditions, failure to control for the latter 
could suppress the underemployment/crime 
relationship. The household activities index 
(HHACTRAT), following Cohen and Felson 
(1979), is included to control for illegitimate 
opportunity. MINORITY, or percent minority 
population (black and hispanic), is included 
since both levels of serious crime and levels 
of underemployment are disproportionately 
high among blacks and hispanics (Curtis 
1975; Wilson 1987; Currie 1985). Residential 
mobility (MOBILITY) is included as an 
indicator for breakdown in informal mecha- 
nisms of social integration and control (Loftin 
1980) and is operationalized as the percent of 
the population not living in the same 
residence in 1980.as in 1975. SOUTH is a 
dummy variable distinguishing states located 
in the South (South Atlantic, East South 
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Central, or West South Central) from those 
located in other regions. Since criminal 
justice policy is thought to have an important 
influence on overall levels of crime, we 
included the rate of prison commitments per 
100 index crime arrests as a rough indicator 
for probability of punishment (PRISON). 
Since the number of police available to. make 
arrests is an important determinant of arrest 
volumes, the number of police per 1000 


‘population (POLICE) is included to control 


for law enforcement activity (Liska 1987). 
Finally, a trend variable, YEAR, is included 
to control for trending, to minimize the 
likelihood of confounding longitudinal with 
cross-sectional effects. The control variables 
are included in all regressions. 

Standard multiple regression techniques are 
the chief analytical tool. Because substantial 
error variances were discovered in the course 
of state-by-state inspection of data for the 
early years of the study period, weighted least 
squares are used to provide more efficient 
estimates of regression coefficients (see, Neter 
and Wasserman 1974). Appropriate weights 
are adopted from Census Bureau (1978) 
documentation accompanying the CPS tapes. 
Regressions are run separately for the two 
age-groups, for each of the four property 
offenses. 


RESULTS 


As displayed in Table 1, juvenile and young 
adult differences in arrest rates and underem- 
ployment are generally as expected. Except 
for robbery, arrest rates are higher for 
juveniles. Unemployment rates are high for 
both: age-groups, with the labor-force-based 
rate for juveniles about one-third higher than 
the rate for young adults. On the other hand, 
the population-based unemployment measure 
is higher for young adults (because labor 
market participation is so much lower among 
juveniles). The rate of subunemployment 
(discouraged workers) is also higher for 
juveniles when based on the labor force, but 
not when measured against the entire popula- 
tion. The rates of low-hour and low-wage 
employment are much higher for young 
adults, regardless of whether the variables are 
measured on the labor force or the population. 

The very low juvenile rates for LOW- 
HOUR. and LOWWAGE (each less than one 
percent of the juvenile population) are less 
surprising when it is recalled that most 
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Table 1. Means and Standard Deviations For Arrest Rates and Labor Market Variables, by Age 
Age-Group 
14-17 18-24 

Variable Mean (s.d.) Mean (s.d.) 
Arrest Rates: 
ROBBERY 

Adj. Rate* 299.23 (261.96) 312.38 (173.36) 

Log (Rat)? , 5.39 (.91) 5.57 (.66) 
BURGLARY 

Adj. Rate 2439.63 (977.45) 1146.66 (446.74) 

Log (Rate) 7.72 (6.97) 42 (.40) 
LARCENY 

Adj. Rate 4161.16 (2122.67) 1880.69 (763.50) 

Log (Rate) 8.23 (.46) 7.60 (.33) 
AUTO THEFT 

Adj. Rate 781.86 (358.51) 290.80 (150.27) 

Log (Rate) 6.55 (.50) 5.57 (.44) 
LABOR MARKET VARIABLES 
UNEMP (LMF 19.86 (11.33) 12.12 (5.30) 
UNEMP (POP)? 6.67 (3.33) 9.77 (4.17) 
SUBUNEMP (LM) 3.34 (3.84) 1.98 (2.13) 
SUBUNEMP (POP) 1.10 (1.26) 1.57 (1.58) 
LOWHOUR (LM) 2.73 (3.08) 5.02 (2.58) 
LOWHOUR (POP) 0.94 (1.10) 3.96 (2.02) 
LOWWAGE (LM) 1.25 (2.05) 3.72 (2.16) 
LOWWAGE (POP) 0.43 (0.73) 3.02 (1.77) 


* These rates are adjusted to reflect actual distribution of the population among center cities, the metropolitan 
balance, and nonmetropolitan areas. See text for method used. 

> Natural log of the adjusted rate. The logs were used in the regressions reported here. 

* (LM) represents the variables measured as a percent of the labor force for the age-group. 

4 (POP) represents the variables measured as a percent of the total population for the age-group. The 


population-based measures were used in the regressions. 


juveniles in the labor force are voluntary 
part-time workers. Only a relatively small 
proportion of juveniles seek full-time employ- 
ment (the low-hour measure includes persons 
who would prefer full-time employment, and 
the low-wage measure is virtually restricted to 
full-time employees). Finally, standard devi- 
ations for all variables are fairly high in 
reJation to the means, reflecting considerable 
variation among the states both in levels of 
offending and in labor inarket conditions. 
Regression results for the four underemploy- 
ment measures in Table 2 are. quite different 
for juveniles and young adults. For juveniles, 
the strongest effects are found for unemploy- 
ment (associated with high arrest rates) and 
for low-wage employment (associated with 
low arrest rates): all coefficients for these two 
variables are statistically significant." In terms 
of size of effects, a 1 percent difference in 
unemployment rates (which vary within a 
range of 3 to 6 percent of the juvenile 


7 Unless otherwise stated, results reported as 
“statistically significant” have a p-value of .05 or 
less. 


population) is associated with a proportional 
difference in arrest rates of about 114 to 6 
percent.? A 1 percent difference in low-wage 
employment is negatively associated with 
proportional differences in juvenile property 
arrest rates that vary from 11 to 15 percent. 
Effects for the other two underemployment 
variables -LOWHOUR and SUBUNEMP— 
are inconsistent, frequently weak and/or 
negative. 

The strongly negative effects of LOW- 
WAGE on juvenile property arrests do not 
conform to the expectation of weak or mixed 
results and deserve some attention. An 
interpretation that low wages for juveniles 
leads to low juvenile crime does not seem 
plausible. Collinearity between UNEMP and 
LOWWAGE is ruled out as an explanation, 


* [n the discussion here, a change from 15 to 16 
percent in the juvenile unemployment rate would 
be a one percent change. With arrest rates, a 
change from 100 (Zn — 4.605) per 100,000 to 105 
(1n 4.654) would represent approximately a one 
percent proportionate change, as would a change 
from 1000 to 1070, or a.change from 10 to 10.2. 
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Table 2. Regression Coefficients for Underemployment Model 
Arrest Rates for Juveniles Arrest Rates for Young Adults 
ROB- BUR- LAR- AUTO ROB- BUR- LAR- AUTO 
BERY  GLARY CENY THEFT BERY  GLARY CENY THEFT 
Underemployment Variables 
UNEMP 0.058*** = 0.018**  0.017** — 0.015* 0.011 0.002 0.010* 0.019 
SUBUNEMP —0.052" -—0.011  -—0.023' 0.030' 0.032  —0,.060*** -0.014  —0.012 
LOWHOUR 0.042 0.038' 0.066** —0.042 0.053***  0.018* 0.030***  0.030** 
LOWWAGE —0.140** —0.112*** —0.149*** —0.156*** 0.067*** 0.009 0.048***  0.024* 
Control Variables 
HHACTRAT 0.003 0.027*** 0.018**  0.037*** — —0.003 0.013* 0.009” 0.024*** 
MINORITY 0.012* 0.010*** 0.004’  —0.007* 0.021*** 0.011*** 0.008*** 0.002 
MOBILITY —0.016** — 0.003 0.008**  0.019***  —0.005 0.001 0.006** 0.011*** 
PRISON —0.060*** —0.038*** —0.052*** —0.049*** — —0.017*  —0.015** —0.031*** —0.055*** 
POLICE 0.075*** 0.115"  —0.089'  —0.007 0.574*** 0.157** —0.054 0.381*** 
SOUTH 0.382** 0.002  -—0.276*** —0.338*** 0.265** — 0.337**"* 0,139* 0.396 
YEAR 0.060' 0.056* 0.049*  —0.060** . —0.006 0.079*** 0.061*** 0.019 
R-SQUARE .66 .67 .65 .67 .63 .59 51 59 
ADJ R-SQ 64 65 .63 65 61 57 48 56 
D.F.* 191 191 191 191 191 191 191 191 
* Missing data for one or more variables caused some attrition in sample size. 
! pz .20 or less. 
* p=.10 or less. * p=.05 or less. 
** p= .01 or less. 


*** p= .001 or less. 


since the zero-order correlation is only 
modest (.27), and collinearity diagnostics 
reveal no problems. It seems more likely that 
these results derive from the fact that, among 
juveniles, the proportion of full-time workers 
with wages above the poverty line is small. 
The LOWW AGE measure is therefore a close 
proxy for the percent of juveniles employed 
full-time.? If this is the case, then the strong 
negative coefficients for LOWWAGE may 
reflect the effects of availability of full-time 
employment for those juveniles wishing it, 
rather than the effects of low-wage employ- 
ment per se. 

For young adults, marginal employment is 
associated with high arrest rates: strong 
positive effects are found for both LOW- 
HOUR and LOWWAGE. All but one of the 
coefficients for these two underemployment 
variables are statistically significant. One 
percent changes in the incidence of these 
forms of underemployment are associated 
with proportionate differences in arrest rates 
that vary from about 2 percent to over 6 
percent. Unemployment effects are positive, 
but much weaker than for juveniles. The 


? To verify whether the negative effects of 
LOWWAGE on juvenile offending are correctly 
attributed to the availability of full-time work, a 
more direct proxy for full-time work should be 
used in future research. 


effects of SUBUNEMP (discouraged worker 
proxy) are weak and predominantly negative 
(the coefficient for SUBUNEMP in relation to 
robbery is negative and statistically signifi- 
cant). Among both juveniles and young 
adults, it is possible that the SUBUNEMP 
measure picks up many who are without work 
voluntarily .1° 

Looking at results across offense catego- 
ries, we see no clear differences in patterns of 
effects for the underemployment variables. 
For juveniles, the effects of UNEMP (posi- 
tive) and LOWWAGE (negative) are quite 
consistent for the four property offenses. And 
for young adults, both LOWHOUR and 
LOWWAGE produce consistently positive 
effects for all four categories. Variations 
occur mainly for SUBUNEMP and LOW- 
HOUR in the case of juveniles, and for 
SUBUNEMP in the case of young adults 
(UNEMP effects are consistently positive but 
weak). . 

Taken together, these findings are gener- 
ally consistent with our expectations, outlined 


10 Since the March CPS does not include a 
measure designed explicitly to reveal discouraged 
workers, the SUB measure assumes a desire for 
work from the previous year's job history, which 
for juveniles and young adults may include 
summer work preceding a return to school in the 
fall. 
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Table 3. Contribution of Underemployment Variables to Total R-Square 


Juveniles Young Adults 

ROB- BUR- LAR- AUTO ROB- BUR- LAR- AUTO 

BERY  GLARY CENY THEFT BERY  GLARY CENY THEFT 
Net Contribution of Underemployment Variables to Total R-Square: 
R-SQUARE .097***  .052*** — .0g3*** — 122^ .082*** 011" -143*** — O51*** 
ADJ R-SQ .093 .047 .078 .119 .077 .002 .167 .068 
Percent of Explained Variance Accounted for by Underemployment Variables: 
R-SQUARE 14.8% 7.896 12.7% 18.1% 13.0% 1.9% 27.8% 8.6% 

14.5% 7.2% 12.4% 18.296 12.6% 4% 32.5% 11.6% 


ADJ R-SQ 


earlier. Among juveniles, the strong positive 
effects for UNEMP, coupled with the strorg 
negative effects for LOWWAGE, imply that 
the availability of suitable employment is a 
more important predictor of arrest rates for 
this age-group. 

' Regression results for the various control 
variables conform fairly well to expectations. 
Criminal opportunity, as measured by the 
household activities index (HHACTRAT), is 
positively related to arrest rates for property 
crimes for all age-offense categories except 
robbery arrest rates of young adults; five cf 
the seven positive coefficients are statistically 
significant. The percent minority populatioa 
(MINORITY) has strong, positive coeffi 
cients for robbery and burglary for both 
juvenile and young adult arrest rates, with 
mixed results for the larceny and auto theft 
offense categories. The percent not living in 
the same place in 1980 as in 1975 (MO- 
BILITY) has strong positive effects on 
juvenile and young adult arrest rates for 
larceny and auto theft, but weak and/o- 
negative effects for robbery and burglary. The 
coefficients for PRISON (the number of new 
prison commitments per 100 index offense 
arrests) are uniformly negative and statisti- 
cally significant.!! Coefficients for POLICE 
(number of police per 1000 population) are 
positive and statistically significant for rob- 
bery arrest rates for juveniles, and for three of 
four offense categories for young adults, a 
result that probably reflects the likelihood that 
more police lead to more arrests (Liska 1987). 
The inconsistent results for the remaining four 
age/offense categories may reflect differences 
in arrest patterns among states with varying 
urban/rural population patterns. Strong but 





' H The results for PRISON may be an artifact of 
the higher arrest rates for juveniles and for less 
serious offenses, compared to greater probabilities 
of imprisonment for older arrestees with more 
serious offenses. 


inconsistent effects are found for the regional 
variable, SOUTH. Coefficients are positive 
for SOUTH in six of the eight equations and 
statistically significant in four; coefficients 
are negative and statistically significant in the 
remaining two equations (juvenile rates for 
larceny and auto theft). Finally, coefficients 
for the trend variable YEAR reveal that 
changes in arrest rates over the four-year 
period of the data are generally positive, net 
of the effects of the other control variables. 

After taking into account the effects of 
criminal opportunity, percent minority popu- 
lation, geographic mobility, probability of 
imprisonment, and the other control vari- 
ables, the net direct effect of the underemploy- 
ment variables is statistically significant for 
most age/offense categories. The proportion 
of variance in the dependent variable that is 
explained collectively by the underemploy- 
ment variables ranges from less than .01 to 
.14 or more, averaging around .08 (between 
10 and 15 percent of total explained variance 
for most equations) (see Table 3). The 
underemployment variables make the least 
contribution to explaining variation in bur- 
glary arrest rates, particularly among young 
adults—the only equation for which the 
contribution of the labor market variables is 
not statistically significant. 

So far we have examined the varying 
effects of the underemployment variables on 
property crime rates for juveniles and young 
adults; and, after we control for other 
important factors related to crime, the under- 
employment variables collectively make a 
statistically significant contribution to ex- 
plained variance in all but one instance. To 
evaluate how these effects fare in comparison 
with variables used in earlier studies, a 
variety of models employing traditional unem- 
ployment indicators were assessed: age- 
specific unemployment alone (excluding the 
other underemployment variables), total un- 
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Table 4. Regression Coefficients for Alternative Labor Market Measures 
UNEMP (age-spec.) TOTALUN LAGUNEMP UNEMP 
(Excl. other under- (Excl. age-spec. (Current UNEMP (Lagged UNEMP 
employment vars.) Underemp. vars.) also in model) also in model) 
14-17 18-24 14-17 18-24 14-17 18-24 14-17 18-24 
ROBBERY 0.073*** 0.003 0.123***  0.044* 0.020 0.031**  0.062***  —0.012 
BURGLARY 0.010’ 0.002 0.027* 0.021* 0.008 0.014* 0.012' — 0.003 
LARCENY 0.023*** 0.005 0.040** — 0.046***  —0.002 0.012” 0.013’ — 0.003 
AUTO THEFT  0.024** 0.020** — —0.004 0.048***  —0.023* 0.019* 0.011’ 0.012' 


employment, and lagged versus contempora- 
neous unemployment.!? Unemployment is the 
labor market variable most commonly used in 
previous studies, while in the present study it 
is but one of several labor market indicators. 
The first two columns in Table 4 display the 
results obtained for age-specific unemploy- 
ment when SUBUNEMP, LOWHOUR, and 
LOWWAGE are excluded from the equation. 
The results show little difference overall for 
UNEMP coefficients, but reductions both in 
total R-square and in adjusted R-square range 
from less than .01 to .15 or more. This 
reflects the reduction in explanatory power 
caused by loss of the other underemployment 
dimensions. Our use of the full array of 
underemployment variables makes a greater 
contribution (than age-specific unemployment 
alone) to clarification of the crime/labor 
market relationship as it is patterned by age. 
(F-tests on the differences were statistically 
significant for 11 of the 12 equations.) 
Results are somewhat stronger for total 
unemployment (TOTALUN—unemployment 
for all age-groups), displayed in columns 3 
and 4 of Table 4. Seven of the eight 
coefficients are positive and statistically 
significant. For young adults the results for 
TOTALUN are much stronger than the results 


12 Measures used in this study differ in impor- 
tant ways from unemployment measures used in 
prior research. Most studies have used annual 
summary statistics on unemployment, while the 
labor market data here are drawn from a single 
month, March. Taken in the middle of the school 
year, these data may well provide more sensitive 
indices of economic marginality for young people 
than is possible with annual statistics which 
include the summer employment and unemploy- 
ment of groups that may be more economically 
advantaged than those employed or seeking 
employment during the school year. The measures 
used here also differ by virtue of being based on 
the entire age-specific population, rather than just 
on the labor force (results are weaker for juveniles 
when labor-force-based measures are substituted). 


found for age-specific unemployment. 13 How- 
ever, as is the case with UNEMP, exclusion 
of the other underemployment measures from 
the equation results in substantial reductions 
in R-square (statistically significant in all but 
one case). 

The results for lagged unemployment 
(LAGUNEMP) and current unemployment 
(UNEMP) are displayed in the last four 
columns of Table 4 (the other underemploy- 
ment variables were excluded from the 
equations, but the control variables were 
retained). Cantor and Land (1985) make a 
strong case for including both lagged and 
contemporaneous unemployment in the model, 
arguing that unemployment may initially act 
to suppress crime by reducing criminal 
opportunity (more people at home to serve as 
effective guardians, reduced demand for 
illegal as well as legal goods and services), 
and that this suppressor effect of unemploy- 
ment on criminal opportunity is likely to be 
contemporaneous. Conversely, they argue 
that effects on criminal motivation are likely 
to be lagged, until unemployment compensa- 
tion and other sources of support no longer 
serve to mitigate the economic effects of 
unemployment. 

Following Cantor and Land, the regres- 
sions were run including lagged unemploy- 
ment (LAGUNEMP) along with contempora- 
neous unemployment (UNEMP), but excluding 
the other underemployment variables.!^ The 
findings provide only limited support for the 
Cantor/Land argument. If contemporaneous 
unemployment is an indicator of criminal 


15 The age-specific measure for young adults 
may include much unemployment that is at least 
semivoluntary (see Osterman 1980 for a discussion 
of “moratorium behavior" in this age-group). 

14 Results obtained when the other underemploy- 
ment variables are also included do not differ 
greatly from the results reported here (the positive 
effects for LAGUNEMP are somewhat attenu- 
ated). 
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Me eee ex nas fes 
opportunity, and.lagged unemployment an 
indicator of criminal motivation, then nega- 
tive: coefficients would be expected for 
UNEMP, . and positive coefficients would be 
expected for. LAGUNEMP. 

Results , come closest to the predicted 
pattern : for your adults. For this age-group, 
three : of the, four UNEMP coefficients are 
negative: but weak. The coefficients for 
LAG jUNEMP . are uniformly positive and 
- statistically significant for three of the four. 
Fort the. offense categories of larceny and auto 
theft, the contemporaneous/lagged unemploy- 
ment..r model, produces a slight increase in 
R-Square: (.01 to .03) in comparison to the 
model using.the four underemployment vari- 
ables, but.a comparable reduction in R-Square 
for- the, categories of robbery and burglary. 
For. juveniles, the results are contrary to the 
Cantor/Land argument: UNEMP coefficients 
uiid positive (although weak in 


Ga 


AC coefficients are negative; and 
Ey is. Ieduced for all four offense 
categories (by: .02 to. 10). 

To sucum ummarize,. results of our analysis show 
dee, that, the. effects of labor force margin- 
ality on rates, of offending among juveniles 
and young adults are more complex than can 
be captured in any single labor market 
indicator such as total yearly unemployment. '5 
Past analyses of the effects of unemployment 
on crime have produced inconclusive results 
subject to varying interpretation (Freeman 
1983). Our use of four dimensions of 
underemployment has provided a richer 
framework for exploring the effects of labor 
force marginality on crime rates and the way 
in which these effects differ by age. First, we 
found that the availability of employment has 
strong aggregate effects on rates of juvenile 
offending: states with higher levels of full- 
time work for juveniles have lower delin- 
quency rates for property crimes, while states 
with high levels of juvenile unemployment 
have higher delinquency rates. Second, we 


15 One yearly summary measure investigated in 
this research (the percent experiencing “any 
unemployment” during the year) produced very 
weak and contradictory results. Yearly unemploy- 
ment for juvenile and young adult age-groups 
includes summer job-seeking by relatively affluent 
groups, which may well obscure the employment 
problems of groups at greater risk. 
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found thatthe, quality of employment has 
strong aggregate effects on young adult 
offending: .states in which a high proportion 
of young-adult employment is characterized 
by Jow;hours and/or low pay have high rates 
of young adult property crime. 


ia 
IMPLICATIONS 


Our findings present thorny problems for 
theory and research on the employment/crime 
relationship. Most importantly, the relation- 
ship of employment variables to arrest rates 
for property crimes varies by age-group,- by 
type of crime, and by type of labor. market 
indicator. These findings are consistent with 
our expectations (as expressed above), and 
they also reinforce the speculation of some 
writers (e.g., Radzinowicz 1939; Chiricos 
1987) concerning subgroup variation in the 
relationship between labor force marginality 
and crime. 

It is, important to understand how the 
effects. vary for juveniles and young adults. 
Since these two age-groups have the highest 
rates of offending for index crimes, factors 
that affect rates for these ages will have an 
important impact on total rates. For juveniles, 
the results reported here suggest that struc- 
tural factors are quite important in "tipping 
the balance," with widespread unemployment 
heightening the inducements for a larger 
proportion of the juvenile population to lean 
toward delinquency, and the availability of 
full-time employment increasing the induce- 
ments toward conformity. 

Over the life-course, it is important to 
understand how labor market experiences of 
juveniles affect their life-chances as adults, 
and how the availability of work at this stage 
in life may affect the ease of transition to 
adulthood. DiPrete (1981) reports that early 
employment instability has lifelong conse- 
quences for employment quality and employ- 
ment tenure. Ethnographic research of Sulli- 
van (1983) confirms the central role of 
juvenile employment/unemployment in urban 
areas. His work with street gangs indicates 
that as youths enter their late teens, the 
availability of employment is one of the 
important factors assisting youths to “mature 
out" of delinquency, while the unavailability 
of work may increase the pressures toward 
illegitimate alternatives. 

In terms of the ecological relationship 
between labor markets and crime, the more 
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the youth labor market is characterized by 
adequate opportunities for work, the smaller 
the proportion of young people that may be 
susceptible to peer subcultures and to the 
attractions of illegitimate alternatives to the 
labor market. As young people move into 
adulthood, the larger the proportion who have 
had delinquent experiences and have been 
officially sanctioned, the larger the proportion 
that will have added handicaps in the 
legitimate world of work, coupled with added 
opportunities and inducements toward illegit- 
imate markets, particularly if labor market 
opportunities for young adults are character- 
ized by low pay, low hours, and limited 
benefits and chances for advancement. Thus, 
possible reciprocal effects of crime on the 
labor market should also be explored (see 
Thornberry and Christenson 1984). 

For the young adult age-group, our data 
suggest that the quality of employment 
becomes more important than the employ- 
ment/unemployment ratio which has such 
strong effects for juveniles. This is reflected 
in the strong positive effects for LOWHOUR 
and LOWWAGE and in the relatively weak 
results for UNEMP and SUBUNEMP. For 
adult males the marginal character of low- 
wage and involuntary part-time employment 
may be singularly lacking in satisfaction. 
Such employment is likely to be inadequate 
for self-support, let alone support of a family. 
Furthermore, such employment seldom pro- 
vides health insurance, retirement funds, or 
other fringe benefits, or even unemployment 
compensation in case of layoff. Adult males 
forced to rely on such employment are likely 
to be among the “hard core” individuals 
difficult to employ at all. Such employment is 
less likely to nurture the sort of “stakes in 
conformity” that one would expect to be 
associated with most forms of employment. 
Thus, at the aggregate level, when labor 
market opportunities are dominated by very 
marginal sorts of employment, a smaller 
proportion of these young adults with records 
of delinquency will find incentives to “put 
away childish things” (including delinquent 
behavior), and a larger proportion may find 
incentives to progress further into a lifestyle 
that deviates increasingly from the main- 
stream. 

Our findings for juveniles and young adults 
are consistent with a number of theoretical 
interpretations. Anomie theory has been the 
perspective most commonly brought to bear 
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upon the employment/crime issue. One inter- 
pretation consistent with the anomie perspec- 
tive is that underemployment produces frus- 
tration and thereby promotes the emergence 
of criminal motivations. Such an etiology 
suggests that underemployment can be ex- 
pected to produce direct effects upon crime 
rates, such as the effects reported here using 
arrest rates as the dependent variable. A 
second interpretation can also be derived from 
the anomie perspective: high levels of under- 
employment, and the accompanying high 
levels of anomie, may undermine normative 
structures and thus weaken the capacity of 
communities both to guide the behaviors of 
young people and to mobilize themselves 
against crime. Such an interpretation suggests 
that underemployment may also have indirect 
effects on crime rates by reducing formal and 
informal community controls (McGahey 
1986), by undermining family . stability 
(Sampson 1987), by fostering subcultural 
adaptations which promote deviant lifestyles 
(Cloward and Ohlin 1960), and by altering 
patterns of criminal opportunity (Cantor and 
Land 1985) and routine activities (Cohen and 
Felson 1979). Subsequent studies therefore 
need to examine not just the direct effects of 
underemployment on youth crime but also its 
indirect effects and the reciprocal effects of 
crime on underemployment. 

The present investigation should also be 
extended in two other important ways: (1) the 
linkage between employment marginality and 
crime should be examined among other 
population subgroups and among smaller or 
more homogeneous ecological units; (2) the 
search needs to identify those measures of 
labor market risk that can best predict crime 
and other dysfunctional consequences ' of 
economic marginality. 

First, differentials in economic marginality 
may hold clues to differences in crime rates 
among demographic subgroups. High rates of 
underemployment may help to explain why 
black rates of crime are comparatively high, 
or why urban rates of crime are higher than 
those of suburbs. Research on race differ- 
ences in underemployment and crime will 
need to examine not only direct effects but 
indirect effects of underemployment on fam- 
ily instability and on diminished community 
controls more generally. Fully one-third of 
inner-city blacks are jobless, unable to find 
full-time jobs, or unable to earn enough 
money to raise themselves significantly above 
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the poverty threshold (Lichter 1988; Wilson 
1987). The deleterious consequences of such 
extensive economic hardship—including 
crime — need to be fully explored.!¢ 

Second, this study documents the need for 
more sensitive labor market indicators and a 
wider array of such indicators. Future re- 
search should explore still other aspects of the 
labor market, such as duration of employ- 
ment/unemployment, indicators of the exis- 
tence of segmented labor markets (Baron and 
Bielby 1984), and working-class stratification 
(Form 1985). The segmented labor market 
approach would appear to offer especially 
fruitful insights for structural approaches. 
particularly in view of the fact that many of 
those demographic subgroups relegated to the 
secondary sector are also subgroups at higher 
risk for crime. 

With respect to research methods, an 
important implication is that age-specific 
research on labor market effects is not only 
necessary. (Chiricos 1987) but practicable. 
Research that looks only at total societal rates 
will be vulnerable to artifactual results 
dependent upon age composition. This may 
explain why Chiricos (1987), in comparison 
to earlier research, finds stronger relation- 
ships between unemployment and crime 
reported in studies dated 1970 and more 
recently —a period when the youth cohort has 
been large relative to the total population. 
Simply including a control for age composi- 
tion (e.g., percent aged 15-24) will be 
inadequate to permit full explication of labor 
market effects on crime, since the nature of 
those effects varies by age. 

The results of this study also suggest that 
longitudinal research should not necessarily 
be the method of choice in investigating labor 
market effects on crime. If the focus is on the 
Structural determinants of crime, cross- 
sectional research may actually be superior in 
terms of revealing the ecological components. 
Regional variations in levels of both property 
crime and personal crime are well known and 
have persisted over many years. The extent to 
which these variations are linked to persistent 


1$ An earlier phase of the research (Allan 1985) 
explored the effects of race-specific measures and 
generally found stronger effects for minority (black 
+ hispanic) underemployment than for anglo 
underemployment. However, use of race-specific 
dependent variables is needed in order to explore 
this issue more fully. 
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regional differences in labor markets may be 
obscured by longitudinal research based upon 
societal totals or upon single regions. At the 
very least, cohort analyses and panel analyses 
that follow particular populations across time 
should make allowances for age-linked differ- 
ences in etiology. Pooled cross-sectional data, 
such as used in this study, are amenable to 
examination of longitudinal variation through 
cross-lagged design. 

Finally, this study has a number of public 
policy implications. One implication is that 
the unemployment rate— particularly the yearly 
average—is only a crude indicator of labor 
market hardship. For the period of this study, 
unemployment represented only a little over 
half of the total underemployment of young 
people. Moreover, the movement to a service 
economy is apparently increasing various 
dimensions of underemployment among juve- 
niles and young adults (Lichter 1988), 
providing little basis for optimism about the 
control or reduction of property crime among 
these age-groups in the foreseeable future. 
Certainly our findings are consistent with 
Currie’s (1985, p. 140) observation “that the 
grim prospects for decent work that afflict the 
marginal poor in America produce a potent 
breeding-ground for crime; that as long as 
large numbers of people remain trapped in the 
dead end of the shattered labor markets of the 
cities, we will have great difficulty in keeping 
them out of crime—or at work.” 

Indeed, as reported here, the age- 
specification of the effects of employment 
marginality on juvenile and on young adult 
rates of property crime suggests that there are 
no “quick” or “easy” fixes for policy. The 
negative association between low-wage em- 
ployment and juvenile arrest rates might 
appear to imply that a substandard wage for 
juveniles could reduce juvenile crime, since 
for this age-group it is the availability of jobs 
that is most important. However, increased 
availability of cheaper juvenile labor might 
well reduce the supply of adequate jobs for 
young adults, thereby increasing crimino- 
genic forces on an age-group that is involved 
in fewer but more serious offenses than 
juveniles. Apparently, the most viable policy 
is one that produces more jobs for teenagers 
while simultaneously providing better-paying 
jobs for young adults. The importance of job 
quality for young adults should be taken into 
consideration in planning employment and/or 
training programs, and the concentration on 
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menial, low-paying employment that has 
typified such programs in the past may 
account in part for the absence of dramatic 
results for programs targeted at offenders 
(Piliavin and Gartner 1984), addicts (Skid- 
more 1984), and disadvantaged youths (May- 
nard 1984). 


CONCLUSION 


The findings of this study suggest that the 
effects of labor market conditions on arrest 
rates vary in important ways by age, by type 
of crime, and by type of employment 
indicator. Among juveniles, the presence or 
absence of employment opportunities for 
those juveniles at risk has an impact on arrest 
rates. For young adults, the quality of 
employment opportunities is more important. 

The employment/crime issue is an exciting 
one not only for sociologically oriented 
criminologists but also for sociologists inter- 
ested more generally in work, in stratifica- 
tion, and in the consequences of urbanization. 
First, work and crime are the principal 
alternatives by which most people generate an 
income. While the value of work and the 
value of crime are independently affected by a 
variety of factors, an understanding of their 
relative interdependence as alternative choices 
has clear theoretical as well as policy 
implications. Second, the labor market/crime 
issue as outlined here represents a long- 
standing tradition in sociology —the tradition 
of focusing on the conditions of ecological 
areas associated with social phenomena such 
as crime. The general thesis of the structural 
approach is that aggregate characteristics of 
areas (e.g., labor market opportunity, commu- 
nity controls, age or race composition) have 
independent effects on rates of behavior that 


are not attributable to the characteristics of. 


individuals. Third, in recent years there has 
been an increase in the availability of data for 
conducting aggregate research, along with 
considerable advances in the statistical manip- 
ulation of large data sets, so that there are 
expanding opportunities for research not only 
on the labor market/crime question but also 
on related ecological issues that are at the 
core of the sociological enterprise. 
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SOCIOLOGY: 
PROSCIENCE OR ANTISCIENCE?* 


RANDALL COLLINS 
University of California-Riverside 


Criticisms of the scientific status of sociology possess some validity when applied 
against narrowly positivist interpretations of sociological methods and metatheory, 
but do not undermine the scientific project of formulating generalized explanatory 
models. (1) Critics allege that sociology has made no lawful findings; but valid 
general principles exist in many areas. (2) Situational interpretation, subjectivity, 
reflexivity, and emergence are alleged to undermine explanatory sociology, but 
these topics themselves can be explained by a widened conception of science that 
allows informal procedures in theorizing aimed at maximizing explanatory 
coherence. (3) The fact that intellectual discourse itself is a historically changeable 
social product does not invalidate cbjective explanatory knowledge. (4) The 
historicist claim that there can be no principles that hold across particular times 
and places is invalid and rests upon a confusion of underlying generative principles 
with the complexities of the empirical surface of history. (5) Metatheoretical 
criticism of the concept of causality does not undermine a sophisticated conception 
of scientific sociology. Sociological knowledge can and does advance, but it 
depends upon building the coherence of theoretical conceptions across different 


areas and methods of research. 


In recent years there have been a variety of 
assaults on the conception of sociology as a 
science. These include the following themes. 
(1) Sociology has failed to produce valid 
findings or lawful generalizations. (2) Deter- 
ministic laws do not exist because social 
action consists of situational interpretation, 
based on human subjectivity, reflexivity, and 
creativeness. (3) We are locked in a world of 
discourse; society itself is a kind of text that 
we read in different ways at different times. 
(4) The foregoing position is often connected 
to historicism, the claim that only historical 
particulars exist, and no general laws can be 
found that apply at all times and places. (5) 
Finally, there are various technical criticisms 
of scientific methods and metatheory, espe- 
cially of the conception of causality. The 
philosophy of science today is in a postposi- 
tivist mode; and a scientifically oriented 
sociology, it is said, is intellectually out of 
date. 

The various criticisms are not necessarily 
united. Some of them include important 
points that contribute to the widening of 





*I am indebted to Paul DiMaggio, Richard 
Campbell, Robert Hanneman, Arthur Stinch- 
combe, and Jonathan Turner for comments on an 
earlier draft of this paper. 
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sociological knowledge. But the overall thrust 
of the criticisms, that sociology has and can 
have no scientific validity, I believe is wrong. 
To be sure, science is not the only valid mode 
of discourse, or of knowledge. Sociology, 
like many other intellectual disciplines, can 
be concerned with empirical descriptions 
including both contemporary social condi- 
tions and historical sequences; it can discuss 
moral issues, propose or excoriate policies for 
practical action, and compare existing condi- 
tions against ideals; its can discuss founda- 
tional, methodological, and other metatheo- 
retical issues. But the core activity that gives 
the field of sociology its intellectual justifica- 
tion is the formulation of generalized explan- 
atory principles, organized into models of the 
underlying processes that generate the social 
world. It is these that determine how 
particular conditions result in particular kinds 
of outcomes. It is these generalized explana- 
tory modes that constitute a science. 

I will attempt to show that none of the 
difficulties raised by the antiscience argu- 
ments prevent sociology from formulating 
valid generalized explanatory modes. We 
have the basic structures of several such 
models already, in areas ranging from micro- 
sociology, through formal organizations, to 
macrosociology: It is not inevitable that 
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sociology must be a scientific failure, nor has 
it failed. Those who attack scientific sociol- 
ogy typically fight against a caricature of 
“positivism” at its narrowest. On the other 
hand, many practitioners of allegedly scien- 
tific sociology have assumed just those 
narrow conceptions of method and substance 
that make them vulnerable to the antipositivist 
attack. ` 

In what follows, I will deal in turn with the 
major criticisms of sociological science but 
also suggest what we can learn from the 
antiscience critiques. The flux of situational 
interactions, human subjectivity and reflexive- 
ness, the dynamism and episodic movement 
of macrostructures, are all part of sociology’s 
subject matter. It has been the merit of some 
of the antiscience positions to bring these to 
our attention, and even to explore their 
pattern; but this exploration has made it 
possible to formulate these processes more 
generally and hence to widen the realm of 
explanatory models which are the core of 
scientific sociology. 


THE ALLEGED FAILURE OF 
SOCIOLOGICAL RESEARCH 


One line of attack dismisses sociology 
because we have no findings. After almost 
100 years of research, we still have come up 
with no valid generalizations, no laws of 
sociology. This criticism is often made by 
outsiders to our discipline; for instance, 
Alasdair MacIntyre (1984) uses it as grounds 
for his argument that there can be no secular, 
nontraditional basis of morality; Alexander 
Rosenberg (1980) argues that since the social 
sciences have, and can have, no laws, any 
sociological explanation must come from a 
sociobiological level of determinism. Sociol- 
ogists themselves sometimes also make the 
same dismissal, usually in the context of a 
discussion of alternative metatheories (e.g., 
Spencer 1987). l 

But the charge that sociology knows 
nothing, that we have no valid generaliza- 
tions, is patently untrue. Let us review a few 
of these, starting from the microlevel and 
moving up to the macrolevel. 

(i.) The longer, more intensely, and more 
exclusively persons interact with each other, 
the more that they will identify with one 
another as a group, and the more pressure 
they will exert and feel for conforming to 
local patterns of behavior and belief, pro- 
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vided that they are not unequals in power or 
competitors for scarce resources. Variations 
on this principle have been formulated 
numerous times, based on a wide range of 
research. Homans (1950) stated the basic 
point, drawing upon studies of informal 
groups in industrial settings, as well as 
anthropological research and experimental 
small groups. Durkheim ([1912] 1954), in 
analyzing the central dynamic of religious 
rituals, seized on the similar principle that 
intensely focused interaction produces moral 
solidarity and conformity to the group’s 
symbols; Goffman (1967 pp. 1-136) extended 
this model to sociable conversations. The 
famous Asch (1951) experiments demon- 
strated the effect of group cohesion upon 
pressures for conformity even in visual 
perception. Symbolic interactionist theory 
converges on the same point: if a person’s 
concepts are derived from the stance of a 
generalized other based on his or her social 
experience, then what individuals think must 
be influenced by their patterns of interaction. 
Research on self-conceptions (M. Rosenberg 
1979; R.’Turner 1978) can be regarded as a 
variant on this principle, demonstrating the 
influence of group membership and solidarity 
upon beliefs about oneself; in another appli- 
cation, expectation states research (Berger, 
Wagner, and Zelditch 1983) shows the effects 
of group pressures upon task performance (as 
did W.F. Whyte’s famous Street Corner 
Society, 1943). Studies of networks (Bott 
1971) provide the equivalent formulation: 
network cohesion results in homogeneous 
attitudes. The coherence among these various 
kinds of theory and research constitutes 
strong evidence that the interaction- 
density/solidarity/conformity principles are 
true. 

Gi.) Human cognitive capacity is limited; . 
accordingly, the more complex or uncertain a 
situation, the more that participants fall back 
upon a taken-for-granted routine and focus 
on the particular area that presents the most 
dramatic problems. There has been a great 
deal of convergence on this principle from 
very different points of view. Herbert Simon 
presented this as the principle of “bounded 
rationality,” which explains why members of 
organizations engage in “satisficing” in most 
areas while troubleshooting the most pressing 
problem at any given time (Simon 1957; 
March and Simon 1958, pp. 173-71). From a 
very different angle, Garfinkel’s (1967) 
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ethnomethodological experiments demon- 
strate’ that individuals cannot cope with the 
full complexity of social arrangements and 
their justifications (especially since questions 
about these justifications are in principle 
endless); accordingly, people actively resist 
whenever they are forced to question more 
than a very few taken-for-granted routines at 
once. Corroborating evidence comes from 
experimental research on judgments under 
uncertainty (Kahneman, Slovic, and Tversky 
1982). 

There are many ramifications of this 
general insight regarding human strategies for 
operating with bounded rationality in a 
complex world; one might say it is one of the 
common themes of the social sciences in the 
late 20th century. It affects our efforts to 
construct models of human cognitive process- 
ing. It has implications for understanding the 
nature of organizations and of the social 
shaping of markets (Williamson 1975; White 
1981). It explains why a source of power 
within organizations and occupations is occu- 
pying a position that has access to a crucial 
area of uncertainty, whose occupants are able 
to define to the rest of the organization what 
kind of nonroutine reality they are facing 
(Crozier 1964; Wilensky 1964).! The cogni- 
tive limitation principle also implies that 
change on the macrolevel would follow a 
pattern of long periods of routinization broken 
by sudden episodes of restructuring. I would 
suggest, in this light, that the microprinciple 
of. cognitive limitations is implicated in 
Perrow’s (1967, 1984) macromodel of orga- 
nizational systems, in which the combination 
of nonlinearity of organizational processes 
and tight coupling among them results in 
episodic “system accidents.” 

Gii.) On the macrolevel of the state, an 
important principle is: A political crisis arises 


! Power can also be based on resource depen- 
dence in a network structure (Cook, Emerson, 
Gillmore, and Yamagishi 1983; Willer 1987); and 
on coercion, exercised with varying degrees of 
effectiveness in different network structures (Willer 
1987; Schelling 1962). Power also depends on the 
organizational distribution of control resources 
(Etzioni 1975) and on the conditions of mobiliza- 
tion and conflict among opposing groups, both 
civilian (Tilly 1978) and military (Collins 1988b). 
Power- is a complex phenomenon; we have made 
progress on a series of partial theories, without yet 
pulling them all together. 
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when a state's apparatus of military control is 
broken down by internal dissension among 
elites: this breakdown is especially likely 
when there is military defeat and/or the 
economic strain of long-term military ex- 
penses beyond the organizational capacity of 
the state to collect revenues. This principle, 
as stated is limited: it tells us when a revolt 
will break out, not who will win, nor what 
kind of social transformation, if any, will 
follow. This pattern has many variations: 
when the factors causing breakdown occur in 
a centralized state and coincide with popular 
class conflict mobilized by changing property 
relations, the result is a major social transfor- 
mation, a "revolution" in the fullest sense of 
the term (Skocpol 1979). The interaction of 
demographic growth, money supply, end 
inflation can be a prior determinant of the 
state's fiscal crisis (Goldstone 1986, 1987); 
the position of a state within the world system 
affects its ability to bring in revenues ahead of 
its military expenses (Wallerstein 1974, pp. 
133-47); geopolitical patterns determine which 
states will become overextended and hence 
unable to sustain themselves militarily (Col- 
lins 1981b, 1986, pp. 145—209). In particular 
kinds of military/state organization (such as 
most premodern empires), the result of 
geopolitical or fiscal crisis is disintegration 
into smaller coercive units. A more complete 
theory of state crises, both revolutionary and 
nonrevolutionary, would have to take into 
account these kinds of considerations.? But I 
believe we can accept with confidence the 
basic principle of state military/fiscal crisis 
leading to disintegration of the apparatus of 
coercive control, and that in turn leading to a 
revolt of subordinates. 

My purpose in citing these principles is 
merely to disprove the argument that sociol- 
ogy knows nothing, and hence that a social 
science is impossible. I have not attempted to 
pick out our most important principles, to set 





? A fiscal and/or military crisis is not the only 
route to the dissension among elites that leads to 
the disintegration of the coercive apparatus. The 
fiscal/military crisis theory is not a complete theory 
of all revolutions and other revolts; but it appears 
to be true, as far as it goes, and it covers a very 
important portion of events. As Paul DiMaggio 
points out (personal communication), this theory is 
related to a more abstract explanatory principle, 
applicable in many contexts, about the disintegra- 
tion of an organizational system. 
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forth a systematic theory, or to assess the 
overall state of our knowledge in sociology 
(an effort to do so is in Collins 1988). Hence 
these principles may look eclectic, lacking the 
elegance of an overarching image of the 
social world. But I have given some hints 
how these principles, although they are 
chosen almost at random from different parts 
of the field, may be coherent with one 
another, and I have suggested that such 
principles are not trivial but lead to sociolog- 
ical insights into a wide range of important 
questions. There are a good many other 
principles of this sort, especially in organiza- 
tion theory but also in other areas of 
sociology. There is of course plenty of room 
for improvement in our precision and in our 
understanding of the scope of these theories, 
but enough evidence has built up that we can 
be confident we have good approximations of 
how some important processes work. Many 
different social scientists have contributed to 
this knowledge; we do have a core we can 
build upon, and a discipline we can be proud 
of. 

Is there anything salutary in the criticism 
that sociology knows nothing? It is not true, 
but it should serve to remind us that sociology 
has a serious problem in professional self- 
presentation. And within our own ranks, we 
need to pay more attention to the explicit 
cumulation of what we do know. 


SITUATIONAL AND REFLEXIVE 
INDETERMINISM 


It is sometimes held that deterministic expla- 
nations are impossible because social action 
consists of situational interpretation, subjec- 
tivity, reflexivity, and emergence. This is an 
old criticism, going back at least to Dilthey’s 
distinction between the Geisteswissenschaften 
and the Naturwissenschaften, and ultimately 
to the German Idealists’ revolt against the 
Enlightenment. In recent years this line of 
criticism has become very prominent, so 
much so that one might characterize the late 
20th century as a time of neo-Idealist revival. 

It is important to recognize that subjectiv- 
istic and interpretive schools of thought in 
recent sociology have made positive contribu- 
tions to sociological knowledge. On the 
methodological side, these approaches have 
fostered microresearch in naturalistic settings, 
empathetically entering into the processes, 
feelings, and thoughts of real people as they 
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enact society. Such work includes the partic- 
ipant observation practiced by symbolic 
interactionists, as well as Goffman’s efforts to 
map out the nature of everyday life. Without 
such research, we would be left studying the 
fundamental realities of sociology’s subject 
matter only indirectly, at methodological 
arm’s length. In the last few decades, there 
have been further innovations in microre- 
search, including ethnomethodologists’ breach- 
ing experiments, and culminating in perhaps 
the closest empirical analysis ever done in the 
social sciences, using audio and video 
recordings of natural interactions as a basis 
for developing formal models in conversation 
analysis. 

Most of this work would have been ruled 
out by textbook canons of research 30 years 
ago. The sense of alienation form the 
sociological “Establishment” felt by many 
interpretive sociologists is no doubt due to 
memories of having lived through that time. 
The vituperation against “positivism” is 
partly the expression of an oppressed intellec- 
tual minority against their long-standing 
oppressors after finally gaining a foothold in 
respectability. 

But we need not assume that all connection 
between interpretive and scientific sociology 
is now severed, and that microresearch with 
interpretive methods should be institutional- 
ized as a kind of “separate but equal” 
enclave. On the contrary, the achievement of 
the interpretive microsociologists should 
broaden our sense of acceptable methods for 
sociological science. Clearly, a scientific 
method for our field cannot rule out studies of 
the subjective; sociological science cannot be 
founded on an exclusionary behaviorism 
(although we should avoid going to the 
opposite extreme of ruling out the importance 
of behavior, including unconscious behavior). 
A science does not have to be built out of 
“hard data” in the narrow sense. What makes 
it scientific is its ability to explain the 
conditions under which one kind of pattern 
holds rather than another, in whatever realm 
those patterns may be found. 

Similarly, sociological science cannot be 
equated to a rigid operationalization of all of 
its concepts. Not only is it legitimate for 
explanatory theories to include nonopera- 
tional concepts at some levels; even a very 
positivistic model must include general orient- 
ing concepts within which its specific hypoth- 
eses and operationalized variables are located. 
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We always need a model of what the world is 
like, a mental picture of what are the 
fundamental processes and entities and how 
they link together (see J. Turner 1988; Willer 
1987). Specific hypotheses make sense only 
in terms of some such background assump- 
tions about what kind of world we are dealing 
with. The narrower traditional forms of 
positivism, in demanding total operationaliza- 
tion of all concepts, merely took for granted 
an unconscious conception of the world in 
which its explicit hypotheses were lodged. 
Such researchers could easily lock themselves 
into commonsensical or ideological assump- 
tions, which their focus on the foreground of 
research techniques kept them from seeing. 
The interpretive sociologies do us a service in 
forcing the issue, so that we are made 
explicitly aware of those background models 
and thus able to theorize them. 


The Place of Informal Concepts and 
Intuitions in Theory 


The notion of a complete and rigid formaliza- 
tion, operationalization, and measurement of 
everything in a scientific theory is a chimera. 
There are informal concepts and intuitive 
leaps at several points. There is always a 
metatheoretical stance about what we are 
doing intellectually in the first place. Scien- 
tific theory sketches a model of the aspect of 
the world under consideration; hypotheses are 
derived from this, by a process of derivation 
that itself involves intuitive leaps. When 
operationalizing concepts for empirical test, 
we always make another intuitive leap in 
deciding that particular measurements or 
other observations actually bear upon the 
theory. These intuitive or informal leaps are 
areas in which theoretical discussions can 
take place (or, in many cases, ought to take 
place). But they are not illegitimate. That is 
simply the way the world is. They do not 
undermine our ability to have a science, for 
all sciences have these places where there are 
intuitive leaps. If physical scientists some- 
times forget this and talk in crude positivistic 
terms as if they report “nothing but the 
facts,” that is because they have been 
successful at making the right intuitive leaps 
as their scientific procedures have cumulated, 
so that they now have workable models that 
they know intuitively how to apply to most of 
the things they study. 

Everything whatsoever has “fuzzy edges,” 


AMERICAN SOCIOLOGICAL REVIEW 


so to speak; even numbers and logical 
relations have some areas of indeterminacy. 
We encounter this when number systems are 
extended to infinity or to infinitesimality, and 
in nonconverging algebraic series. Many 
systems of equations are unsolvable mathemat- 
ically. Even rather restricted formal systems 
of logic always encounter Gódelian incom- 
pleteness; more complex systems of multi- 
valued, modal, and other nonclassical logics 
have even greater areas of disagreement in 
interpretation (Lewis 1986). I have expressed 
this elsewhere (Collins 1988, Appendix A) in 
the following formula: mathematics is always 
embedded in words. But notice what conclu- 
sion follows: not that mathematics and 
mathematical science is impossible; on the 
contrary, a successful science is possible even 
incorporating areas of fundamental uncer- 
tainty, dealt with by tacit and informal 
understandings. Tacit knowledge is knowl- 
edge too, as long as it works. 

Whatever our kinds of explanatory models, 
we still need to be concerned about validating 
our theories. The fact that we are always 
involved in interpretations (and at many 
levels) does not mean that we can accept 
every interpretation offered at face value. 
Typically we cannot decide these issues by a 
simple operationalization, measurement, and 
one-shot test. But the physical sciences face 
most of these same problems, and their 
success in many areas shows that some 
research programs and theoretical models do 
prove themselves in the long run over rival 
ones; there can be convergence on models 
that work, that capture the central ways in 
which the world is, even if these scientific 
models are inevitably vague and cluttered 
around the edges. Successful heuristics and 
intuitions are possible, and unsuccessful ones 
which lead us into blind alleys can be ruled 
out. 

The crucial criterion is that the best theory 
(with its ancillary assumptions and heuristics) 
is that which maximizes coherence; it brings 
together the most successful explanatory 
models into a consistent overall picture of 
how the world operates. Methodologically, 
empiricism can be part of the coherence 
criterion; the best validated theory is the one 
maximally grounded to the empirical world 
via the various explanatory submodels it 
incorporates. An extreme, all-or-nothing em- 
piricism is impossible; but a flexible empiri- 
cism, working with imprecisions and intuitive 
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concepts where necessary, and making a great 
deal of room for theoretical work that ties 
things together, is a central part of science. 
One needs to work nonpositivistically, so to 
speak, to be a successful positivist. 

It is in just this vein that the interpretive 
schools have contributed substantively impor- 
tant theories. Among these are the symbolic 
interactionist theory of the self (part of which 
is coherent with well-established principle [i.] 
mentioned above); the ethnomethodological 
theory of everyday rationality (which is 
coherent with the limited cognition principle, 
[ii.] above); as well as other existing and 
potential contributions to sociological knowl- 
edge. Goffman’s (1959) model of dramaturgy 
in everyday life is a model, in the sense of 
“what the world is fundamentally like” that I 
mentioned above; from this base, one can go 
on to develop specific explanatory principles. 
I have argued, for instance, that it provides a 
base for understanding the differences in class 
cultures, between those wielding power and 
those subjected to power (Collins 1988, pp. 
203-14). 

Many sociologists in the interpretive camp, 
however, claim that their major substantive 
finding is the impossibility of deterministic 
theories (e.g., Blumer 1969, p. 60). In their 
empirical investigations, they discover, above 
all, emergence, unpredictability, situational- 
ity, human capabilities for subjectively reflect- 
ing upon and changing social conditions. 
Here we have an argument about what kind of 
model we arrive at, not about whether it is 
possible to have any model at ail. 

But is it true that the main feature of the 
social world is unpredictability, overwhelm- 
ing any determinate processes? I suggest that 
it is not true, and that this perception comes 
from selectively focusing on a limited portion 
of the social world. Although much (but not 
all) content of sociology consists of human 
subjectivity, it does not necessarily follow 
that this cognition and feeling are indetermi- 
nate. Without pursuing this point into the 
status of theories of cognition and emotion, 
let us think of Goffman, the acknowledged 
genius of micro-interpretive sociology. Goff- 
man used soft methods, but he believed the 
world he was studying is hard. His social 
theory of language (Goffman 1981) grounds 
cognition in the social ecology of interaction. 
The complexity and reflexiveness of human 
subjective worlds come from the many 
possible "reframings" that actors can carry 
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out (Goffman 1974), but Goffman did not 
regard this framing activity as free-floating, 
and he rejected the suggestion that it reduced 
the world to a kind of psychedelic fantasy. 
Transformative reinterpretations of subjective 
reality are linked together into orderly trans- 
formations, among “adjacent” frames, so to 
speak. For Goffman, the bottom level frame, 
out of which all the others arise, is the 
physical interaction of human animal bodies, 
an ecological baseline that links Goffman 
theoretically to Durkheimian theory of the 
ritual basis of solidarity and of symbol- 
construction (for elaboration, see Collins 
1988, pp. 188-203, 291-97, 320-34). 

It is possible, then, to have a structured 
understanding of subjectivity. In that perspec- 
tive, the explorations of the subjective side of 
human life in the last few decades, notwith- 
standing their sometimes extreme pronounce- 
ments, have contributed elements towards a 
much more sophisticated explanatory theory 
of mind than would have been possible 
before. 

How Unpredictable is the Social World? 
Let us confront the issue of unpredictability 
head on. How much of the social world is 
unpredictable? A great many things are highly 
predictable. It is the pattern of people going 
to work over and over again to the same jobs 
that makes up much of formal organizations; 
repetitive patterns make up households and 
families; networks of friends and acquaint- 
ances similarly are constituted out of behav- 
iors, cognitions, emotions, and communica- 
tions that are heavily patterned. For regularity 
and predictability to exist, it need not even be 
the same persons who repeatedly interact. 
Most stores have only episodic relationships 
with particular customers, but it is the 
predictability of a certain number of people 
coming in to shop that allows businesses to 
stay open at all. Although microsociology is 
the theoretical bastion of indeterminacy, it 
should be apparent from these examples of 
everyday life that the microlevel has a high 
degree of predictability. 

The theory of indeterminacy seems to rest 
upon two suppositions. One is that this kind of 
predictability is banal. It is true, but it is too 
boring for sociologists to pay attention to it, 
we should focus on something that everyone 
does not already know. So one might say there 
is a built-in bias toward studying the dramatic 
and unpredictable. But I would deny that what 
is banal from a participant’s point of view is 
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necessarily banal for an explanatory theory. 
On the microlevel, Garfinkel took the banality 
of everyday life as a product to be explained, 
and uncovered cognitive mechanisms that al- 
low it to happen—and that show us the pres- 
sure points when these mechanisms are inter- 
fered with. On the macro- and mesolevels, 
good sociological work consists in reframing 
the banality of taken-for-granted patterns. Al- 
though it seems natural to a particular person 
that she or he does a job and chats with friends 
every day, there is much that sociologists have 
uncovered as to why jobs are structured in this 
way rather than that, why these persons are 
friends rather than others, and so forth; these 
are the contents of, for example, organiza- 
tional theory, exchange and network theories, 
and stratification theory. 

The other supposition leading us toward 
theoretical indeterminacy is more valid. It is 
the recognition that situations can sometimes 
change very rapidly: that there are negotia- 
tions, conflicts, sudden insights, decisions, 
and, on the macrolevel, movements, revolts, 
and revolutions. All this is true. But do we 
take this as the end of analysis, or as a 
beginning point, a challenge to develop 
theories to explain when such sudden shifts 
will occur? I have already noted that on the 
macrolevel, we know some of the crucial 
features that make revolutions predictable. On 
the microlevel, indeterminacy is typically 
grounded in some version of the Thomas 
Theorem. But if situations are determined by 
subjective definitions, we can still ask what 
determines what the definition of the situation 
will be.? What sometimes makes situations 
seem to have an unpredictable, emergent 
quality is that one looks at them from the 
point of view of a single actor who knows 
only his or her own intentions. But if we 
know enough about all the actors in the 
situation, and the structure of their interac- 


? The findings of ethnomethodological research 
do not support the notion of a great deal of sudden 
emergence and reinterpretation. Clegg (1975), for 
instance, who set out to study a construction firm 
in microdetail armed with tape recorder, soon 
found that the banality of everyday repetitiveness 
was overwhelming, and he had to shift to conflict 
points in management to find more dramatic 
material. Ethnomethodological theory proposes 
that making everyday life into a routine is the basic 
process, and that people attempt to avoid and gloss 
over disruptions as much as possible. 
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tions, the emergent events often look highly 
patterned. The belief in indeterminism here is 
a by-product of individual reductionism. If 
we go to a genuinely interactive level, it is 
possible to specify the combination of ingre- 
dients that go into the situation and determine 
its outcomes. (Such models of chains of 
interactions are proposed by Heise 1979, 
1987, and by Collins 1981a.) 

Does Knowing a Sociological Law Reflex- 
ively Overturn It? Finally, there is the argu- 
ment that whatever laws sociologists may dis- 
cover will be reflexively overturned. As soon 
as people know what these laws are, they can 
act to make them false. But although this 
sounds plausible in the abstract, it is difficult 
to think of many cases in which it actually 
applies. Perhaps the Thomas Theorem itself is 
the prime example of a principle that can be 
reflexively undermined. We have been most 
interested in this as a theory of prejudice, of 
the self-perpetuating nature of biases against 
particular categories of persons. By becoming 
aware of these biases, the liberal public in the 
United States has made efforts to counteract 
the bias. But does this actually violate the 
Thomas Theorem? Instead, we seem to be at- 
tempting to avoid the circumstances under 
which the theorem gets started in a negative 
direction; we avoid giving negative definitions 
of persons, in the hopes that the consequences 
will be a positive self-fulfilling prophecy in- 
stead of a negative one. 

Determinative principles on the macrolevel 
look rather difficult to evade. For instance, if 
military defeat or fiscal crisis leading to 
breakdown of the coercive apparatus fosters 
revolutionary conflict, one can hardly prevent 
this simply by being aware of it; the best a 
government could do would be to try to avoid 
this principle's coming into operation, by 
avoiding circumstances that lead to military 
or fiscal crisis. Reflexivity may enable people 
to try to change the distribution of indepen- 
dent variables, but not the relationships 
between independent and dependent vari- 
ables. Similarly, principles of formal organi- 
zational structure give information as to what 
people can work around, but hardly what they 
can fly in the teeth of. Even on the 
microlevel, where an individual would seem 
most capable of reflexively changing an 
outcome, I would suggest that when individ- 
uals actually control outcomes they are doing 
it by applying microsociological laws, not 
going against them. For instance, when 
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people deliberately go into an encounter 
group or similar group-dynamics situation, 
they are implicitly making use of the 
microprinciple ([i], above) regarding how 
group solidarity is generated, because they 
want its emotional payoff. Their mistake 
typically is to overestimate how long such 
solidarity and emotional energy will last, after 
a temporary group of this kind is disassem- 
bled. Knowing the principle does not under- 
mine it.^ 

It would be rash to predict that sociological 
science will explain everything. There may 
well be a considerable residue of indetermin- 
ism even if sociology is successful far into the 
future. But our intellectual impetus comes 
from pushing back the frontier. To preach 
indeterminism and nothing further seems to 
me a parasitical strategy, since it is intellectu- 
ally of interest only if there is some body of 
theory already existing that one wishes to 
counter. The constructive job is to build as 
good an explanatory theory as we can. 

I have tried to establish that a science can 
operate flexibly, and on all kinds of subject 
matters. Against this backdrop, I will com- 
ment more briefly on the remaining criticisms 
of sociological science. 


SOCIETY AS DISCOURSE 


Another criticism goes as follows. We are 
locked in a world of discourse; society itself 
is nothing more than a kind of text that we 
read in different ways at different times. This 
is a popular theme now, deriving from French 
structuralism and its offshoots. It has created 
a veritable revolution in the world of literary 
criticism. This is understandable as a profes- 
sional ideology, elevating literary theorists’ 
own field by the assertion that all reality is a 
piece of literature. “Discourse” has thereby 
won a wide influence in the larger intellectual 
work (including the publishing business). It 
fits well especially with the particularistic, 
descriptive themes of anthropology and also 





^]t is sometimes said that knowing too much 
about how social relationships operate makes them 
go flat. Can an exchange theorist or someone who 
applies Durkheimian and Goffmanian ritual theory 
fall in love? Does the theoretical self- 
consciousness destroy the situation? I can attest to 
you it does not; robust social processes have a 
quite wonderful power, overriding a weaker 
process like momentary reflectiveness. 
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makes inroads into the cosmopolitan side of 
sociology (e.g., Brown 1987; Gottdiener 
1985). 

But the upsurge of sociological research on 
culture is not always merely relativistic. It has 
also been carried forward in a deterministic 
mode: we have rather good researchers and 
theories about the material and organizational 
base of the production of culture, both as it is 
distributed among social classes, as well as in 
more specialized culture-producing institu- 
tions (Bourdieu 1984; DiMaggio and Useem 
1982; Coser, Kadushin, and Powell 1982). 
Culture does not simple organize itself; it is 
organized by social processes. 

Foucault (1969), the most significant of the 
French "discourse" school, has documented 
the social bases of ideas in such practical 
fields as psychiatry and law. But Foucauit 
does not attempt to undermine the validity of 
his own discourse as a historian. Moreover, 
the historical patterns that he emphasizes as 
determinative of the field of discourse —the 
bureaucratization and specialization of social 
control agencies, the shift in public/private 
boundaries, the move from ritual punishments 
to a reflexively conscious self—are highly 
congruent with Weberian (Weber [1915]1946) 
and Durkheimian/Maussian (Carrithers, Col- 
lins, and Lukes 1985) theories of modernity. 
The best of this European work reinforces 
rather than departs from the central cumula- 
tive traditions of sociological knowledge. 

The intellectual popularity of “discourse” 
as a master worldview is bolstered by the 
success of the sociology of science in 
showing how knowledge is a social construct. 
This success is indeed something to cheer 
about. But we should not forget that the 
sociology of science is an empirical research 
discipline, which has made great strides in the 
last 30 years in cumulating sociological 
models of what determines the kind of 
knowledge produced by particular kinds of 
organizational conditions (see Whitley 1984, 
for recent summary and synthesis). Notice 
what this does for the claim of undermining 
scientific knowledge. There is a determinism 
at the very heart of this alleged indetermin- 
ism; sociology of science itself is becoming a 
scientific success. 

This raises some interesting issues of 
reflexive self-consciousness. Some sociolo- 
gists of science (e.g., the British school 
around Michael Mulkay, Harry Collins, Steve 
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Woolgar, and others; see Knorr-Cetina and 
Mulkay 1983) go so far as to argue that 
science is merely a set of competing power 
claims. The only democratic path is to allow 
no single voice to have a privileged position; 
hence, Mulkay (1985) and others have taken 
to writing and presenting papers in a “New 
Literary Form,” in which the author steps 
aside and lets many competing voices appear. 
The result is entertaining, but it is not clear 
why reflexiveness should prevent scientific 
knowledge. Bloor (1976, 1983), who power- 
fully applies a Durkheimian theory, argues in 
his “strong program” that a sociology of 
science can and should explain not only false 
knowledge claims but also true knowledge. 

From an organizational perspective, the 
power claims in scientific discourse that 
Mulkay and others document are themselves 
part of fairly predictable patterns. Different 
kinds of intellectual discourse (i.e., particular 
scientific disciplines) are embedded in organi- 
zations, and they themselves can be under- 
stood as organizational forms. Hence, organi- 
zational theory (especially the theory of how 
various kinds of task uncertainty and of 
resource dependence affect organizational 
behavior and structure; see Whitley 1984; 
Fuchs and Turner 1986) shows scientific 
discourse is not a free-floating construction 
but appears in predictable ways under given 
circumstances. Furthermore, we know that 
organizational structures are at least partly 
determined by the task environments in which 
they work (Collins 1988, pp. 467-85). This 
means that the objective nature of the subject 
matter is one of the determinants of the social 
activity (including the discourse) that makes 
up science. 

Arguments that exclusively emphasize dis- 
course are one-sided; although there is ja 
culturally constructed component in any 
knowledge, it also can be knowledge ) f 
something. Indeed, any argument about the 
social basis of knowledge is self-undermining 
if it doesn't have some external truth- 
reference as well—otherwise why should we 
believe that this social basis itself exists? W 
need to get beyond polemically one-sided 
epistemologies, of either the subjectivist or 
the objectivist sort; a multidimensional episte- 
mology can take account of the way we live 
in a cultural tunnel of our own history, but 
still we can cumulate objective knowledge 
about the world. 
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Historicism is the claim that only historical 
particulars exist, and no general laws can be 
found that apply at all times and places. This 
argument arises to some extent in conjunction 
with other antipositivist criticisms, as a kind 
of oppositional united front. But it also has its 
autonomous basis in the actual practices of . 
historical sociology. It is a price we are 
paying for a very good thing. The last 20 
years, since the publications of Barrington 
Moore and Charles Tilly in the 1960s, have 
been a golden age for historical sociology. 
We have learned a great deal about macropro- 
cesses by looking at historical materials with 
a sociological eye and by making compari- 
Sons across societies and times. This, for 
instance, is how Moore (1966) showed the 
connection between the forms of capitalist 
agriculture and the differing political struc- 
tures of modern states. But although this 
shows us something about particular states 
(17th-century England, 19th-century United 
States, etc.), the underlying theoretical con- 
ceptions have a more universal application; it 
is for this reason that a family of models 
related to Moore's (Paige 1975; Skocpol 
1979; see also Stinchcombe 1961; Weber 
[1923] 1961, pp. 81-94) have been power- 
fully applied to other times and places. 
Historical sociologists are under two sorts 
of pressure to announce themselves as 
historicists. One is that they come into a good 
deal of contact with historians. Historicism is 
a kind of professional ideology for historians; 
they make their living by describing particu- 
lars, and furthermore the pressures of intellec- 
tual competition within their field lead them 
to specialize and to resent intruders on their 
turf. Hence, historians tend to dislike any 
suggestion that there are general processes, 
and particularly that such processes might be 
uncovered by comparing across times and 
places (i.e., across the boundaries of their 
research specialties). Historians often espouse 


an ideology that would make it impossible to 


consciously develop a general explanatory 
theory; and many historical sociologists 
respond to criticism from historians by giving 
in to their ideology. 

But historians' claims are not consistent. In 
interpreting their particular cases, they implic- 
itly draw upon some ideas of what general 
structures are and how patterns of social 
motivation and change operate. The analysis 
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of historical reality can hardly be approached 
with a tabula rasa; historians have theories 
whether they know it or not. What makes 
someone a great historian, one whose work 
commands widespread intellectual attention, 
is typically his or her ability to create a 
theory, to show the more general skeleton 
underlying the narrative of particulars. Lesser 
historians are usually those who operate with 
naive, taken-for-granted ‘conceptions, or with 
old theories that have passed into common 
discourse. Historians at their best have been 
building sociological theory, although they 
have not always discussed it as such, and 
have typically interwoven it with their 
particular historical description, sometimes 
with considerable artistry and dramatic style. 

I have no sympathy with the blanket claim 
that historical particularism is all that is 
possible; ori the contrary, we cannot even see 
particulars without general concepts. But 
there is a more valid reason why historical 
sociologists tend to work at a low level of 
generality with theories embedded in the 
understanding of a particular range of cases. 
Even if one's aim’ is to develop general 
theory, the macromaterials of long-term 
history are extremely complex. Although we 
may know something of rudimentary pro- 
' cesses, getting any very abstract picture of the 
overall combination of conditions operating in 
historical change is very difficult. Theoreti- 
cally oriented historical sociologists have 
worked with intermediate approximations to a 
level of explanatory generality. For instance, 
Weber's massive historical comparisons of 
the conditions involved in the rise of 
rationalized capitalism have yielded many 
general analytical points, but embedded in an 
account of certain concrete sequences of 
world history. This same halfway combina- 
tion is found in modern work such as that of 
Mann (1986) on the conditions .determining 
the history of social power, of Goldstone 
(1986, 1987) on the state crises and social 
transformations; and I will admit.that my own 
work (Collins 1986), on such matters as the 
extension of Weberian theories of capitalism 
or of sexual stratification, is also: quasi- 
embedded in accounts of particular historical 
developments. Such works are a challenge for 
theorists to attempt to departicularize what we 
have learned, to pull out the more fundamen- 
tal theoretical structures underlying . these 
accounts. 

We wil always have two levels: a 
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theoretical level of abstract and universal 
explanatory principles, and a level of histori- 
cal:particulars. Insofar as our theories are 
successful, we will become better and better 
at showing how the myriad arrangements of 
historical particulars are generated by particu- 
lar combinations of variables in theoretical 
modes. There will always’ be tasks for 
historians, and for historical sociologists, in 
making this kind of concrete interpretation. 
At the same time, exploring concrete history 
is one of the main ways that we make 
progress in building and validating our 
general models — although such theory is built 
and validated by its coherence with all kinds 
of sociological research, V E as 
well as historical. 

It is not true that there are no explanatory 
principles that hold. generally across history: 
The three examples that I gave at the begin- 
ning of this paper are completely coherent, as 
far as I know, with evidence from any histor- : 
ical epoch, and there are many more such prin- 
ciples. Of course, some principles may not be 
applicable because the independent variable. 
does not exist in a given historical situation. 
One cannot predict either the existence or the 
nonexistence of revolutionary crises if there is 
no state. But there is doubtless a more abstract. 
formulation related to principle (iii.) that would 
apply to sources of crises in political power in 
stateless societies. Macroprinciples in general . 
are likely to be more complex than microprin- 
ciples, involving combinations of many pro- 
cesses. But we have at least rudimentary out- 
lines of promising principles on the macrolevel 
as well. It is a mistake in criticizing: the limi- 
tations of particular theories (e.g., in widening 
the reference from allegedly independent socie- 
ties to a world system, or overthrowing unili- . 
near evolutionism or development theory or 
functionalism), to go to conclude that general 
theory is impossible: The result of this critical 
development is not no theory at all, but im-. 
Pes theory. 


THE METATHEORETICAL ATTACK 
ON CAUSALITY. 


Critics of explanatory sociology like to point : 
out that the consensus in the philosophy of 
science has changed since the heyday of 
logical positivism..It is generally acknowl- 
edged that programs such as that. of Carnap, 
which attempted to construct all scientific: 
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knowledge from sensory experience orga- 
nized into statements of formal logic and 
mathematics, have failed. There is no consen- 
sus now on an alternative epistemology for 
science, although most philosophers make 
room for theoretical preconceptions and 
programs, and for pragmatics in both theory 
formulation and research (Quine 1969; Dum- 
mett 1978; Putnam 1983). It would be 
generally agreed that mathematics and formal 
logic are not self-grounding, and there is 
considerable recognition of the role of 
nonformalized statements in any body of 
knowledge. Along with this, there is a 
loosening up in the conception of what 
constitutes knowledge: not merely the ideal of 
classical physics, but widened to include the 
rather different scope of knowledge in the 
biological and earth sciences, history, extend- 
ing perhaps even as far as alternative forms of 
knowledge embodied in art (Goodman 1978). 

What does this mean for sociology? I 
would suggest that it puts sociological science 
more on an even footing, epistemologically 
speaking, with the established natural sci- 
ences. For they too operate under the same 
sort of epistemological imprecisions. Sociol- 
ogy will never be a science fitting the old 
logical positivist ideal, but none of the natural 
Sciences fits that ideal either. We are not 
aiming at the impossible; if we can reach the 
degree of approximate and pragmatic success 
the natural sciences have achieved, that 
would be plenty. It is true that some 
sociologists may continue to uphold a meth- 
odological ideal that is closer to the crude 
induction-plus-mathematical-formalization 
model of science. I would suggest that this is 
particularly likely in the applied end of our 
field, where purely descriptive information 
(e.g., on the success of desegregation pro- 
grams) has some immediate use, and hence 
straightforward induction is more likely to be 
pursued. But this does not affect the larger 
issue of the methods appropriate for building 
a general explanatory science. 

Modern philosophy of science does not de- 
stroy sociological science; it does not say that 
science is impossible, but gives us a more flex- 
ible picture of what a science is. This is all to 
the good in consolidating a science out of the 
materials that sociology already has available. 
A number of the more specific technical criti- 
cisms mounted by the antiscience position in 
sociology seem to me to cling to a narrowly 
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positivist image of their opponents and do not 
touch a more realistic image of science. 

Criticisms of the concept of causality are 
frequently raised in this spirit. Causal theories 
are dismissed on the grounds that there is no 
such thing as “the cause" of anything; there is 
always a complex of conditions, and those in 
turn have antecedant conditions, which can be 
traced backwards and outwards in an endless 
web. Given causes explain something only 
under particular conditions, which are typi- 
cally taken for granted, especially in statisti- 
cal analyses of survey data that attempt to 
causally explain all the variance in their 
particular sample. Some of the attack on 
causality, however, has come from within the 
scientific camp itself (Gibbs 1972) and does 
not dismiss the aim of testable, generalized 
explanatory principles. 

Certain aspects of this debate are merely ter- 
minological. "Cause" is to come extent a met- 
aphor, a shorthand for referring to our focus on 
a particular portion of a complex of conditions 
that are involved in producing certain out- 
comes. Some of these conditions may be con- 
comitant relationships among parts of a social 
structure, as well as antecedent conditions that 
determine which kinds of outcomes will fol- 
low. (See general discussions by Klein 1987; 
Walker 1987; Meeker and Hage 1988. As Wal- 
lace [1987] points out, there is a variety of 
causal patterns — continuous, episodic, multile- 
veled, etc.) But it is important to retain this 
concept, whether under the term "causality" or 
something equivalent, for it enables us to dis- 
tinguish between explanations that work and 
ones that are vacuous. Functionalist analysis, 
for example, has turned out to be a poor mode 
of explanation, unless it can be translated into 
causal mechanisms (see Stinchcombe 1986, pp. 
80-100). One cannot “explain” something by 
giving it a name, even if that name is "norms" 
or "rules" or "culture" —or for that matter a 
*problematique" or "discourse" —any more 
than one can explain gravity by referring to a 
“gravitationous propensity.” Here “causality” 
is useful by giving us a mechanism, which tells 
us what process is operating and when partic- 
ular outcomes can be expected rather than oth- 
ers. “Causality” saves us from reifications, as 
well as from ideological justifications masquer- 
ading under the guise of explanations. 

As we have seen above, an explanatory 
theory has as its core a model of “how that 
part of the world works,” what the parts are 
and how they fit together. Specific causal 
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propositions fit into such a model and are the 
object of empirical testing, but they depend 
upon the background conditions of the whole 
model. Some of the objections to “causal 
theory” in sociology are directed at particular 
kinds of statistical models (e.g., in the status 
attainment literature) built entirely at the level 
of propositions. But although such models 
may be too embedded in a particular body of 
data from a given historical period and fail to 
make explicit the structural conditions that 
frame these processes, that is not to say that 
such causal processes cannot be incorporated 
in a valid theory of the larger social universe 
(see Campbell 1983). 

Steven Turner (1987) voices a more 
specific objection, to causal statements of the 
form “the more the X, the more the Y." He 
argues that such propositions are literally 
untrue unless the correlation is perfect; but 
empirically there are always exceptions, and 
hence such propositions have no logical 
foundation. Turner denies that imperfect 
correlation can be taken as an approximation 
to true causal relations. He holds that there is 
no path, logically, from general propositions 
(which are always idealized and perfect) to 
the messy world of inexact relationships. 
Statistics provides no answer to this underly- 
ing issue. Theory is always underdetermined 
by data, and a wide-open pluralism of 
theories is the consequence that presumable 
will always be with us. 

Turner’s argument, however, leads to 
absurd extremes. Does anyone really believe 
that if we had a large number of well- 
validated propositions of the form “in a large 
proportion [fill in the range of probability] of 
cases, the more the X, the more the Y,” we 
would know nothing at all? Turner’s argu- 
ment cuts against physical science as well as 
against sociology; again, I would be quite 
happy with the level of approximation and 
pragmatic success that these other sciences 
have achieved, whatever a purist argument 
like Turner’s would say about the logical 
status of such knowledge.5 On the philosoph- 


* As Willer (1987, pp. 43, 220) points out, 
physical scientists typically carry out experiments, 
not to achieve statistical validation, but to discover 
the range of conditions under which a theoretically 
derived relationship holds. Willer (1987, p. 220) 
comments: "physicists were satisfied with an 
experiment if its results came within a factor of ten 
or more of theoretically predicted values. This sort 
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‘ical level, Turner seems to assume a rigidly 


positivist conception of theory and fails to 
recognize that any theory involves interpre- 
tive leaps and pragmatics, including deciding 
that given observations are appropriately 
connected to a given theory. All theories are 
not equally valid; the question is which theory 
works in the largest number of contexts that 
are coherent with one another. 


Obstacles to Cumulation and The 
Organizational Politics of Sociology 


J maintain that when we go looking for it, we 
find bits and pieces of sociological knowledge 
lying all over the landscape. Our problem is 
recognizing what we have and organizing it 
So as to maximize its visibility. Why is this so 
difficult? 

One reason is the fragmentation and 
antagonism within our field. Sociology ‘is 
divided into a large number of specialties. 
This is hardly surprising, since there are many 
thousands of researchers with an interest in 
cultivating their own turf; and since sociology 
takes the entire social world (including its 
causes and effects) as its target, there is an 
enormous range of empirical things that we 
can study. The sheer amount and diversity of 
sociology give us a practical incentive not to 
pay attention to work outside one's own area. 
On top of this, there is a diversity of methods, 
whose proponents frequently regard work 
produced by rival methods as having no 
knowledge value. And there is a further split 
into theoretical schools, which often denigrate 
each other as part of their competition for 
hegemony. These battles are especially in- 
tense when theoretical factions have political 
overtones, or when it is argued that only 
practical knowledge or politically engagé 
expressions of a particular sort are worth- 
while. All these conditions make it difficult 
for us to pick out the places where theoretical 
explanations come together, and to assemble 
the bits of evidence thrown up by different 
approaches into a coherent pattern. 





of experimental error was considered ordinary and 
no one even considered statistically testing the 
result. . . . [W]hat can it possibly mean to say that 
physics is an exact science? . . . [E]xact meant the 
exact use of theory, not necessarily the exact 
production of clean results." 

$ One of the main advantages of the natural 
Sciences may be that there are relatively few direct 
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Perhaps even more important for American 
sociology has been a different type of split, 
‘between descriptive and theory-oriented re- 
search. The latter is the search for general 
explanatory principles; the former is a much 
more straightforward investigation of ques- 
tions that, at least initially, are fully compre- 
hensible to a layperson. This is the class of 
issues such as: how much social mobility is 
there? (are we still the land of opportunity? 
were we ever?) how much racial discrimina- 
. tion, and what progress towards its diminu- 
tion? These kinds of researches could be 
theory-relevant, but it is not necessary that 
one have an explanation in terms of genuinely 
analytical variables in order to produce useful 
information. Statistical technique without 
much theory finds its place most naturally 
here. It is likely this practically oriented 
search for descriptive information that anti- 
positivistic sociologists have in mind when 
they attack the methodological and antitheo- 
retical bias they perceive in sociology. 

This division of labor between our different 
activities need not poison the atmosphere over 
what is a scientific sociology. Practical/descrip- 
tive research could doubtless be done better if it 
were grounded in more analytical theory, but 
for many purposes an atheoretical description is 
enough. What is important to recognize is that 
cumulating an analytical core of explanatory 
propositions is a different enterprise. When we 
attempt to cumulate our general knowledge, we 
have to know what we are looking for and to 
cull it from the mountains of descriptive mate- 
rial that make up so much of the research lit- 
erature. Theory-oriented research and practical- 
/descriptive research can live together 
harmoniously, even if they produce an 
information-retrieval problem. 

More debilitating is an attitude that seems 
to carry over from practical/descriptive work 
into all of sociological research: the tendency 
to regard only the latest data as relevant. If we 
want to know what the equality of educational 
opportunity is, the survey with the latest date 
is obviously of most practical significance. 
But this "bias of the present," a kind of 
headline-chasing in sociology, is one of the 
factors that turn our faces away from pull- 


political implications of'their theories. This has 
enabled them to avoid a source of controversies 
that has clouded the analytical project of the social 
sciences, and of sociology above all. 
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ing together different studies into theoretically 
coherent patterns. 

Data from different time periods are a 
positive advantage in attempting to cumulate 
general explanatory principles. The change in 
empirical distributions that we often observe 
enables us to formulate principles of scope 
and to refine our theories. For example, 
research from industrial sociology and com- 
munity studies from the 1930s through the 
1950s led to general principles about how the 
experience of giving or taking orders pro- 
duces divergent class cultures (summarized in 
Collins 1988, pp. 211-14). The knowledge 
accumulated remains relevant on the analyti- 
cal level, even if (as seems to be the case) the 
harsher forms of authority have diminished 
for most working people in recent decades, 
which would bring about declining differ- 
ences in class cultures. A comparison of 
historical differences would thus not merely 
add particularistic description, but have value 
for testing and extending the theory; its scope 
would be further bolstered by bringing in 
even more extreme instances, which we can 
find most easily by going back to another 
historical epoch in which extremely harsh and 
violent coercive controls were common. 
Analytical theory must be distinguished from 
empirical generalizations about trends over 
time; only the former can give us understand- 
ing of future social patterns, whether or not 
there is a reversal of empirical trends in the 
independent variables. 

Another reason why we fail to cumulate is 
that we place too much emphasis on the most 
recent advances in statistical techniques, and 
insufficient emphasis on the connections of re- 
sults across different methods. As Stinchcombe 
(1984, pp. 55-56) pointed out under the head- 
ing "Why are Generalizations forgotten?”: “the 
biggest correlation coefficients in a body of em- 
pirical materials will ordinarily give rise to the 
biggest path coefficients. . . . Similarly, in al- 
most all cases, the metric coefficients in a re- 
gression equation will be biggest relative to vari- 
ations in causal forces when the path coefficient 
is largest. . . . And, in turn, the big relations in 
loglinear analysis are almost always the same 
ones that have big metric regression coefficients 
. . . .What this means is that almost all of our 
causal knowledge from the time we were using 
correlation coefficients (or even eyeballing cross- 
tabulations) is still our causal knowledge 
throughout the transitions to path analysis, to 
structural equations with metric coefficients, to 
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loglinear analysis.” There are of course ad- 
vances, in the sense that log linear analysis or 
latent variable measurement models enable us 
to see certain difficult patterns, to model certain 
complexities and take account of particular pos- 
sibilities of measurement error. But these are 
developments, so to speak, on the edges of the- 
oretically substantive paradigms, not at their 
core. We should be building on the earlier re- 
sults rather than dismissing them because the 
` research lacked all of today’s technical refine- 
ments. 

As current methodological sophistication 
(e.g., Campbell 1983; Lieberson 1985; Willer 
1987) readily acknowledges, there is no auto- 
matic "methodological fix”; explanatory mod- 
els are always underdetermined by any partic- 
ular set of data, and theoretical considerations 
always enter into our assumptions as to which 
models can be plausibly excluded. If empirical 
patterns are robust enough— that is, if our the- 
ories are on the right track—we will find them 
with a variety of methods. If the theories have 
a weak grasp of the central processes, then what 
we end up with after great statistical refinement 
is likely to be at best a shadow. 

All research methods have their weak- 
nesses. We can overcome this problem by 
showing a coherent pattern among results 
found with different methods. Thus, although 
there are weaknesses in cross-sectional re- 
search (Lieberson 1985), the coherence of 
theory based on all sources of evidence can be 
used to validate our inferences as to what 
factors are operative in a given body of data. 
The existence of experimental evidence helps 
enormously, when tied to a general theoreti- 
cal scheme; so do theoretical connections to 
naturalistic studies (e.g., of organizations or 
of face-to-face interaction), to historical 
studies of dynamics, and indeed to the entire 
range of sociological research. Coherence, if 
it is sufficiently strong, can bail us out of any 
local methodological jam. And this is a 
reason for working to theorize our findings 
everywhere. From a methodologist's angle, 
theory might be regarded as a device for 
storing our understandings of this coherence. 
Hence, ignoring earlier research is a vice and 
not a virtue; the very differences in research 
methods give us an opportunity to validate 
our findings by “triangulation.” 

Sociology is engaged in many different 
enterprises. The extreme diversity of what we 
do tends to keep what we already know 
hidden under a cloud of dust. Our problem, in 
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a sense, is too much knowledge, especially 
since so much of it is on the practical/descrip- 
tive side, which becomes unwieldy if it is not 
boiled down into theoretical generalizations. 
We suffer from the limitations of our 
cognitive capacity, and this cognitive over- 
load reinforces the structurally based tenden- 
cies in the intellectual world to simplify one's 
sociological worldview down to one's own 
research specialty, theoretical camp, or polit- 
ical faction, and to focus solely on the latest 
research data or technique. 


CONCLUSION 


My final point, perhaps the most important, is 
about the mood of sociology. It concerns our 
social relationships, our attitudes toward one 
another within our intellectual field. Much of 
what we express today about each other's work 
is negativistic, hostile, dismissive. This faction- 
alism is debilitating because we need multiple 
approaches in order to cross-validate our find- 
ings. 

For sociology to make progress, we need 
some spirit of generosity, instead of a spirit of 
factional antagonism. This is not the same as a 
policy of “go your own way,” tolerating each 
other but having nothing to do with one another 
intellectually. Building sociological knowledge 
is a collective enterprise, and in more ways 
than one. All human activities are social, and 
Science itself is a process of organizing thought 
collectives. As in other human activities, con- 
flict is inherent within the organization of the 
intellectual world. This is not bad, since con- 
flict is a main source of intellectual dynamics, 
including the processes by which we put for- 
ward new theories and collectively decide which 
ones lead to the best results. But conflict need 
not be extreme. In no form of intellectual life 
do we depend more heavily upon each other 
than in a science. To come together as scien- 
tists, we need to concentrate on the coherence 
of theoretical conceptions across different pieces 
of research. The personal aspect of that intel- 
lectual structure is generosity and good will, a 
positive feeling towards each other's best con- 


.tributions as we grope our way forward to- 


gether. 
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1988 ASR EDITOR’S REPORT 


This is the last report for my term as editor of 
ASR. First, I will summarize the activities for 
this year and then make some observations on 
my term as editor. 


1988 REPORT 


ASR porocessed 483 manuscripts this year, 
slightly up from last year. New submissions 
numbered 352. The acceptance rate of 
processed manuscripts was 17 percent; for 
new manuscripts, 11.4 percent. The average 
editorial review lag was 7.5 weeks, a new 
low. The publication lag was 4.5 months, 
slightly up from last year. ASR published 54 
articles in 1988, 12 research notes, and 4 
comments and replies. The Deputy Editors 
assigned papers to 562 reviewers, an increase 
of 273 over 1985. 

In general, women were equitably repre- 
sented in manuscripts submitted, manuscripts 
accepted, and manuscript reviewed. About 25 
percent of the tenured and nontenured profes- 
sors in four-year colleges are women. When 
manuscripts submitted were classified by the 
sex of the first author, 26 percent were 
women. For accepted manuscripts, 27.5 
percent were women. Among the manuscript 
reviewers, 22.2 percent were women. 


THREE-YEAR REPORT 


Over the last three years, ASA has improved 
the physical quality of its publications. The 
cover stock was improved with coated paper, 
the size of type was enlarged, and shrink- 
wrapping of tbe journals for mailing was 
scheduled. Other improvements are in the 
offing. I have tried to open up the journal to 
more diverse kinds of efforts. I don't know 
how successful I have been, but over my 
tenure, 18.7 percent of all published works 
have been nonquantitative: having either no 
number or only percentages. I have intro- 
duced a section on Problems of the Disci- 
pline, reported the contents of other ASA 
publications, and introduced editorials that 
discuss publication issues. 


An analysis of the content of submissions 

and acceptances generally shows a consistent 
relationship. That is, between 17 and 20 
percent of manuscripts in most subareas of the 
discipline get published. The largest submis- 
sions (over 100 in the interval of my 
editorship) have been in stratifiction, criminol- 
ogy, and race and ethnicity. The next largest 
number of submissions (50-60 manuscripts) 
have been in social psychology, world 
systems, demography, theory, and gender. 
Two areas that may have been “overpub- 
lished” are religion and culture. Areas 
“underpublished” may include: criminology, 
education, gender, and industrial. 
: My perhaps biased impression is that 
neither the editors nor ASR reviewers are 
biased against nonquantitative work. To those 
who feel that a bias exists, I advise, “Get 
your stuff in. Don't create a self-fulfilling 
prophesy." A second bit of advice to improve 
acceptance of manuscripts, "Write better." 
Third, “Don’t overtheorize." You can't test 
three theories with three independent vari- 
ables. My only bitterness as an editor is 
directed toward too many potential reviewers 
who find very lame excuses not to review. 
The problem is severe and I will devote an 
editorial to the subject. Fortunately, these 
irritations are overshadowed by my deep 
gratitude to the Deputy Editors, the Associate 
Editors, and the many conscientious review- 
ers. I take this opportunity to recognize the 
strong, selfless support given by the deputy 
editors, Robert L. Kaufman and Craig J. 
Jenkins. The following outgoing Associate 
Editors have given authors and me enormous 
intellectual and moral support. I cannot thank 
them sufficiently for their extensive and 
dedicated labor. I salute Ronald Aminzade, 
James Baron, Paul DiMaggio, Donna Eder, 
Larry Griffin, Melissa Hardy, and Peggy 
Thoits. We look forward to working with the 
newly appointed Associate Editors: Rodolfo 
Alvarez, Glenn Carroll, Dan Clawson, Randy 
Hodson, Ronald Kessler, Elizabeth Menag- 
han, and Jill Quadagno. 


William Form 


IMPORTANT NOTICE FROM THE EDITOR 


The editors of Social Psychology Quarterly and the American Sociological Review regret to 
report that a 1987 article published in Social Psychology Quarterly (Volume 50, No. 1:1-6) 
has been plagiarized from a 1970 article originally published in the American Sociological 
Review (Volume 35, No. 4:697—710). 

The Social Psychology Quarterly article is titled, “The Interpretive Conception of Social 
Interaction and the Logic of Deductive Explanation," and is authored by Anne E. Foon, who 
gave her address as The Australian National University. The original American Sociological 
Review article was published by Thomas P. Wilson and is titled, “Conceptions of Interaction . 
and Forms of Sociological Explanation." Professor Wilson is currently on the Sociology 
faculty at the University of California at Santa Barbara. In reviewing the case the Publications 
Committee of the American Sociological Association (which produces both journals and holds 
the copyright of the original Wilson article) issued a finding of plagiarism. The editors of all of 
the ASA publications and the elected members of the publications committee wish to 
underscore our commitment to uphold the general scholarly norms regarding plagiarism. 

This notice is being published simultaneously in Social Psychology Quarterly and in the 
American Sociological Review. It is also being circulated to the relevant professional 
organizations. Members of the scholarly community should eliminate all references to the Foon 
(1987, Social Psychology Quarterly) article, citing instead the original source, Wilson (1970, 
American Sociological Review) in all future written work. Foon has requested that an apology 
to Wilson be included in this statement: ^I apologize without reservation for the events which 
have occurred." The editors and the members of the publications committee wish to reaffirm 
that all possible steps will continue to be taken in the review process to avoid such violations 
in the future. 


ERRATA 


ASR apologizes for the following errors in Herbert J. Gans' article "Sociology in America" 
(February 1989): 


p. 10, line 5, should read, "standing, is what I think of as overquantification.” 


p. 12, lines 1 and 2, should read, “hope for power—in an ideal society in which these laws and | 
their formulators — would" l 


p. 12, footnote 13, should read, “replace us even further in doing American” 
p. 15, line 25, should read, “and abuse, convergences of and relations” 


MANUSCRIPTS FOR THE 
ASA ROSE SOCIOLOGY SERIES 


Manuscripts (100 to 300 typed pages) are solicited for publica- 
"tion in the ASA Arnold and Caroline Rose Monograph Series. 
‘The Series welcomes a variety of types of sociological work— 
qualitative or quantitative empirical studies, and theoretical or 
methodological treatises. An author should submit three copies . 
of a manuscript for consideration to the Series Editor, Professor 
Teresa A. Sullivan, University of Texas, Population Research Cen- 
. ter, 1800 Main, Austin, TX 78712. . 





ELITE CONFLICT AND STATE FORMATION IN 
16TH- AND 17TH-CENTURY ENGLAND AND FRANCE* 


RICHARD LACHMANN 
University of Wisconsin 


Class conflict, relative autonomy, and state-centered theories of absolutism are 
critiqued for their inability to isolate causal factors that could explain the differing 
patterns of state formation in England and France, culminating in the crown’s 
defeat in the English Civil War of 1640-46 and the French monarch’s victory over 
aristocratic rebels in the Frondes of 1648-53. This article argues that conflicts 
among the three principal elites—the clergy, monarchs and their officials, and lay 
landlords—are the critical dynamic that explains the contrasting developments of 
absolutism in the two countries. English monarchs won a horizontal national-level 
hegemony over rival elites at the cost of access to resources at the point of 
production, leaving them vulnerable to a class-based rebellion by landowners. In 
France, elites were amalgamated into an absolutist state by their conflicts, and 
thereby lost their separate identities as feudal exploiters as well as the opportunity 


to transform agrarian social relations to their sole advantage. 


Recently sociologists have been advised to 
"bring the state back in," to analyze states 
“as organizations claiming control over terri- 
tories and peoples" and inhabited by elites 
who use the capacities of those organizations 
to "formulate and pursue goals that are not 
simply reflective of the demands and interests 
of social groups, classes, or society" (Skocpol 
1985, p. 9). Claims for state autonomy vary 
in their generality, the spheres of activity or 
functions for which they are made, and the 
extent to which tbe actions of state managers 
are imputed to their conscious articulation of 
self-interest. Such claims are challenged by 
theories that view states as the creations of 
classes in civil society. This article specifies 
the rival hypotheses, testing both them and an 
alternative theory that emphasizes conflict 
among elites for their abilities to explain the 
critical early development of absolutist states 





* Direct all correspondence to Richard Lach- 
mann, Department of Sociology, 1180 Observa- 
tory Drive, University of Wisconsin, Madison, WI 
53706. 

This research was supported by the University of 
Wisconsin-Madison Graduate Research Commit- 
tee. I am grateful to Julia Adams, Ronald 
Aminzade, William Beik, Robert Brenner, Charles 
Camic, James B. Collins, Warren Hagstrom, 
Michael Kimmel, Edgar Kiser, David Parker, 
Theda Skocpol, Charles Tilly, Erik Olin Wright, 
and the anonymous reviewers of ASR for their 
comments on earlier drafts of this article. 
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in England and France during the 16th and 
17th centuries. 

Absolutism is defined in contrast to the 
decentralized forms of political power charac- 
teristic of feudal societies in medieval Eu- 
rope. Prior to the consolidation of absolutist 
states, military force and legal authority were 
exercised by sovereign aristocrats, clerics, 
and urban and provincial corporate bodies 
with hereditary rights to command a portion 
of peasants' agrarian production in areas 
under their control. Those feudal powers were 
undermined during the 16th and subsequent 
centuries as absolutist states tended to monop- 
olize armed force within increasingly defined 
national boundaries. Historians have mea- 
sured the strength of absolutist monarchs by 
(a) the share of resources channeled through 
state organizations as taxes and duties in 
comparison to the surplus still held by 
aristocrats as feudal rents, (b) the capacity of 
state officials to recognize property rights and 
enforce laws that superseded individual aris- 
tocrats' land rights and judicial authority, and 
(c) the extent to which monarchs and their 
alies prevailed in political and military 
conflicts against domestic rebels. 

Absolutist monarchies, as the first national 
states, are the central testing ground for 
theories of state formation. England and 
France make for an especially apt compari- 
son: both states were relatively weak in 1500, 
and by the mid-17th century English and 
French monarchs both had established mili- 
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tary hegemony over their nations and were 
able to extract significant resources from civil 
society. The two states were challenged by 
major rebellions with contrary outcomes. The 
English monarchy was overthrown in the 
English Revolution and Civil War of 1640-46, 
while the French crown defeated the Frondes 
of 1648-53. An adequate theory must explain 
not only the similar growth in the capacities 
of the two states from 1500 to the 1640s, but 
also the contrasting abilities of the monarchs 
to overcome internal armed challenges. 

Most of the evidence analyzed by the 
historians and social scientists who study 
absolutism in the two countries focuses on 
identifying the antagonists in the conflicts at 
the local, regional, and national levels in the 
16th and 17th centuries. At issue are the 
grounds for conflict. What are the salient 
characteristics of opposing groups that ex- 
plain particular conflicts? Why did some 
actors ally with absolutist monarchs while 
others opposed them? This article first 
categorizes general theoretical treatments and 
their specific historical arguments, in terms of 
the ways their authors answer these questions. 
The main perspectives are compared for the 
sorts of evidence they emphasize or overlook, 
as they try to explain which actors benefited 
and suffered from absolutist monarchs’ victo- 
ries and defeats. 

The main body of this article compares the 
conflicts and the protagonists involved in 
state formation in England and France 
between 1500 and the 1640s and explains the 
outcomes of the Civil War and the Frondes. It 
identifies the state elites, their relation to 
classes, and the extent to which the elites 
achieved autonomous action over the two 
centuries. The comparison suggests that 
elites’ autonomy has less to do with the 
capacities of state bureaucracies than with 
factional divisions inside and outside the 
state. In essence, the power of each elite was 
determined less by its own relative or absolute 
capacities, and more by the way in which 
rival elites were incapacitated by the ongoing 
structure of relations among them. 


THEORIES OF CIVIL SOCIETY AND 
THE ABSOLUTIST STATE 


Students of absolutism agree that the English 
and French states increased their military 
power, legal authority, and revenues in the 
16th and 17th centuries. Three general 
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hypotheses have been offered for how power 
and resources were centralized, and whether 
absolutist monarchies were formed at the 
behest or at the expense of feudal aristocra- 
cies. One theory argues that peasant chal- 
lenges to feudal exploitation forced the 
nobility to reorganize their coercive power 
within organs of a centralized state. Thus, 
absolutism served nobles by affording them 
greater security and a larger share of peasant 
production than they would have gained 
through localized struggles. A second theory 
claims that a rising bourgeoisie attained 
economic parity with the nobility during the 
16th century. During the unusual and transi- 
tory balance and stalemate between two 
putative ruling classes, state managers 
achieved a relative autonomy over both 
parties. Their autonomy eroded as soon as the 
bourgeoisie gained sufficient power to defeat 
the nobility and subordinate the state to its 
exclusive interest. A third theory proposes 
that state power and autonomy are cumulative 
and self-generating. As self-serving state 
elites gain revenues, they invest those re- 
sources in armies and bureaucracies that are 
then deployed to capture territory and extract 
further revenues from “citizens.” This consti- 
tutes the process of nation formation. 

` Behind theories that view the absolutist 
state as representative of the nobility, rela- 
tively autonomous, or progressively autono- 
mous, notions are introduced to explain when 
and how state officials and classes engage in 
efficacious action. This section probes those 
often implicit notions as hypotheses and 
identifies the sorts of evidence that supports 
or undermines each theory. 


Marxist Theories 


Perry Anderson represents the first view: 
*one of the basic axioms of historical 
materialism [is] that the secular struggle 
between classes is ultimately resolved at the 
political—not at the economic or cultural — 
level of society" (1974, p. 11). He argues 
that during feudalism the nobility and peas- 
antry were politically efficacious to the extent 
that they were able to strengthen or weaken 
the state. Anderson conceives the state as a 
complex of legal-coercive institutions that 
guarded feudal property relations. 

Anderson traces the development of the 
absolutist state to the feudal crisis of the 14th 
and 15th centuries that shattered the ability of 
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locally based, autonomous nobles to exploit 
their peasants. Serfdom disappeared in West- 
em Europe as peasants demanded freedom 
from seigneurial obligations. Absolutism was 
the response of the aristocracy to this crisis, 
“a redeployed and recharged apparatus of 
feudal domination designed to clamp the 
peasant masses back into their traditional 
social position. . The result was a 
displacement of political-legal coercion up- 
wards towards a centralized, militarized 
summit—the Absolutist state. Diluted at 
village level, it became concentrated at 
‘national’ level” (Anderson 1974, pp. 18-19). 

Anderson gives primary emphasis to class 
struggles between nobles and peasants. In the 
formation of absolutist states, he regards as 
relatively unimportant the conflicts among 
aristocratic factions, even though he acknowl- 
edges that, “for many individual nobles,” 
absolutism “signified indignity or ruin, against 
which they rebelled” (1974, p. 47). How- 
ever, “no feudal ruling class could afford to 
jettison the advances achieved by Absolutism, 
which were the expression of profound 
historical necessities working themselves out 
right across the continent, without jeopardiz- 
ing its own existence; none, in fact, ever was 
wholly or mainly won to the cause of revolt” 
(1974, p. 54). As a result, regionally and 
factionally based rebellions, such as the 
Frondes in France, failed less because of the 
military or fiscal capacities of the absolutist 
monarchy per se and more because most 
aristocrats had no choice but to remain loyal 
to absolutist monarchies upon whom they 
depended for the power and legal legitimacy 
necessary to extract resources from peasants. 

Absolutist states’ aristocratic character is 
further indicated by their relations with the 
rising class of bourgeois. Anderson sees “a 
potential field of compatibility . . . between 
the nature and programme of the Absolutist 
State and the operations of mercantile and 
manufacturing capital" (1974, p. 41), with 
both state and capital growing and profiting 
from the monetization of taxes and rents, the 
sale of state offices, and the establishment of 
protected monopolies domestically and of 
colonial ventures abroad. Nevertheless, in his 
view, the bourgeoisie always remained subor- 
dinate to the nobility in the policies of 
absolutist states. 

To support Anderson’s thesis, there should 
be no evidence that the bourgeoisie played a 
critical role in the anti-absolutist rebellions as 
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long as the bourgeoisie was subordinate to the 
aristocracy. And as long as the aristocracy 
remains the dominant class, absolutist monar- 
chies should have been invulnerable to 
challenge from any source. Thus, Anderson 
contends that the Frondes failed because the . 
aristocracy was unwilling, and the peasantry 
and bourgeoisie were unable, to overthrow 
absolutism. The unique success of the English 
Civil War among 17th-century antistate rebel- 
lions is a sign that “a commercialized gentry, 
a capitalist city, a commoner artisanate and 
yeomanry” developed precociously and gained 
the strength to challenge and defeat an 
aristocratic state (1974, p. 142). To support 
that contention, Anderson needs to demon- 
strate that the interests of the new bourgeois 
class in England differed from those of 
disgruntled elements of the old aristocracy 
who had headed previous rebellions, and 
explain how that new bourgeoisie developed 
within the confines of a feudal system. 

Other Marxists are less convinced about the 
aristocratic character of absolutism. Engels, 
in The Origin of the Family, Private Property 
and the State ([1884] 1972), differentiates 
“the feudal state [which] was the organ of the 
nobility for holding down the peasant serfs 
and bondsmen” from “the absolute monarchy 

which balanced the nobility and 
bourgeoisie against one another” (p. 231). 
Marx, in The German Ideology, found “the 
independence of the State . . . in those 
countries where the estates have not yet 
completely developed into classes . . . in 
which no section of the population can 
achieve dominance over the others" ([1846] 
1970, p. 80). For Marx and Engels, the 
absolutist state was not the instrument of 
noble class rule, as Anderson asserts, but an 
artifact of the nobility's loss of hegemony. 
Marx and Engels believed that the growing 
power of the new bourgeoisie, rather than 
heightened challenges from the peasantry, 
allowed monarchs to recast feudal states once 
controlled by nobles into instruments of 
absolutist rule increasingly staffed by bour- 
geois purchasers of offices sold by the crown. : 
Marx and Engels view absolutist states as 
only relatively autonomous, because once the 
bourgeoisie achieved control over production, 
the state was quickly subordinated to capital- 
ist class interest, regardless of the organiza- 
tional capacities and resources accumulated 
by the state elite during the transitional era. 
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The third set of theories views autonomous 
state elites as the crucial agents of social 
change and argues that their actions explain 
the parallel formation of European absolutist 
states and bourgeois classes. Charles Tilly 
compares European monarchs to Mafia chief- 
tains: who use their military might to threaten 
war upon peoples and territories unless they 
pay “protection” in the form of taxes. War 
within and between nations enhances states’ 
revenue-collecting capacities which, in turn, 
provide the revenues to support ever-larger 
military forces (Tilly 1985). Michael Mann 
(1980, 1986) advances a similar argument for 
England. The elites that command the mili- 
tary and taxation organizations of the state are 
self-interested. 

Tilly and Mann view state-building as a 
process: as state elites build their mutually 
reinforcing fiscal-administrative and military- 
coercive capacities, the growing corps of state 
officials augment their power and their share 
of the national income at the expense of 
nobles and peasants alike. Tilly and Mann 
see the bourgeoisie as an inadvertent by- 
product of state formation. State taxation 
concentrated resources at a national level, 
creating (mainly through military procure- 
ment) the initial markets for capitalist enter- 
prises. More important for Tilly is the effect 
of state tax demands upon the French agrarian 
economy. Rising taxes increased peasants’ 
need for cash, forcing them into markets to 
sell a growing portion of their produce and 
often their labor. When peasants could not 
realize enough cash, they fell into debt, 
eventually losing their land in bankruptcy 
sales to the bourgeoisie. By dispossessing the 
peasantry, the state was indirectly responsible 
for freeing the land for capital and for the 
concomitant genesis of a labor market (Tilly 
1981, pp. 202-6). 

The expansion of the state also altered the 
nature of class mobilization and political 
conflict. Monarchs used their vast resources 
to "create a large class of officials and 
financiers who served their own advantage by 
helping to pay the expense of the state" (Tilly 
1986, p. 132). As monarchs enticed and 
compelled nobles, urban merchants, and new 
bourgeois to tie their personal fortunes to the 
state's fate, the long-standing alliance of 
regional magnates and peasants was broken. 


The crown's victory in the Frondes demon- 
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strated, in Tilly's view, that the nobility had 
more interest in guarding state revenues than 
in protecting special privileges for their 
provincial subordinates. 

Tilly argues that changes in the actors and 
issues involved in conflicts within France are 
the best indicators of tbe extent and conse- 
quences of state formation. State officials and 
taxpayers replaced lords and peasants as the 
principal antagonists in rural conflicts. Rebel- 
lions and lesser forms of resistance were 
increasingly directed against state demands 
for taxes, provisions, or men for armies rather 
than against landlords' requirements for rents 
or feudal labor dues. (Tily 1986, pp. 
119-61). Tillys model would be further 
supported by evidence that the aristocracy and 
bourgeoisie were not cohesive. He argues that 
they divided into two camps. One camp held 
offices, monopolies, contracts, and other 
sinecures and therefore supported the state. 
The other camp were the nobles and mer- 
chants who opposed absolutism because they 
did not benefit from it. 

Several scholars have tried to apply a 
state-centered approach to the development of 
English absolutism. Mann (1980, 1986) 
interprets as evidence of state formation the 
increases in the crown's revenues and the 
growing number of nobles and bourgeois 
dependent upon state favors. He does not 
claim that the state played a major role in the 
development of English capitalism. This 
would be difficult to sustain, since agrarian 
and merchant capitalism developed faster in 
England than in France. Instead, Mann 
(1980, p. 203) views the English state as 
almost exclusively an international military 
actor which did not create a bourgeoisie, but 
merely "pushed classes toward a national 
form of organization." He consigns state 
actors and social classes to two different 
military and economic spheres, arguing that 
they interacted only when the former hindered 
the latter with tax demands or aided capitalists 
in their conquest of foreign markets. As a 
result, Mann is unable to find grounds for the 
1640 Revolution against the monarchy. In- 
deed Mann (1980, 1986) completely passes 
over the Revolution and Civil War in his 
studies of English state development. 

English historians H.R. Trevor-Roper 
(1965) and Lawrence Stone (1970) explain 
the English Civil War as a conflict between a 
“court” composed of self-interested state 
officials and the “country” which was forced 
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to bear the increasing burden of taxes. They 
argue that the division between court and 
-country does not correspond to the sorts of 
class divisions highlighted by Marxist analy- 
ses. Rather, the common feature among the 
proroyalist factions was an interest in govern- 
ment offices or patronage. Although the 
Parliamentary side of the Civil War suffered 
from class and regional differences, it was 
united against court demands. These histori- 
ans differ from Tilly and Mann in regarding 
the Civil War and Frondes as rebellions 
against the unusual, demands and corruption 
of “Renaissance states.” Trevor-Roper con- 
tends (1965, pp. 88-94) that, regardless of 
the outcomes of the mid-17th-century rebel- 
lions, both the victorious French kings and 
the reconstituted English monarchy adopted 
mercantilist policies that were less burden- 
some on their subjects and eventually served 
to spur economic development.! 


! Many historians subscribe, at least implicitly, 
to one of the three perspectives, although they 
define the actors and their interests in somewhat 
different terms from the theories examined here. 
Thus, Christopher Hill (1963), the most prominent 
exponent of the class analysis model for England, 
does not discuss a general bourgeois class interest. 
Instead he describes how purchasers of former 
monastic properties came to share an interest in 
protecting their property rights against crown 
attempts to reappropriate church lands and tithes. 
A similar inability to identify bourgeois and 
aristocratic classes with opposing interests means 
that few English historians have accepted or 
elaborated Dobb's (1947) rendition of the relative 
autonomy model. The main thrust of English 
scholarship, including many of the sources used 
here, resembles the state-centered approach. How- 
ever, the argument, initially expounded by Trevor- 
Roper (1965) and Stone (1970), rejects the 
historical and theoretical sweep of Tilly and Mann 
fro a focus on the particular characteristics of 
“Renaissance states." A recent historical school, 
led by Russell (1971) and Hunt (1983), rejects ail 
theoretical generalizations in favor of viewing the 
Civil War as an amalgam of particularistic, 
localized disputes, tied by a Puritan ideology that 
championed parochial self-interest, though not 
bourgeois acquisitiveness. Russell and Hunt argue 
that Puritanism delegitimized royal government 
without creating or articulating a new group 
interest. As a result, the antistate coalition that won 
the Civil War disbanded once a new state was able 
to maintain order without antagonizing Puritans. 

Historians are divided in their assessments of the 
salience of class categories for understanding 
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As a result of the way they conceptualize state 
formation, existing theories are unable to 
address the politics of anti-absolutist revolu- 
tions. All three perspectives conflate two 
different forms of absolutism—horizontal and 
vertical. Horizontal absolutism is distin- 
guished by the crown’s ability to subordinate 
its two principal rivals at the national level— 
great nobles, referred to by historians and in 
this article as magnates, who fielded their 
own independent armed forces and subordi- 
nated lesser landlords, and clerics organized 
into a national. church. Thus, horizontal 
absolutism exists where the crown exercised a 
monopoly over armed force and dominated 
the national church. Historians’ descriptions 
of English absolutism as it developed in the 
16th and 17th centuries fit this model. While 
in England the Catholic church was subordi- 
nated in a Protestant Reformation, other 
monarchs achieved horizontal control over 
their churches while remaining Catholic (e.g., 
in the Austrian Empire and Poland) at the 
same time as some Protestant monarchs lost 
control over the reformed churches of their 
nations (e.g., Wurttemburg). Another varia- 
tion in horizontal absolutism is the relation- 
ship of national assemblies to the crown. In 
countries (primarily in Eastern Europe) where 
national assemblies remained congresses of 
aristocrats and clerics, horizontal absolutism 
led to the subordination of those assemblies to 


17th-century French politics. The view of the 
absolutist state as an instrument of the aristocracy 
is expressed more often by historians of the 
Revolution, notably Lefebvre (1967) and Robin 
(1970), than by scholars of the centuries covered in 
this article. Most Marxist studies of French 
absolutism adopt the relative autonomy perspec- 
tive; in addition to Porchnev (1963) and Lublins- 
kaya (1968), key studies include Mandrou (1965) 
and Morrill (1978). Non-Marxist French historians 
have also worked mainly within a state-centered 
approach, for example, Goubert (1969-73), Major 
(1980) and Mousnier (1979-84). However, these 
historians reject any sort of class analysis and deny 
the existence of a unified state elite. Instead, they 
analyze the state as an amalgam of “orders” 
competing for prestige and power. Recently 
several scholars have sought to bring class back 
into the analysis of the French state in order to 
explain the difficulties state officials encountered, 
and often failed to surmount, in their efforts to 
control and exploit civil society. (See Castan 1974 
and Asher 1960). 
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Table 1. Four Theories of the Absolutist State 
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Class Analysis Relative Autonomy = State-Centered Elite Conflict 
Interest aristocracy state managers state managers state managers 
represented in horizontal, 
by state aristocracy in 
vertical absolutism 
Cause of peasant aristocratic- previous state monarch’s ability 
cause resistance bourgeois capacities to make vertical 
development balance or horizontal 
alliances 
Absolutism’s limits its not specified aids its aided by horizontal, 
effect on development formation hindered by vertical 
bourgeoisie absolutism 
Antistate rebels bourgeoisie bourgeoisie landlords without locally based 
in England state offices landlords 
Reason for growth of growth of no explanation local gentry's 
monarch's defeat bourgeoisie bourgeoisie isolation from 
in Civil War horizontal absolutism 
Antistate rebels regional aristocracy non-office oppositional 
in France aristocratic holders patron-client 
elements chains 
Reason for aristocracy's state allied with existing state aristocracy tied to 
monarch's interest in weak bourgeoisie capacities vertical absolutism 
victory in absolutist state 
the Frondes 


the crown. However, in England, as I show 
below, the crown purged most clerics and 
many magnates from Parliament. Parliament 
therefore shifted from being an assembly of 
men with national power to a congress 
representing local interests, which, as they 
found common bases of unity, became a new 
opposition to English horizontal absolutism. 
The second form of absolutism resulted 
from monarchs’ inabilities to eliminate rival 
magnates or to dominate the national church. 
As a second-best strategy, rulers formed 
direct ties to locally based officials and 
corporate bodies, hence the term vertical 
absolutism. Over time, the successful construc- 
tion of vertical absolutism created corps of 
officeholders who rivaled clerics, magnates, 
and their retainers in their access to revenues 
and in their control over judicial and military 
organizations. The crown’s aristocratic rivals 
then sought to exploit the power and profits of 
vertical absolutism, seeking to purchase or to 
be appointed to high venal offices. Thus, 
strong vertical absolutism yielded a measure 
of horizontal absolutism as well, as formerly 
independent magnates and clerics became 
officeholders themselves. I describe this 


general form and French absolutism in 
particular as vertical, rather than as a 
combination of vertical and horizontal, to 
emphasize the different starting points and 
developmental trajectories of the two forms 
and to highlight contrasts in the ties between 
magnates and the crown, and between 
magnates and lesser landlords and officehold- 
ers, in societies marked by the two forms of 
absolutism. 

My thesis is that monarchs' abilities to 
pursue strategies that fostered vertical or 
horizontal absolutism depended on the struc- 
ture of relations existing among the elites in 
both countries. All three perspectives tend to 
examine the aristocracy as a class and to 
confine their debates to questions about the 
interests of the aristocracy as a whole and its 
relationship to rival peasant and bourgeois 
classes and to the state. They thus obscure the 
conflicts within the feudal ruling class and the 
ways in which those conflicts became em- 
bedded in the horizontal and vertical forms of 
absolutism. As a result, the theorists are 
unable to account for the divisions among 
aristocrats into loyal and rebellious factions 
during the Civil War and in the Frondes. 
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Moreover, previous efforts to track align- 
ments among classes and state officials have 
paid insufficient attention to the complexes of 
alliances and conflicts in 17th-century En- 
gland and France. 


16TH-CENTURY SOCIAL STRUCTURE 
IN ENGLAND AND FRANCE 


I now lay the groundwork to test my elite 
conflict model against the class, relative 
autonomy, and state-centered approaches, for 
their ability to explain horizontal and vertical 
patterns of absolutist state formation and the 
outcomes of challenges to royal power in 
England and France. I first define elites in 
contrast to classes and state officials, and then 
compare the usefulness of those categories to 
describe the interests and relations among the 
social actors in early 16th-century England 
and France. 

Competing terminologies express the differ- 
ent understandings of theorists of how actors 
converge to conceive and act on their 
interests. A Marxist class analysis of feudal 
society contrasts aristocratic and peasant 
classes. The former comprises all those who 
extract a surplus from the latter class. For 
Anderson, landlords and state officials are 
part of the aristocracy since they share an 
interest in exploiting the peasantry. Indeed, 
Anderson contends (1974, pp. 53-55) that 
anti-absolutist revolts failed because the 
nobles came to realize that they shared a class 
interest with state officials. 

Scholars from the relative autonomy per- 
spective (beginning with Marx 1846 and 
Engels 1884) and from the state-centered 
approach (exemplified by Tilly 1985 and 
Mann 1986) contend that state officials do not 
necessarily share the interests of an aristo- 
cratic or a bourgeois class in all instances. 
Advocates of this approach use the concept of 
a state elite to draw attention to officials’ use 
of a state’s administrative and military means 
to draw taxes and duties from other classes 
and the peasants. For Marx and Engels, state 
officials’ interests are political because they 
rest on a balance of class forces rather than on 
their administrative capacities. Relative auton- 
omy and state-centered theorists agree that 
state officials can act against various classes’ 
interests because they understand that their 
power and income are not dependent on a 
single class or mode of production. If that 
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interpretation is correct, state officials consti- 
tute an elite separate from any class. 

I define an elite as a group that shares 
control over an organization that extracts 
resources from a subordinate class and guards 
access to those resources from rival elites as 
well as from subordinate classes. My use of 
the plural "elites" leaves open to empirical 
investigation the number and characteristics 
of the forms through which actors converge to 
guard and to justify their claims to resources 
and authority. By using the term elite, rather 
than class fraction, I hypothesize that differ- 
ences among ruling groups in feudal societies 
are in the organizational and ideological 
mechanisms they use to appropriate re- 
sources, and not in any particular authority 
over a point or process of production. The 
term class fraction draws attention to the 
realm of production and thus obscures the 
organizational bases that define and differen- 
tiate feudal elites. 

Space limitations prevent me from discuss- 
ing the applicability of my conception of 
elites to societies other than those marked by 
absolutist states. Nor can I compare my use of 
elite with those of Mosca, Pareto, Mills, 
Domhoff, and others. For present purposes, it 
is sufficient to determine if my elite model is 
better able than the other theories to predict 
the salient lines of conflict in the two 
countries during the era of absolutism. My 
model predicts conflicts among clerics orga- 
nized in a transnational Catholic church, lay 
seigneurs mobilized through regional net- 
works headed by great magnates, and kings 
presiding over corps of officials organized on 
increasingly national lines (and less powerful 
groups of merchants organized in corporate 
bodies). 

All the theoretical approaches view inter- 
ests as derived from the ways in which feudal 
actors appropriate resources. One can test the 
competing theories by examining the ways in 
which resources were extracted from peasants 
who constituted the vast majority of produc- 
ers in early 16th-century England and France. 
To the extent that all extractors were drawn 
together by an overriding interest in maintain- 
ing their dominance over the peasantry, then 
the class analysis model would be supported. 
The state-centered theory would appear valid 
if conflicts between rent and tax collectors 
were not subordinated to a common interest in 
exploiting peasants. The relative autonomy 
model also predicts state officials' indepen- 
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dent interests, but regards that as transitory. If 
interests cannot be reduced to those of class 
and state, but instead are grounded in a 
multiplicity of institutions, each of which 
controls resources both at and beyond the 
point of production, then the elite structure 
model should be preferred. Historians, in 
their accounts of social relations in early 
16th-century England and France, concentrate 
on describing the ways in which peasants’ 
obligations to various extractors were deter- 
mined, and examine the legal, military, and 
other means used by individuals and corpo- 
rate bodies to protect and extend their claims 
to resources. Below I examine such accounts 
to identify the bases of cohesion and conflict 
among extractors, and thereby determine the 
validity of the rival theories. 

English and French peasants were sub- 
jected to clerical and royal demands in 
addition to labor obligations and/or dues paid 
in kind or in cash to the lord of the manor. 
The Catholic church was entitled to collect 
tithes in all rural and urban parishes. 
Benefices (the right to tithe revenues in a 
parish) were held variously by parish priests, 
monastic and other church institutions, and 
sometimes laymen. Kings exercised the right 
to collect taxes on wealth and production in 
the territories under their control. At times 
kings’ fiscal authority was complemented or 
rivaled by great nobles who, in addition to 
collecting feudal dues on their estates, also 
demanded fiscal or military tribute from the 
lesser seigneurs, towns, and peasants in the 
regions they dominated. 

The three types of extractors—manor lords, 
benefice holders, and tax collectors—each 
maintained an independent legal and adminis- 
trative apparatus to protect their agrarian 
income against depredations from rival extrac- 
tors as well as from the peasants. Nobles were 
distinguished by their right to exercise 
judicial powers on their estates. English and 
French kings differed from one another and 
over time in their capacities to intervene in 
seigneurial courts’ disposition of disputes 
between manor lords and peasants over land 
rights and rent obligations (Mousnier 1979, 
pp. 492-528; Gray 1963). 

The principal barrier to royal control over 
seigneurial justice was the paucity of judges 
who were directly under crown control. In 
England, only twelve judges were salaried by 
the king and available to hear appeals from 
local judges (Cockbum 1972). The main 
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work of “royal” justice was carried out by the 
more than one thousand Justices of the Peace 
empowered to apply common law to land 
tenure disputes (Gleason 1969, p. 49). 
Although JPs were royal appointees, the 
crown was limited to choosing them from 
among the major landlords of each county if 
those judges were to be respected and obeyed 
by their fellow manor lords. Since those 
manor lori-JPs shared similar interests on 
questions of land rights, it was futile for the 
crown to attempt to influence the direction of 
judicial decisions by dismissing one justice 
and replacing him with another equivalently 
situated landlord. 

French kings operated under similar con- 
straints in appointing judges to the Parle- 
ments, as the high provincial courts were 
called. Almost all 15th- and early 16th- 
century appointees (i.e., the men given royal 
permission to buy offices from retiring judges 
or to purchase new Parlement seats estab- 
lished by the crown) were seigneurs them- 
selves, and many were clients of the one or 
several great nobles who dominated that 
province (Dewald 1980, pp. 69-112). As a 
result, Parlement judges were unreliable 
enforcers of royal edicts designed to limit 
seigneurs' powers over peasants (ibid., pp. 
16—68). 

Ultimately, the critical difference in the 
relationships between English and French 
kings and their judicial appointees lay in the 
fact that English JPs were unpaid, while 
Parlement judges received lucrative incomes 
from their offices. As a result, French nobles 
competed for the right to invest in offices, 
and French judges increasingly defended the 
king and his interests because their own 
authority and the value of their offices were 
intertwined with them. As I show below, 
French kings were able to regulate and sell 
provincial judicial authority by exploiting the 
divisions between nobles who held offices. 
and those who did not. In contrast, English 
landlords exercised enough solidarity on the 
country level to prevent the king from selling 
to a few the authority already enjoyed by all 
(Brenner 1982, pp. 50-60; Dibble 1965; 
Parker 1983). 

The clergy of both countries operated a 
system of clerical courts which exercised 
authority over land tenure and urban com- 
merce under the rubric of guarding the 
church's tithe rights (Hill 1963, pp. 84—92, 
Blet 1959, pp. 88-89). However, English 
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clerics were far more successful than their 
French counterparts at maintaining the inde- 
pendence and fiscal integrity of their offices. 
French kings and nobles controlled appoint- 
ments to most ecclesiastical offices and used 
that control to claim much of the church's 
tithe income. The crown endorsed the legal 
ruses that Parlementaires devised to encroach 
upon the authority of ecclesiastical judges to 
regulate manorial land tenure and familial 
relations (Blet 1959, pp. 88-99). In contrast, 
English clerics were able to prevent the 
Papacy from signing control over most offices 
to the king. English kings did appoint the 
archbishops, but all other appointments were 
determined within the church. As a result, 
English clerical judges had the independence 
to protect the church’s fiscal interest, even 
when that required intervention into landlord- 
peasant disputes over land tenure (Hill 1963). 

The above discussion identifies the way in 
which kings, clerics, and lay landlords each 
were limited in their attempts to enforce 
extraeconomic surplus extraction by rival 
elites’ independent judicial apparatus. Simi- 
larly, each elite’s armed forces (which may 
have had a primary genesis and raison d'etre 
in the struggle to compel peasants to 
surrender part of their production to that elite) 
were most often deployed to guard against 
threats of rival armies from both within and 
without the two countries. 

The division of military force in provincial 
centers forced both English and French 
monarchs to negotiate with aristocrats and 
corporate bodies for the fiscal and military 
means to prosecute foreign wars and to defeat 
usurpers and seccessionists. At the outset of 
the 16th century, English monarchs carried on 
such negotiations in a national Parliament 
attended by nobles, clerics, and urban mer- 
chants. The distribution of the tax burden 
among counties and towns, and the nobility's 
liability for taxes had been determined in a 
1344 compromise (Schofield 1963). French 
monarchs were relatively more successful 
than their English counterparts at collecting 
taxes from the peasants in the pays d'election, 
where there were no provincial estates. They 
were less successful in the pays d'etat, where 
provincial estates existed. However, in both 
types of provinces French kings were forced 
to grant aristocrats full exemption from taxes. 
French kings usually were unable to gather 
the aristocracy of the more recently unified 
French nation in a national Estates General, 
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and so were unable to raise taxes in the pays 
d'etat to the level of the pays d'election 
(Parker 1983, pp. 13-27). 

By the early 16th century, the "dis- 
placement of . . . coercion upwards towards a 
centralized, militarized summit" (Anderson 
1974, p. 19) had concentrated military forces 
at the regional and corporate levels, rather 
than in the hands of national monarchs. 
Aristocrats did not surrender their military 
and financial resources to distant kings. 
Instead, they united under the banner of 
greater nobles, the wealthiest aristocrats of 
their province or county, who already com- 
manded armies and controlled strategic forti- 
fications. In 16th-century France, nobles 
competed with towns and with Protestant and 
Catholic leagues which organized and hired 
armies of their own, while in England armed 
forces remained in aristocratic hands (Major 
1964; Stone 1965, pp. 119—270). 

The above historical discussion suggests 
that the various extractors in early 16th- 
century England and France can be consid- 
ered as a single class only by stretching the 
meaning of that term. Certainly, kings, 
officials, landlords, clerics, and various 
corporate bodies all derived their incomes 
from peasants and shared an interest in 
suppressing peasant resistance. Yet, at the 
same time, various actors, especially clerics 
and kings, sought to limit rivals' abilities to 
exploit peasants as a way of guarding their 
own access to peasant resources. The divi- 
Sions within the not-so-unified feudal "ruling 
class" cannot be reduced to rent-collecting 
aristocrats versus tax-collecting kings. Nor 
does Michael Mann's "four sources and 
organizations of power," the ideological, 
economic, military, and political (1986, pp. 
22—30), get at the bases of power in the two 
countries. All three principal elites exercised 
class-like and state-like authority. Kings, 
regionally and locally based landlords and the 
Roman Catholic church each extracted reve- 
nues and exercised influence within a greater 
or lesser expanse of territory, manned a 
separate judicial apparatus and, except for the 
church, fielded armed forces to protect their 
fiscal, judicial, and ideological claims within 
those lands. 

The following section focuses on how the 
Sovereign powers disbursed among the three 
elites became concentrated to different extents 
and in different configurations. Again, I 
examine the historical evidence to compare 
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my elite conflict model against the other 
perspectives for their abilities to identify the 
social actors critical to the forms of absolut- 
ism that developed in the two countries. 


HORIZONTAL AND VERTICAL 
STRATEGIES OF ABSOLUTISM 


The directions of English and French state 
formation in the 16th and early 17th centuries 
cannot be derived from an inventory of each 
group’s capacities taken singly, but from the 
outcomes of alliances and conflicts within 
which they deployed those capacities. The 
incongruence between the strategic accomplish- 
ments and capacities of groups is demon- 
strated by the contrasting outcomes of the 
Reformation in England and France. The 
more autonomous national church of England 
lost most of its assets and authority in the 
Dissolution of Monasteries, while the French 
clergy preserved its administrative and fiscal 
organization. English historians from all 
perspectives view the Reformation as key to 
subsequent development of English absolut- 
ism and sharply differentiate that nation’s 
political landscape from France’s. Yet, theo- 
rists of the three perspectives give the clergy 
and the Reformation (or its nonoccurrence in 
France) little weight. The following section 
offers an explanation of why the English and 
not the French clerical elite was subordinated 
to the monarchy, and then traces the conse- 
quences of the divergent church fates on the 
strategic opportunities open to both mon- 
archs. I then evaluate the three theories of 
absolutism against my elite-focused model for 
their abilities to explain the patterns of 
conflict and alliance. 

The strength of the English church’s 
institutional autonomy represented weakness 
for the clergy’s position in English politics. 
Since lay landlords lacked influence over 
clerical appointments and were opposed by 
ecclesiastical courts in land tenure conflicts, 
they had no interest in preserving church 
authority and assets. In contrast, French 
nobles, who controlled clerical appointments 
and revenues of church offices, wanted to 
preserve the formal autonomy of the Catholic 
church from royal appropriation. Of the 129 
men appointed as bishops during the reign of 
Francois I (1514-47), 93 were nobles of the 
sword who held land and headed military 
companies independent of the crown. The 
majority of bishoprics were kept within 
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nobles’ families, and incumbent bishops en- 
joyed the de facto right to name successors 
(Salmon 1975, pp. 80-83). Consequently, 
when kings submitted their fiscal and legal 
demands to national assemblies of bishops, 
clerics enjoyed sufficiently strong ties to 
provincial nobles to resist royal threats to 
their interests (Blet 1959). Clerical subordina- 
tion to their aristocratic kin saved clerical 
property from being expropriated by the 
crown. 

The English church’s autonomy from 
county landlords created a situation where the 
monarchy needed to control only a few dozen 
clerics at the top of the hierarchy. The 
hierarchical character of the church, the 
absence of ties between lower-level clerics 
and lay landlords, and Henry VIII's domi- 
nance of the bishops, enabled him to gain 
parliamentary ratification for the transfer of 
revenues and assets from the clerical hier- 
archy to the crown. Through the Dissolution, 
the monarch assumed rights to the third of all 
English manors that had been held by the 
monasteries. Combining his pre-Reformation 
landholdings with the assessed value of the 
monastic manors, Henry VIII was to receive 
200,000 pounds a year. This would have been 
enough to guarantee the crown’s fiscal 
independence from the aristocracy and Parlia- 
ment in peacetime while giving the crown the 
resources necessary to a royal bureaucracy 
(Hill 1963, pp. 3-5). 

The crown was unable to realize the 
promised income from its new estates because 
the Reformation transformed the context of 
the monarch's absolutist strategies. The 
crown still needed lay landlords' support to 
suppress opposition to the Reformation and to 
launch the series of military campaigns that 
began in 1539. In order to accomplish these 
two objectives, by the end of his reign in 
1547 Henry VIII had given away or sold 
two-thirds of the properties he had seized in 
the Dissolution of the Monasteries (Dietz 
1964). Because he was unable to get 
Parliament to agree to sufficient tax increases, 
he had to sell properties to finance wars 
which, while not an ordinary annual expense, 
were normal costs incurred by all European 
kings of that era. Henry's dominance of 
Parliament had been weakened when most 
clerical lords lost their seats after the 
Dissolution of the Monasteries. Lay landlords 
refused to allow the crown to admit the newly 
appointed bishops to Parliament. In short, 
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crown control of bishops no longer translated 
into many Parliamentary votes. This gave lay 
lords control of that body and the power to 
force Henry to pay for his wars through land 
sales rather than through taxes. 

The crown’s administration of former 
monastic lands and its evaluation of those 
- properties for sale were guided more by the 
need to win lay support for the Reformation 
than by an interest in enlarging the king's 
treasury. In the wake of the 1536 Pilgrimage 
of Grace—a rebellion by disestablished priests, 
peasants, and landlords excluded from royal 
patronage who sought to undo the Reforma- 
tion— political considerations became para- 
mount. Henry VIII induced county landlords 
to support the Reformation by appointing 
them as bailiffs to manage the new crown 
estates. The absence of a royal bureaucracy 
precluded the crown's supervision of the 
bailiffs and enabled the latter to lease manor 
lands to themselves and their allies at rents 
that captured much of the estates’ surplus. 
The crown's attempts to revise leases pro- 
voked resistance and forced the crown to 
honor most existing leases (Youings 1955). 

Henry VIII's successors, most notably 
Elizabeth I, spent the substantial remaining 
former monastic assets on patronage to lure 
magnates from their rural bases to the capital, 
and to reestablish royal control over Parlia- 
ment (Stone 1965). The first step in building 
national-level absolutism, the dissolution of 
the monasteries, provided the assets for the 
second step, the disarming of the magnates. 
The enticement of royal patronage trans- 
formed enough nobles into crown dependents 
to give Elizabeth I the military domination 
she needed to defeat the remaining magnates 
who refused to dismantle their country 
fortifications and disband their armed retain- 
ers. The crown achieved a monopoly on 
armed force within England by the end of the 
16th century. 

The crown's expenditures of resources 
seized in the Dissolution of the Monasteries 
transformed networks among lay landlords. 
Prior to the Reformation, lay landlords had 
been members of political blocs led by 
magnates. The removal of magnates to the 
royal court and the elimination of their private 
armies disrupted magnáte hegemony in the 
counties. The magnates in essence were 
absorbed into the royal elite, becoming court 
retainers and the king's obedient allies in 
Parliament. The remaining lay landlords 
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became what historians describe as “the 
gentry.” In many counties, the number of 
gentry tripled in the century following the 
Reformation because new families were able 
to purchase former monastic and royal lands 
(Stone and Stone 1984, pp. 258-59, 269-70). 

County-based gentry were weakened ini- 
tially by the loss of magnate leaders who 
could pressure the crown on their behalf. 
English monarchs created a new political 
dynamic in the counties and prevented the 
reemergence of provincial political strongmen 
by reclaiming the power they had ceded to 
magnates to appoint Justices of the Peace. 
Monarchs, who had spent their financial 
resources on war and on patronage for 
magnates, used appointments to the county 
Commissions of the Peace as a way of 
establishing direct ties to gentry families 
without expending any funds. The crown kept 
the gentry off balance through manipulations 
designed to encourage gentry to organize into 
factions to compete for control of the 
Commissions of the Peace (Barnes 1961). 

The gentry’s main source of new power 
came from the absence of crown and clerical 
interference in the management of their 
relations with the peasantry. After the Refor- 
mation, clerical courts lost the power to 
regulate peasant land tenure (Hill 1963, pp. 
84-92). Crown efforts to protect peasant land 
rights were limited by the paucity of judges 
under direct crown control and by tenants’ 
inability to afford the cost of appealing 
adverse decisions by JPs to the king’s courts. 
As aresult, gentry-dominated county commis- 
sions of the peace assumed the generally 
unchecked power to regulate peasants’ copy- 
hold tenures (Kerridge 1969, pp. 54-58) and 
to determine the residence and wages of 
landless peasants (Kelly 1977). 

The absence of a Dissolution meant that 
16th-century French monarchs lacked the 
fiscal resources to disarm the great nobles and 
draw them away from their provincial bases. 
Unable to challenge magnate rivals and 
their state-like organizations at the national 
level, French kings attempted to create clients 
at the local level who could serve as 
counterweights to the established power blocs 
in the provinces. Many provinces had become 
part of the French nation only in the 15th 
century and, as a result, retained Estates, 
clerical, urban, and judicial organs estab- 
lished when the provinces had been indepen- 
dent or parts of other kingdoms (Parker 1983, 
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pp. 2-13). The crown moved to augment the 
powers of corporate bodies at the expense of 
individual seigneurs, principally by giving 
body members venal rights to their offices. 

Venality (the purchase of the right to hold 
and assign incumbency of an office) served 
the financial interests of both the monarch and 
officeholders at the expense of those who 
supported the offices through taxes and 
duties. The crown, upon establishing a venal 
office (or endowing an existing office with 
venal rights), would also grant the office- 
holder the right to collect taxes, duties, or 
fees that would constitute a return on the 
purchaser’s “investment” in the office. Since 
nobles and clerics were exempt from almost 
all taxes, they were not directly affected by 
the new extractions that supported venal 
offices. Nobles and clerics were disadvan- 
taged only so far as the resources and 
jurisdictions they previously had monopolized 
were encroached upon by venal officeholders. 

The crown’s strategy for building vertical 
ties between the national court and locally 
based venal officeholders increased royal 
revenues. During the 16th century, sales of 
offices surpassed loans as the major source of 
extraordinary royal revenues (Parker 1983, 
pp. 13-39). By 1633, half of all crown 
revenues were derived from the sale of venal 
offices, and from the paulette, an annual fee 
paid by incumbent officeholders in return for 
the right to resell or bequeath their venal posts 
(Treasure 1967, p. 54). Receipts from those 
two sources peaked at 30 million livres in 
1639 (Dent 1967, pp. 247-50). 

While French monarchs gained increasing 
revenues from the sale of venal offices during 
the 16th and early 17th centuries, the crown 
received a decreasing portion of the taxes and 
duties that venal officials collected. In order 
to make venal offices financially attractive to 
potential buyers, and to retain the loyalty of 
the original and subsequent holders of those 
offices, the crown was forced to allow venal 
tax collectors and judicial officers to collect 

. “commissions” on the revenues destined for 
the crown. Those commissions ranged from 
17 percent to 25 percent in the early 17th 
century and rose to over 40 percent by the eve 
of the Frondes in the 1640s (Dessert 1984, 
pp. 46-63). 

The crown’s restricted political control 
over the aristocrats who mainly purchased 
venal offices resulted in contradictory results 
for the monarch: increasing income from the 
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sale of offices at the cost of limiting returns 
from the taxes that venal officials had been 
appointed to collect. (Merchants and manufac- 
turers had the resources to buy urban posts, 
but rarely could they compete with the landed 
aristocracy for the more expensive offices at 
the provincial level [Beik 1985, pp. 34-55; 
Harding 1978; Dewald 1980, pp. 69-112; 
Wood 1980, pp. 95-98]. The crown’s 
massive sale of venal offices created the 
financial and social bases for local nobles to 
distance themselves from the great nobles 
who had dominated their provinces. Where 
seigneurs previously had turned to the princes 
and ducs et pairs for patronage and social 
prestige (Major 1964; Lefebvre 1973), they 
now became able to expand their family 
fortunes by investing in venal offices. Mem- 
bership on a provincial court provided 
office-holding nobles with status and political 
power independent of their ties to great 
aristocratic families. 

French kings, by creating corps of venal 
officials in the provinces, succeeded both in 
subverting the capacities of great nobles to 
mobilize lesser elites to challenge the crown 
on a national level and also in undermining 
the crown's ability to expand its ordinary 
revenues from taxes and duties. Once seigneurs 
were installed in venal offices, they had an 
interest in opposing the creation of new 
positions that would compete for the revenues 
and powers previously reserved to them. 
Thus, while the crown forged ties of 
obligation and dependency with venal office- 
holders, it also created a constituency with an 
interest in preventing it from further expand- 
ing its venal administration. 

While venality freed provincial nobles from 
dependence upon the great nobles, it did not 
make them automatic allies of the French 
crown. Unlike English monarchs who spent 
their windfall from the sale of monastic 
properties to entice the great magnates to 
depend on the crown, l6th-century French 
kings did not have the wealth to support 
notables at court. Instead, the crown used its 
political power to create nonseigneurial sources 
of income, but then it appropriated much of 
that income for the royal account. French 
kings, in their fiscal duress, made themselves 
both the patrons of and competitors with 
venal officials. 

Wars of religion were the archetypal 
conflicts of 16th-century France. French 
kings profited from the struggle between 
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Protestant and Catholic nobles to control 
provincial institutions, clerical tithes, and 
urban governments by alternately supporting 
one or another faction in return for the lion’s 
share of the resources seized by the crown’s 
allies from the losing side (Parker 1980, pp. 
46-94). In towns, factions often shared the 
same religious orientation and merely com- 
peted for political power and control over 
urban revenues (Parker 1980). Provincial 
and urban conflicts, while usually couched in 
religious terms or justified as the defense of 
ancient rights by provincial and corporate 
bodies, were precipitated when the crown 
attempted to expand its revenues by empow- 
ering one faction to collect taxes or control 
resources previously under the authority of a 
rival. 

Provincial and urban factions that were on 
the losing side of the monarch’s venal 
establishments sought allies within and with- 
out the French nation. In contrast to 16th- 
century English monarchs, who succeeded in 
preventing foreign interventions in domestic 
politics, French kings were often confronted 
by political enemies who enlisted foreign 
armies in their cause (Parker 1983, pp. 
271-45). The weakness of the French "state" 
is demonstrated by the frequency with which 
monarchs had to make concessions to domes- 
tic opponents so as not to create an opening 
for foreign intervention (Major 1964). By 
threatening to ally with Protestant foreign 
powers, an array of mostly Catholic nobles 
and Parlementaires forced the crown to call a 
national Estates General in 1614 which, at 
least temporarily, restricted royal authority to 
create new venal offices (Hayden 1974). 

Most 16th-century struggles were confined 
within the French nation. Factions raised their 
own armies to protect privileges against 
encroachments by the crown and its new 
venal allies. Although French monarchs 
succeeded in weakening the capacities of 
great nobles to mount armed challenges, 
venality created new opportunities for armed 
resistance to coalesce (Beik 1985; Harding 
1978). Many of the "antistate" rebellions 
catalogued by Charles Tilly often were 
instigated and fought by nobles and officehold- 
ers who sought to protect their venal 
privileges from the next generation of royal 
concessionaires. 

The three theoretical perspectives presented 
in the introduction each highlight some 
aspects of the initial development of absolut- 
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ism in 16th-century England and France. 

Anderson’s description of absolutism as the 

“displacement of politico-legal coercion up- 

wards towards a centralized militarized sum- 

mit” (1974, p. 19) is fairly accurate for 
France. There, a growing share of the surplus 

was extracted through taxes rather than rents, 

and even non-office-holding landlords be- 

came increasingly dependent upon royal 

edicts and Parlementaire decisions to raise 

peasant rents and to bring land under their 
direct control (Dewald 1980, pp. 162—201). 

Most manorial and magnates' feudal courts 

fell into disuse by the 16th century. Instead, 

lords and peasants took their disputes to 

provincial Parlements or requested reviews by 

the governors, intendants, or their aides. 

Anderson's approach is less helpful for 
understanding England, where the state’s ` 
development was shaped by the singular 
opportunity to appropriate clerical property 
and authority. The English Reformation had 

the unintended effect of reinforcing landlords’ 

abilities to appropriate the vast bulk of 
peasant payments and to regulate agrarian 

production themselves (albeit through commis- 

sions of the peace organized on the country 

level rather than by virtue of seigneurial 

authority on the manor). 

The emphasis given by Marx, Engels, 
Dobb, Porchnev, and Lublinskaya to the 
bourgeoisie as purchasers of state offices and 
as counterweights to the nobility receives less 
support from my analysis of the situation. 
Nobles, more than urban merchants, were the 
principal buyers of state offices in France and 
of the former clerical properties in England. 
A more persuasive case can be made that 
English and French monarchs, by stimulating 
markets in land and in offices, helped develop 
bourgeois interests, rather than the reverse 
causality that Engels posited. Monarchs in 
both countries expanded the ranks of the 
bourgeoisie by granting monopolies in trade 
and manufacture (Parker 1983, pp. 73-81; 
Stone 1970, pp. 85-86). However, many 
monopolists were nobles, and others sought 
to marry into aristocratic families. As I 
explain in the following section, merchants in 
both countries were as likely to ally with 
aristocrats to protect their comparable privi- 
leges as with the monarchs whose fiscal 
demands threatened to appropriate the profits 
from previously granted commercial conces- 
sions. ; 

Finally, Tilly and Mann’s emphasis on the 
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expanding military and fiscal capacities of the 
states misses the crucial contrasts in the two 
cases. The English crown grew wealthy by 
receiving and spending a windfall from the 
Dissolution of the Monasteries. Yet it was 
unable to build a bureaucracy capable of 
collecting taxes without the cooperation of 
Parliament and the gentry tax collectors. 
English monarchs gained military security by 
disarming the armies of rival magnates. In 
contrast, 16th-century French kings did not 
achieve military hegemony against domestic 
rivals. Although the French "state" did 
increase its revenues dramatically in the 16th 
century, most of the funds were collected and 
retained by venal officials whose interests 
often opposed those of the crown. 

The problems faced by theorists from the 
three perspectives in describing English and 
French absolutism stem from their failures to 
give sufficient causal weight to conflict 
among elites and to the divergent fates of 
clerics in the two countries. Elite relations 
created the opening for the Reformation in 
England and foreclosed it in France. The 
ability of English monarchs to build a 
horizontal absolutism derived from the weak- 
ness of the clergy, not from the crown's 
previous capacities, either absolutely or in 
comparison to the aristocracy. 

The inability of French kings to eliminate 
or buy out their national rivals forced them 
into the more gradual effort, which Tilly 
described, of building mutually reinforcing 
fiscal and military capacities by creating and 
manipulating rival corps of venal officehold- 
ers. However, the venal form of those 
achievements left French kings as the arbi- 
ters, rather than the executives, of state 
power. “Venality tied absolutism to its feudal 
past by concentrating a new form of private 
ownership of public authority which enabled 
rich and influential subjects—noble or bour- 
geois—to share in the profits and prestige of 
the state. It was a new expression of the 
king’s inability to control his society without 
conciliating his most powerful subjects” 
(Beik 1985, p. 13). Venality did not provide 
the basis for an autonomous or bureaucratic 
state. Instead, it incorporated elite interests 
once represented through independent institu- 
tions within the structure of absolutism, 
making the state itself the site for conflicts 
among elites, each with its own interests and 


jects. 
The difficulties with viewing the absolutist 
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state as representative of the interests of one 
class against another, or of a state elite 
against all classes outside the state, become 
more apparent in the following section where 
I identify the forces aligned on either side in 
the English Civil War and the French 
Frondes. Members of a state elite, as defined 
by Tilly or Mann, and of classes, as defined 
by the other theorists, were divided in their 
support for the English and French monarchs. 
The Civil War and the Frondes tested the 
extent to which English horizontal and French 
vertical absolutism were vulnerable to chal- 
lenges. The historical analysis in the follow- 
ing section also serves to test the explanatory 
power of the three theories against one 
another and against my elite conflict hypoth- 
esis. 


STATE POWER AND 
CLASS RESISTANCE 


The English kings of the early 17th century 
enjoyed virtually unchallenged national sover- 
eignty, though they possessed only limited 
ability to administer law and command 
resources at the county and local levels. Their 
efforts in the 16th century to preserve and 
enhance national horizontal hegemony fore- 
closed later strategies that might have allowed 
the government to penetrate county gentry 
organization. By subordinating the clergy and 
eliminating the magnates and their local 
bases, the crown allowed the gentry to 
consolidate landownership and local political 
authority. The crown created a structure in 
which gentry could not participate in national- 
level politics and therefore had no interest in 
supporting royal interests in the counties. The 
shift of manors from clergy and crown to 
gentry meant that transfers of surplus to the 
crown would have come at the expense of the 
gentry and to a lesser extent from urban 
merchants who were often also landlords and 
intermarried with rural gentry families (Stone 
and Stone 1984, pp. 257-58). Thus, for the 
gentry, the royal court became not a source of 
patronage and status, but a threat to their 
income and property. 

The outcome of the 16th-century conflicts 
between the clergy, nobility, and crown made 
English absolutism unique. No other monar- 
chy enjoyed the opportunity to eliminate rival 
elites at the national level. As a result, only 
the English crown was in a position to lose 
access to local manorial surpluses and author- 
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ity. Only in England was the integral feudal 
connection between national power and local 
lordship severed. The English crown came to 
head the first modern state, a political 
apparatus separate from and, as the English 
Civil War demonstrated, ultimately depen- 
dent upon the gentry who controlled produc- 
tion. 

The gentry became a distant class in the 
sense that it did not bave to share its control 
over agrarian land or labor with the crown, 
clergy, or magnates. As a result, the gentry 
could challenge crown authority on the 
national level without risking its authority 
over peasants on the local level. That is why 
the peasant rebellions that occurred concur- 
rently with the English Revolution and with 
the Frondes had such different consequences 
for landlords in the two countries. In France, 
as I demonstrate later in this section, local 
landlords continued to share control over the 
peasantry and indeed became increasingly 
dependent upon royal officials for extracting 
revenues from their tenants. Thus, elite 
Frondes against the crown also had the effect 
of undermining officials' and seigneurs' joint 
authority over peasants. The English king's 
loss of authority in 1640, and Charles I's 
subsequent efforts to exploit factional rival- 
ries among gentry in the Civil War, did not 
disrupt the local mechanisms used by the 
gentry to dominate tbeir tenants. The relative 
paucity of peasant rebellions during the 
English Civil War and the ease with which 
popular resistance was crushed (Charlesworth 
1983, pp. 39-41), confirm that the English 
king was militarily and ideologically superflu- 
ous to gentry class rule.? 


? My analysis agrees in part with that of 
Goldstone (1983) who argues that existing rela- 
tions among elites and the state, and not peasant 
rebellions, determined the outcomes of the English 
Civil War and the Frondes. However, his explana- 
tion for those conflicts differs from mine. He 
contends that the Civil War and the Frondes were 
instances of "state breakdown" caused by popula- 
tion increases that led to the subdivision of peasant 
holdings, landlessness, falling real wages, and 
rising prices, and in turn sparked peasant rebellions 
across Europe. Goldstone further contends that the 
rise in the proportion of landless English peasants, 
from 11 percent before 1560 to 40 percent after 
1620, resulted from simple population increase and 
the resultant division of tenancies among sons. 
Elsewhere (Lachmann 1987, pp. 100—141) I argue 
that the gentry used their control over the county 
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In contrast, the crown, once it had 
exhausted its monastic endowment, became 
dependent upon the gentry for the resources 
needed to wage foreign wars. The crown's 
attempt between 1634 and 1639 to collect a 
tax for military expenses, the Ship Money, 
was emblematic of the situation. The gentry 
collected and paid the tax as long as it was 
used to equip the navy, a policy they 
supported. However, when Charles engaged 
in war, the gentry refused to collect the tax 
(Barnes 1961, pp. 203-43). Merchants proved 
capable as well of challenging in Parliament 
crown attempts in 1608 and after to increase 
custom duties (Dietz 1964, pp. 372-76). The 
crown lacked alternative sources of income. 
By 1640, the crown and subordinate clergy 
held less than 10 percent of English manors, 
compared with 40 percent before the Dissolu- 
tion (Lachmann 1987, p. 98). 

Charles I, stymied in his efforts at state 
formation, attempted to break out of the 
structural constraints created by the previous 
century of absolutist strategy. He sought to 
win allies at the county level with the promise 
of patronage from assets to be expropriated 
from landlords not allied with the crown. 
However, more landlords stood to lose 
property and privilege under Charles I than 
could gain from the distribution of patronage. 


Commissions of the Peace, and their influence in 
Parliament, to gain approval for schemes to evict a 
plurality of peasants from the land in the century 
from 1540 to 1640. Goldstone’s model ignores the 
widespread practice of primogeniture among peas- 
ants (Howell 1975) and the fact that it was the 
portion of landless peasants and not of tiny 
holdings that rose in the century before 1640. He 
cites: Spufford (1974) to support his argument, 
even though in two of her three case studies 
middle-sized copyhold tenancies disappeared only 
after landlords imposed Parliamentary ascertain- 
ment surveys or after a landlord reached an 
enclosure agreement with freehold tenants that 
disadvantaged copyholders. 

This is not the occasion to attempt to resolve my 
disagreements with the demographic model re- 
cently championed by Goldstone. What is at stake 
here is the effect of peasant rebellions upon the 
relative capacities of different elites to prevail in 
their struggle to control the state in England and 
France. Goldstone’s denial of the gentry’s greater 
control over tenants relative to that of French 
seigneurs leaves him unable to explain why 
peasant rebellions so weakened the Frondeurs 
while having a far less significant effect on the 
antiroyalist side in England. 
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Charles’s efforts to raise revenues by manip- 
ulating monopolies and trade privileges alien- 
ated London merchants who “backed radical 
new governors . . . who seized power from 
the Royalist old guard by means of a peaceful 
political revolution in 1641” (Stone 1970, p. 
106). While Charles I's statecraft has been 
compared unfavorably to that of his predeces- 
sors, the horizontal absolutism created by 
earlier monarchs left Charles with the choice 
of setting policy in accord with the wishes of 
Parliament and the county gentry or of 
exercising autonomy and thereby provoking 
revolution. The mobilization of a majority of 
gentry yeomen and merchants against crown 
demands, and the monarch's defeat in the 
Civil War, were determined by the elite 
structure that the crown had created in its 
earlier pursuit of absolutism at the national 
level. 

In their failure to eliminate magnates who 
held regional power, French monarchs pre- 
served access to lesser landlords who directly 
collected rents and taxes. Venality was a 
second-best strategy to compete with power- 
ful magnates and clerics for the allegiance of 
locally based seigneurs. Unable to subordi- 
nate the clergy or magnates in full, the crown 
secured revenues by selling military, jurisdic- 
tional, and honorific titles to rivals of the 
magnates within the provinces, and to com- 
petitors of the old oligarchies in the cities. 

The political costs of new taxes and offices 
for French kings escalated in the first decades 
of the 17th century, an era of rapid escalation 
in military costs and of almost continuous 
domestic and international war even after the 
end of the Thirty Years’ War. The crown's 
military expenses rose from 5 million livres 
per annum in 1600—1609 to 16 million in the 
1620s, 32 million in 1635, and 38 million in 
1640 (Parker 1983, p. 64) The crown 
attempted to meet those expenses by selling 
new venal offices and by demanding that 
incumbent officials and corporate bodies 
increase their payments to the crown. Such 
increases would have compounded the finan- 
cial pressures already created by competition 
from new venal officers who had been given 
the right to tax the same production and trade 
as the older officials. Urban and provincial 
officials used tax strikes to force the crown to 
rescind sales of new venal posts. As a result, 
crown revenues from venal office sales, 
which had peaked at 30 million livres in 
1639, collapsed in the following two decades, 


AMERICAN SOCIOLOGICAL REVIEW 


bottoming out at an insignificant 800,000 
livres in 1661 (Dent 1967, pp. 247-50). 

The French kings of the early 17th century 
“neither taxed nor spent at will and local 
elites retained a great deal of control over the 
practical workings of the [fiscal] system” 
(Collins 1988, p. 2). However, unlike the 
English monarchs who had lost their hold 
over the gentry-JPs with the elimination of 
magnate power, the French kings had been 
unable to dislodge the great nobles’ provincial 
power. French monarchs found that they 
needed to ally with the most powerful 
provincial nobles if they hoped to extract 
further resources from the lesser aristocrats 
within each province. Provincial governors 
appointed from among the great aristocratic 
families became the main instrument of 
royal policy at that time. 

The most successful governors, in terms of 
preventing fiscal or military rebellion within 
their provinces, were those men with indepen- 
dent networks of clients made up of lesser 
landlords and provincial officials to whom 
they afforded political and military protection 
(Harding 1978; Bonney 1978). By siding with 
their clients to prevent the creation of new 
competitive venal offices, governors ensured 
that the incomes and market values of existing 
offices escalated dramatically. A seat in the 
Parlement of Provence at Aix worth 3-6000 
livres in 1510 rose in value to 40—50,000 in 
1633, an increase of 400 percent after 
accounting for inflation (Kettering 1978, pp. 
221—25). Similar increases occurred in other 
provinces and for other offices (Dewald 1980, 
pp. 131-61; Collins 1988, pp. 80-87). 
Governors used their influence over appoint- 
ments to the shrinking supply of new or 
vacant offices to reward their clients. Addi- 
tionally, governors maneuvered to ensure that 
their allies among venal officials controlled 
the Parlements, lesser courts, and urban 
councils. 

Governors found that they could best 
advance their political and financial fortunes 
by acting as mediators between crown and 
provincial interests. When governors were 
able to head off sales of crown offices and tax 
increases, they won the loyalty of urban and 
provincial officials who had been threatened 
by the crown's fiscal demands (Harding 
1978). Governors were able to compound 
their political control by supporting the 
crown's demands for tax increases and then 
rebating most of the increase to favored allies 


ELITE CONFLICT AND STATE FORMATION 


in the form of commissions for collecting the 
taxes. This allowed governors to punish 
obstreperous officials by excluding them from 
the largesse made possible by the new taxes. 
As long as governors were able to buy the 
loyalty of a majority of their Parlements, 
lesser courts, Estates, and urban councils, 
they could prevent a unified opposition by 
those provincial bodies. 

Through the governors, French monarchs 
solved the problem that plagued 17th-century 
English kings: how to win control over those 
local officials who had direct access to 
resources. French kings could accomplish this 
because they had not been able to eliminate 
magnates who had consolidated their control 
over the provinces. However, the French 
crown paid a high fiscal cost for peace with 
provincial officials. Aside from the loss of 
revenues from the sale of venal offices, crown 
direct tax receipts failed to keep up with 
inflation in the decades leading up to the 
Frondes (Dent 1967). Despite the drastic 
increase in the nominal direct tax burden from 
10 million livres in 1610 to 36 million in 
1635, most of it was used to pay commissions 
to officials or interest to financiers on old 
crown debts, or was spent by the governors 
within the provinces. Although some of the 
tax revenues spent by the governors went for 
crown-approved military and administrative 
activities within the provinces, kings often 
were unable to divert monies for projects not 
supported by governors or for use outside the 
province. Revenues received or controlled by 
the king reached a peak in 1635 and then 
declined as a result of tax strikes by peasants, 
unwillingness of officials to collect new taxes 
or surcharges, and officials’ skimming an 
increasing portion of taxes actually paid 
(Collins 1988, pp. 163-64 and passim). 

The crown reacted to the fiscal crisis 
created by rising military expenses and falling 
tax receipts by pressing governors to super- 
vise provincial tax collectors better. The 
crown circumvented governors unwilling or 
unable to antagonize their provincial clients 
by appointing intendants to supervise tax 
collection and by cutting back on the 
resources kept by provincial elites. Height- 
ened fiscal extractions caused an aristocratic 
reaction. Sharon Kettering (1982), in a 
comparison of the Parlements which joined 
the Frondes and those which remained loyal 
to the crown, found that judges rebelled in 
provinces where governors were allied with 
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intendants or were ineffectual at preventing 
the crown's representative from establishing 
new venal courts that cut into their revenues. 

The Frondes were aimed at limiting the 
crown's ability to establish new offices or to 
issue decrees that would undermine the 
existing powers of venal bodies. A common 
demand was to abolish the intendants. The 
Parlement of Paris articulated the Frondeurs' 
sentiments by demanding that judges retain 
their power to decide whether to register, 
amend, or reject royal decrees. Provincial 
Parlements echoed that position within their 
provinces. The Parlements were supported by 
a variety of provincial and urban groups. 
Venal officials threatened by governors and 
intendants' efforts to augment crown revenues 
were the principal supporters of the Frondes. 
Some of the most powerful governors stood 
with their clients and against crown policies 
that would have interjected intendants into 
provincial politics, undermining the gover- 
nors' bases of authority (Harding 1978, pp. 
199-212; Moote 1971; Bonney 1978). 

Despite the interest of venal officeholders 
to limit crown power, the Frondeurs were 
unable to unite and sustain a rebellion. 
Although they began the rebellion with 
superior military forces and strategic advan- 
tages, they were defeated because the crown 
reached accords with particular opponents and 
thereby isolated and then defeated the remain- 
ing Frondeurs (Moote 1971, pp. 316—54; 
Kettering 1978, pp. 277-97). Michael Mann 
errs in accounting for the divergent outcomes 
of the Civil War and the Frondes by 
emphasizing the preexisting military capaci- 
ties of the monarchs. He contends that the 
French kings commanded armies that could 
be turned against domestic as well as foreign 
enemies, while English monarchs concen- 
trated their military investments in a navy 
more appropriate for an island kingdom 
(1986, p. 478). In fact, both Charles I and his 
Parliamentary opposition were able to raise 
armies as well as the French king, the 
magnates, and corporate bodies involved in 
the Frondes on both sides. We must examine 
the alliances and conflicts of the Frondes, 
rather than the monarch's previous military 
capacities, to explain why some provincial 
elites broke with their fellow aristocrats and 
submitted to crown rule, thereby providing 
the crown with the necessary armies and 
fiscal resources to outman and outlast the 
Frondeurs' forces. 


158 


French lay seigneurs never were able to 
establish direct domination over peasants 
unmediated by rival feudal elites. French 
agrarian class relations retained the character- 
istic feudal form of multiple elites regulating 
and profiting from agrarian production. French 
monarchs’ absolutist strategies had weakened 
clerical and magnate control over land tenure, 
while they enhanced the powers of Parlement 
and lesser judges to intervene in landlord- 
tenant relations. Rather than a single class of 
landowners as in England, a number of 
French elites jostled for the rights to collect 
taxes and rents from the peasantry. After a 
century and a half of crown manipulation of 
provincial elites, and after the reorganization 
of aristocratic power within venal offices, 
each elite’s authority came to rest upon royal 
grants of “privileges which were subject to 
differing interpretations and which were 
defined in reference to the king” (Beik 1985, 
p. 219). 

Once the Frondeurs had rejected crown 
regulation of crown-granted privileges, pro- 
vincial factions were left to sort out their 
overlapping jurisdictions. As a result, the 
weaker elements within the Frondeur coali- 
tions fared less well in competition with 
provincial rivals than they had as crown 
subordinates. Officials and nobles whose 
positions the crown had most recently estab- 
lished and who were not clients or allies of 
the leading Frondeurs were the least likely to 
have joined the Fronde, or most likely to 
abandon quickly the rebels’ common front 
(Kettering 1982, pp. 294—304). Frondeurs 
were interested in fighting each other as well 
'as the crown. While the crown's venal 
strategy had not pacified provincial elites, it 
had precluded the formation of a common 
front of similarly situated nobles in opposition 
to the royal court. 

The inability of French provincial elites to 
act as a class, demonstrated by their conflicts 
with one another over privileges, was con- 
firmed by their failure to suppress the massive 
peasant resistance during the Frondes. The 
crown's venal strategy had diminished the 
seigneurs' capacities to collect rents and to 
exercise military domination over the peas- 
ants, and substituted instead state-mediated 
taxation collected by venal offices. As a 
result, when peasants went on tax strikes 
against the crown under the cover of the elite 
Frondes, Frondeurs’ venal incomes from tax 
collection were lost. Frondeurs had few cash 
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reserves and were unable to support the 
private armies needed to defeat both the 
crown and peasant rebellions. The monarch, 
by contrast, could raise funds from great 
financiers whose investments in royal debt 
required the crown to dominate the provinces 
to ensure repayment. Financiers provided the 
crown sufficient resources to outlast and 
defeat the Frondeurs (Bonney 1981 pp. 
213-14, 228-41; Parker 1983, pp. 95-117). 

The Frondeurs' inability to sustain armed 
resistance against the crown or to defeat 
peasant rebellions resulted from the crown's 
establishment and manipulation of venal 
offices that had fused the powers of the 
various elites into a single state structure. 
"The rulers could not subtract their sphere 
from the larger politv because their authority 
was based on a system of shared power and 
graduated privilege which was presided over 
by the monarch. Without the king there could 
be no hierarchy of authorities and no ‘division 
of labor' among them. Yet there was no 
possibility of a royal ‘takeover’ either because 
the king relied on his social allies within the 
province and had no alternative to their rule” 
(Beik 1985, p. 219). The fiscal crisis of the 
early 17th century demonstrated the limits of 
crown control over provincial nobles, just as 
the Frondes proved that aristocrats and venal 
officials could not form a class outside of the 
absolutist state. The political settlement 
achieved after the Frondes was not the 
monarch’s victory over the aristocracy, nor of 
state bureaucrats over civil society, but the 
accommodation of formerly autonomous feudal 
elites within a single state structure. 


CONCLUSION: ELITE CONFLICT AND 
STATE FORMATION 


The debate among scholars of absolutism 
centers on who benefited and lost from the 
formation of national states in 16th- and 
17th-century England and France. The three 
main theoretical perspectives argue that the 
beneficiaries of absolutism were also the 
agents of its advancement. Thus, for Ander- 
son the aristocratic class organized and gained 
against peasants and bourgeois from the 
centralization of state power. Proponents of 
the state-centered and relative autonomy 
models contend that state officials, separate 
from the principal classes of feudalism and 
capitalism, built up state capacities for their 
own ends. The relative autonomy thesis 
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argues that state officials were aided at 
particular moments by the weaker of the two 
principal classes, noble or bourgeois, against 
the stronger class because the weaker class 
was also helped by a strong state. My 
historical analysis suggests that interests and 
actors cannot be defined simply in terms of 
class or state. I argue that the relevant actors 
are elites, organized around a multiplicity of 
organizational sites. I now review the histori- 
cal evidence to clarify where my elite-focused 
model is helpful to identify the bases and 
explain the outcomes of conflicts in the two 
societies. 

At the outset of the 16th century, lay 
landlords, clerics, and monarchs and their 
agents were organized to protect their inter- 
ests from one another’s encroachment as well 
as from peasants’ resistance to meeting the 
separate obligations imposed by each elite. At 
times elites shielded peasants from a rival 
elite’s attempt to extend is power by appropri- 
ating additional agrarian resources. Their 
conflicts make it difficult to analyze the three 
elites as a single class. 

The principal forces that reordered the 
relationships among the three elites and with 
peasants during the 16th century were evident 
in the maneuvering by each elite to strengthen 
its organizations against those of the rival 
elites. The differential abilities of each elite in 
the two countries to extend its organizational 
authority at the expense of rivals resulted in 
two different patterns of absolutism. The 
English development is described as horizon- 
tal absolutism to emphasize the monarch’s 
ability to sever clerics’ and some aristocrats’ 
ties to agrarian production while absorbing 
the heads of those two elites within a unified 
national court. Those crown “victories” 
created two separate sites of political power: 
an isolated monarchy on the national level, 
and a unified gentry that did not have to share 
with other elites its access to peasant 
production on the local level. 

A state elite with only an indirect and 
parasitic relationship to both peasants and a 
capitalist gentry was the result, not the 
creator, of absolutism. The state elite, which 
is seen as the agent of absolutism in both the 
state-centered and relative autonomy models, 
was instead an artifact of English state 
formation. The process by which the crown 
severed its clerical and aristocratic rivals from 
their agrarian bases created a new elite of 
monarch and clients. Where the English 
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monarch of the early 1500s had been one of 
three elites defined by a national organization 
supported from direct rights to peasant 
production, a century later the crown had 
absorbed its clerical and magnate rivals into a 
single organization with a monopoly on 
military power and political authority at the 
national level. Yet the conflicts that amalgam- 
ated portions of three elites within one state _ 
also removed those actors from direct control 
over the agrarian economy, transforming 
them into the sort of state elite described by 
state-centered theorists. At the same time, the 
seigneurs excluded from positions of power 
and patronage in the absolutist state were 
transformed into a capitalist gentry in the 
absence of royal power at the local level. 

The English Revolution was indeed a 
conflict between a state elite and a gentry 
class whose interest was defined by its new 
monopoly of control over agrarian produc- 
tion. In that confrontation, the military and 
administrative capacities emphasized by state- 
centered theorists proved less powerful than 
the gentry's ability to pursue its interests 
vis-à-vis the peasantry without reference to 
the royal state's grants of authority. Even 
when Parliament and the gentry split into 
opposed factions in the Civil War, Charles I 
was unable to recruit enough allies from 
among the gentry, London merchants, or 
county yeomen to win. The king was such an 
uninviting ally because he was not vital to the 
maintenance or expansion of any elite's 
economic power. The king lacked the re- 
sources to sustain a patronage network 
sufficient to dominate national politics, and 
therefore a crown victory would have harmed 
more gentry, yeomen, and merchants than it 
could have aided. 

The Frondes, in contrast, fit less well into 
the archetypal struggle between tax-collecting 
state officials and tax-paying state subjects. 
The French crown's inability to achieve 
horizontal dominance over national rivals 
forced it to cede areas of sovereignty to the 
institutions now controlled by other elites. 
The French crown could weaken rival elites 
only by creating more sovereign institutions, 
and thereby new venal elites. The Frondeurs' 
defeat was less a consequence of the crown's 
independent capacity to deploy force, and 
more a consequence of the embedding, over 
two centuries, of each elite's interests within 
a vertically organized state. Antistate rebel- 
lion failed because most elites were sited 
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within that state and could express their 
interests only through it. 

Anderson is correct in describing the 
absolutist state as a redeployment of the 
feudal ruling class. However, in looking to 
the struggles between peasants and landlords 
for the dynamic of state formation, he is 
unable to explain the differences in the rise of 
English and French absolutism, nor the early 
demise of the former. Both Anderson’s class 
analysis and Marx and Engels’ relative 
autonomy model lead us astray in depicting 
the bourgeoisie as external to, if for a time 
dependent upon, the absolutist state. The 
bourgeoisie emerges differently in my analy- 
sis. The varying sites at which it organized 
and defended its interests in the two countries 
appear less the product of an autonomous 
process of capitalist development and more 
the result of openings created by the particular 
paths of elite conflict and absolutism in 
England and France. In England, elite 
conflict created a structure of gentry class rule 
outside of, and hence capable of destroying, 
absolutism, and it set up an opportunity for 
London merchants to oppose the king. In 
France, vertical absolutism created narrow 
opportunities for holders of capital to buy 
offices and to invest in state debts and 
precluded a broad alliance of officeholders 
and landlords against the crown. The nature 
of the 17th-century French bourgeoisie forced 
it to define its interests with the state in the 
Frondes, while the locally based English 
gentry was divorced from the state. Bourgeois 
strategies in the two countries were deter- 
mined by that class’s structural location, not 
by an absolute measure of strength or 
maturity. Possibilities for bourgeois political 
agency developed as artifacts of conflict 
among elites. 

The state should be brought back into the 
study of social relations in early modern 
England and France, not as an actor or an 
interest, but as the outcome of long chains of 
conflicts among a complex of actors. Absolut- 
ism will be understood only when its diverse 
forms are brought into the larger study of how 
feudal elites transformed themselves and 
thereby formed new structures within which 
they struggled and ruled. 
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RELIGION AND THE RISE OF LIBERAL-DEMOCRATIC 
IDEOLOGY IN 17TH-CENTURY ENGLAND* 


DAVID ZARET 
Indiana University 


In classical and contemporary sociology, key elements of liberal-democratic 
ideology are seen as secular extensions of Protestant ideas. This case study 
provides a different analysis that emphasizes the problem of religious conflict and 
radicalism in early liberal-democratic ideology. Proponents of the new ideology 
rejected key tenets of their Puritan heritage, adopting deistic beliefs that 
legitimated pluralism and tolerance and opposed the older Puritan ideal of godly 
politics. Building on recent work in the sociology of culture, the paper outlines an 
analytic strategy for explaining change in ideological systems. Ideological change 
emerges out of the interaction of contextual pressures and intellectual precedents, 
as a collective response by ideological innovators to problems of authority. The 
analysis in this study shows how historical events can form an episodic context 


which structures this problem of authority. 


Religion and the rise of liberal democracy is 
not a topic that has provoked much contro- 
versy among sociologists. Unlike the debates 
over religion and capitalism, there is a 
uniform account of the Protestant origins of 
the democratic culture in Anglo-American 
society. The fundamental tenet of this cul- 
ture —the priority of tolerance over revelation 
as a precondition for the pluralist pursuit of 
utility —is seen as a secular extension of 
Protestant beliefs, such as the sanctity of 
conscience or the priesthood of all believers. 

This paper provides a different account of 
religion and liberal-democratic ideology. It 
challenges the view that Protestantism was a 
source of democratic ideas.! Liberal- 
democratic ideology was not a secular 
extension of Protestant beliefs; it developed 
explicitly against the application of these 
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! There are aspects of universalism in democ- 
racy that do not derive from religion, for example, 
citizenship as an extension of aristocratic privi- 
leges. 
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beliefs to politics, rejecting the claim, then 
prevalent in Protestantism, that politics and 
religion were inseparable. This rejection, by 
John Locke (1632-1704) and others, shaped 
key tenets of the new ideology: in religious 
matters, tolerance; in politics, the priority of 
empirical perceptions of utility over revela- 
tion. 

Still, Locke and other Whig writers came 
from Puritan backgrounds and maintained 
Protestant identities. What made their commit- 
ment to a secular framework for politics 
compatible with a Protestant identity was the 
radically altered version of Protestantism — 
rational religion or deism—that they adopted 
in place of Puritanism. Their religion was as 
far reformed from the Reformation, from the 
world of Luther, Calvin, and the Puritans, as 
the Reformation was from medieval Catholi- 
cism. . 

At the core of these developments lay the 
problem of religious conflict. Liberal- 
democratic ideology was a response to the 
radicalism and intellectual dissonance gener- 
ated by conflicts internal to Protestantism. It 
was not opposed solely to the divine-right 
theories of monarchy upheld by the Whig’s 
opponents, the Tories. In response to prob- 
lems posed by claims to revelation in public 
life, liberal-democratic ideology secularized 
political discourse and revised Protestantism 
so as to excise nearly every element of 
revelation from religious life. Whereas the 
standard sociological account points to the 
Protestant sources of liberal-democratic ideol- 
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ogy, this paper advances nearly the opposite 
claim: the new ideology developed as its 
proponents adopted deistic beliefs and re- 
jected central tenets of their Protestant 
heritage. 

These empirical issues raise a broader set 
of theoretical issues that pertain to the 
sociology of knowledge and its application in 
analyzing ideological systems. The standard 
account derives from Durkheim and Weber. 
Their specific views on Protestantism and 
democracy reflect an analytic strategy that 
analyzes cultural change as an aspect of 
differentiation (Durkheim) or rationalization 
(Weber) in history. Obviously, if the standard 
account is as wrong as I think it is, then the 
analytic strategy that produces such accounts 
requires revision. Building on recent develop- 
ments in cultural sociology, this paper 
outlines an alternative strategy that leads to 
subtler and empirically more sustainable 
analyses of change in ideological systems. 

First, I present the analytic strategy and 
evidence for a new account of Protestantism 
and liberal-democratic ideology. Next, I 
describe the older account, its origins in work 
by Durkheim and Weber, and its empirical 
shortcomings. In its place, I then develop the 
new account that has a better fit with the data 
that disconfirm the older accounts. 


THEORETICAL AND 
EMPIRICAL ISSUES 


The task of this paper is to explain ideological 
change. This requires an analytic strategy for 
drawing causal inferences about the sources 
of ideological change and evidence that can 
sustain these inferences. 


Theoretical Issues 


The analytic strategy in this paper derives 
from recent work in the sociology of culture 
(e.g., DiMaggio 1987; Gieryn 1983; Gris- 
wold 1983; Swidler 1986; Wuthnow 1987; 
Zaret 1985). This work is critical of older 
macrosociological studies of culture that rely 
on overly simple, reductionist models of the 
relation of culture to social structure. Causal 
relations between culture and social structure 
are held to be sufficiently complex so as to 
preclude the specification of a simple, 
invariant relationship. For example, the 
mimetic aspect of culture—as a reflection 
of its social setting—may be variable. Under 
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different conditions, artistic genres (see Di- 
Maggio 1987) or beliefs (see Swidler 1986) 
can possess different degrees,of autonomy 
from their social structural setting. 

Order and change in ideological systems 
cannot be explained directly by social struc- 
tural or cultural factors. These factors are 
important, but their influence is mediated by 
the specific contexts for the specialized tasks 
of cultural production. It is in these contexts 
that ideological producers respond to the 
problem of contested authority. For example, 
recent studies examine art worlds (Becker 
1982), professions (Gieryn 1983), state insti- 
tutions (Wuthnow 1987), and social move- 
ments (Zaret 1985) as organizational contexts 
in which ideological producers respond to 
challenges to their authority. Although these 
contexts, and the problems of authority in 
them, have social structural and cultural 
components, neither the contexts nor the 
problems of authority are reducible to these 
components. In each instance, empirical 
specification must establish the parameters of 
this context in which ideas about the social 
world (ideologies), as they are defined, 
invoked, and altered, undergo what Karl 
Mannheim (1936, p. 78) called their existen- 
tial conditioning. 

This paper furthers this line of research by 
showing that the context mediating between 
ideology and social structure can be framed 
not only by an organizational context but also 
by an episodic context—by historical events. 
Sequences of historical events can provoke 
changes in beliefs about the social world, 
such as the French Revolution and the modern 
idea of revolutionary praxis. Other examples 
include Fussell’s account (1975) of a new 
literary sensibility etched by the experience of 
World War I; Elder's analysis (1974) of 
changing views on family and work as 
responses to the depression; and Schwartz's 
argument (1983) that a new type of political 
leader, the uncharismatic hero, reflected 
Washington's role in the War for Indepen- 
dence. 

Like organizational contexts, episodic con- 
texts influence the probability and direction of 
ideological change because they present 
problems of authority for the ideological 
producers. And in both contexts, the issue of 
contested authority cannot be reduced either 
to ideological or social structural factors. 
Althougb these factors shaped the English 
Revolution—the episodic context in this 
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paper—they predetermined neither its course 
nor the problem of contested authority in it. 
These theoretical reflections lead to the 
following strategy for explaining ideological 
change. Specification of an organizational or 
episodic context should guide descriptions of 
the contested authority that confronts ideolog- 
ical producers. Ideological change can then 
be analyzed in terms of the interaction of 
these contextual pressures and the intellectual 
precedents used to alter the ideology (see 
Zaret 1985). The salience of any given 
precedent derives from its utility for resolving 
or minimizing the problem of contested 
authority. This utility can derive from the 
conceptual structure, the rhetoric and imag- 
ery, or the social distribution of a precedent. 


Empirical Issues 


This analytic strategy requires evidence of 
ideological change as a collective response to 
the problem of contested authority. The 
problem and the response should be located in 
an organizational or episodic context. Causal 
inferences about ideological change as a 
response to contested authority should be 
supported by textual evidence and by other 
evidence that links the ideologues as a group 
united by shared experiences (social back- 
ground, cohort, education), by networks of 
friendship, patronage, and formal organiza- 
tions, and by access to the intellectual 
precedents for the new ideology. 

The analysis in this paper refers to two 
ideological groups that partly overlap: the 
proponents of natural religion and the Whig 
theorists. The episodic context commences in 
the English Revolution, beginning with the 
Civil Wars between the King and Parliament 
(1642-48), the Regicide (1649), the creation 
of the Republic (1649-53) and Cromwell’s 
Protectorate (1653—58), and the Restoration, 
in 1660, which restored the Stuart monarchy. 
In this context, sectarian attempts to apply 
Protestant tenets to politics, to build a holy 
commonwealth, generated considerable disso- 
nance and. radicalism among Protestants. 
Liberal-democratic ideology developed, along 
with natural religion, as a collective response 
to these intellectual and social problems. 

The appearance of modern democratic 
ideas in 17th-century England represents a 
classic case of ideological change. The new 
ideology rejected a fundamental premise in 
the political discourse of both the Puritans and 
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the Royalists. Instead of religious rationales 
for political choices, the new ideology put at 
the center of politics the adjudication of 
empirical claims on individual utility. At a 
stroke, Locke dismissed the rationale that 
both sides, Puritan and Royalist, had ad- 
vanced during the English Revolution. The 
ideology of Locke and other Whig writers— 
for example, Anthony Ashley Cooper, the 
Earl of Shaftesbury (1621-83); Algernon 
Sidney (1622-83)—did not dominate English 
political life until well after the 17th century. 
Complete toleration, the destruction of corpo- 
rate privileges and royal prerogatives, and a 
universal franchise occurred in the 19th and 
20th centuries. But this only highlights the 
precociousness of the new ideology and 
makes its appearance all the more puzzling. 
Although the subsequent development of 
early Whig thought is a complex topic (see 
Ashcraft 1986), it is clear that Locke, Sidney, 
and the Earl of Shaftesbury developed the 
fundamental tenets of modern liberal democ- 
racy. 

The proponents of the new political and 
religious doctrines constituted cohesive groups 
within the high cultural establishment of early 
modern England, united not only by their 
ideas but also by social background and 
shared experiences and by friendship, patron- 
age, and formal organizations. They reached 
adulthood during the revolutionary era and 
had attended-Oxford or Cambridge. They 
came from the middle to upper levels of 
English society, from the merchants and 
tradesmen to the gentry and aristocracy. Most 
grew up in Puritan families but all rejected 
Calvinist theology and the sectarian ideal of a 
holy commonwealth (Jones 1961; Lacey 
1969). 

Complex interpersonal ties united the Whig 
ideologues. Shaftesbury, the leader of the 
early Whigs, was Locke’s aristocratic patron 
and political mentor; and though Shaftesbury 
and Sidney were not friends, they shared 
many of the same supporters and friends. The 
other Whig ideologues cited in this article 
collaborated with Locke, Sidney, and Shaftes- 
bury (Ashcraft 1986). Similar interpersonal 
ties also united the proponents of the radically 
revised Protestantism—natural religion—that 
was associated with early Whig ideology. Not 
all proponents of natural religion were Whigs, 
but most Whig theorists, such as Locke, 
supported it. The central figures in this 
development (for details, see below, p. 172) 
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included prominent individuals in the church 
establishment after the Restoration, the Royal 
Society, and the Neoplatonic philosophers at 
Cambridge. Locke and Shaftesbury belonged 
to the Royal Society, and Locke was an 
intimate acquaintance of the liberal church- 
men and Cambridge philosophers (M. Jacob 
1976; Shapiro 1969). 


THE STANDARD SOCIOLOGICAL 
ACCOUNT 


Before Durkheim and Weber codified the 
argument, it was a commonplace in sociolog- 
ical analysis that liberal-democratic ideology 
was an extension of Protestantism. Comte, de 
Tocqueville, and Marx described Protestant- 
ism as a symptom of the global forces that 
leveled ancien regimes and established demo- 
cratic rule. This view guides Comte’s nega- 
tive appraisal of the revolutionary implica- 
tions in “the dogma of unbounded liberty of 
conscience” ([1830—42]1896, 2:148—54). It 
also appears in the more sympathetic observa- 
tions of Tocqueville, for whom "Puritanism 
was not merely a religious doctrine, but 
corresponded in many points with the most 
absolute democratic and republican theories" 
([1835—40]1945, p. 33). Marx saw Protestant- 
ism as capitalist ideology: in capitalism, 
“Christianity with its cultus of abstract man, 
more especially in its bourgeois develop- 
ments, Protestantism, Deism, etc., is the 
most fitting form of religion" ([1867]1967, p. 
79). Puritanism was the ideology of the 
“bourgeois revolution" in England; and after 
“the real aim had been achieved, when the 
bourgeois transformation of English society 
had been accomplished, Locke supplanted 
Habakkuk" ([1852]1979, p. 105). 


Durkheim and Weber 


These views influenced Durkheim and Weber, 
who saw democratic ideology as a secular 
extension of tendencies in Protestant religion, 
liberalism as Protestantism without God. 
What distinguished their views from the 
antireligious rationalism of the Enlightenment 
is the sociological supposition that some 
varieties of Profestantism inadvertently pro- 
moted the liberalizing dissociation of politics 
and religion. In Protestantism, Durkheim and 
Weber found a radical individualism that 
legitimated the idea of individual rights, 
intellectual tolerance, and the ideal of a 
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universal franchise. The latent liberalism in 
this religion, admittedly not foreseen or 
favored by Luther or Calvin, was unleashed 
by secularization. On this, Durkheim and 
Weber are essentially of one mind, but then 
their thinking diverges on three points. 

First, Durkheim’s analysis of seculariza- 
tion shows how the differentiation of social 
structures leads to new divisions between the 
sacred and the profane; Weber emphasizes 
instead the rationalization of charisma. Sec- 
ond, Durkheim adopts a mimetic or correspon- 
dence theory that makes change in religious 
and political beliefs a function of social 
structural change. Weber’s doctrine of elec- 
tive affinity points to the mutual influences of 
economic, religious, and political spheres as 
they undergo rationalization. Third; the two 
prior points indicate why Durkheim devel- 
oped a more limited treatment of Protestant- 
ism as a symptom of modern individualism, 
whereas Weber saw Protestantism as both a 
symptom and a cause of this individualism.? 

Durkheim held that "if there is one truth 
that history has incontrovertibly settled, it is 
that religion extends over an ever diminishing 
area of social life. . . . gradually political, 
economic and scientific functions broke free 
from the religious function, becoming sepa- 
rate entities and taking on more and more a 
markedly temporal character." This process 
“has continued uninterruptedly from earliest 
times," for it "is bound up with the basic 
conditions for the development of societies" 
([1893]1984, pp. 119-21). Differentiation not 
only separates religious and political func- 
tions but also produces moral individualism, 
which Durkheim regarded as the essence of 


. Protestant ideals ([1897]1951, pp. 158-59). 


Subsequently, these ideals appeared in poli- 
tics, in democratic ideologies that displayed, 
in secular form, the moral ideals of Protestant- 
ism. These ideologies are secular religions in 
which “this cult of man has as its primary 
dogma the autonomy of reason and as its 
primary rite the doctrine of free inquiry” 
([1898]1973, p. 49). 

Weber's analysis of Protestantism's politi- 





? The point about Durkheim applies with even 
greater force to Marx, and to modern Marxist 
scholars, who regard Protestantism and liberal- 
democratic ideals as reflections of the formal 
principles of exchange (e.g., Goldmann 1973; 
Marcuse 1972, pp. 56—78; and, more generally, 
see Friesen 1974). 
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cal implications reflects his preoccupation 
with rationalization. For Weber, the rational- 
ization of religion leads to increasingly 
dualistic worldviews, to what he calls ethical 
religion, which sharply separates worldly and 
spiritual realities. Ethical religion portrays the 
universe “as a cosmos governed by imper- 
sonal rules” and disenchants it as a lawlike 
order created by a God of, but not in, the 
world. Evil and suffering appear as conse- 
quences of human weakness, not evil or 
capricious gods. In place of magical and 
highly ritualized practices, there arises the 
ethical pursuit of salvation, a venture guided 
by abstract beliefs that has as its goal the 
alignment of human conduct with divine will. 
This is Weber’s analysis of the attack on 
ritualism in the Reformation: “By far the 
strongest such devaluation of magical and 
institutional grace occurred in Puritanism” 
([1915]1948, p. 282, [1922]1978, pp. 422-39, 
506, 574). 

Political implications arise from the empha- 
sis on inner religion and the rejection of 
ritualism.? Among the Protestant sects, this 
led to a general defense of the 'rights' of 
individual conscience. Weber saw the “char- 
ismatic glorification of reason" in the 18th- 
century Enlightenment as a direct outcome of 
the highly rationalized beliefs held by the 
Protestant sects. The Puritan appears in 
Weber's work as a John the Baptist to 
Thomas Paine (“My mind is my church"). 
Liberal-democratic values emerge from the 
sectarian credenda, for "the genuine sect 
must demand the non-intervention of the 
political power and freedom of conscience for 
specifically religious reasons." The political 
legacy of "the rationalistic sects" includes the 
idea of individual rights based on natural law, 
tolerance, and an insistence on the "sepa- 
ration of church and state" ((1922]1978, pp. 
863, 1208-9). 


The Standard Account in 
Contemporary Sociology 
Currently, the analysis of Protestantism and 


democracy reiterates two themes from Durk- 
.heim and Weber. First, Protestantism is 





? How the rationalization of worldviews inter- 
acts with the rationalization of economic, legal, 
and political institutions, and whether it leads to 
democratic outcomes, are for Weber empirical 
questions. 
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linked, as symptom or cause, to moderniza- 
tion; second, the interpretation of Protestant 
doctrine reveals a manifest and/or latent set of 
liberal implications for political life. 

In work by Parsons and those influenced by 
him, discussion of Protestant beliefs reflects 
neo-evolutionary assumptions about long- 
term historical change. In the “heart and soul 
of Calvinism,” writes Little, “is the idea of a 
differentiated social order," an idea that leads 
to voluntarism and consent as organizing 
principles in the newly demarcated realms of 
politics and religion (1970, pp. 222-23; see ` 
also Bellah 1976; Gould 1987; Parsons 1977). 
Although writers more directly influenced by 
Weber display his antipathy toward evolution- 
ary theories, their work retains Weber's 
abiding interest in rationalization, which 
emphasizes, though to a lesser degree than 
neo-evolutionism, elements of continuity, 
cumulation, and irreversibility in the master 
trends of modernization (see Roth 1979; 
Schluchter 1981). : 

On the Protestant sources of democratic 
ideology, there is broad agreement that "the 
origins of the value-commitment within 
Western democracies derive most broadly . . . 
from the Protestant Reformation in which 
each person came to be seen as equally sacred 
in the eyes of God" (Prager 1985, p. 188). 
The standard account defines this value- 
commitment, common to Protestantism and 
democracy, in terms of universalism (Nelson 
1969, pp. 73-87; Parsons 1978, pp. 199—204). 
Minor variations in this account reflect the 
many aspects of Protestantism that are cited 
as sources of universalism in democracy. 

Bellah states that "the 'priesthood of all 
believers’ . . . was- applied to secular politics 
in the course of the Puritan Revolution in 
England and eventuated in the secular demo- 
cratic theory of John Locke" (1970, p. 68, 
but cf. 1986, pp. 142-43; Hawthorn 1976, 
pp. 11-12). Bendix links the doctrines of 
‘justification by faith’ and ‘the priesthood of 
all believers’ to “the constitutional transfor- 
mation of England during the seventeenth 
century” (1978, pp. 309-13). Little (1970) 
sees similar implications in the ecclesiological 
doctrines of the early Presbyterians in En- 
gland. Parsons found democratic implications 
in the doctrines of ‘justification by faith’ and 
predestination (1977, p. 132). In a recent 
commentary on Parsons, it is the sanctity of 
conscience whose secular extension is said to 
give rise to “theories of democratic rights” 
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(Alexander 1983, pp. 130, 384). According 
to Tiryakian (1975, pp. 24-25; and see 
Hammond 1980), Puritan thinking on the 
covenant is infused with voluntarism, which 
“underlies the basic American value orienta- 
tion of individualism"; "our emphasis on 
civil liberties can be shown to derive from 
basic Covenant premises of the Puritan 
mind." Skepticism on this appears in Berger, 
but he identifies tolerance as a secular 
extension of the belief in the sanctity of 
conscience (1977, pp. 150-51). Finally, 
Martin (1979, pp. 25-29) cites Arminian 
trends in Protestantism as a source of 
egalitarianism in liberal democracy: it was a 
secular extension of the theology of universal 
grace. 

However, two important studies dismiss 
the claim that Puritan ideas inspired demo- 
cratic conceptions of politics. Walzer argues 
that Calvinism was a repressive response to 
the disorder and anxiety in modernization. 
“The triumph of Lockean ideals, on the other 
hand, suggests the overcoming of anxiety. 
. . . The struggle against the old order seems 
largely to have been won by Locke's time. 
. . . Lockean liberals found it possible to 
dispense with religious, even with ideologi- 
cal, controls" (1965, p. 303). More recently, 
Fulbrook advances a structural analysis which 
concludes that there “was not anything 
inherent in the religious or social characteris- 
tics" of Puritanism that set it in opposition to 
efforts by the Stuarts to create an absolutist 
monarchy (1983, p. 16). It was the mon- 
archy's control over the church—this en- 
forced prerogative rule—that led the mon- 
archs to oppose Puritan plans for reforming 
the church, and this made Puritanism an 
opponent of absolutism. 

Although these two studies contradict the 
standard account, they do not adequately 
address the central issue in this paper: what 
was the relationship between liberal- 
democratic ideology and the religious factor?* 
Fulbrook shows why religious groups, such 
as the Puritans, supported or opposed the 
absolutist monarchies; but this does not 
establish why or even whether the anti- 
absolutist Puritans supported a  liberal- 





4 In part, this is because Walzer covers the 
period from 1540 to 1640 and Fulbrook covers 
1560—1640, although both allude to later develop- 
ments. Ás noted above, liberal-democratic ideol- 


ogy appears later. 
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democratic view of politics. Walzer thinks 
there is a relation between Puritanism and 
Lockean liberalism: “it is perhaps one of 
historical preparation," in which the internal- 
ization of moral controls by Calvinism made 
plausible "liberal confidence" (1965, pp. 
301-3). My analysis revises this view: 
Lockean liberalism did not, pace Walzer, 
emerge when anxiety dissipated; it was a 
response to anxiety, generated by sectarian 
attempts to implement the Puritan ideal of 
godly politics. 


HISTORY AND THE STANDARD 
SOCIOLOGICAL ACCOUNT 


Recent historical research provides little 
support for the standard account. Some 
historians regard assertions about an intrinsic 
link between Puritanism and democracy as 
residues of an old historiographic tradition— 
Whig history—that survive in the feckless 
generalizations of sociologists (e.g., Hexter 
1979, pp. 231-33, 237-40; Macfarlane 
1979). More recently, there is the ultimate 
anti-Whig view: the events of the English 
Revolution had no major impact on intellec- 
tual habits (Finlayson 1983; Sommerville 
1986, p. 238). 

The standard sociological account receives 
some support in work by other historians 
who, not coincidentally, look more favorably 
on the uses of social science in history. They 
argue that Puritanism supported democratic 
developments, but they also call attention to 
the nonreligious factors that imparted to 
Puritanism ideas on political equality and 
justice (see Hill 1973). For example, popular 
aspirations to participate in politics inspired 
Puritan efforts to democratize the church 
(Manning 1978). 

Many historical specialists think that Puri- 
tanism had only fortuitous links with demo- 
cratic developments, as it had strongly 
authoritarian tendencies that opposed any 
liberalizing withdrawal of religion from the 
political arena. For these historians, neither 
was there anything in English Puritanism that 
led inevitably to Locke, nor was this 
development surprising (Stone 1972; Under- 
down 1985; Zagorin 1970). This is the 
conclusion to be drawn from Fulbrook’s 
argument,” and it also applies to Puritanism in 





5 Puritanism opposed absolutist rule "not be- 
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17th-century New England, where “the Con- 
gregational system was developed by men 
who were allied with those financial and 
political leaders whose ends were best served 
by an extension of the people's liberties” 
(Ziff 1968, pp. 27-28). 


The Christian Commonwealth 


The historical flaw in the standard account is 
a narrow view of Protestantism. In associat- 
ing Protestant tenets, such as 'justification by 
faith', and democratic principles of equality, 
tolerance, and voluntarism, the sociological 
account refers to Puritan thinking about the 
elect and the virtues of renewing grace. But 
for the great majority of reprobates, there 
were coercive discipline, enforced by church 
and state, and God's restraining grace.© To 
infer from Puritan remarks on the elect and 
renewing grace—emphasizing equality and 
voluntarism in religion—liberal political com- 
mitments is a great mistake, for the business 
of politics is, after all, the governance of the 
outward actions of the masses. The same 
mistake would lead one to find an affinity for 
tolerance and voluntarism in Lenin’s writings 
on the party. 

Central to the Puritan view of politics was a 
community united in obedience to its austere 
God. It distinguished between church and 
state, but this only pointed to the religious 
ends of politics: enforcing a holy common- 
wealth. In Christian commonwealths, “there 
may be no toleration of any other religion. 
For that which is the end of God’s laws must 
also be the end of all good laws in all 
commonwealths and kingdoms, namely, to 
shut up the people in the unity of one faith” 
(Perkins 1608-9, 2:289). Puritans wanted 
“the godly magistrate not only to countenance 
the painful ministry but also to compel all the 
people to yield obedience thereunto” (Brinsley 
1622, p. 471). As “God’s vice-regent,” a 
magistrate must not only suppress offenses 
“against public peace and human society, but 


cause of specific ideas about the role of individual 
conscience . . . but because of the peculiar 
structure of the society and the state in which it 
arose” (Fulbrook 1983, p. 176). 

$ A famous Puritan divine wrote that restraining 
grace "bridles and restrains the corruptions of 
men’s hearts from breaking forth into outward 
actions, for the common good, that societies may 
be preserved” (Perkins 1608-9, 2:131). 
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blasphemies, heresies, swearing, Sabbath- 
breaking, and such sins as more directly 
oppose God and his worship” (Taylor 1612, 
sig. *3). 

Recent studies (for examples, see Zaret 
1985, pp. 82-83) describe the Puritan agenda 
for godly politics. Efforts to enforce piety and 
social discipline appeared in regulatory initia- 
tives in towns ruled by Puritan oligarchs and 
in rural areas where Puritan clerics built 
alliances with local magistrates. These initia- 
tives attacked prevailing mores in a popular 
culture that the Puritan mind associated with 
the immorality and irreligion of the alehouse. 
A coercive, intolerant politics of moral 
reform lay at the heart of Puritanism, which 
saw public enforcement of piety and social 
discipline to be a way for the elect to honor its 
God. In its theory and’ practice, the Puritan 
vision of godly politics had little, if any, 
relevance to liberal-democratic principles of 
tolerance and voluntarism. 


Radicals and Sectaries 


These principles are often associated with the 
radical sectaries who diverged from the 
Puritan mainstream. But this development 
also offers little support for the standard 
sociological account. Radical opposition to 
the Presbyterians and Independents in the 
English Revolution upheld the right of the 
godly not to be coerced by the state. Like the 
Puritan mainstream, the radicals upheld the 
ideal of godly rule (Solt 1959). This ideal, in 
all its authoritarian implications, animated the 
Independents, whose titular head was Crom- 
well. “The impulse to form Independent 
congregations was, at root, disciplinarian in 
nature, not libertarian: to create, not asylums 
from tyranny, but superior vehicles for 
godliness” (Lamont 1986, p. 78; see Jordan 
1936, pp. 291-92). The same point applies to 
the initial emigration of the Puritans to New 
England: their inability to impose a coercive 
moral regime in England led them to try to 
establish it elsewhere (Miller 1956, pp. 
5-11). 

Calls for toleration did appear among some 
sectaries. But this was a concession to 
necessity, forced by sectarian conflicts among 
the godly, not a principled call for fuH 
religious liberty. Twenty years ago, Wolff 
noted, "It is now an historical commonplace 
that the great Anglo-American tradition of 
religious liberty can be traced to just such a 
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grudging acceptance of de facto heterodoxy 
and not to early Protestant devotion to the 
freedom of conscience” (1965, p. 15). 
Sectarian arguments for toleration noted that 
regimes with a state religion inevitably 
harmed the godly and, worse, usurped the 
divinely appointed day of judgment. Toler- 
ance was proposed by some sectarians who 
regretfully noted the inability of human 
institutions to accomplish the chief aim of all 
sectarians: to separate the saints and the 
sinners (Jordan 1936; Lamont 1986). Thus, 
the strongest support for toleration in a few of 
Puritanism’s sectarian offshoots reflects a 
radical sense of religious intolerance. And 
among most of the sectarian groups spawned 
by Puritanism, in Old and New England, “it 
seems clear that their toleration does not 
extend beyond the unity of belief which only 
the saints . . . have" (Solt 1959, p. 64). 

The Levellers were a radical group that 
advocated toleration and democratic ideas.7 
During the Civil War, they arose as activists 
in the New Model Army that fought against 
the King. Under their influence, the Army 
became the bulwark of support for defeating 
and executing the king, for toleration, and for 
proposals to enlarge the political franchise. 
Although the Levellers had social and organi- 
zational ties to the religious radicals, their 
ideology was not an extension of Puritan 
ideals. It had several important secular 
sources that enabled it to go beyond the 
authoritarianism in godly politics. 

Three secular sources were central to 
Leveller thought. First, popular aspirations 
for political participation (see above, p. 168, 
and Hirst 1975) led to speculation on 
democracy in religious life. Second, the 
doctrine of “native birthrights” and the myth 





7 Proponents of a republic also appeared in more 
elite social circles, for example, James Harrington 
(1611-77), Henry Marten (1602-80), and Henry 
Neville (1620-94), the editor of the English edition 
of Machiavelli’s collected writings. They moved 
away from Christianity toward Deism and, as was 
said about Marten (Williams 1978, pp. 126-27), 
their general attitude toward religion was “one of 
detachment, sometimes amused, sometimes not.” 

8 [n politics, said Milton, there is knowledge of 
“the general reason of that government to which 

. . subjection is required; how much more ought 
the members of the church . . . seek to inform their 
understanding in the reason of the government 
which the church claims to have over them" (1953, 
1:747). 
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of the “Norman Yoke” (see Hill 1964, pp. 
75-82) provided a rationale for political and 
religious liberty by presenting it as a recovery 
of the rights lost long ago by the Anglo- 
Saxons. Finally, there was a modern doctrine 
of natural rights, possessive individualism 
(Macpherson 1962), based on ‘right reason’ 
which “neither morality nor divinity among 
men can or may transgress.” From this 
doctrine, Leveller writers justified religious 
and economic rights, arguing that if there are 
“penalties and restrictions put on men’s 
consciences, they cannot be termed free-born 
people . . . but rather . . . men barred of all 
propriety and goods" (Wolfe 1967, pp. 158, 
309). Democratic ideas among the Levellers, 
though tied to religious concerns, derived 
their distinctively modern features from 
nonreligious sources (Ashcraft 1986, pp. 
161—64). 


THE NEW ACCOUNT 


The historical evidence clearly does not 
sustain the standard sociological account. In 
the Puritan vision of godly politics, principles 
of consent and tolerance apply only to a 
spiritual elite; for the rest of the political 
community, there is a coercive theocracy. In 
most of the sectarian groups spawned by 
Puritanism, the authoritarian implications of 
godly rule are even more pronounced.? And 
for those radicals with modern democratic 
ideals, secular influences were decisive. The 
inescapable conclusion is that there may be a 
relationship between Puritanism and demo- 
cratic ideology but the standard sociological 
account of it is wrong. 

An alternative hypothesis begins with the 
problem of contested authority in the revolu- 
tionary era, 1640-60. The problem was 
Protestantism, more specifically, the intellec- 
tual dissonance and social radicalism that 
arose from godly efforts to apply religious 
tenets to politics. Liberal-democratic ideology 
solved this problem by removing religion 
from politics, by rejecting Puritan theology 
and its vision of godly politics, substituting in 
its place natural religion. Thus, the relation- 
ship between Protestantism and democratic 





? Some sectarian groups—for example, the 
Quakers—renounced political agendas and re- 
treated from the political realm (see Hill 1984). 
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ideology is that the latter was a response to 
perceived failures in the former. 


Sectarian Conflict and Doctrinal Anxiety 


Religious conflict among the godly was the 
inner dynamic of revolutionary politics and a 
critical problem for the Puritan opponents of 
absolutism. Parliament’s war with thé King 
gave Puritanism an opportunity to implement 
its vision of godly politics. But as the godly 
seized control of Parliament and its army, and 
destroyed the authority of the state church, 
Puritanism itself disintegrated into rival fac- 
tions, for example, Presbyterians, Indepen- 
dents, Baptists, Seekers, Ranters, Muggleto- 
nians, Quakers, and Fifth Monarchists. 
Lacking institutional controls, the emphasis 
on inner religion, on ‘justification by faith’, 
led to sectarian conflict, as the tenets and 
practices of any one sect could be seen by 
others as further instances of the worldliness 
that oppressed the saints. In this context, 
sectarianism was a fertile ground for social 
radicalism (see Hill 1973), which developed 
as a charismatic revolt against those worldly 
institutions—for example, the universities, 
the church, law, and the state—that were seen 
as impediments to the creation of a holy 
commonwealth. 

Sectarian conflict created doctrinal anxi- 
ety. This followed the widespread perception 
of dissonance in the competing claims of 
rival, but equally godly, sects. This doctrinal 
anxiety was as acute as the salvation anxiety 
that Max Weber saw in Calvinism. Whereas 
salvation anxiety refers to the problem of 
knowing one’s spiritual destiny, doctrinal 
anxiety is a problem posed by conflict among 
essentially similar groups. In The Protestant 
Ethic, Weber finds evidence for salvation 
anxiety in writings by Richard Baxter and 
other Puritans. But, oddly, Weber says little 
about the evidence for doctrinal anxiety in 
these writings, which is, in my judgment, 
even more apparent. 

Doctrinal anxiety dominates religious life 
in this era. “What can appear more ugly, than 
to see among the professors of religion, 
children opposing their fathers, scholars 
contesting with their masters, inferiors slight- 
ing and crossing their superiors?" asked Isaac 
Barrow (1630—77), a moderate churchman, 
mathematician, and teacher of Newton (1859, 
4:36—37; see Whichcote 1751, 2:24). Accord- 
ing to Gilbert Burnet (1643-1715), a bishop 
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and Whig politician, “There is nothing that 
defeats the ends of religion more, and does 
more naturally lead to all manner of sin and 
impiety . . . than intestine heats and divisions 
about it” (1681, p. 3). Baxter, the Puritan 
divine, also thought that religious conflict 
was a principal cause of sin (1653, pp. 4-5, 
352). 

This doctrinal anxiety is not limited to 
religious texts by Puritans and others. It. is 
also a major theme in the scientific and 
philosophic literature of this era, as in 
Thomas Sprat’s propaganda for the Royal 
Society, The History of the Royal Society, 
which was, in part, a response to “the great 
ado which has .been made in raising, 
confirming, and refuting so many different 
sects and opinions of the Christian faith” 
(1667, p. 25). This problem appears in 
Locke’s Essay Concerning Human Under- 
standing, where the sensationalist epistemol- 
ogy addresses the following question: “Where 
shall one find any, either controversial debate 
or familiar discourse, concerning honor, faith 
grace, religion, church, & etc., wherein it is 
not easy to observe the different notions men 
have of them?" ([1690]1894, 2:109). 

In some writers, doctrinal anxiety reflects a 
prior attraction to sectarian religion, as it did 
for Milton (1608-74) and Bunyan (Hill 1978; 
Bunyan 1841, 2:157, 209). A friend of Locke 
and Sidney, Benjamin Furley (1636-1714), 
had tried many sectarian groups before 
joining and, at last, quitting the Quakers: “I 
am sick of anything that looks like sectism," 
he told Locke (Locke 1976, 3:625—26). In an 
influential treatise against religious enthusi- 
asm, the renowned Cambridge philosopher 
and church intellectual, Henry More, con- 
fessed that "I have a natural touch for 
enthusiasm in my complexion” (More, 16622, 
p. vi). 

Doctrinal anxiety could also express elite 
disdain for the popular milieu of sectarian 
religion. This is evident in Locke’s reaction 
when, as a young man, he saw a Quaker 
fanatic, James Nayler,!? who had been seized 
by officials; in Hobbes, who had nothing but 
scorn for the enthusiast's claims to revelation; 


10 In 1656, Nayler rode a donkey into Bristol, 
claiming that he was the messiah. Puritan Parlia- 
mentarians deliberated more than a week before 
deciding not to execute him. Instead, he was 
branded, bored through the tongue, whipped, and 
imprisoned. 
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and in Robert Boyle (1627-91), when, in 
1647 as a young man, he discovered 
sectarianism in London (Locke 1976, 1:43—44; 
Hobbes [1651] 1962, p. 562; Westfall 1958, 
p. 115). A founding member of ‘the Royal 
Society, John Evelyn (1620-1706), recorded 
in his diary, in 1653, the “feculent stuff” of a 
sermon by a lay preacher, a "mechanic," 
whose “purport was that no danger was to be 
thought difficult when God called for shed- 
ding of blood, inferring that now the saints 
were called to destroy temporal governments" 
(Evelyn 1857, 1:286—87). 

At the Restoration, the political problem 
posed by the sectaries was solved by the 
execution of the regicides, the reestablish- 
ment of the church, censorship, and the 
punitive laws against the godly (now known 
as dissenters). But “it was a commonplace 
that in the 1660s that Church itself was an 
inadequate instrument for the policing of 
consciences” (Shapin and Schaffer 1985, p. 
296). The problem of sectarianism led to a 
shift in the thinking of educated elites on the 
relationship between religion and society. It 
was no longer possible to assume that religion 
would stabilize the social order: in politics, 
religion could facilitate the mobilization of 
popular grievances. The problem inhered in 
Protestantism, for when poorer Protestants 
consulted their consciences, they did not 
always find the abiding respect for property 
that God seemed to implant in more affluent 
consciences. 


Tolerance 


One response to doctrinal anxiety was more 
tolerant attitudes toward religious truth. 
Tolerance resolved this problem when con- 
flicts among the godly led some observers to 
the view that the same piety might apply to 
otherwise hostile parties. Doctrinal anxiety 
receded before the claim that there was no 
singularly true creed, only a plurality. of 
reasonable beliefs. 

This response to sectarian conflict was the 
Latitudinarian position, held by moderate 
church leaders after the restoration,!! by 
prominent members of the Royal Society, 





! They included Archbishop John Tillotson 
(1630-94); Gilbert Burnet (1643-1715), the Bishop 
of Salisbury; Thomas Sprat (1635-1713), the 
Bishop of Rochester; and John Wilkins (1614— 
1672), the Bishop of Chester. , 
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such as Boyle, Evelyn, and Isaac Newton 
(1642—1727),?? and by the Cambridge Neopla- 
tonists who had strong intellectual and 
personal ties to the first two groups!? (see M. 
Jacob 1976). According to Glanvill, “All 
opinions have their truth, and all have what is 
not so." The sectarian inability to acknowl- 
edge this was the cause of conflict among the 
godly. 


All are in the truth, and all mistaken; every sect 
is in the right . . . and everyone is out, by the 
sentence of all the rest. Here's religion, says 
one. Nay but its here, says the next; and a third 
gives the lie to them both. And then they scuffle 
and contend, till they have talked themselves out 
of sense, out of charity, and out of breath. 
(1661, p. 65, 1681, p. 313; and see Barrow 
1859, 9:577—86; Sprat 1667, p. 346) 


The influence of these views is even evident 
in the writings of Richard Baxter, a staunch 
Calvinist. But he was also an acquaintance of 
Boyle and Locke, and dismayed that “you see 
so many godly men on this side, and so many 
on that." Baxter advised lay readers to seek 
guidance from tolerant clerics: "Let him be 
Lutheran, Calvinist, Arminian, Episcopal, 
Independent or Presbyterian, so he be sound 
in the main and free from division" (1653, 
pp. 509-10, 1677, p. 97). 

Not all Latitudinarians were Whigs, but 
their views influenced the liberal thinkers, 
such as Locke ([1690]1894, 2:109-10, 120, 
[1689]1983). The commitment to tolerance 
by Locke, Neville, and Sidney "sprang as 
much from distrust of any law but the natural 
law of reason, and dislike of fanatic, 
presbyterian, . . . Laudian or papist, as from 
any interest in personal revelation or in rule . 
by the good" (Robbins 1969, p. 50). 


Reason and Revelation 


Tolerance was legitimated by a renewed 
emphasis on the role of reason in religious 
life. In this context, reason denoted secular 
knowledge —knowledge derived from experi- 
ence, of the laws and maxims that governed 





12 Many members of the Royal Society were 
Latitudinarian clerics, including the clerics listed in 
n. ll, Isaac Barrow (1630—77), and Joseph 
Glanvill (1636-80). 

P Their leader, Henry More (1614-87), collab- 
orated with Glanvill. Other members included 
Ralph Cudsworth (1617-88) and Benjamin Which- 
cote (1609-83). 
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the natural and social worlds. New Protestant 
doctrines that emphasized reason in religion 
were developed by members of the Royal 
Society, the Cambridge Neoplatonists, the 
Latitudinarian clerics, and the Whig ideo- 
logues. They did not replace religion with 
reason; they held that nothing in religion was 
incompatible with reason. Still, this radically 
revised Puritan thinking—and the broader 
Protestant tradition laid down by Luther and 
Calvin—which emphasized the corruption of 
reason and its limited role in religion (see 
Morgan 1986, pp. 43-61). 

For Locke and the other Whig ideologues, 
there are “no suggestions of the Holy Spirit 
but what are always agreeable to, if not 
demonstrable from, reason” (1976, 2:504; see 
Hunt 1680, p. 13; Tyrrell 1692, Sig. A7). 
The same point, almost word-for-word, was 
made by Henry More (1662b, p. 39). So 
argued Bishop Wilkins and the Latitudinarian 
churchmen (Shapiro 1969), and Boyle and 
leaders of the Royal Society (Westfall 1958, 
p. 175; see also J. Jacob 1977). 

Intolerance and radicalism were inevitable 
when religion rejected reason in favor of 
unbridled revelation (Barrow 1859, 3:399; 
Glanvill 1661, pp. 95-105; More 1662b; 
Whichcote 1751, 2:215). Locke wrote that 
“to this crying up faith in opposition to 
reason, we may, I think, in good measure 
ascribe those absurdities that fill almost all the 
religions which possess and divide mankind.” 
The danger of revelation was that it legiti- 
mated irrationality in religion. “If such 
groundless thoughts as these concern ordinary 
matters, and not religion, . . . we call it 
raving and everyone thinks it a degree of 
madness, but in religion men accustomed to 
the thoughts of revelation make a greater 
allowance to it" ([1690]1894, 2:426, 1976, 
2:500). 

This appeal to reason was a means for 
attacking sectarianism. But it must be empha- 
sized that this was not intended to uphold 
atheism. Attacks on enthusiasm frequently 
cited the problem, real or imagined, of 
atheism. According to More (1662b, p. 2), 
“atheism convinces enthusiasm that reason is 
enemy to faith; enthusiasm's ravings convince 
atheism of the opposite case" (see also 
Glanvill 1661, pp. 103, 209). When these 
writers argued that nothing in religion is 
incompatible with reason, they were certain 
that reason would reveal God's existence as 
well as the irrationality of the sectaries. 
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This confidence in reason clearly reflects 
the role of Baconian science as an intellectual 
precedent. The appeal to reason in religion 
relied on the growing prestige of science in 
the Royal Society and the Universities. 
Advocates of science developed the Baconian 
theme that order in nature pointed to God's 
existence (Merton [1938]1970, pp. 82-86, 
102-7). The well-publicized successes of the 
new science (see Webster 1976) further 
fueled confidence in the potential of reason 
and its compatibility with religion. Finally, 
there were the many ties, noted above, that 
linked the scientists to the Latitudinarians, the 
Neoplatonists, and the Whig theorists. It is, 
then, hardly surprising that the appeal to 
reason in religion relied on the theme of 
providence-in-nature. 


From Providence-in-Society to 
Providence-in-Nature 


The appeal to reason and the theme of 
providence-in-nature were not entirely novel,!4 
but they led to a rejection of an old idea that 
was central to the Puritan vision of godly 
politics or to any scheme that provides a place 
for religion in politics. This is the idea that 
some political institutions have a divine 
mandate. ' 

The existence of such a mandate was called 
into question by the events of the English 
Revolution. Many found it difficult to discern 
the hand of providence in any human 
institution, group, or event because of the 
drastic reversals in policy and regime, from 
1640-1660, that frustrated the Puritan goal of 
a holy commonwealth. Midway through that 
era, Baxter complained: 


We have looked for wonders from Scotland, and 
what is come of it? We looked that war should 
have even satisfied our desires, and when it had 
removed all visible impediments, we thought we 
should have had such a glorious reformation as 
the world never knew! And now behold, a 
Babel, and a mangled deformation! (1653, ‘sig. 
A10) 


The Restoration, with its ungodly monarchy 
and repression of nonconformists, intensified 
this sense of failure (see Hill 1984). "In 
historical writing it became increasingly 
unfashionable after the mid seventeenth 





14 For medieval precedents, see Chenu (1979) 
and Murray (1985). 
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century to explain events in terms of God's . 


providence” (Thomas 1973, p. 127). 

The eclipse of providence-in-society by 
providence-in-nature was a response to the 
same issue of contested authority that had led 
some to emphasize the virtues of reason and 
tolerance in religion. Providence-in-society 
connoted the sectarian aim of godly rule. 
Providence-in-nature referred to the visible, 
well-advertised discoveries of the scientists. 
Order and harmony in nature were evidence 
of God's existence; and in nature God's glory 
was not troubled by the religious conflicts that 
had disturbed the social order. 

Thus, when More described the evidence 
of God's existence, he reiterated old argu- 
ments about the harmony of elements in 
nature, But there is conspicuously little 
evidence of harmony in More's discussion of 
religion and the social order. Among the 
“external effects" of religion are “bloody 
massacres, the disturbances and subversion of 
commonwealths . . . most savage tortures . . . 
the extirpating and dispossession of whole 
nations, as it has happened in America." 
Religion's "internal effects" include the 
terrorizing of conscience and "making bitter 
all the pleasures and contentments of this 
life." More concludes that "it is manifest that 
if there were no God, no spirit, no life to 
come, it were far better that there were no 
such religious propensities in mankind" 
(1662a, pp. 83-84; see also Glanvill 1661, 
pp. 245-48, 1681, p. 276; Wilkins [1675]1693 
pp. 78-82, 173—75). A similar point was 
made to Locke by his friend James Tyrrell, a 
Whig theorist. Tyrrell praised Locke's inten- 
tion to return to "the study of natural things," 
for this gives us "the highest reasons to 
admire the wisdom and goodness of their 
creator, whereas few civil histories show one 
any more than the great corruption and 
pravity of human nature" (Locke 1976, 
4:243). 

This shift from providence-in-society to 
providence-in-nature used several intellectual 
precedents. The Baconian theme of providence- 
in-nature was the most important precedent 
because, as noted above, religious conflict 
had no place in nature. This amiable idea led 
Barrow, Boyle, and Glanvill to argue that 
"the proper study of nature bridges religious 
and political divisions among men" (J. Jacob 
1977, pp. 117-18; Barrow 1859, 3:399; 
Glanvill 1661, pp. 229-31). Other intellectual 
precedents reinforced skepticism about 


AMERICAN SOCIOLOGICAL REVIEW 


providence-in-society. These were established 
by the growing sophistication of historical 
and philological studies,!5 which showed that 
all social institutions and languages were 
historical accretions of circumstance and 
custom, devoid of any evident mandate from 
God. 


Natural Religion and Liberal Democracy 


Natural religion was the term contemporaries 
used for the doctrine that elevated reason over 
revelation and located providence in nature 
and not society. The doctrine’s adherents 
included the liberal-democratic ideologues, 
who used it to justify tolerance in public life 
as a precondition for the pluralist pursuit of 
utility. Natural religion was antithetical to the 
Protestant ideal of godly politics because the 
emphasis on providence-in-nature prevented 
any specific religious creed or political 
arrangement from laying claim to a divinely 
privileged status. This was the ideological 
basis of the separation of religion and politics 
in liberal democracy. 

‘In natural religion, the idea of providence- 
in-nature implied that religious sentiments 
reflected human nature and were not imposed 
on it by grace. Whichcote argued that religion 
“is no stranger to human nature,” a point 
commended in a preface to his writings by the 
Earl of Shaftesbury (Whichcote 1751, 3:5-7, 
181). This is source of the “liberal confi- 
dence” in human abilities that, according to 
Walzer (1965, p. 302), separates Puritanism 
from Lockean liberalism. An optimistic 
appraisal of human nature replaced the 
pessimistic Puritan doctrines, so central to its 
godly politics, that underlined the corruption 
of reason and humanity’s abject dependence 
on divine grace. In rejecting this pessimism, 
natural religion taught that we are reasonable 
creatures who live in the best of all possible 
worlds. 

This optimistic religion was antithetical to 
nearly every key tenet of Puritanism. Natural 
religion rejected the doctrines of predestina- 
tion and covenant theology (see Shapiro 
1969, p. 62; M. Jacob 1976, pp. 41-42). It 
also neglected "the central doctrine of 
Christianity, the mediation of Jesus to redeem 


15 At this time, the chief English figure in 
philology was John Wilkins, the Latitudinarian 
bishop and fellow of the Royal Society. 
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man,” and “instead of religiosity” natural 
religion focused on “a code of social 
conduct” (Westfall 1958, pp. 118-25, -134, 
157, 161). This may only slightly overstate 
the case. Beyond the existence of God and an 
afterlife, the core tenets of natural religion 
constituted a utilitarian creed with a few 
ethical maxims, such as the golden rule. “The 
laws of God," said Tillotson, "are reason- 
able, that is, suitable to our nature and 
advantageous to our interest" (1728, 1:57). Jt 
is “not possible for us to contrive any rules 
more advantageous to our own interest than 
those which religion does propose" (Wilkins 
[1675]1693, p. 393). 

Although this jettisoned the formal theol- 
ogy of Calvinism—indeed, of most of 
Protestantism—the utilitarian framework in 
natural theology accommodated many of the 
Social values in Puritanism, stressing asceti- 
cism in everyday life and obedience to 
authority (see Tillotson 1728, 1:37, 2:297-98; 
Whichcote 1751, 3:219-224; Wilkins 
([1675]1693, pp. 293-94, 336-37). But 
unlike the conflict between society and 
Puritanism or its sectarian offshoots, there is a 
complete lack of tension between natural 
religion and society. From here it is only a 
short step to the world of the Enlightenment, 
where the union of morality and utility, in 
that best of all possible worlds, has no 
religious veneer. 

This is why Locke and other Whig writers 
had no trouble with the Puritan principle "that 
religious experience must be granted a place 
in any adequate doctrine of authority" (Cragg 
1975, p. 300). Their religious thinking lacked 
any specifically religious commitments be- 
yond the bland acknowledgement that provi- 
dence created a wonderful world. For this 
reason, Locke advocated civil tolerance for 
all, except those who denied the existence of 
providence. Even so, Lock stated unequivo- 
cally that "there is absolutely no such thing 
. . . aS a Christian commonwealth”!® (Locke 
[1689]1983, pp. 73, 93). 

Proponents of liberal-democratic ideology 
insisted that "Christianity had no special 
teaching in connection with politics" (Gooch 
1927, p. 279) and they maintained a 


16 Locke thought that atheists would not observe 
their oaths and contracts (see also Wilkins, 
[1675]1693, p. 298). In the early 18th century, 
proponents of toleration, such as Mandeville, 
included atheists within its scope. 
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Protestant identity, which was vital to the 
Whigs’ opposition to absolutism. They were 
Protestants, God’s Englishmen, but divinity 
was irrelevant to politics: because their 
religion held utility, not revelation, to be the 
basis of the political order. Government was 
ordained by God, but "God having in the 
institution of magistracy, confined such as 
shall be chosen rulers, within no other limits 
in reference to our civil concerns, save that 
they are to govern for the good of those over 
whom they come to be established, it remains 
free and entire to the people . . . to prescribe 
and define what shall be the measure and 
boundaries of the public good" (Ferguson 
1689, p. 6; and see Hunt 1680, p. 8; Tyrrell 
1681, p. 243). 

For the Earl of Shaftesbury, the defense of 


England's liberties was inseparable from the 


cause of Protestantism. Yet he thought that 
neither religion nor politics "has been much 
advantaged by the union, it having never yet 
appeared that divinity has been greatly 
bettered by policy or that policy has been 
anywhere mended by divinity" (in Whichcote 
1751, 3:3-4). Algernon Sidney saw nothing 
incompatible in his adherence to Protestant- 
ism while he argued that civil liberties 
derived, not from divinity as Filmer argued, 
but from nature, utility being the measure of 
politics ([1698]1805 1:313—314, 423). 

Certainly Locke had little difficulty in 
reconciling his belief in providence with a 
utilitarian doctrine that upheld individual 
rights: "I find no difficulty to suppose the 
freedom of mankind, though I have always 
believed the creation of Adam." Political 
institutions derive from natural rights that are 
revealed by reason, not revelation. Entry into 
civil society merely secures these rights and 
does not create new ones. The irrelevance of 
theology in this theory makes plausible the 
almost complete toleration advocated by 
Locke, for "men of different professions may 
quietly unite . . . under the same government 
and unanimously carry the same civil interest, 
and hand-in-hand march to the same end of 
peace and mutual society, though they take 
different ways to heaven" ([1690]1960, pp. 
169, 375-76; 1976, 1:110). 


CONCLUSION 


The main empirical issue is whether Protes- 
tantism was a source of liberal-democratic 
tenets, as the standard sociological account 
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maintains, or whether it was an obstacle to 
democratic ideas, a political failure and the 
source of problems for which Lockean 
ideology was intended as a solution, as I have 
argued. Clearly, the relationship between 
religious and democratic ideas in 17th-century 
England is far more complex than the 
standard sociological account of it. Although 
that account correctly points to the impor- 
tance of religious factors, it is flawed in 
emphasizing the continuity between Puritan- 
ism and liberal-democratic ideology. The 
continuity is more mythical than real, and 
emphasis on it obscures the shift from 
Puritanism to natural religion among liberal 
thinkers. This shift eliminated religious obsta- 
cles to a pluralist view of politics as the 
pursuit of individual utility; yet it also 
provided this essentially secular conception of 
politics with a religious rationale and, more 
specifically, with a Protestant identity. 

Walzer (1965) and Fulbrook (1983) are 
therefore correct in claiming that Puritanism 
did not have inherently democratic implica- 
tions. Both point to structural factors, to 
modernization (Walzer) or the political soci- 
ology of absolutism (Fulbrook), in explaining 
why Puritanism became inadvertently allied 
with democratic developments. This study 
has not addressed these claims, but it has 
shown that the origins of liberal-democratic 
ideology are bound up with religious prob- 
lems that neither Walzer nor Fulbrook 
discusses. 

The persistence of the standard account of 
Protestantism and democracy requires a brief 
comment. What other aspects of sociology 
have survived as long and as intact as this? In 
part, the persistence and stability of the 
standard account reflect factors internal to 
sociology, namely, the influence of Durkheim 
and Weber. Factors external to the discipline 
may be equally important. The standard 
sociological account has a popular analogue, 
civil religion (see Bellah 1976, pp. 168-89), 
which traces—in ritual iterations of May- 
flower history in classrooms, Sunday schools, 
and political rhetoric— democratic ideas to 
religious sources. Its resonance with this civic 
mythology may further explain the persis- 
tence and stability of the standard account in 
our academic discipline.!? 


17 Another part of the explanation may lie in the 
Protestant origins of many U.S. sociologists 
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The main theoretical issue in this study 
pertains to the sociology of knowledge and its 
application to the analysis of ideological 
change. Building on recent work in the 
sociology of culture, this study has outlined 
an analytic strategy for explaining change in 
ideological systems. The strategy focuses on 
the episodic and organizational contexts for 
cultural production, analyzing ideological 
change in terms of the interaction of intellec- 
tual precedents and contextual pressures. By 
treating episodic and organizational contexts 
as variables that mediate between ideological 
change and its cultural and social structural 
determinants, this strategy leads to subtler 
and more sustainable analyses than can be 
obtained by seeking to establish direct links to 
cultural and structural factors. For the analy- 
sis of ideological change, this avoids the 
problems associated with reductionism and 
intellectual history, which respectively place 
too much weight on social structural: or 
cultural factors. 

In analyzing the relationship between 
religion and democratic ideology, I have 
therefore not argued that structural factors are 
unimportant. But their causal relevance lies in 
the way they enter into the episodic context in 
which the ideologues responded to the 
problem of contested authority. Certainly, 
one could expand my account to show, for 
example, the relevance of early capitalism for 
liberal-democratic ideology. In destabilizing 
the balance of power between Crown and 
Parliament, capitalism was a cause of the 
English Revolution; and in creating a broad 
entrepreneurial stratum, it provided a strong 
basis of support for the Puritan clerics (Zaret 
1985) and Whig leaders (Ashcraft 1986). 
These are wholly unobjectionable assertions, 
but they are also useless for explaining the 
origins of a Lockean conception of politics. 
As we have seen, these origins lie in a 
collective response by innovators to problems 
of contested authority that were resolved with 
the aid of several intellectual precedents. The 
relevance of capitalism, then, is determined 
by its role in shaping the relevant context and 
problem of contested authority. The same 
point applies to other analyses that, following 


(Vidich and Lyman 1985). Alexander (1983, pp. 
132, 384) thinks that “Parscns’ complex relation to 
the Puritan heritage is evident"; Parsons cited 
Bellah's essay on civil religion (1976) "more than 
any other statement on 20th-century culture.” 
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Durkheim, emphasize differentiation or, fol- 
lowing Weber, rationalization, instead of the 
capitalist connection. 

Thus, the theoretical issues raised by this 
study go beyond the case at hand to some of 
the broadest questions in sociology. How are 
cultures related to social structures? What is 
the relationship between cultural change and 
other types of change? Is there always a 
discrepancy between the manifest interests 
upheld by ideological systems and the latent 
interests of the ideologues? Answers to these 
questions will not emerge from one or a few 
case studies. But the promise for our 
discipline at large contained in recent devel- 
opments in the sociology of culture is that 
they point to a reexamination and possible 
resolution of these larger macrosociological 
issues. 
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COLONIAL EMPIRES AND THE CAPITALIST WORLD-ECONOMY: 


A TIME SERIES ANALYSIS OF COLONIZATION, 1640—1960* 


TERRY BOSWELL 
Emory University 


For most of the history of the capitalist world-economy, imperial conquest was the 
principal factor in creating a periphery to the European imperial core. This study 
focuses on how the dynamics of the capitalist world-economy affected the pattern of 
colonization in the periphery. Colonization is considered a hierarchical alternative 
to market relations, increased when market relations perform poorly and slowed 
when the market expands. A time series regression analysis, covering the “longue 
durée" of the whole system, provides initial comprehensive quantitative support for 
two central propositions in world-system theory. The findings indicate that long 
waves of economic expansion and periods of unincentric hegemony negatively 
affected the rate of colonization. Major wars among core states had no immediate 
impact but had long-term positive effects. A shift in the international regime 
brought about by the rise of socialist states also contributed to the decline of formal 
colonization. The findings point to the utility of long waves, hegemony, and 
international regimes in long-term historical studies of the trade-off between 


market and hierarchical relations. 


Despite the historical importance of colonial 
conquest in creating a global system, coloni- 
zation has not been a central concern in 
world-system theory. Although capital accu- 
mulation is the primary source of expansion 


of the capitalist world-economy, imperial ` 


conquest introduced capitalism to the world 
outside of Europe. Theories of capitalist 
imperialism explain how the development of 
capitalism and nation-states in Europe re- 
sulted in powerful imperial states. European 
imperialists expanded into external areas of 
the world in order to open commodity and 
investment markets, control labor, monopo- 
lize resources and trade, and protect those 
investments from competing core states (Ei- 
senstadt 1963; Fieldhouse 1073; Brewer 
1980). 

Colonization, as the direct and formal 
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political acquisition of states or territory in the 
periphery, is but one form of imperialism 
(Etherington 1984). Outside Europe, it was 
the principal political form for incorporating 
external areas into the capitalist world- 
economy.! Incorporation was a continuous pro- 
cess, occurring in roughly three phases (Hall 
forthcoming, 1986). The earliest colonies were 
usually coastal trading posts for merchants in- 
volved in long-distance preciosity exchange with 
the contact periphery, such as the spice trade. 
Following rivers upstream, these were later su- 
perseded by settler colonies involved in the pro- 
duction of necessities, primarily mining and 
cash-crop agriculture using coerced labor. Af- 
ter 1870, "occupation" colonies became com- 
mon where a small number of European so- 
journers coerced an indigenous population into 
production for the world-economy (Fieldhouse 
1973). 

The importance of imperial conquest is 
diminished by Wallerstein's (1974) definition 
of world-systems as multisocietal units with a 
self-contained economic division of labor. He 
divided these into two types: world-empires 
politically united under a single state, and 


! Colonization of the already incorporated pe- 
riphery also occurred when imperial states ex- 
tended their sovereignty over previously indepen- 
dent nations, such as the colonization of the 
Chinese port cities of Macao, Hong Kong, and 
Kiochow. 
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world-economies with no political unity. 
Wallerstein attributed the original dynamism 
of capitalism to the lack of world-empire. He 
sought to explain how unequal exchange led 
the kingdoms of Europe to become interde- 
pendent and unequal nation-states in a world 
capitalist division of labor rather than compo- 
nents of a united empire. His explanation of 
world capitalist development thus focused on 
economic expansion of commodity markets in 
Europe rather than global expansion by 
conquest. 

What got lost in this Eurocentric focus on a 
self-contained world-empire is the relation- 
ship between the economic dynamics of the 
world-economy and the political dynamics of 
the set of colonial empires (British, Dutch, 
French, Spanish, American, etc.) that domi- 
nated the globe until very recently. Since the 
16th century, empires have expanded into the 
periphery rather than conquering the Euro- 
pean core, despite attempts by the Hapsburgs 
and others. To be sure, these empires were 
not self-contained. They were embedded in 
the division of labor of the world-economy. 
But the imperial states were global, ruling 
multisocietal systems where the subordinated 
societies fulfilled distinct functions in an 
imperial division of labor. Rather than 
reducing relationships between imperialists 
and colonies to relationships between core 
and periphery, expansion of the world-system 
by colonial conquest can be contrasted with 
expansion through (unequal) exchange. This 
study focuses on the relationship between the 
expansion and contraction of the colonial 
empires and the dynamics of the capitalist 
world-economy of which they are a part. The 
central question is this: How do changes in 
the capitalist world-economy affect the pat- 
tern of colonization? 

In an important synthesis of previous 
research, Chase-Dunn and Rubinson (1979) 
developed three propositions for investigating 
this question. They proposed that the rate of 
colonization fluctuates with two long-term 
dynamics of the capitalist world-economy, 
economic long waves and the cycle of 
hegemony (pp. 462-63). Colonization in- 
creases both during long-wave periods of 
extended economic stagnation and during the 
cyclic absence of a single world hegemony. 
Third, less colonization takes place during 
major wars among core states. 

In this study, I utilize a time series 
regression analysis to examine the relation- 
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ship between colonization and the three 
systemic dynamics of the capitalist world- 
economy. Unlike most previous research, this 
analysis covers almost the entire existence of 
the capitalist world-system. Chase-Dunn and 
Rubinson (1979) also point out that concepts 
about the structural and dynamic properties of 
the whole system differentiate world-system 
theory from traditional theories of imperial- 
ism, modernization, international relations, or 
Marxist political economy. Despite the impor- 
tance of these properties for world-system 
theory, few scholars have conducted empiri- 
cal quantitative research on them for the 
whole system over the "longue durée" (but 
see McGowan 1985; Goldstein 1985; Kiser 
and Drass 1987). Instead, quantitative re- 
search has concentrated on cross-national 
studies of the effects of dependency on the 
postcolonial periphery, chiefly the effects on 
economic growth. The typical unit of analysis 
is nation-states, rather than the world-system 
as a whole. Lacking independent states, 
colonies are usually left out of cross-national 
data sets by definition. 

Bergesen and Schoenberg (1980) and 
McGowan (1985) have completed the only 
previous quantitative world-system research 
on colonization. They coded Henige's (1970) 
list of colonial governors and plotted net 
colonization over time. Visually comparing 
periods of high and low colonization with 
long-wave periods and the cycle of hege- 
mony, they found support for the first two 
Chase-Dunn and Rubinson propositions. On 
the other hand, they suggested that coloniza- 
tion increases rather than decreases during 
wars among core nations. 

While Bergesen and Schoenberg's evi- 
dence is plausible, they did not provide 
statistical tests for their conclusions. McGo- 
wan (1985, p. 478) applauded their “pio- 
neering" work as free from the problems of 
cross-sectional data analysis but criticized 
their lack of statistical rigor. In a regression 
analysis using only the variables of time, 
war-time, and "colonial cycles," he found 
support for the positive relationship between 
war and colonization and called for further 
analysis that would control for international 
political factors frequently neglected in world- 
system research. 

In the sections that follow, I first explain 
Chase-Dunn and Rubinson’s propositions 
concerning long waves, the cycle of hege- 
mony, and major intercore wars. Their theory 
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- omits an explanation of how world-systemic 
variables affect the decision making of 
imperial states.? I fill this gap by extrapolat- 
ing propositions from the theory of market 
versus hierarchy trade-offs in institutional 
economics (Williamson 1975, 1985; Boswell 
forthcoming). Following McGowan's (1985) 
unfulfilled suggestion, I establish temporal 
limits on the validity of the propositions at 
1640-1960 based on Wallerstein’s (1974, 
1980) description of the origin of the 
capitalist world-economy and the concept of 
international regimes (Keohane 1984). The 
central findings indicate that, as expected, 
both long waves and the early unicentric 
phase of hegemony negatively affected colo- 
nization. Unexpectedly, major wars had no 
immediate effect, but war-bred political 
instability had a long-term positive effect. 


CAPITALISM AND COLONIZATION 


Colonization predates capitalism, playing an 
important role in imperial expansion during 
antiquity. During the Middle Ages, imperial 
states annexed preexisting principalities rather 
than colonizing distant external areas (with 
the exception of the Crusades) (Eisenstadt 
1963). The first colony established outside of 
the European-centered economy is Ceuta, 
colonized by Portugal in 1415 (Henige 1970). 

McGowan (1985, p. 485) points out that 
the availability of data on colonization from 
1415 to 1969 does not necessarily imply that 
the entire period is relevant for one's theory. 
Since world-system theory predicts changes 
in the pattern of core-periphery colonial 
relations as a result of capitalist dynamics, the 
relevant period is that between the consolida- 
tion of the capitalist world-economy and the 
time when colonization is no longer a 
principal feature of core-periphery political 
relations. According to Wallerstein (1974), 
the capitalist world-economy originated dur- 
ing the "long 16th century,” circa 1450-1640. 


? World-system factors set constraints on, and 
incentives for, state actions. How directly these 
incentives affected decision makers depends on the 
‘structure of society and the state, such as the 
relative autonomy of rulers (Kiser, forthcoming). 
Unfortunately, no long-term quantitative data on 
international decision-making processes is avail- 
able. Imperial states are treated as unified 
corporate actors and only successful decisions can 
be measured. . 
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The establishment of a world division of labor 
and predominance of market relations is not 
complete until near the end of this period. 
Since world capitalism is not an unequivo- 
cally integrated system prior to 1640, my 
analysis of its affects on colonization begins 
at 1640. 

Colonial expansion was limited by the 
amount of uncolonized territory in the periph- 
ery, but it ended as a viable form of 
core-periphery political relations before this 
plateau was reached. Although the number of 
colonies peaked during 1920-1940, vast areas 
of the periphery remained politically indepen- 
dent. The most obvious were China, which 
Japan attempted but failed to conquer; Persia, 
which was already divided into spheres of 
influence; and Siam, which both Britain and 
France openly coveted. Another obvious 
possibility was recolonizing parts of the 
periphery for commercial and/or military 
needs along the model of the U.S. Panama 
Canal Zone. Had colonization remained a 
viable political form, the numerous U.S. 
military bases leased around the world could 
have easily constituted a string of colonial 
"zones." 

Rather than continuing to expand, since 
World War II colonization has nearly evapo- 
rated as an important measure of core political 
control over the periphery. Although decolo- 
nization has occurred throughout history, it 
accelerated after World War II. From 1945 to 
1965, 93 colonies were terminated and only 
12 were established, the last being the British 
Indian Ocean Territory in 1965 (Henige 
1970). Colonial revolts increased with eco- 
nomic development and the spread of nation- 
alist and socialist ideologies in the periphery. 
Imperial states in the core also eventually 
rejected the legitimacy of formal colonialism. 
Proletarianization had reduced the profitabil- 
ity of coerced labor while liberation move- 
ments dramatically increased the costs of 
colonial administration and military subjuga- 
tion. As a result, imperialists began to 
question the moral and economic justifica- | 
tions for colonization. Self-determination of 
internal colonies in Europe (such as Yugosla- 
via and Ireland) became a powerful moral 
argument after the debacle of World War I 
(Grimal [1965] 1978, pp. 7-107). But unti] 
World War LH, only the United States and 
Soviet Union supported self-determination of 
external colonies. The United States had the 
fewest colonies and much to gain from the 
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breakup of the older empires. After the 
Russian Revolution, Lenin based Soviet 
influence in the periphery largely on the 
appeal of anti-imperialism (Lenin [1914] 
1970, pp. 595-648). 

After the Second World War, decoloniza- 
tion took on a much greater political signifi- 
cance. The success of the Russian and 
Chinese revolutions made them models for 
liberation movements. While past indepen- 
dence movements were usually led by colo- 
nial landlords and bourgeoisie, after World 
War II decolonization was more commonly 
achieved by mass liberation movements and 
guerrilla armies led by left nationalists. 
Imperial states were faced increasingly with 
the possibility that resisting decolonization 
would spur the growth of Communist Party 
led liberation movements and increase Soviet 
geopolitical influence in the “Cold War.” For 
instance, with the exception of Indonesia, 
. Communist Parties were in the forefront of 
liberation movements throughout Southeast 
Asia (Goodwin 1988). Relinquishing direct 
political control from the imperial state to a 
local state dominated by the indigenous bour- 
geoisie decreased costs while increasing the 
safety of core economic investments from na- 
tionalization by socialists, what Kolko (1968, 
p. 604; 1976) calls a “controlled liberation” 
(see also Darby 1987). Imperial states began to 
officially renounce colonization for "feder- 
alist” alternatives (such as the French Commu- 
nity in 1958). Rather than slowing the drive 
toward independence as intended, these alter- 
natives served as frameworks for granting in- 
dependence while maintaining the appearance 
of association (Grimal [1965] 1978; Albertini 
[1966] 1971). 

The postwar elimination of colonialism 
marks a structural shift in the "international 
regime" (Keohane 1984). Pinpointing an 
exact year is somewhat arbitrary, but an end 
date must be established for analytic pur- 
poses. This is facilitated by locating a formal 
corewide agreement on the issue inscribed in 
an international treaty or institution (Haggard 
and Simmons 1987). While the United 
Nations technically supported self- 
determination since its founding, the clearest 
and most important declaration was the 1960 
"Declaration on the Granting of Indepen- 
dence to Colonial Countries and Peoples" 
(1960). It proclaimed colonialism as contrary 
to fundamental human rights and to the UN 
Charter. This was not simply another rhetori- 
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cal incident because 22 colonies were granted 
independence in 1960. Therefore, 1960 marks 
the end of colonization as an appropriate 
measure of core political contro] over the 
periphery. Colonial data after 1960 are 
increasingly misleading because they do not 
include the more important and subtler forms 
of neocolonialism. 


THE LONG-TERM DYNAMICS OF THE 
CAPITALIST WORLD-ECONOMY 


Economic Long Waves 


According to Chase-Dunn and Rubinson's 
(1979) propositions, the three world-system 
factors that determine the pattern of core 
colonial control over the periphery are 
economic long waves in the core, the cycle of 
world hegemony, and reoccurring major wars 
among core powers. Long waves, or "Kon- 
dratieffs” are extended periods of economic 
expansion and stagnation. The length of long 
waves is variable, but they typically last 
between 40 and 60 years. The literature since 
Kondratieff's ([1926] 1979) and Schumpe- 
ter's (1939) pioneering work is currently 
undergoing a revival as a result of the 
extended stagnation in the 1970s and 1980s 
and the recent stock market crash (a selected 
annotated bibliography is found in Barr 
1979).3 
From a world-system perspective, the 
international division of labor integrates 
national economies such that long waves are 
relatively synchronized into a single world 
phenomenon, at least in the core (Chase- 





3 In a synthesis of the long-wave theories, 
Boswell (1987) concluded that expansive periods 
are caused by a bunched series of technical, social, 
and political accumulation innovations that rapidly 
increase productivity. Growth rates are relatively 
high and recessions are light. Expansion is 
prolonged by the multiplier effect of bunched 
investment and the addition of modifying innova- 
tions. Periods of stagnation follow when the 
multiplier effect is exhausted and demand has 
saturated. Growth rates relatively decline, reces- 
sions are steep and prolonged, and capital 
accumulation is hindered by sluggish demand and 
cautious investment. Long waves are distinct from 
business cycles, four- to seven-year cycle of 
recessions and recoveries. Long waves structure 
the economic environment and the buoyancy of 
both supply and demand in the business cycle but 
do not cause them. 
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Dunn and Rubinson 1979, p. 465; Wallerstein 
1984). Long waves have played a distinctive 
role in world-system studies of the historical 
development of the capitalist world-economy 
(Braudel 1973; Frank 1978). For instance, 
Wallerstein’s two volumes (1974, 1980) are 
thematically structured according to long- 
wave periods. 

Chase-Dunn and Rubinson (1979, pp. 
462-63) contend that “during periods when 
average economic production in the system as 
a whole is expanding, the structure of control 
is less direct and political domination by core 
states over peripheral areas is relaxed. During 
periods when average economic production is 
contracting and competition among core states 
'is increasing, political domination is in- 
creased over older colonies and new areas are 
brought under the colonial domination of 
individual core states." They expect core 
colonization of the periphery to rise during 
long waves of economic stagnations but 
decrease during expansions. Long-wave ex- 
pansions also increase the rate of decoloniza- 
tion. During an expansion, the resources 
available to the lower classes increase and the 
coercion of the market decreases leading to a 
relative downward shift in power (Gordon, 
Edwards, and Reich 1982). Liberation move- 
ments are thus more likely and more potent 
(Chase-Dunn 1987, forthcoming). 


Markets versus Hierarchies 


A missing link in their world-system theory is 
a notion of how economic waves affect the 
political action of imperialist states. The 
theory can be expanded by considering 
colonial conquest as an alternative form of 
capitalist expansion similar to other market 
versus hierarchy trade-offs (Williamson 1975, 
1985; Boswell forthcoming). Colonization 
establishes a hierarchical organization that has 
monopolistic privileges over peripheral land, 
labor, production, or trade. Competitive 
markets push capitalist expansion, but monop- 
olies can increase profits without expansion 
by raising prices. High prices induce market 
competition, so monopolies are unstable 
unless supported by political power. During a 
long stagnation, capitalists (and kings) pres- 
sure state officials for monopolistic privileges 
and territorial expansion. At the same time, 
state revenues decline unless taxes are raised 
(which would prolong the stagnation). How- 
ever, military resources do not automatically 
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decline, especially when empires face threats 
of war or colonial revolt. Thus empires 
confront declining nonmilitary resources and 
fewer operational options relative to their 
military ones. Hierarchical expansion within 
the core is highly expensive (war between 
core states declines during stagnations; Gold- 
stein 1985), pointing them toward the periph- 
ery. Colonial conquest then becomes an 
increasingly attractive strategy because it uses 
an available resource (the military) and 
potentially increases taxable resources. 

The reverse processes occur during ex- 
tended economic expansions. Profits rise in 
the core, making monopolistic charters and 
military conquest a relatively less attractive . 
alternative. Imperialist states experience less 
intense pressure to grant monopoly privileges 
and have more nonmilitary resources and 
options when the market is expanding. All 
else equal, the higher risk and overhead of 
colonial relative to core investment make the 
latter preferable. Furthermore, monopolistic 
policies can be guaranteed only for that part 
of the world-economy under the political 
jurisdiction of the imperialist state. Politically 
secured privileges in one empire insulate 
colonial monopolies from more innovative 
and more productive producers in other 
empires. Over the long term of the wave, 
investments will shift away from the less 
productive monopoly and may even become a 
drag on core profits. Colonial conquest 
becomes an irrational policy for imperial state 
actors when it interferes with a robust market 
expansion.* 


The Cycle of Hegemony 


The second determinant of variation in core 
political control over the periphery is the 
cycle of hegemony. Wallerstein defines 
hegemony as a historical period of low 
international rivalry when one core power 
manifests “simultaneously productive, com- 
mercial, and financial superiority over all 





* Kondratieff makes a similar argument, claim- 
ing that, “in the capitalistic economic system, new 
regions are opened for commerce ing those 
periods in which the desire of old countries for new 
markets and new sources of raw materials becomes 
more urgent than theretofcre. It is equally apparent 
that the limits of this expansion of the world 
economy are determined by the degree of this 
urgency” ([1926] 1979, p. 539). 
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other core powers” (1980, p. 39, emphasis 
his). The Netherlands in the 17th century, 
then Britain in the mid-19th and the United 
States in the mid-20th centuries have D 
widely recognized as hegemons. 

Hopkins and Wallerstein (1979). divide the 
cycle of hegemony into four phases of rise 
and fall. Phase one, ascending hegemony, is a 
period of conflict between core powers when 
potential hegemons are rising to meet the 
decline of the former hegemonic power. 
Phase two, hegemonic victory, occurs when a 
core power clearly surpasses competing 
contenders. Phase three is hegemonic matur- 
ity, when the hegemon has reached the full 
extent of its economic and political power. 
Lastly, during the phase of hegemonic 
decline, instability and conflict grow between 
the declining hegemon and other core powers. 

Another way to characterize the cycle is in 
two periods: à unicentric period when a single 
core state has full hegemony and a multicent- 
ric period when the hegemon is.declining and 
contenders are ascending. Chase-Dunn and 
Rubinson (1979, p. 463) suggest that during 
the unicentric period, hegemons impose "free 
trade" on the world-economy and "they 
interfere with the colonial ties between other 
core states and their empires." Leading state 
and economic officials in the hegemon tend to 
support a free trade strategy because it enjoys 
a substantial competitive economic advan- 
tage. Undermining the monopolistic colonial 
privileges of other core empires allows the 


hegemon to take advantage of its competitive - 


superiority in the periphery. Colonization is 
expected to decrease during the unicentric 
phases of hegemonic victory and mature 
hegemony. 

A hegemonic nation penetrates the world 
market to such an extent that injuries to the 
legemon have negative consequences. for the 
world-economy as a whole. Mature hegemons 
tend to benefit the entire core by increasing 
the stability of the world-economy and 
reducing the risk of investment. On the other 
hand, during multicentric periods, trade and 
investment are considerably more unstable. In 
order to reduce the level of instability in the 
part of the world-economy they control, 
imperial states will make trade relations more 
bilateral through increased colonization. In 
terms of markets and hierarchies, imperial 
states replace market transactions with colo- 
nial hierarchies during periods of high 
uncertainty in trade relations. As a result, 
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colonization is predicted to increase during 
the multicentric period of hegemonic ascent 
and decline. 


Major Wars Among Core States 


The final Chase-Dunn and Rubinson (1979, 
p. 463) proposition is that colonization 
decreases during ‘periods when the core 
powers are warring among themselves. In a 
later elaboration, Chase-Dunn (forthcoming, 
pp. 459-64) contends that during a war, 
military preoccupation and resource depletion 
reduces the ability of imperialists to initiate or 
support new colonial expeditions and impede 
their resistance to independence movements. 
The more intense the war, the less coloniza- 
tion expected. In addition, even those impe- 
rial states not directly involved in a major war 
will be affected because all core states are tied 
to a network of formal and informal alliances 
and because any major war has the possibility 
of becoming a global war. Nonbelligerent 
imperial states will tend to divert resources to 
preparing for potential conflict and indirectly 
supporting allies. 

On the other hand, Bergesen and Schoen- 
berg (1980) come to the opposite conclusion. 
They consider major wars among core powers 
to be an indicator of political instability 
resulting from a relative equality of political 
power. They argue that, similar to the 
occurrence of instability during periods when. 
a world hegemon is lacking, more political 
control over trade and thus more colonization 
would. occur during war periods. McGowan 
(1985, p. 492) found (limited) empirical 
support for this proposition in his regression 
analysis of colonization that included time, 
the interaction of time and war periods, and 
“colonial cycles" as independent variables.* 
While an important beginning, his analysis 
did not include other world-system or control 
variables and failed to correct for autocorrela- 
ted errors (see critiques of McGowan's model 





* McGowan's “colonial cycles" is a 390-year 
sine wave variable derived from a pattern he 
observed in the residuals. Bergesen (1985) and 
Schoenberg (1985) challenge the temporal specifi- 
cation of the "cycles" and suggest they result from - 
long waves or hegemony. Since both long waves 
and hegemony are measured directly, I have 
excluded the atheoretical “cycles” variable from 
my analysis. Including it might mask the effects of 
other variables of theoretical interest. 
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by Bergesen [1985] and Schoenberg [1985]). 

. Also, both theories overlook the option (often 
goal) of taking the enemy’s colonies through 
war rather than conquering new areas. 
Stealing colonies results in no net gain and 
will reduce new conquests to the extent that it 
diverts resources. Once all the variables are 
included and corrections made, major wars 
are expected on balance to have a negative 
effect on colonization because of the diver- 
sion and exhaustion of resources used in 
colonial conquest. 


DATA AND VARIABLES 


The data on colonization are drawn from 
Bergesen and Schoenberg’s (1980) coding of 
Henige’s (1970) exhaustive list of all known 
colonial governors since 1415. In this data 
set, the life of a colony is determined by when 
the first colonial governor was officially 
established and the last terminated. The 
colonization variable (COLON) is the cumu- 
lative net of annual establishments minus 
terminations, which provides an overall 
indicator of the total number of colonies for 
each year. Colonial establishments and termi- 
nations that were solely the transfer of a 
colony from one imperialist to another are 
thus excluded. Starting with 98 colonies in 
1640, cumulative net colonization peaked at 
168 colonies in 1921, and 1923-1925, then 
fell to 93 in 1960, the last year in our study. 
This is one of the few quantitative 
world-system data sets that is both “free of 
cross-sectional problems” and covers “a 
‘longue durée’ sufficient to reveal structural 
change” in the system (McGowan 1985, p. 
478). Establishing or terminating a colonial 
governorship is not a trivial political act. It 
requires both a physical movement of an 
administration and a legitimate claim in the 
“international regime” to recognize a territory 
as part of an imperial state’s domain (or to 
have to relinquish that claim to dominion). 
While this is the best available data source 
on colonization, it has certain limitations. 
Trading, settler, and occupation colonies are 
treated the same, yet the theory of markets 
and hierarchies would not apply to the first 
type if they were simply posts with no 
production or trade to monopolize. Internal 
colonies and subjected provinces, such as the 
Celtic fringe in the British Empire or Serbia 
in the Austria-Hungarian Empire, are not 
included. As a result, territories that eventu- 
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ally became part of nation-states are not 
counted. For example, Alaska is counted as a 
Russian colony until 1865 but never as a U.S. 
colony. The data also include colonies held 
by semiperipheral states, the most important 
being those held by Portugal. However, the 
same dynamics that affect core imperialists 
should affect semiperipheral ones as well, 
only more severely. For instance, Barbosa and 
Hall (1986, p. 111) concluded that Portugal’s 
semiperipheral position amplified “Brazil’s 
susceptibility to fluctuations in the world- 
economy.” More importantly, semiperipheral 
imperialists typically serve as transmission 
belt surrogates for core imperialists, the most 
important example being the transmission belt 
relationship between Portugal and Britain 
after the Methuen treaty of 1704 (Parry and 
Hopkins 1970).§ 


World-System Variables 


The long-wave variable (WAVE) is dummy 
coded 1 for expansion years and 0 for 
stagnation years (see Appendix 1). Dates for 
long waves prior to 1790 are derived from 
Goldstein’s synthesis of price indexes (1985). 
Rising prices are considered a proxy for 
increased production, declining prices for 
decreased production, direct measurement of 
which is not available until 1790. Mandel’s 
(1980) dating based on production series is 
used after 1790. The dummy coded variable 
is intended to measure the effects of reoccur- 
ring periods or eras rather than annual 
fluctuations in economic conditions. It is 





5 Measuring colonization by governors does not 
at first appear to measure their relative importance 
(McGowan 1985, p. 478). However, the most 
important colonial regions were rarely under the 
rule of a single governor. For example, Britain's 
most important colonial possessions, North Amer- 
ica and India, were made up of 13 and 17 separate 
colonies respectively (Henige 1970, pp. 83-190). 
While the importance of each individual colony 
cannot be ascertained, the data indirectly indicates 
the relative importance of colonial regions that 
upon decolonization became nation-states. Also, 
the piecemeal establishment of governors in the 
separate colonies yet near simultaneous termina- 
tion in a region reflects the processes of colonial 
conquest and liberation. As a result, the overall 
pattern of colonization is similer to historical 
descriptions by Fieldhouse (1973) and Choucri and 
North (1975). . 
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expected to have a negative effect on 
colonization. 

Four hegemony variables are derived from 
Hopkins and Wallerstein’s (1979) dating of 
the phases in. the cycle of hegemony. 
Hegemonic victory (HEGVIC) (1815-1849, 
1920-1944), hegemonic maturity (HEGMAT) 
(1850-1872, 1945-1967), and hegemonic 
decline (HEGDEC) (1650-1692, 1873-1897) 
are each coded 1, 0 otherwise. Hegemonic 
ascent phases (1798-1815, 1897-1920) and 
inconclusive time periods are excluded (see 
Appendix 1). A clear hegemon exists in only 
the hegemonic victory, maturity, and decline 
phases. Several contending potential hege- 
mons, none with an a priori likelihood of 
success, exist during the ascent and inconclu- 
sive periods. The hegemonic victory and 
maturity variables are expected to have a 
negative effect on colonization, while hege- 
monic decline is expected to have a positive 
effect. 

.Major wars are defined as wars between 
Great Powers that involve at least 50,000 
troops and last two years or more (Levy 1983, 
pp. 10-19, 47-48). The effect of major wars 
is not expected to be uniform but instead to 
depend upon the intensity of the war. The 
more intense the war, the less colonization. 
War intensity (WARINT) is the logged index 
of annual battle deaths during a major war as 
a proportion of total European population 
(Levy 1983, pp. 87-91). It is expected to 
have a negative effect on colonization. 


Control Variables 


Major wars have received the most attention 
in long-wave research as both a dependent 
(Goldstein 1985; Thompson and Zuk 1982) 
and an independent variable (McGowan 1985; 
Kiser and Drass 1987) but may not be the 
most important type of war when it comes to 
colonization. Civil wars in core states would 
` also tend to preoccupy and exhaust their 
military and political apparatus, retarding 
colonization. Thus, civil wars in the core 
are also expected to have a negative effect 
on colonization. The civil war variable 
(CIVWAR) is derived from dummy coding 
Wright’s (1942, p. 641) list of civil wars 1 for 
years when an internal war occurred and 0 
otherwise (no intensity measures are avail- 
able). Unfortunately, Wright’s coding and 
concept of what constitutes a civil war are 
unclear. His list included a number of 
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colonial wars such as the American Revolu- 
tion, the Haitian Revolt, and the Latin ~ 
American Revolt. To construct the civil war 
variable, wars with more than one participant 
(i.e., not internal) were subtracted. The data 
were updated using Small and Singer’s 
appendices (1972). 

Another factor that should obviously ac- 
count for part of the variation in colonization 
is the number of imperialist states. As more 
states enter the competition for overseas 
colonies, more colonies should be estab- 
lished. The imperialist variable (IMPERIAL) 
is measured by the number of states that 
owned at least one colony per year (unfortu- 
nately a measure of failed imperialists is not 
available). It is derived from Henige (1970) 
and ranges from 5 to 11. Also included is.a 
variable that measures the growth of Euro- 
pean population under the assumption that 
periods of rapid growth would be periods of 
increased colonization for purposes of settle- 
ment. The variable (LPOPCHAN) is the log 
of the annual percent change in European 
population. European population figures are 
derived from Wright (1942, p. 612) and 
United Nations (1984, pp. 11-50) with a 
linear plot used to map missing data. 

The shift in the legitimacy of colonialism in 
the international regime is measured by 
mapping one of the major causes of that shift, 
the rise of socialist states and Soviet influ- 
ence. The number of socialist states was 
coded from the World Almanac (1987), 
starting with 1 in 1917 and ending with 13 in 
1954. Socialist states are defined as countries 
ruled by a Communist Party. While obviously 
a rough measure, this variable (SOCSTAT) 
maps the limited political influence during the 
interwar period but rapid growth of Soviet 
power and influence following World War II. 
To be sure, decolonization occurred for 
reasons other than the threat of socialism. 


Nationalism has been the most important . . 


element in spawning liberation movements. 
With the eventual commodification of eco- 
nomic relations, the utility of colonization for 
imperialists has also greatly decreased. How- 
ever, the number of colonies was nearly at its 
peak in 1940, and nationalism or commodifi- 
cation cannot explain the near universal 
acceptance of decolonization shortly thereaf- 
ter. In addition, an expansive long wave and a 
period of mature hegemony overlap with the 
postwar decline in colonization. Including the 
socialist states variable controls for the 
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uniqueness of the end of colonization, 
allowing the effects of waves and hegemony 
to be separately assessed. 

Two final control variables were used in 
the analysis. A time-trend variable, (YEAR) 
measured by the years 1640 to 1960, is added 
to eliminate any unmeasured linear effects 
and reduce the possibility of spurious effects 
(McGowan 1985, p. 490; Jacobs 1988, p. 
863). One likely unmeasured linear variable 
is the cumulative effect of commodification. 
In order to account for the inherent inertia in 
colonization, a lagged dependent variable 
(LAGCOLON) is also included in the equa- 
tion. Rapid structural change in large histori- 
cal processes such as colonization is unlikely. 
Empires are neither built nor destroyed in a 
year. The start-up costs of a colonial 
apparatus, such as governors, bureaucrats, 
and military regiments, are high, as are the 
military and economic costs of liberation. 
Furthermore, conquest of the colonial world 
was divided among the various imperialists, 
which diffuses and delays the impact of 
common world conditions. A simultaneous 
(de)colonization would be nearly impossible. 
As a result, the responses of imperial or 
colonial actors to world conditions are 
constrained and delayed by inertia in the 
colonial systems. Changes in particular world- 
system variables can only explain partial 
adjustments in colonization. However, histor- 
ical inertia also means that the causal effects 
of any large-scale historical variable are not 
played out in a single year. Instead, the 
effects are distributed over time, long after 
the initial occurrence. The strength of inertia 
is indicated by how close the coefficient of 
the lagged dependent variable approaches one 
(Hicks 1984, p. 170; Johnston 1972, pp. 
292-303). Including the lagged dependent 
variable turns the model into a dynamic one 
that examines the change in colonization over 
time. 


REGRESSION RESULTS 


Regression equations were estimated using a 
time-series model that corrects for first-order 
autocorrelation of errors by means of general- 


ized differences (White 1978). All the 





7 The SHAZAM econometrics computer pro- 
gram has a sophisticated procedure for time-series 
analysis that includes an iterative Cochrane-Orcutt 
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independent variables were lagged one year to 
reflect the time lag between changing condi- 
tions in the world-system and net coloniza- 
tion, but only one year in order to minimize 
overlap between different long-wave and 
hegemony periods. In the first equation in 
Table 1, only the five world-system variables 
and the lagged dependent variable are exam- 
ined. The equation has a very high? (.985), as 
is common with the inclusion of a lagged 
dependent variable, and it is devoid of 
first-order autocorrelation error (Durbin's 
h = -—0.066). As expected, colonization 
declined on the average during extended 
expansion periods net of other variables as 
indicated by the significant negative relation- 
ship for the long-wave variable. Both the 
unicentric hegemony variables (victory and 
maturity) also had expected negative effects. 
Unexpectedly, the slope estimate for the 
hegemonic decline variable also had a nega- 
tive sign, although it failed to reach signifi- 
cance. The slope estimate for the war 
intensity variable was effectively zero. 

On the full model that includes the control 
variables (equation 1.2), six variables—long 
waves, hegemonic victory, imperialists, civil 
wars, socialist states, and the lagged depen- 
dent—reached significance at the 0.05 level 
or better. The equation also has a good fit 
(R? = .986) again as a result of inclusion of a 
lagged dependent variable, and is devoid of 
first-order autocorrelation error (Durbin’s 
h = 0.235). The long-wave variable main- 
tains its significant negative relationship as 
does the hegemonic victory variable. But, 
with the inclusion of the socialist state 
variable, hegemonic maturity no longer reaches 
Significance. The imperialist variable shows 
the expected positive relationship, and the 
socialist states measure of international re- 
gime shift has the expected negative relation- 
ship. Regardless of how these two control 
variables are interpreted, the long-wave and 


type method for calculating rho, correction for 
autocorrelation, and an option specifying that 
calculation of the estimated variances be adjusted 
for the presence of a lagged endogenous dependent 
variable. Prior to familiarization with SHAZAM, I 
estimated the regression equations in Table 1 using 
the more commonly used pseudo-GLS (ARI) 
method on SPSSX (Hibbs 1974, Hicks 1984). 
While the final results are nearly identical, the 
accuracy, speed, and options of Shazam make it 
far superior (despite the name). 
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Table 1. Regression Analyses* of Cumulative Net Colonization on Long Wave, Imperial, Hegemony, War, Socialist, 
Population, and Time Trend Variables 
Dependent Variable: Colonization (COLON) 


(1.2) (1.2) 


Independent Variables B beta B beta 
Long waves (LAGWAVE t-1) — .820 —.019* — 1.348 — .032* 
Hegemonic victory (LAGHVIC 1-1) -1.311 — .0214* — 1.733 — .032* 
Mature hegemony (LAGHMAT 1-1) — 1.491 — .024* — .676 —.011 
Hegemonic decline (LAGHDEC t-1) —.022 —.004 —.552 - 011 
Log war intensity (WARINT 1-1) — .000 — .000 -016 .002 
Civil war (CIVWAR r-1) — .860 — .022* 
Imperialists (IMPERIAL 1-1) .709 .044* 
Socialist states (SOCSTAT 1-1) —.315 — .034* 
Log pop. change (LPOPCHAN. 1-1) —.211 — .006 
Lag dep. variable (LAGCOLON t-1) 7 .979 .978* .968 .967* 
Time trend (YEAR) — .000 — .001 
Constant 3.540* — .094 

R 985 .987 

Adj. R? 985 .986 

Durbin- Watson 1.803 1.810 

Durbin's h — .066 235 

rho 317 2270 


* p<.05 one-tailed test. 
** p.10 one-tailed test. 


* The analysis was corrected for first-order autocorrelation (ARI). 


hegemonic victory variables remained signif- 
icant despite their presence. 

While the civil war variable had a signifi- 
cant negative effect as expected, the major 
war variable is still far from significant. The 
rate of European population growth also has 
no significant impact on colonization. Popu- 
lation growth may be important for some 
colonies (such as settler colonies) and some 
historical periods, but it apparently does not 
affect the entirety of colonization systemati- 
cally over the “longue durée.” 


Alternatives and Diagnostics 


Dates for long waves based on both prices 
(Kondratieff [1926] 1979; Goldstein 1985) 
and production are widely used (Mandel 
1975, 1980; van Duijn 1983; Kiser and Drass 
1987). The price and production dates differ 
widely for the period of 1790-1825 (Gold- 
stein 1985, p. 413). The expansion period 
that began in 1790 ends in 1813 according to 
the price data, but not until 1825 according to 
the production data. Because price is only a 
proxy for production, all dates that rely on 
prices suffer from possible misclassification 
due to spurious causes of inflation, such as 
war (Wallerstein 1984, p. 561; Schumpeter 
1939, p. 298). While historical data from this 
period is limited, theoretically, production 


data should be superior to price data. As in 
other recent studies of long waves, Mandel's 
dates were chosen on a priori theoretical 
grounds (Pfister and Suter 1987, p. 243; Kiser 
and Drass 1987, p. 289; van Duijn 1983). 
After the analysis was completed, inspec- 
tion of the residuals indicated that selection of 
the production over the price-based dating 
Schemes was highly advantageous for the 
theory. The disputed period, 1814 to 1825, 
includes the main period of decolonization in 
Spanish America. In particular, 31 colonies 
were liberated during 1821, the most in any 
single year. The production dating puts this 
decolonization period during a long expansion 
as predicted by the theory; the price dating 
excludes it. While. the production dating is 
more theoretically sound, it is important to 
know whether effects of long waves rest upon 
8 disputed period. In equation 2.1 on Table 2, 
long waves have been recoded using the 


: price dates for the disputed period (i.e., 


1814-1825-0). As one can see, while the 
coefficient for the altered variable is reduced, 
it remains significant, albeit at only the 0.10 
level. None of the other variables is much 
affected. 


8 An alternate procedure is to retain the 
production dating of long waves and examine 
whether 1821 is an unduly influential case. Using 


190 


AMERICAN SOCIOLOGICAL REVIEW 


Table 2. Regression Analyses* of Cumulative Net Colonization on World-System and Control Variables: Alternative 
Models i 


Dependent Variable: Colonization (COLON) 


Independent Variables B 
Long Waves (LAGWAVE 1-1) —.665 
Hegemonic victory (LAGHVIC 1-1) —1.598 
Mature hegemony (LAGHMAT 1-1) — 906 
Hegemonic decline (LAGHDEC 1-1) — 208 
Log war intensity (WARINT 1-1) .004 
Civil war (CIVWAR 1-1) —.821 
Imperialists (IMPERIAL 1-1) 610 
Socialist states (SOCSTAT 1-1) — .329 
Log pop. change (LPOPCHAN. 1-1) —.219 
Lag dep. variable (LAGCOLON 1-1) .970 
Time trend (YEAR) .000 
Log war intensity (WARINT 1-5) — 
Constant 

R 

Adj. R? 

Durbin-Watson 

Darbin’s h 

rho 


* p<.05 one-tailed test. 
** p<.10 one-tailed test. 


Q.1* (2.2 
beta B beta 
— 016** — 1.465 — 034* 
— .029* —1.778 — .032* 
—.015 —.781 —.013 
— 004 — 611 - 012 
.001 —.019 — ,003 
—.021* ~ 951 —.024* 
.038* 763 .047* 
— .036* —.344 - .037* 
— 007 —.198 — .006 
.970 .964 .963 
.002 — 001 — 005 
NS .116 .018* 
—1.243 2.641 
.986 .987 
.986 .986 
1.822 1.822 
125 51 
.307 .286 


* The analysis was corrected for first-order autocorrelation (AR1). 
> Long Waves was recoded, 1814-1824 =0 (stagnation) in equation 2.1. 
* The first War Intensity variable was unlagged in equation 2.2. 


A second issue is the surprising insignifi- 
cance of major war intensity, given McGo- 
wan's (1985) earlier finding.? The positive 


discrete data points in single years makes influen- 
tial cases an important problem in the quantitative 
study of historical processes (Bollen and Jackman, 
1985). A Cook's D test on the residuals found no 
significant undue influence. The highest score was 
for 1821 (D=.9485, sig. = .49). While the 
influence of 1821 is not significant overall, it may 
have a particular effect on the long wave or other 
variables. To test this, a dummy variable for 1821 
(YEARD - 1 in 1821) was added to equation 2.2. 
Adding a single-case dummy variable makes its 
influence as high as possible and the effects of 
other variables i t from it, obviously a 
stringent test (Jackman 1987). When added to 
equation 1.2, YEARD had a highly significant 
coefficient (b = — 28.365, s.e. — 1.92), and the 
coefficients for the other variables were (some- 
what) reduced. However, with the exception of 
civil war, the same variables remained significant, 
indicating robust findings given rather stringent 
conditions. The importance of civil war may be 
linked entirely to the internal conflicts in Spain and 
Portugal during 1821, which aided the liberation of 
Latin America. 

? McGowan used a different measure of major 
wars, which might explain the difference in 
results. He dummy coded major wars (1 = war) 


sign is also unexpected, although this squares 
with political instability theory. If both 
theories are relevant, the expected negative 
effect of resource depletion may be sup- 
pressed in the model by the positive reaction 
to political instability. These crosscutting 
effects can be separated if they occur at 
different time points. Resource depletion and 
diversion from colonization should be greatest 
while a war is occurring. On the other hand, 
reactions to political instability would be 
muted during a war but come to the fore 
afterwards as combatants jockey for position 
in the new hierarchy. We might expect war to 
initially retard colonization, but later induce 
increases (see Chase-Dunn, forthcoming). A 
second reformulation (equation 2.2) includes 
war intensity twice, once unlagged and the 
second lagged five years (t-5). The signs of 
both coefficients are in the expected direc- 
tions, but only the second, lagged variable 
reaches significance. Postwar political insta- 


and then created an interaction term of war times 
year. Substituting this variable for WARINT in a 
separate analysis of equation 1.2 had no effect. 
The new coefficient was also essentially zero (b = 
— 0.0001, se — 0.0002) and no change occurred 
in the other coefficients. 
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bility among the Great Powers apparently 
leads some to increase their rate of coloniza- 
tion, but only after the war is well over (or at 
least has lasted five years). 

Finally, in all the equations the effect of 
inertia is exceptionally high, as indicated by 
the coefficient of the lagged dependent 
variable. It accounts for the high R? values, 
which can be deceptive (see Appendix 2 on 
alternative interpretations of inertia). One 
way statistically to minimize its effects 
without affecting the other coefficients is to 
subtract it from the dependent variable, 
creating a change variable (CHGCOLON = 
COLON - LAGCOLON). As a result, the 
coefficient for the lagged dependent variable 
becomes minus one its previous value (b — 1) 
(Johnston 1972, pp. 292—383). A regression 
analysis was performed on the change vari- 
able, using the same model as equation 2.2. 
Equation 2.2 (Y = a + b Yg_1) + bX + u) 
and the change equation (Y — Ya, = a + 
(b —1) Yo—1) + bX + u) are equivalent. The 
results indicate a respectable but much lower 
amount of explained variance with an R, of 
0.227. The coefficients for this model are not 
reported since they are exactly the same for 
the independent variables in equation 2.2, 


except for the lagged dependent variable 


which falls to 0.968 — 1 = 
(s.e. =0.014). 


— 0.032 


DISCUSSION 


Of the three world-system propositions, most 
support is found for the idea that expansive 
imperial colonization has an inverse relation- 
ship with extended economic expansion and 
contraction. Markets versus hierarchy theory 
appears to make a useful contribution to 
explaining the pattern of colonization. Impe- 
rial states are less likely to construct colonial 
hierarchies when markets are expanding and 
vice versa. The effect of long waves remains 
despite inclusion in the model of additional 
control variables. Nor do the effects depend 
upon a disputed measure or an influential 
case. The findings also lend support for the 
existence and historical importance of eco- 
nomic long waves in general. 

The proposition that unicentric global 
hegemony retards colonization was only 
partially supported in the analysis. Only the 
phase of hegemonic victory was both consis- 
tent with the theory and consistently signifi- 
cant. The victory phase occurs when the new 
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hegemon surpasses its rivals after a long 
competitive ascent. The findings here suggest 
that victorious hegemons support “free trade” 
in order to consolidate their superior position, 
but slacken support once trade relations have 
been realigned. Most importantly, while the 
theory predicts that hegemons support the 
decolonization of competing empires, it is 
silent on decolonization of their own posses- 
sions. British imperialists may have preferred 
independent states in Latin America to the 
Spanish Empire, but they saw no similar 
advantages in an independent India during the 
same period. In the US, which had only a 
small colonial empire, strong support for 
international decolonization under Franklin 
Roosevelt dropped off precipitously after the 
United States reached. mature hegemony 
following World War II (Grimal [1965] 1978, 
pp. 148—56). 

In regard to the effects of major wars on 
colonization, the theory is divided between 
the expected negative effects of resource 
depletion and positive effects from reactions 
to political instability. The initial model did 
not resolve this difference since no clear 
effect of major war intensity was found net of 
other variables. I attempted to resolve these 
differences by including two war variables, 
one with an expected immediate negative 
effect and the second with an expected 
five-year lagged positive effect. Only the 
latter was significant, supporting political 
instability theory. Historically, acquiring col- 
onies from defeated empires has been a 
central purpose of interimperialist war such as 
the British taking of Canada from France and 
the allied distribution of German colonies 
after World War I (Darby 1987, pp. 15-30; 
Choucri and North 175, p. 154—60; Ethering- 
ton 1984). Yet this only produces a redistri- 
bution of colonies with no net increase. More `. 
important was the colonial expansion of 
victors into the spheres of influence of 
defeated empires. For instance, British colo- 
nization of the Indian subcontinent acceler- 
ated after France was no longer a viable 
competitor. 

Support is also ix for the theory of 
"controlled liberation," but it applies only 
where imperialists yielded independence to 
avoid a socialist or pro-Soviet takeover. If no 
acceptable alternative was available, the 
theory- predicts that they would stiffen resis- 
tance. For instance, the official U.S. explana- 
tion for its postwar decline in support of 
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decolonization was the need for “restraint” 
where rapid decolonization might result in 
“Communist imperialism” (see documents 
quoted in Grimal [1965] 1978, pp. 175-76). 
The rise of socialist states may have had 
mixed effects on the rate of colonization, but 
the net effect is negative. Rather than “the 
highest stage of capitalism” (Lenin [1917] 
1939), the peak in colonialism around War 
War I represented the highest stage of 
(formal) imperialism.: 


CONCLUSION 


Previous quantitative support for world- 


system theory has come primarily in the form 
of cross-national studies of development 
during recent time periods. In contrast, this 
: study provides quantitative statistical support 
for the operation of world-economic dynam- 
ics at the level of the whole system for a 
320-year period. It replicates and expands 
upon conclusions from previous research that 
have been frequently cited as evidence of the 
efficacy of the theory even though the theory 
has not heretofore undergone rigorous statis- 
tical analysis. Two of the three Chase-Dunn 
and Rubinson (1979) propositions were at 
least partially supported and tbe theory was 
elaborated by treating the propositions in 
terms of markets versus hierarchies. The 
conclusions from this study indicate that 
colonization is an alternative form of systemic 
expansion when economic markets are stag- 
nating and, to a lesser extent; when political 
power in the core is multicentric.!? No clear 
support was found for the negative effects of 
resource depletion during a major war, but 
subsequent changes in the balance of power 


19 The effects of hegemony may be diminished 
by using three dummy variables to measure a 
single process, resulting in possible multicollinea- 
rity. An alternative operationalization of hegemony 
is to use only one variable, coded 1 or hegemonic 
victory and maturity, and —1 for hegemonic 
ascent and decline or no hegemon. This variable 
reflects Chase-Dunn and Rubinson's (1979) bipo- 
lar concept of hegemony rather than Hopkins and 
Wallerstein's (1979) four phases. Substituting this 
single variable (lagged 1 year) for the there- 
hegemony variable (HEGVIC, HEGMAT, AND 
HEGDEC) in equation 2.2 yields a significant 
coefficient (b = —.537, s.e. = .263) without 
much altering the rest of the equation. 
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eventually lead to an acceleration of coloniza- 
tion. 

While recent research has greatly improved 
its quality, the quantitative evidence for long 
waves and hegemony is frequently mixed and 
Sketchy, especially data from prior to 1848. 
That crude dummy variables based on this 
data were significant in models explaining 
global events of major historical consequence 
lends credence to the viability of the 
concepts.!! Colonial governorship also fails 
to measure both internal colonies and the 
intensity of incorporation into the world- 
economy, such as the difference between 
trading, settler, and occupation colonies. 
Since imperialists could respond to multicent- 
ric hegemony or stagnation by intensifying 
political domination of existing (internal) 
colonies rather than adding new ones, using 
governorship mutes their full effects, contrib- 
uting to the low levels of explained variance. 
In addition, the theory of markets and 
hierarchies may not apply to trading colonies, 
and their prevalence in the earlier years may 
have depressed the relationships. Compara- 
tive case studies of colonization are needed to 
investigate differences in the intensity and 
types of colonies. 

While no substitute for detailed compara- 
tive and interpretive historical research, 
quantitative research enhances our confidence 
in the theory. Analyzing aggregate dynamics 
over extensive time periods tests "the basic 
notion of a system" (Stinchcombe 1982, p. 
1394). The breakthrough in assembling pre- 
20th-century data on colonialism (Bergesen 





!! Modelski and Thompson (1988, Appendix B) 
recently measured “world leadership" from 1494 
to the present based on the concentration of naval 
power among great powers. The variable ranges 
from near equal distribution in 1648 at 0.164 to 
1.00 from 1946 to 1959 with the U.S. nuclear 
monopoly (mean = 0.367). While explaining the 
differences between their theory and Wallerstein's 
is beyond the scope of this study, hegemony and 
world leadership are kindred concepts (the former 
more economic, the latter more political) with 
similar empirical patterns (correlation with HEG- 
MAT - 0.46, HEGDEC = -—0.13, HEGVIC - 
0.02). A strong negative effect on colonization was 
found when the log of world leadership (lagged 
one year) was substituted for the hegemony 
variables in equation 2.2 (b — —2.899, s.e. — 
0.950) indicating a robust model (see also, 
footnote 10). The rest of the model remained 
basically unchanged. 
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and Schoenberg 1980), major wars (Levy 
1983), long waves (Goldstein 1985), and 
naval power (Modelski and Thompson 1988) 
makes such research a realistic possibility. 
Further analysis of this data is a logical next 
step in cumulating support for world-system 
theory. In order to focus on strictly capitalist 
economic dynamiics, I began the time-series 
analysis at 1640. Research beginning earlier 
should indicate whether theories of long 
waves or markets versus hierarchies are 
appropriate in noncapitalist or transitionary 
periods. Studies of core-periphery political 
relations after 1960 should not use colonial 
governorship as a measure because of the 
shift in the international regime. ' 

The results of this study lend credence to 
only to the propositions concerning coloniza- 
tion, but also to the historical importance of 
long waves and hegemonic phases that are 
central to world-system theory. Their effects 
should be relevant for studies of internal 
colonialism and nation building, such as an 
explanation of the pace of continental expan- 
sions by the United States or the Russian 
Empire (see Hall, forthcoming). Extending 
the analysis of Goldstein's (1985) data on 
long waves and war is another obvious step, 
and examining Wallerstein’s propositions 
about world development with quantitative 
data is another interesting research path. 

The political relationship of core nations 
(and their semiperipheral allies or surrogates) 
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with the periphery has been articulated in the 
form of colonial empires for most of the 
history of the capitalist world-economy. The 
theory suggests that political domination 
should intensify during stagnations and peri- 
ods of multicentric hegemony. Subsequent 
research will need to define core political 
control of the periphery in terms other than 
colonization in order to study later time 
periods or to study noncolonial control 
patterns prior to 1960. Such research is 
necessary for complementing the existing 
economic theories with a well-rounded politi- 
cal analysis of the world-system. 
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rebellion, income inequality, economic growth 
and regime repressiveness. 


APPENDIX 2: Interpretations of Inertia — 

There are two polar interpretations of inertia in 
time-series equations. The first interpretation 
specifies that inertia in the dependent variable 
prevents external conditions from having more 
than marginal effects. The small coefficients found 
in this and most other time-series equations that 
include a lagged dependent variable are accurate 


APPENDIX 1. Coding for Long-Wave and Hegemony Variables,* 1650-1789 


Economic 
Long Wave Hegemonic Victory 
Period Code Period Code 
1650-1688 0' 1650- 0 
1689-1719 1 
1720-1746 0 
1747-1761 1 
1762-1789 0 1814 
1790-1825 1 1815- 1 (G.B.) 
1826-1847 0 
1848-1873 1 1849 
1850- 0 
1874-1893 0 
1894-1913 1 
1914-1939 0 1919 
1920- 1 (U.S.) 
1940-1967 1 1944 
1945-- 0 


Hegemonic Maturity Hegemonic Decline 
Period Code Period Code 
1650- 0 1650- 1 (U.N.) 

1672 
1673- 0 
1849 
1850- 1 (G.B.) 
1872 1872 
1873- 0 1873- 1 (G.B.) 
1896 
1897 0 
1944 
1945- 1 U.S.) 


Sources: Long Waves: Mandel (1975, 1980), Goldstein (1985); Hegemony Variables: Hopkins and Wallerstein 


(1979). 


* The phase of hegemonic ascent and period of no hegemon, 1673-1814, 1897-1919, are excluded. 
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indicators of the total impact of these variables. 
This might be especially true of world-systemic 
variables which measure changes in the global 
environment that constrain and motivate state 
actions, but do not tap the internal dynamics of 
individual states or empires. 

A second interpretation views inertia as delaying 
the effects of the independent variables, distribut- 
ing them over time. While measured at one point 
in time, a major historical process—such as a long 
wave, rise of a global hegemon, or world war— 
could have a distribution of (declining) effects 
spread over several years. The reasons for the 
continued effects are the inability (and unwilling- 
ness) of imperial actors to quickly modify the 
colonial structure (in a single year) in response to 
changing conditions. The second interpretation 
leads one to the assumption that, rather than an 
index of the effect of the lagged dependent 
variable, its coefficient (b;) is the repository of the 
effects of all the variables after their initial 
“impact.” This coefficient can be used to trans- 
form the coefficients of the other variables from 
indexes of particular period-to-period “impacts” to 
coefficients of the cumulative "equilibrium" ef- 
fects. The transformed coefficients index the sum 
of all distributed impacts on "Y" of a sustained 
change in "X" from the first impact through to the 
last (Johnston 1972, p. 292-303). The estimated 
regression coefficients for the independent vari- 
ables (bj) are thus deflated and should be adjusted 
upward. On the average, if we made the Kyock 
assumption of a declining geometric sequence, the 
adjustment equals b; / (1 — bı) (Hicks 1984, p. 175; 
Johnston 1972, pp. 299—302). For instance, 
equation 1.2 would be rewritten as: 


Y = —42.13 WAVE —54.16 HEGVIC — 21.12 
HEGMAT -—17.25 HEGDEC —0.50 WARINT 
— 26.88 CIVWAR + 22.15 IMPERIAL — 9.84 
SOCSTAT - 6.59 LPOPCHAN - .0003 YEAR 


According to the Kyock assumption, distributed 
lag effects decay exponentially over time, techni- 
cally, for infinity. The equation could also be 
reestimated using the Almon model where the 
effects are distributed polynomialy over time 
and/or to a "truncated" form with a finite series of 
lags (Hibbs 1974). Differences between an "infi- 
nitely" declining and a judiciously "truncated" 
Kyock model should be small. 

In sum, while the results presented in the text 
are probably conservative with respect to the 
qualitative conclusions, they are consistent with a 
range of quantitative conclusions. At one extreme, 
the first interpretation suggests that the indepen- 
dent variables explain a modest 20 percent of the 
variance in colonization. Much of the process must 
be left to a combination of unformulated (or better 
measured) world-level factors and more micro- 
scopic processes and events. At the other extreme, 
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the second interpretation suggests a reformulated 
model that generates only partial adjustments in 
"Y" for changes in "X" for any single period, but 
about 98 percent of the variation once cumulated. 
While the first interpretation errs in underestimat- 
ing the total distributed lag effects of the 
independent variables, the second interpretation 
errs in overestimating these effects by allocating to 
the independent variables all the variation ex- 
plained by the lagged dependent variable. At this 
point, one can be confident that the quantitative 
power of the model lies somewhere between these 
two extremes, but it cannot be determined where. 
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This paper extends Perrone’s conceptualization of positional power of labor and 
investigates its impact on three labor outcomes. We conceptualize positional power 
as embedded in the structural relations among interconnected spheres of 
production in the economy. We maintain that positional power is a three- 
dimensional construct. It entails (1) the ability of workers in a given industry to 
disrupt operations of other industries “upstream” in the production process, (2) 
the ability to disrupt operations “downstream” in the production process, and (3) 
the ability to disrupt production locally within their own industry. After developing 
new measures of these three dimensions from interindustry input-output flow 
matrices, we estimate the impact of positional power on three labor outcomes: (a) 
labor’s organizational capacity, (b) labor militancy, and (c) labor’s economic 
standing. Our findings suggest that positional power positively influences all three 
of these dimensions. We discuss the significance of these results for future 
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research. 


INTRODUCTION 


Sociologists have a long-standing interest in 
the sources of economic power among 
competing classes. Power can be defined as 
the ability of individuals or collectivities to 
obtain and retain advantages in the economy 
and the polity. Many sociologists have been 
concerned with the sources of workers’ 
powers although the question of how workers 
obtain and retain power is a matter of much 
debate. Some view power as a result of 
differing “market capacities” that workers 
bring to the labor market (Parkin 1971; 
Giddens 1973). Marxists view power as a 
dynamic feature of the contradictory tenden- 
cies in capitalism; the concentration and 
centralization of capital and the collective 
organization of production encourage the 
formation of trade unions and working-class 
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action (Marx [1867] 1967). Edwards (1979) 
and his collaborators, for example, discuss 
the historical evolution of “control systems” 
in the capitalist firm and characterize capital- 
ist development as class power struggles 
about and within “social structures of accu- 
mulation” (see Gordon, Edwards, and Reich 
1982). Others emphasize the importance of 
labor’s organizational strategies (Rubin, Grif- 
fin, and Wallace 1983) and/or capitalists’ 
counterorganizational strategies (Griffin, Wal- 
lace, and Rubin 1986). Recent theories of the 
dual economy (e.g., Beck, Horan, and 
Tolbert 1978) and their derivatives (Hodson 
and Kaufman 1982) have demonstrated differ- 
ential earnings power for workers in different 
sectors of the economy, implying strongly 
that workers have differential power vis-a-vis 
their employers. Finally, several theorists 
from widely divergent perspectives have 
pointed to the emergence of a “new class” of 
technocrats, intelligentsia, and experts who 
exert power deriving from their privileged 
access to key information or knowledge 
(Gouldner 1979; Mann 1973; Bell 1973). 
‘While each of these perspectives has 
merits, none precisely captures the structural 
bases of power embedded in the linkages 
among different spheres of production. They 
fail to account for how, in strategically 
situated industries, a small number of workers 
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can sometimes exercise power far beyond 
their numbers by interrupting the flow of key 
intermediate products and services, thereby 
disrupting production in related industries and 
possibly throughout the whole economy. In 
exceptional circumstances, work stoppages in 
key sectors could create secondary systemic 


damage that causes economic crises and. 


threatens social harmony. Acknowledging 
and extending this understanding of “worker 
power” as “industrial centrality,” Perrone 
(1983, p. 414) contends that many theories 
incorrectly assume that conflict takes place in 
an “amorphous and undifferentiated” eco- 
nomic system where workers in different 
sectors are endowed with roughly equal 
capacities for struggle. With the arguable 
exception of economic segmentation theories, 
no perspective adequately explains the differ- 
entiated power of workers in different sectors 
of the economy. 

Before his untimely death, Luca Perrone 
was developing a promising theory for 
understanding the positional sources of la- 
bor’s power. His aborted effort produced two 
articles that Erik Wright edited into publish- 
able form (Perrone 1983, 1984). These 
articles initiated two fruitful lines of inquiry. 

First, Perrone sought to develop a method 
to measure the positional power of labor that 
is anchored in the interdependencies among 
different:spheres of production. Using stan- 
dard interindustry input-output flows among 
16 Italian industries, Perrone argued that the 
positional power of workers in different 
industries stems from their differing capaci- 
ties to block (through a work stoppage, for 
example) production not only in their own 
industries, but also in industries that serve as 
suppliers and customers. Thus, some workers 
have "disruptive potential" proportionately 
far greater than their actual numbers because 
their influence ripples throughout many indus- 
tries. Workers whose influence is confined to 
their own industry have more limited disrup- 
tive potential. 

Second, he suggested that such a measure 
might be systematically related to various 
labor outcomes such as higher wages and 
propensity to strike. The empirical evidence 
in Perrone's analysis was mixed on these two 
counts. Perrone's measure was positively 
related to average wages, but not related to 
Strike activity. Both results conformed with 
expectations, but the latter finding left open 
two possibilities. On the one hand, the 
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relationship between positional power and 
overt conflict may be curvilinear, not linear 
(Wright 1984). On the other hand, the 
nonsignificant result may be a by-product of 
the crudeness of the positional power mea- 
sure, the strike measure, or poor model 
specification. Thus, while Perrone's analysis 
was hampered by data limitations and by bis 
inability to complete the project before his 
death, his two published papers lay a solid 
conceptual foundation but leave enough 
unresolved questions to warrant further re- 
search. 

Two commentaries by Wright (1984) and 
Griffin (1984) accompanied Perrone's second 
article. They noted several limitations to the 
analysis and suggested possible modifications 
to be incorporated into future research. We 
review only a few of those recommendations 
here. First, they suggested the need to refine 
the measurement of positional power. For 
instance, they encouraged the development of 
a continuous measure of positional power that 
is more sensitive to small variations in power. 
Also, whereas Perrone measured only the 
disruptive potential of workers in supplying 
industries, workers in receiving industries 
may also have this potential. Wright also 
noted several alternative methods of standard- 
izing the positional power variable. Second, 
the commentators suggested remedies to 
several analytical and model specification 
problems apparent in Perrone's analysis. For 
example, Griffin (1984, p. 425) noted 
Perrone's failure to consider the counterbal- 
ancing features of capitalist strength or of 
indicators of the business cycle in his models. 
Perrone also neglected to include measures 
tapping the "industry's ability to pay" or 
unionization. We will discuss these problems 
and our solutions below as they become 
relevant. 

This paper extends the line of inquiry 
started by Perrone. Based upon our reading of 
Perrone's work and the commentaries, we 
develop a more sophisticated measurement 
strategy that yields a three-dimensional con- 
ceptualization of positional power. Next, we 
provide empirical evidence for how positional 
power relates to three domains of critical 
importance to workers: (a) the organizational 
capacity of labor; (b) labor militancy; and (c) 
the economic standing of labor. Then, we 
discuss possible extensions of the logic of the 
positional power approach to other issues 
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within and beyond the social stratification 
literature. 


MEASURING POSITIONAL POWER 


Our measures of positional power are derived 
from a 30 x 30 matrix of interindustry 
input-output flows that was produced by 
aggregating a more detailed matrix available 
from the Department of Commerce. These 
data were available for four years (1963, 
1967, 1972, and 1977) and were matched 
with industry-based data for the same 30 
industries for the same four years. The flows 
in these matrices represent the dollar value of 
goods and services that are exchanged among 
different industries. They thus capture the 
flows only of intermediate goods and services 
and not of goods and services produced for 
the final consumer market. Since these 
absolute values are highly correlated with 
industry size, we considered several alterna- 
tive methods of standardizing the data, some 
of which were suggested by Wright (1984) in 
his commentary.! We eventually settled on 
Perrone’s original strategy of standardizing by 
the number of workers in the industry. This 
seemed more reasonable than any alternative 
we considered since the ultimate purpose of 
the measure is to approximate a per worker 
contribution to industry production.? This 
method also serves as an adequate control for 
industry size. 

Our measurement of positional power 
attempts to overcome several shortcomings of 
Perrone’s original measure. First, Perrone 
created only one measure based on the 
volume of goods and services per worker sent 
“downstream” in the production process, 
assuming that the disruptive potential of 
workers lies exclusively in their ability to 
create shortages in industries that use their 
products. Yet workers also may have disrup- 
tive potential “upstream” in the production 
process by curtailing the demand for products 





l For example, one could standardize the 
financial flows by the number of firms in the 
sending sector or by the total value of output in the 
sending sector. 

? Alternative formulations of this type of 
measure may be better suited for other purposes. 
For example, if one were interested in the 
economic interdependencies among firms in differ- 
ent industries, it might be appropriate to standard- 
ize the input-output flows by the number of firms. 


199 


from supplying industries (Wright 1984). For. 
example, the impact of a major strike in the 
auto industry may create both of these effects. 
In a long strike, the decline in automobile 
production hurts auto retailers and other 
industries that purchase cars. However, such 
a strike will also have a severe impact on 
supplying industries such as primary and 
fabricated metal products, rubber, and glass 
by creating backlogs and oversupplies. We 
therefore argue that the positional power of 
workers is not unidimensional and consider it 
an open question whether workers have more 
disruptive potential “upstream” or “down- 
stream.” We create two measures, the 
“receive vector” and the “supply vector,” to 
represent the upstream and downstream dimen- 
sions of positional power, respectively. 
Second, although Perrone’s measure is 
explicitly designed to measure the threat of 
systemic economic damage, some measure of 
localized damage should also be included in 
the analysis. For example, workers in the auto 
industry may be able to raise their wages 
merely because of the threat of damage they 
can inflict within their own industry. Yet 
Perrone’s conceptualization precludes a direct 
test of this hypothesis. To explore this 
possibility, we create a "self-supply vector" 
that taps the flow of goods and services 
occurring exclusively within each industry. 
Our conceptualization of positional power is 
thus three-dimensional: positional power may 
reside in the ability of workers to disrupt 
production in their own industry ("self- 
supply" vector), in industries upstream in the 


‘process (“receive” vector), or in industries 


downstream in the process (“supply” vector). 

Third, Perrone used what we call a 
"threshold" method of computing standard- 
ized industry flows. Consider for now only 
the supply vector. Perrone assigned a score of 
1 for each industry to which Industry A sent 
more than a threshold level of value (500,000 
lira) per worker, and otherwise assigned a 0. 
To create an ordinal measure of positional 
power for Industry À, he then determined 
scores for the supply vector by summing these 
0-1 values across all 15 industries to which 
Industry A sent goods and services. He then 
transformed this ordinal variable, using natu- 
ral logarithms to create the positional power 
measure. This procedure was repeated for 
each of the 16 industries. $ 

Perrone’s threshold method for construct- 
ing the supply vector can be represented 
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mathematically as we show below. He first 
determined the value of goods and services 
per worker in the sending industry that was 
sent to each of i industries using the following 
formula: 


t = (S/Ws) 


where t; is the value of goods and services 
per worker in the sending industry supplied to 
the ith industry; S; is the value of goods and 
services supplied by the sending industry to 
the ith industry; W, is the number of workers 
in sending industry s. Next, he decided 
whether this value was above or below a 
predetermined threshold value (in his case 
500,000 lira). Then, :; was converted to a 
dichotomous variable (Tj) scored 1 if above 
the threshold value and O if below the 
threshold value. In the next step, the 
threshold values were summed and trans- 
formed by natural logarithms to create the 
supply vector for each sending industry. 
Mathematically, 


k 
SV, = INÈ T) + D 


i=l 


where SV, is the supply vector for sending 
industry s; T; is the dichotomous variable 
derived from Equation (1). These values are 
summed across k industries before being 
transformed by natural logarithms. (The “+ 
1” in the formula avoids the log transforma- 
tion of a O value, which is undefined.) This 
operation is repeated to create a supply vector 
score for each industry in the analysis. 

The problem with such a construction, 
aside from being only a “pseudocontinuous” 
measure, is that it is insensitive to variations 
far below and far above the threshold level. 
To remedy this problem, we propose a 
“logarithmic” method for computing the 
supply vector that has the form: 


k 
SV, = Y, LN(S/W,) + 1) 


i=] 


where SV, is the supply vector for sending 
industry s; S; is the value of goods and 
services supplied by the sending industry to 
the ith industry; W, is the number of 
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employees in sending industry s.? The value 
within the parentheses is transformed by 
natural logarithms and then summed across k 
industries. Thus, using the logarithmic method, 
we first derive the value per worker sent to 
each industry, take the natural logarithms of 
these values, and sum the logarithms. 

Both Perrone's method and our method 
have analogous forms to derive the receive 
vector (although Perrone did not consider this 
in his original work). For example, using our 
logarithmic method, the receive vector is 
computed as 


k 
RV, = Y, LN (RIW,) + 1) 


inl 


where RV, is the receive vector for receiving 
industry r; R, is the value of goods and 
Services received by the receiving industry 
from the ith industry; W, is the number of 
employees in receiving industry r. i 
The logarithmic method has two desirable 
effects. First, it creates a true continuous 
measure that utilizes more of the information 
in the input-output flows. Second, it gives 
greater weight to industries that send some 
goods and services to every other industry 
than to industries that send a large volume to 
a single industry. Consider two industries, A 
and B, in a 30 x 30 matrix, each of which 
sends $29,000 of goods and services per 





3 One reviewer of this paper suggested that the 
supply vector should be standardized by the 
number of workers in the industry to which goods 
and services are supplied. His or her reasoning was 
that damage should be gauged on the basis of 
workers in the affected industry. We disagree with 
the reviewer on this point. Since our purpose is to 
construct a measure that reflects the power of 
workers in the supplying industry, we feel it should 
be standardized by the workers in the supplying 
industry. However, this conceptual disagreement is 
moot when one considers the empirical differences 
between our measure and the one suggested by the 
reviewer. When we compared the supply vector 
suggested by the reviewer with our own, we found 
that they were correlated at .95. When we 
reestimated our models substituting the reviewer’s- 
version of the supply vector for our own, the 
substantive differences were virtually nil. None of 
the models showed an improvement in the 
performance of the supply vector, and in only one 
case (organizational effort) did a previously 
significant effect turn nonsignificant. 


POSITIONAL POWER OF AMERICAN LABOR 


worker to other industries. However, Industry 
A sends all of that value to a single industry 
whereas Industry B sends an equal value 
($1,000) to each of the other 29 industries. 
By our logarithmic method, Industry A will 
have a supply vector score of LN(29,000 + 
1) = 10.28; Industry B will have a supply 
vector score of 29 x LN(1,000) + 1) = 
200.35. Thus, the logarithmic method will 
correctly assign greater disruptive potential to 
workers in industries that send a moderate 
amount of goods and services per worker to 
many industries.* 

The self-supply vector is computed in an 
analogous fashion, except that it involves 
only one cell entry per industry. Thus the 
formula for the self-supply vector is 

SSV; = LN((S/W,) + 1) 

where SSV, represents the self-supply vector 
for the ith industry; S; is the value of goods 
and services in the ith industry; W; is the 
number of workers in the ith industry. 
Transformation using natural logarithms puts 
the self-supply vector on the same metric as 
the supply and receive vectors. 

In the analysis below, we are primarily 
interested in the impacts of the supply and 
receive vectors on the various outcomes 
because, like Perrone, we feel that positional 
power rests mainly on workers’ ability to 
inflict “surplus damage” outside their own 
industries. These two dimensions tap the 
potential systemic impacts of workers’ ac- 
tions. Secondarily, we are interested in the 





*Two reviewers of this paper questioned 
whether the logic of our method might be 
reversed—that is, whether one should attribute 
more disruptive potential to workers who can 
inflict much damage to a single industry than to 
those who inflict some damage to many industries. 
We believe this alternative specification fails to do 
justice to the principle of systemic damage which 
we are trying to tap. Like Perrone, we feel that 
systemic damage rests on the principle that 
workers can cause “economic or social injury 
disproportionate to the subjective cost of undertak- 
ing it” (Perrone 1984, p. 414). If we hold constant 
the absolute level of damage that is created, the 
disproportionalities of systemic damage are maxi- 
mized when more actors in the system are hurt by 
it. Our measure thus attributes more power to 
workers whose impact touches more sectors in the 
economy than those who affect only a single 
industry. 
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impact of the self-supply vector, which taps 
the potential localized impact of workers' 
actions on the flow of production within their 
own industries. We expect this vector also to 
influence labor outcomes. 


DATA AND METHODS 


The data in this analysis are limited by the 
availability of the Department of Commerce 
input-output flow matrices published for 
1963, 1967, 1972, and 1977. Having aggre- 
gated these data into the 30 x 30 form 
consistent with the 30 major industries in 
Table 1, we then matched a set of industry 
variables from secondary published sources 
for each of these years. All these variables 
represent industry-specific indicators of firm 
strength, business cycles, and other variables 
suggested by Griffin (1984). From these data 
we constructed a pooled, cross-sectional data 
set with 120 cases (30 industries X 4 time 
points). All variables were measured at time ¢ 
except the dependent variables which we 
constructed as three-year moving averages at 
times t-- 1, t+2, and t+3. This method of 
constructing the dependent variable follows a 
suggestion by Perrone (1984, p. 419) and 
comments by Griffin (1984, p. 426) and 
accomplishes two things. First, it smooths out 
potentially idiosyncratic and volatile fluctua- 
tions in the dependent variable that may be 
apparent in a single year. Second, it reduces 
the likelihood of simultaneity bias in the 
specification of the models by allowing all the 
independent variables in the model to precede 
in time the dependent variable. 

We consider seven dependent variables in 
this analysis, two each for organizational 
capacity and economic standing outcomes and 
three for labor militancy outcomes. First, to 
measure organizational capacity of labor, we 
use industry-specific measures of (a) the 
percentage of workers unionized and (b) the 
percentage of workers involved in NLRB 
union representation elections. The former is 
a conventional measure of union density and 
indicates the extent of actual organization in 
an industry. The latter measure is an index of 
organizational effort—that is, the extent to 
which workers in an industry have mobilized 
sufficiently to warrant an NLRB representa- 
tion election and make themselves eligible for 
union membership. We expect the upstream 
and downstream dimensions of positional 
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power to be positively associated with both 
these measures of organizational capacity. 

Second, we measure labor militancy with 
industry-specific measures of (a) strike fre- 
quency, (b) average strike duration, and (c) 
strike volume. Strike frequency, the number 
of strikes per 1,000,000 workers in an 
industry, is a conventional indicator of 
propensity to strike (Hibbs 1976). Average 
duration, the number of persondays lost per 
striker, measures the length of strikes. Strike 
volume represents the number of persondays 
lost to strikes per worker and approximates 
the percent of total working time lost to 
strikes. Thus, this measure is the best single 
indicator of the actual economic damage of 
strikes. Further, strike volume is closest to the 
measure Perrone used in his analysis of 
personhours lost as a result of strikes.5 
Perrone (1984) had no clear expectation of the 
relationship between positional power and his 
measure of strike activity, arguing only that 
potential and actualized conflict are not 
conceptually or empirically identical. We 
expect a positive relationship, however, 
reasoning that workers who hold positional 
power will be more likely to exert it. Thus, 
we expect positional power to stimulate both 
strike frequency and strike volume. Since so 
little research has been done on the determi- 
nants of strike duration, we are noncommital 
on expectations for this variable. On the one 
hand, workers wielding positional power 
might be able to shorten the length of 
conflicts in which they are involved. On the 
other hand, positional power may give 
workers more staying power to withstand the 
hardship of strikes, causing strikes to get 
. longer. Workers with positional power may 
realize that to be successful, strikes must 
extend long enough to inflict costs on other 
industries. 

Third, we measure economic standing of 
labor with industry-specific measures of (a) 
average wages and salaries and (b) average 
fringe benefits. The former measure, which 
was used by Perrone (1984), is the most 
direct industry-level indicator of economic 





? We do not use Perrone's original strike 
measure because the number of personhours lost to 
strikes is not standardized by the number of 
workers or the total amount of working time 
available in the industry. As a result, industry 
measures of this variable are likely to be highly 
correlated with industry size. 
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rewards to workers. Notwithstanding the fact 
that salaries of managerial people and execu- 
tives are included in the numerator, the 
measure overwhelmingly reflects the eco- 
nomic conditions of workers. Similarly, the 
average fringe benefits measure represents the 
total nonwage and salary portion of the wage 
bill and includes such benefits as pensions, 
health insurance, and sick leave. Because this 
is the fastest growing portion of the wage bill, 
we consider it separately from wages per se. 
We expect our measures of positional power 
to be positively associated with both indica- 
tors of economic standing. 

Both Wright and Griffin noted in their 
commentaries several possible specification 
problems in Perrone's original analysis. It is 
beyond our ability to attend to all of the 
commentators' recommendations in one pa- 
per, but we do try to address the most salient 
ones. First, we include in our analyses a 
battery of relevant controls that could argu- 
ably compete with positional power as 
explanations for the dependent variables. 
These variables include indicators of capitalist 
strength, the business cycle, unionization, 
and dummy variable controls for aggregate 
conditions. Our basic model is as follows: 


DP = bọ- + b,*1963 Dummy + b2*1967 
Dummy + 63*1972 Dummy 
+ b,*Sales/Inv + bs*Change Employ 
+ be*Change Profits + b7*Concentration 
+ bg*Avg Assets + bo*Avg Estab Size 
+ b,o*Union Density + 5,,*Supply 
Vector +  bj*"Receive Vector + 
bi3*Self-Supply Vector + e 


Where DP represents the dependent variable; 
1963 Dummy, 1967 Dummy, and 1972 
Dummy represent dummy variables for three 
of the four panels in the analysis (see below); 
Sales/Inv is the sales/inventory ratio, an 
indicator of industry-specific business cycles; 
Change Employ is the five-year change in 
industry employment; Change Profits is the 
five-year change in profits per personhour 
worked; Concentration is the percentage of 
corporate sales held by the industry’s largest 
firms; Avg Assets is the average dollar value 





' é Longitudinal measures of concentration for all 
industries in the economy have not been developed 
previously. Past researchers have, however, devel- 
oped cross-sectional measures from a variety of 
sources. One method that has been used is to take 
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of assets per firm in the industry; Avg Estab 
Size is the average number of employees per 
establishment in the industry; Union Density 
is the percentage of industry work force 
belonging to unions; and Supply Vector, 
Receive Vector, and Self-Supply Vector are 
the three positional power measures described 
above. The sales/inventory ratio, the change 
in employment, and change in profits tap 
dimensions of the business cycle in each 
industry. Concentration, average assets, and 
average establishment size represent various 
dimensions of capitalist strength in the 
product market and may also serve as 
resources for workers (Hodson and Kaufman 
1982). Union density is a key indicator of 
labor’s strength in contests with capital. 
Finally, the three dummy variables serve as 
proxies for the impacts of aggregate-level 
variables such as inflation and unemployment 
and/or secular trends in the dependent 
variable.” Together, these variables represent 


the percent of corporate sales or assets held by 
“large firms" (i.e., firms above a certain size 
threshold in terms of sales or assets) as a measure 
of concentration. This method works fine in 
cross-sectional analyses but not so well in 
longitudinal designs. Simply as a result of 
inflation, more firms are likely to cross the 
threshold that places them in the "large firm" 
category, thus biasing the measure upward over the 
years. In developing our concentration measure, 
we adapted the preceding strategy to our longitudi- 
nal data with the help of z-scores. In each of the 
four years in our panel design, we computed the 
percent of corporate sales held by firms with (a) 
greater than $100,000,000 in assets and (b) greater 
than $250,000,000 in assets. We then created 
z-scores of these two measures and averaged the 
z-scores. After repeating this sirategy for each year 
in the analysis, we then entered the z-scores in the 
pooled analysis. This method accurately captures 
the cross-industry component of variation in 
concentration within each year, but may distort the 
cross-panel component of variation. Fortunately, 
for a measure such as concentration the cross- 
industry component of variation is by far much 
larger than the cross-panel component. (There is 
much more variation in concentration across 
industries than within the same industry over 
time.) We are thus confident that our measure 
captures the salient dimensions of market concen- 
tration. 

7 Griffin's (1984) commentary mentioned the 
need to control for aggregate economic factors 
such as unemployment and inflation. Of course, in 
any given year, aggregate indicators are constant 
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likely alternative explanations for the class- 
relevant outcomes that we investigate. 

To this basic model, we make slight 
modifications for some dependent variables. 
In the union density model, we drop union 
density as a regressor and add strike fre- 
quency. Recent research: has suggested that 
the collective action involved in launching 
strike actions may spur union growth in some 
contexts (Rubin et al. 1983; Wallace, Rubin, 
and Smith 1988). In the three labor militancy 
models (strike frequency, duration, and vol- 
ume), we add a dummy variable for the 
mining industry since this industry is notori- 
ously much more strike-prone than other 
industries. All these measures (except the 
dummies) are constructed as ratios, not 
absolute levels, thus serving as proximate 
controls for industry size. All measures 
involving dollar values in either the numera- 
tor or denominator were converted to constant 
dollars before the analysis to minimize the 
statistical impact of inflation on the measure- 
ment of the variables (the time dummy 
variables, of course, control for whatever 
effect inflation may exert causally on the 
dependent variables). In the Appendix, we 
report further details about all variables and 
their sources. 

Another limitation to Perrone's original 
analyses is that he had only 16 cases at one 
point in time, meaning he had limited degrees 
of freedom to experiment with complex 
model specifications. Our analyses employ a 
pooled, cross-sectional and longitudinal de- 
sign in which four cross-sections (or panels) 
of 30 industries each are included in the same 
analysis, thus yielding a total of 120 
observations.? Following Hannan and Young's 
(1977) suggestion, we first conducted analy- 
ses on each panel separately and determined 
that the pattern of relationships among our 
variables was fairly stable across panels.? 


for all industries in a given panel. In a pooled 
analysis such as this one, the dummy variable 
approach serves as an adequate proxy for these 
factors. 

3 There is a slight violation of the assumptions 
of this design in that the time intervals between our 
four panels (four, five, and five years) are not 
identical. However, this does not seem to be an 
egregious violation. When we reestimated our 
models excluding the 1963 panel, none of the 
substantive results were affected. 

? We also performed more formal tests for 
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Pooling the four panels increases the degrees 
of freedom, thus allowing for the exploration 
of a larger number of hypotheses, and 
generates more efficient parameter estimates. 
It is usually risky to estimate models with 
ordinary least squares regression with this 
type of data set because of the possibility of 
autocorrelation and heteroskedasticity. There- 
fore, we utilize a generalized least squares 
procedure to estimate a cross-sectionally 
heteroskedastic and time-wise autoregressive 
model (see Kmenta 1971, pp. 508-14) to 
correct for this problem. Probably because we 
have only four time points in our series, we 
were unable to estimate separate autoregres- 
sive parameters for each industry. We thus 
imposed the restriction that all industries must 
have the same autoregressive parameter, a 
slight modification to the basic Kmenta model 
(1971, p. 512, model 12.32). Thus, in 
estimating this model, we assume we are 
tapping a fairly stable structural process that 
was present in the American political econ- 
omy throughout the period 1963-1977 (see 
Freidland and Sanders 1985, for another 
example of this metbod). 

Because of the relatively small number of 
cases, we show three levels of statistical 
significance up to and including the .10 level. 


statistical interaction induced by differences in the 
slopes across each panel by comparing the 
explained variances of “fully saturated" interactive 
equations with those obtained by the far more 
parsimonious "main effects only" equations. The 
“saturated” equations universally explained more 
variance than the simpler models, of course, but 
generally explained less variance after the R?s were 
corrected for the loss of degrees of freedom 
induced by the inclusion in the equation of forty to 
forty-eight interaction terms. There are two clear 
exceptions to this generalization, strike frequency 
and strike size. Almost all of the significant 
interaction in the strike frequency equation, 
however, was due to the (for our purposes here, 
substantively uninteresting) interaction of panel 
date and the mining dummy variable. With those 
particular interactions removed from the saturated 
frequency equation, the adjusted R? falls below 
that estimated by the “main effects" equations. 
Inclusion or exclusion of those interactions, 
moreover, did not alter the substantive conclusions 
generated with either equation. The determinants 
of strike size appeared to differ so substantially 
across panels that the estimation of a "pooled" 
equation made little sense, and so we do not 
present any results for this indicator of labor 
militancy. 
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Our analysis is primarily exploratory in 
nature. Therefore, despite the fact that we 
have expectations about some of our variables 
(particularly the supply and receive vectors), 
we utilize two-tailed tests of significance. We 
utilize Buse's R square to determine the 
proportion of explained variance in the pooled 
GLS models because it has been shown to be 
appropriate for estimating goodness-of-fit in 
such models (see Buse 1973). 

We note that there are several possible 
criticisms of Perrone's original analyses 
which we do not address in this paper. 
Notably, we do not attempt to specify 
nonlinear or interactive relationships among 
our independent variables, some of which 
were discussed by Wright (1984) in his 
comment on Perrone's article. We have a 
more limited agenda in this paper to 
construct a workable conceptualization and 
measurement strategy for the positional power 
of labor and to establish its relationship to a 
range of labor outcomes. In future research, 
we intend to explore alternative model 
specifications and applications of the posi- 
tional power concept in more detail.!? Also, 
the reader should note that although extensive 
literature exists for each of the three substan- 
tive areas we investigate below, we do not 
discuss that literature at length. Our main 
focus in this paper is to discern the contribu- 
tion of a new set of variables—the positional 
power measures —to an explanation of labor's 
militancy, organizational capacity, and eco- 
nomic standing. 


ANALYSIS 
Validating the Measure 


Using the logarithmic method described 
above, we first computed the supply, receive, 
and self-supply vectors for our 30 industries 


10 We did, however, perform considerable 
exploratory analyses of the interactions and 
nonlinearities theoretically identified by Wright 
(1984) and Griffin (1984). We did find evidence of 
some of these effects, but their significance was 
heavily conditioned by equation specification and 
panel year. Moreover, the substantive interpreta- 
tion of those that appeared more robust was murky 
because of excessive collinearity among the 
interactive and nonlinear variables themselves. 
Thus, given our purposes here, we decided to 
present only the linear and additive effects. 
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for each of the four years in the analysis. In 
Table 1, we show the actual supply, receive, 


and self-supply vectors from the 1977 input- 


output matrix (values for other years are 
available from the authors upon request). To 
repeat, these data include only the exchange 
of goods and services among industries and 
not production for the final consumer market. 
While production for consumers is also 
important, we. focus here only on the 
overlooked dimension of interindustry ex- 
change as a basis of positional power. Since 
production for the consumer market is not 
reflected in these measures, the receive vector 
is typically larger than the supply vector. The 
values for the transportation equipment indus- 
try (autos, etc.) are particularly striking in 
light of our earlier discussion. The receive 
vector is over 24 times larger than the supply 
vector in this industry. All this suggests that 
when one considers the production process in 
toto, workers may have more disruptive 
potential “upstream” than “downstream” in 
the process. The self-supply vector is always 
‘smaller than either of the two others simply 
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because it is constructed from only a single 
cell in the 30 x 30 matrix. 

We have thus far argued for the conceptual 
independence of these three dimensions of 
positional. power, but we have yet to 
demonstrate this empirically. We address this 
issue in Panel A of Table 2, where two sets of 
correlations are presented. The first is the 


average of the four panel-specific bivariate ` 


correlations (i.e., one correlation each for the 
1963, 1967, 1972, and 1977 panels) among 
the supply, receive, and self-supply vectors. 
The second set is the “pooled” correlations 
among these three vectors across panels (i.e., 
the correlations obtained when we use all 120 
cases as if the data had no time dimension). 
Both sets of correlations suggest extremely 
low associations; indeed, the largest single 
correlation constituting the average is only 
.234, that between the supply and receive 
vectors in 1967 (not shown). The correlations 
suggest near orthogonality among tbe three 
positional power measures; empirically, as 
well as hypothetically, the three dimensions 
do not seem to overlap. 


Table 1. Computed Supply/Receive Vectors for 30 Industries, 1977 


SIC 

Code Industry 

01 Agriculture 

10 Mining 

15 Construction 

20 Food 

21 Tobacco 

2 Textiles 

23 Apparel 

24 Lumber and wood products 
25 Furniture 

26 Paper products 

27 Printing and publishing 
28 Chemicals 

29 Petroleum products 

30 Rubber products 

31 Leather products 

32 Stone, clay, and glass 

33 Primary metals 

34 Fabricated metals 

35 Nonelectrical machinery 
36 Electrical machinery 

37 Transportation equipment 
38 Professional instruments 
39 Miscellaneous manufacturing 
40 "Transportation 

48 Communications 

49 Utilities 

50 Wholesale trade 

52 i Retail trade 

60 Finance, insurance, real est. 
70 Services 


Supply Receive Self-Supply 
57.66 120.01 8.94 
84.48 . 135.97 7.73 
95.94 119.04 1.74 
81.76 141.76 9.56 
118.13 128.91 10.30 
102.27 122.32 9.04 
83.74 113.67 8.60 
108.17 134.44 9.22 
21.22 143.71 5.33 
140.45 144.00 9.36 
91.91 119.48 7.96 
147.23 155.85 9.60 
169.48 164.38 9.14 

“145.81 143.55 7.34 
87.08 123.13 8.15 
114.97 : 141.44 8.13 
107.63 150.73 9.62 
110.56 141.29 7.79 
107.07 138.24 8.56 
98.00 140.13 8.61 
60.54 158.64 9.36 
104.36 141.59 7.42 
117.97 153.23 7.48 
135.14 122.40 8.14 
121.19 88.46 6.54 . 
148.62 124.61 9,81 
117.50 97.57 6.05 
71.66 83.24 3.97 
122.62 99.07 9.17 
103.38 121.57 7.51 
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Table 2. Zero-Order Correlations Among Three Posi- 
tional Power Variables 1963-1977 


Panel A. Panel-Specific Correlations Among the 


Three Positional Power Measures 
Average Correlation for Four Panels: 
Supply Receive  Self-Supply 
Supply — .175 .153 
Receive — .062 
Self-Supply f - 
Pooled Correlations Across Panels: 
Supply Receive — Self-Supply 
Supply _ .193 .021 
Receive — 071 
Self-Supply — 
Panel B: Cross-Panel Correlations of Positional 
Power Measures With Themselves 
Panels Supply Receive — Self-Supply 
1963-67 .989 976 .994 
1967-72 .760 .782 .994 
1972-77 .995 .994 .996 


We view these interindustry exchanges as 
relatively enduring features of the economy 
that change gradually and only with large- 
scale changes in industrial organization. If we 
find little stability from year to year in 
industry inflows and outflows, we would 
suspect our measurement strategy is flawed. 
In Panel B of Table 2, we present correlations 
for the supply, receive, and self-supply 
vectors with themselves for temporally adja- 
cent panels. The correlations are uniformly 
high; the correlations for all three vectors 
approach unity for all comparisons except 
between 1967 and 1972. Even here, the 
instability is quite localized. One industry 
accounts for the lowered supply vector 
correlation. (With that industry, tobacco, 
removed, the correlation increases to .94). 
Only three industries (construction, wholesale 
trade, and finance, insurance and real estate) 
cause the lowered receive vector correlation 
(removing those industries raises the correla- 
tion to .95). We conclude from Panel B that, 
despite some localized instability in one 
period, our positional power measures gener- 
ally tap a stable structural dimension of 
industrial differentiation, at least as that is 
assessed over the 15 years of our study. 


Organizational Capacity 
Much sociological research has focused on 
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the impact of organizational capacity of 
workers on a variety of outcomes such as 
earnings. But little research has explored the 
structural underpinnings of organizational 
capacity. Do workers in strategically located 
industries have an advantage over other 
workers in their ability to organize collec- 
tively? Does the threat of disrupting produc- 
tion and inflicting economic loss beyond the 
localized sphere of production put pressure on 
employers to accede to workers’ demands for 
unionization? If positional power is present 
and mutually recognized by labor and capital, 
it should create a basis on which labor could 
build a strong trade union movement. Whether 
this power is ever wielded should not 
particularly affect its utility as an instrument 
of labor organization as long as both parties 
acknowledge its presence. Hence, we expect 
that our measures of positional power should 
be positively associated with both actual 
levels of organization and potential for 
organizing the unorganized within an indus- 
try. Actual levels of union organization are 
measured directly by union density. Potential 
union organization is measured by our 
organizational effort variable, the percentage 
of industry workers who petition the National 
Labor Relations Board for union recognition. 
In Table 3 we present models estimating 
determinants of these two variables. 

Turning first to the determinants of union 
density, several of the controls are statisti- 
cally significant. All three panel dummy 
variables, for example, are significant with a 
pattern corresponding to the well-known 
secular decline in union organization through- 
out the period of the analysis. Density is 
negatively affected by change in employment. 
Neither of the other two business cycle 
variables is statistically significant. Union 
density is positively related to both concentra- 
tion and average establishment size, but not 
related to average assets per firm. The two 
significant effects (concentration and establish- 
ment size) suggest that union density flour- 
ishes within large, oligopolistic firms. 

Bearing more directly on our concerns, the 
supply vector significantly enhances union 
density (p < .10), but the receive vector 
exerts no significant influence on union 
density. When we compare the standardized 
regression coefficients, the supply vector is 
not among the strongest in the table. Indeed, 
it appears that the supply vector effect is 
vitiated by the inclusion of the concentration 
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Table 3. Determinants of Organizational Capacity, 1963-1977 











Union Density Union Org. Effort 

b B b B 
1963 Dummy .031*** .089 3,87*** .142 
1967 Dummy .024** .067 .585 .021 
1972 Dummy .017** 048 - -.172 ~ 006 
Sales/Inv. ratio ~ 1.40E-5 — .004 — 027 — 090 
Change employ. —.075* — .054 5.59 .052 
Change profits 8.76E-5 .024 — .012 — .040 
Concentration .046*** .296 —1.67 —.138 
Avg. assets —1.36E4 —.045 — .044* -.190 
Avg. estab. size 7.30E-4A*** .290 .068*** 347 
Strike frequency 1.07E-5 .013 _ — 
Union density — — 2.98 .039 
Supply vector 4.05E-4* .088 .061* .170 
Receive vector 6.51E-4 .088 .183*** 319 
Self-supply vector — 012*** — 263 —.165 — 045 
Constant .192 -21.11 
Rho .868 462 
Buse's R square .378 .679 


* Significant at p<.10, two-tailed test. 
** Significant at p<.05, two-tailed test. 
*** Significant at p<.01, two-tailed test. 


variable (these two variables are correlated at 


.39, suggesting that the supply vector effect - 


operates partly through market concentra- 
tion). When concentration is not included in 
the model, the supply vector is somewhat 
stronger and more significant. Thus, it 
appears that workers' potential for disrupting 
production “downstream” in the production 
process is a somewhat important determinant 
of union organization. The negative sign of 
the self-supply vector is difficult to interpret. 
Perhaps it indicates that high volumes of 
within-industry economic exchange blunt 
. Whatever impact is created by linkages with 
outside industries and thus inhibit unioniza- 
tion. 

Analysis of the determinants of union 
organization effort is also instructive. The 
1963 dummy is significant, indicating a sharp 
falloff in organizational effort after the 
mid-1960s. None of the three business cycle 
variables is statistically significant. However, 
among the capitalist strength variables, aver- 
age assets per firm is negative and statistically 
significant, and average establishment size is 
positive and significant. These results are 
theoretically consistent. Industries large in 
economic scale are likely to have the 
hecessary resources to fend off and discour- 
age attempts at union organization. Some of 
these industries are already highly unionized 
and offer little opportunity for further union- 
ization. However, if we hold these factors 


constant, unions are still likely to target 
industries with large establishments because 
such industries offer the highest potential 
payoff for their efforts (Griffin et al. 1986). 
Curiously, the union density measure is 
positive, but not significant, as a predictor of 
union organization effort. 

Here we find even stronger support for our 
positional power thesis. Both the supply and 
receive vectors are positive and statistically 
significant as expected. Among all variables 
in the model, the receive vector has the 
second largest standardized coefficient. This 
suggests that while the organizability of labor 
is associated with labor’s ability to disrupt 
production both “upstream” and “down- 
stream” in the process, the former is more 
important. The self-supply vector is not 
significant, thus yielding the first confirma- 
tion for one of Perrone’s implicit hypotheses: 
the consequences of labor’s positional power 
hinge significantly on the threat of surplus 
damage to other industries, not localized 
damage. 


Labor Militancy 

As noted earlier, the expectations about the 
impact of positional power on labor militancy 
are unclear. Traditionally, research on the 
determinants of strike frequency has assumed 
that when the timing is right, labor will use 
the strike weapon to advance their economic 
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and noneconomic interests (Hibbs 1976). 
Perrone did not find this result in his Italian 
research. It should be noted, however, that 
Italian strike patterns are fairly unique among 
Western industrial nations and may defy 
conventional explanations (Shorter and Tilly 
1974; Snyder 1975). However, Wright of- 
fered the alternative explanation that the 
relationship between positional power and 
strike activity may be curvilinear. Space 
considerations prevent us from exploring 
extensively nonlinear and/or interactive model 
specifications in this paper (but see note 10). 
Instead, we test only for the possibility of the 
linear relationship, expecting a positive asso- 
ciation between our positional power vari- 
ables and both strike frequency and volume. 
As for strike duration, our expectations are 
mixed, and the paucity of previous research 
on this outcome gives little guidance. In 
Table 4 we show the results from our analysis 
of the determinants of labor militancy. 
Several of the controls in the strike 
frequency model are statistically significant 
and in a plausible direction. None of the panel 
dummy variables is significant, suggesting no 
discernible trend in the dependent variable 
after other variables are considered. All three 
business cycle variables (the sales/inventory 
ratio, change in employment, and change in 
profits) are statistically significant, supporting 
the frequently observed connection between 
strikes and the business cycle (Ashenfelter 


Table 4. Determinants of Labor Militancy, 1963-1977 


Strike Frequency 
b B 

1963 Dummy — 1.02 — .003 
1967 Dummy — 1.09 — .003 
1972 Dummy 2.55 .007 
Sales/Inv. ratio —.213** — .053 
Change employ 143.65*** .097 
Change profits -.122* —.031 
Concentration — 10.43** — .063 
Avg. assets .053 017 
Avg. estab. size — ,229*** — .085 
Union density 198.06*** .189 
Mining dummy 677.23*** -149 
Supply vector —.078 —.016 
Receive vector ].75*** 223 
Self-aupply vector —.73 — .015 
Constant — 183.79 

Rho —.215 

Buse's R square .819 


* Significant at p<.10, two-tailed test. 
** Significant at p<.05, two-tailed test. 
*** Significant at p<.01, two-tailed test. 
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and Johnson 1969). In addition, strikes are 
negatively associated with concentration and 
average establishment size, suggesting that 
these indicators of capitalist strength deter 
expressions of labor unrest. This supports 
critics of Perrone's original work who sug- 
gested that these variables are necessary to 
specify the model correctly. As expected, 
strike frequency is positively associated with 
union density and the mining dummy. 

Both the supply and self-supply vectors are 
negative but not significant, the former 
finding being consistent with Perrone's Italian 
analyses. But Perrone did not consider the 
impact of the receive vector which, as we 
showed in our illustration from the automo- 
bile industry, may be quite relevant. Consis- 
tent with our expectations, the receive vector 
is positively associated with strike frequency 
and statistically significant. Thus, potential 
systemic damage upstream in the production 
process appears to be an important factor 
predicting the frequency of strikes. 

In the strike duration model, all three panel 
dummy variables are negative and significant. 
The pattern among these variables suggests a 
secular trend for strikes to get longer during 
this period. Strike duration is positively 
associated with changes in employment, but 
not related to either of the other two business 
cycle variables. Duration is positively associ- 
ated with concentration and negatively associ- 
ated with average assets. Union density is not 





Strike Duration Strike Volume 
b B b B 
—13.25*** — 635 — 243*** —.110 
—10.76*** ~ 515 — .005 ~ 002 
—7.08*** — .339 — .066 — .030 
—.021 —.093 —9.27E-5 — .004 
15.39*»* .187 1.79*** .204 
—.005 -.022 9.47E-4 .041 
2.4]*** .260 .069 .070 
— .028* — 158 —.001 —.061 
—.021 —.141 6.16E-4 .039 
—2.90 ~ .050 2.23*** .358 
— 8.06* —.160 1.96*** .366 
.012 .042 —.001 — 051 
.117*5** 266 .007*** .154 
.448** .161 .019 .065 
10.81 —.973 
.261 .425 
.646 .634 


POSITIONAL POWER OF AMERICAN LABOR 


significant, but the mining dummy variable is 
negative and significant, probably owing to 
the number of short wildcat strikes that occur 
in mining. . . 

Of the three positional power variables, the 
receive and self-supply vectors are both 
positive and significant. The supply vector is 
not significant but, importantly, it is rendered 
nonsignificant by the inclusion of the concen- 
tration variable. It seems that the effect of the 
supply vector variable is explained through 
concentration. Other than the panel dummy 
variables, the receive vector has the largest 
standardized regression coefficient in the 
model. These results, then, support the view 
that positional power gives workers staying 
power in protracted conflicts with employers. 

Finally, we turn to the model estimating 
strike volume, a variable which approximates 
the total amount of labor power withheld. The 
volume variable is also closest to Perrone's 
strike measure of total personhours lost as a 
result of strikes. Among the panel dummy 
variables, only the negative 1963 variable is 
statistically significant. The change in employ- 
ment variable is positive and significant, but 
neither of the other business cycle variables is 
significant. None of the variables tapping 
capitalist strength is significant. As with tbe 
strike frequency model, the union density 
variable and the mining dummy variable are 
both positive and significant. 

The receive vector has a positive, signifi- 
cant impact on strike volume, displaying a 
pattern consistent with results for the other 
two strike activity variables. The standardized 
regression coefficient is relatively small 
compared to other significant variables in the 
model. Consistent with earlier results, the 
supply vector is not significant. The self- 
supply vector is not significant either. Over- 
all, these results suggest that strike activity is 
linked primarily to the disruptive potential 
workers hold with industries upstream in the 
production process. Our brief discussion of 
the disruptive potential of workers in the auto 
industry is instructive here. Because strikes in 
major industries can frequently be antici- 
pated, industries downstream in the process 
can prepare for them by stockpiling needed 
supplies from the struck industry. It is 
virtually impossible, however, for industries 
upstream in the process to make analogous 
preparations. In fact, supplying industries 
may face a decline in demand even before 
strikes occur as employers in the struck 
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industry anticipate diminished need for suppli- 
ers' goods and services. 


Economic Standing 


Ultimately, positional power may be impor- 
tant to workers because it translates into 
additional earnings power. Perrone's analysis 
suggested that this was true for Italian 
workers, but model specification problems 
undermined confidence in his results. The 
controls in our model represent plausible 
alternative explanations for higher earnings 
and should thus alleviate much of this 
concern. Our analysis of economic standing is 
shown in Table 5. Both of the dependent 
variables in this table—average wages and 
average fringe benefits—are represented in 
constant 1977 dollars and thus have been 
purged of any inflationary bias. 

Several of the controls are significant 
determinants of average wages. All three 
dummy variables are negative and significant, 
indicating a secular increase in real earnings 
throughout the period. Of the three business 
cycle variables, only the change in industry 
employment is positive and significant. All 
tbree indicators of capitalist strength are 
significant determinants of wages: concentra- 
tion and assets size are positive, and average 
establishment size is negative. As one would 
expect, the union density variable is a positive 
and significant determinant of wages. All 
these relationships (except perhaps the estab- 
lishment size effect) are plausible and consis- 
tent with previous research. 

Turning to the variables of most theoretical 
interest, both the supply and receive vectors 
are positive and significant determinants of 
average wages. Of the three, the receive 
vector has a slightly larger standardized 
regression coefficient, but not the largest 
among all variables in the model. These 
results suggest that Perrone's finding in the 
Italian data, that wages are positively linked 
to the supply of goods and services to other 
sectors, holds for the United States also. 
However, receipt of goods and services from 
other sectors is also critical to the determina- 
tion of wages. 

A similar pattern is revealed in the model 
of determinants of average fringe benefits. 
The same assortment of control variables is 


- Significant except the change in employment 


and change in profits variables. However, 
unlike the results for average wages, only the 
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Average Wages Average Fringe Benefits 
b B b B 

1963 Dummy —1.31*** —.190 -]1.27*** —.371 
1967 Dummy —.909*** —.131 —]1.10*** — .322 
1972 Dummy —.273*** — .040 — 531 *** — 155 
Sales/Inv. ratio —1.33EA — .010 —8.68E-4 —.023 
Change employ 1.6359 .060 221 .016 
Change profits 6.87E-5 .001 9.84E-5 .003 
Concentration 1.06*** .345 .616*** .405 
Avg. assets .005*** .083 005*** .186 
Avg. estab. size — .006** —.112 —.002*** —.091 
Union density 4.28*** 221 1.53*** .160 
Supply vector .O11*** 121 .002 .038 
Receive vector .026*** 179 .010*** .139 
Self-supply vector .072 .078 .039*** .085 
Constant 6.28 455 

Rho .939 .408 

Buse's R square 741 915 


* Significant at p<.10, two-tailed test. 
** Significant at p<.05, two-tailed test. 
*** Significant at p<.01, two-tailed test. 


receive and self-supply variables are positive, 
significant determinants of fringe benefits. 
The supply vector variable, once again, loses 
significance when the sales concentration 
variable is added to the model. The standard- 
ized coefficient of the receive vector is the 
strongest of the positional power variables, 
but not among the strongest in the model. 
Still, positional power appears to be a major: 
determinant of fringe benefits, the fastest 
growing portion of the total wage bill. — 


DISCUSSION 


Taken in total, our results strongly support the 
notion that positional power is important in ex- 
plaining multiple labor outcomes.!! In the seven 


1! Our models may be biased by our failure to 
consider in the analysis a measure of the 
substitutability of labor. It is extremely difficult to 
measure this concept directly in an aggregate 
analysis such as this one. One indirect method 
would be to control for the occupational structure 
in an industry. In analyses not shown here, we 
reestimated our models including percentages of 
the industry work force in eight major occupational 
categories (professional, managerial, clerical, sales, 
craft, operative, service, and laborer). When we 
compared these models with the ones reported in 
the paper, we found only minor differences in the 
estimates of the efficacy of the positional power 
variables. In the organizational effort model, for 
example, only the supply vector turned nonsignif- 
icant. In the strike volume equation, the receive 


regression models, the supply vector is in the 
expected direction and achieves statistical sig- 
nificance three times. Two other times this vari- 
able would have been significant if concentra- 
tion had been excluded from the analysis. The 
receive vector is significant in six out of seven 
models, failing to reach significance only in the 
union density model. The self-supply vector is 
significant in three of seven models. Impor- 
tantly, then, all three dimensions, particularly 
the two indexing disruptive potential beyond 
the localized sector, appear relevant for under- 
standing positional power. Conceptually, these 
dimensions represent very distinct aspects of 
positional power available to workers though 
they may not be neatly separate in everyday 
struggles with employers. It is even possible 
that these dimensions never quite percolate to 
the conscious level in capital-labor confronta- 
tions, but rather remain just below the surface. 
This does not, however, diminish their impor- 
tance in the structural determination of labor 
Outcomes, as our analyses suggest. 

A remaining question is why does posi- 
tional power of labor have such an impact on 


vector turned nonsignificant. By and large, then, 
our results seem relatively robust when available, 
plausible controls for labor substitutability are 
included. But we readily concede that even those 
few changes to our results induced by the inclusion 
of the industry-specific occupational distribution 
suggest that the issue remains open and certainly 
deserves greater empirical scrutiny. 


POSITIONAL POWER OF AMERICAN LABOR 


organization, militancy, and economic stand- 
ing? The key, we feel, is found in the role 
that strategically located groups exert in a 
highly interdependent economic system. In 
such contexts, economic or social damage can 
“spill over” and become systemic. The threat 
of genuinely systemic damage enlarges the 
scope of the conflict, drawing in the state, the 
media, the public-at-large, and capitalists in 
upstream and downstream industries. No 
longer is there conflict between an isolated 
group of workers and their employers, but 
rather pressure is brought to bear by other key 
actors to avert a strike before it happens or 
resolve it if it is already under way. Thus, in 
key spheres of production, capital-labor 
conflict is highly politicized: the mere threat 
of prolonged strikes in coal, automobiles, or 
steel sends tremors throughout the economy. 
The broader the potential damage, the more 
politicized and the more “public” conflict 
becomes. This is not to say that workers who 
have positional power are necessarily more 
conflict-prone, but rather they have a better 
arsenal of weapons at their disposal if conflict 
escalates to the point of a work stoppage. In 
fact, a web of state regulations and institution- 
alized relationships has evolved to contain the 
potential damage of conflict in certain sectors 
of the economy. In turn, these relationships 
tend to further solidify the positional power of 
workers in the system. In general, then, when 
positional power is present and recognized by 
the key actors in the system, the leverage of 
workers in the system is enhanced. !? 


CONCLUSION 
This paper has attempted to advance Per- 





12 Marxists have long recognized the possibility 
of using precisely this sort of class power in 
confrontations with capital or the state. This is 
shown not only in Perrone’s (1983) discussion but 
also in Gay’s (1962) biography of the German 
Marxist revisionist, Eduard Bernstein. According 
to Gay (pp. 240-41), Bernstein, as early as 1906, 
“knew the phrases ‘mass strike’ and ‘must be 
fought under certain conditions’ were vague and 
needed greater precision. This he attempted to give 
them: the mass strike need not be a universal work 
stoppage. It should involve workers in certain vital 
categories whose failure to show up for work could 
paralyze the whole economy. Bernstein mentioned 
transport and communications, light and power, 
and food distribution as key factors in any modern 
industrial economy.” . 
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rone's theory of the positional power of labor. 
On balance, the measures we developed and 
the empirical analyses we presented support 
this conceptualization and suggest its poten- 
tial utility in future studies of working-class 
power. Our research has been largely explor- 
atory, however, and we therefore should 
caution readers about some of the limitations 
of the research. First, while we see our 
logarithmic method as far superior to Per- 
rone's original threshold method, there may 
be still better methods of conceptualizing 
positional power. However, any such mea- 
sure must hinge on the logic of the intercon- 
nectedness of differing spheres of production. 
Second, we concede that there may well be 
other bases of positional power besides the 
ones we identify. One potential aspect of 
positional power that some may regard as an 
omission from this analysis is an industry's 
interface with the final consumer market. We 
made this omission deliberately because we 
felt it was important to focus for now only 
upon the economic exchanges among inter- 
locking spheres of production. However, 
future research may want to explore how the 
relationships uncovered here are mitigated by 
connections with the consumer market. Third, 
in its present form, our model of positional 
power cannot fully account for the role of the 
state in societal production. Although pub- 
lished input-output matrices include flows 
involving government, the absence of many 
key variables (e.g., concentration, profits) for 
the state sector would make estimation of 
regression models impossible. 

We also did not consider the fact that the 
state and other actors in the political economy 
frequently play a preemptive role in capital- 
labor conflicts. The state has an arsenal of 
weapons available to it to intervene in 
situations that threaten economic stability 
and/or social cohesion. The Reagan adminis- 
tration's, smashing of the Professional Air 
Traffic Controllers Organization's (PATCO) 
strike in 1981 is exemplary in this respect. 
Such displays obviously alter the risk-benefit 
calculus of both labor and management as 
they contemplate such serious confrontations 
and may, in turn, modify the exercise of 
positional power. Similarly, organizational 
and counterorganizational strategies by capi- 
tal and labor may add a different spin to the 
dynamics revealed here. On occasion, con- 
certed efforts by capital or timorous labor 
leadership may blunt the positional power of 
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labor. One wonders, for instance, whether the 
relationships observed in this paper for the 
1960s and 1970s hold for the 1980s. 

None of these observations shakes our 
confidence in the general thrust of the model 
developed in this paper. Positional power is 
not an ephemeral phenomenon, but rather one 
that is embedded in the deep structure of 
economic exchange. We are encouraged by 
evidence gleaned from casual observation of 
the political economies of most Western 
advanced industrial nations. In these coun- 
tries, industries such as mining, construction, 
heavy manufacturing, and transportation are 
typically the most highly organized, the most 
strike-prone, and the highest paid of all 
sectors in the economy. This suggests strongly 
a common structural basis for these patterns 
that transcends the idiosyncrasies of culture, 
politics, and business cycles. Of course, 
casual observation is no substitute for rigor- 
ous comparative analysis. Since most coun- 
tries publish the types of input-output flow 
matrices needed to compute the positional 
power measures, such comparative analyses 
are possible and likely to be instructive. The 
limited comparative data from Perrone’s 
Italian study and our American study are 
encouraging. Despite quite different methods 
and modeling procedures, both studies found 
a positive association of the supply vector on 
wages and no consistent effect of the supply 
vector on strike propensity. 

Future research should also delve more 
deeply into historical patterns displayed in 
individual countries. Although our analysis 
spans roughly the 1960s and 1970s, it does 
not attempt to identify subtle shifts in 
positional power throughout this period, nor 
does it examine the effect of structural 
interdependence on change in the labor 
outcomes we analyze here. We did note that 
our positional power measures were quite 
stable across industries from period to period. 
This would seem to indicate that patterns of 
economic exchange among industries change 
very slowly and, therefore, that positional 
power changes slowly. However, large-scale 
industrial changes such as adoption of more 
capital-intensive methods of production will 
induce gradual changes in the arithmetic of 
our positional power measures. We could also 
assume, ceteris paribus, that to the extent 
economic spheres of production become more 
complex and interlocked over time, positional 
power of workers ‘will increase. But ceteris 
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paribus conditions are stringent indeed, often 
at variance with empirical reality. If eco- 
nomic interdependencies were to change 
substantially, industrial characteristics would 
likely change, as would the firms, workers, 
and employers in those industries. Thus 
speculation is difficult. 

Future researchers might consider more 
disaggregated studies of positional power than 
the one presented in this paper. While we 
used major industry categories in our analy- 
sis, the Department of Commerce input- 
output matrices are produced at the more 
detailed three-digit SIC level. Using compa- 
rable industry-level data, a more disaggre- 
gated analysis might refine and modify many 
of the arguments developed thus far. Ulti- 
mately, such indicators might be appended to 
individual-level data sets as proxies for 
industrial-level resources in studies of earn- 
ings or other job outcomes (Hodson and 
Kaufman 1982). If the results of our analyses 
are an indication, the positional power of 
labor should have relevance for individual- 
level outcomes also. 

Finally, the logic of the positional power 
approach might be extended to other topics in 
the social sciences. For instance, one could 
use data based on the economic flows among 
different states or regions of a country to 
construct analogous measures of regional 
power in an interdependent national econ- 
omy. A similar logic might be utilized to . 
explore patterns of economic and/or political 
dependency in the world system (as it has 
been to a degree already). By using this 
approach to analyze patterns of exports and 
imports among countries, for instance, one 
could develop a quantitative scheme for 
understanding economic dominance in the 
world system. In short, this method could be 
applied to just about any sociological problem 
in which actors in a system are differentially 
linked by a pattern of interdependent ex- 
change, the resulting consequences of which 
are differences in power among those actors. 
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APPENDIX 
A. Variables: 


1. 1963 Dummy—Dummy variable scored 1 
for 1963 and O for 1967, 1972, 1977. 

2. 1967 Dummy —Dummy variable scored 1 
for 1967 and 0 for 1963, 1972, 1977. 

3. 1972 Dummy —Dummy variable scored 1 
for 1972 and 0 for 1963, 1967, 1977. 

4. Sales/Inv Ratio—Sales inventory ratio — 
(Total sales in $1,000,000) / (Total inventories in 
$1,000,000). Sales: IRS. Inventories: IRS. 

5. Change Employ—Five-year % change in 
employment = (Employ, — Employ ,.s) / 
(Employ,..5). Employees: EHE. 

6. Change Profits—Five-year % change in 
profits before taxes in real dollars — (Profits, — 
Profits, 5) / Profits,.5. Profits: NIPA. 

7. Concentration —Z-scores created within each 
year by taking the percent of sales held by firms 
with (a) greater than $100,000,000 in assets and 
(b) greater than $250,000,000 in assets; and 
averaging the two z-scores. Sales: IRS. 

8. Avg Assets— Average assets per fim = 
(Assets in $1,000,000) / (Number of firms). 
Assets: IRS. Firms: IRS. 

9. Avg Estab Size— Average establishment 
size = (Number of employees) / (Number of 
establishments). Employees: CBP. Establishments: 
CBP. 

10. Union Density — Percentage of workers who 
are union members = (Number of union mem- 
bers) / (Number of employees). Union Members: 
DNUEA. Employees: EHE. (Union data available 
biannually; intervening years were interpolated). 

11. Mining Dummy — Dummy variable scored 1 
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for mining industry (SIC 10) and O for all other 
industries. 

12. Strike Frequency —Number of strikes per 
million workers = (Number of strikes * 1,000,000) 
(Number of employees). Strikes: AWS. Employ- 
ees: EHE. 

13. Strike Duration— Average length of strikes 
= (Number of persondays lost to strikes) / 
(Number of strikers). Strikes: AWS. Strikers: 
AWS. 

14. Union Organization Effort — Percent of work- 
ers involved in NLRB union representation elec- 
tions — (Employees eligible for NLRB election) / 
(Number of employees). Employees eligible: 
NLRB. Employees:EHE. 

15. Average Wages— Average wages and sala- 
ries per worker = (Wages and salaries in 
$1,000,000) / (Number of employees). Wages and 
salaries: NIPA. Employees: NIPA. 

16. Average Fringe Benefits— Average fringe 
benefits per worker — (Total compensation - Total 
wages and salaries in $1,000,000) / (Number of 
employees). Total compensation: NIPA. Wages 
and salaries: NIPA. Employees: NIPA. 

17. Supply Vector —XLN((S/W,) + 1), where 
Sı = value of goods and services supplied by the 
sending industry to the ith industry; W, = number 
of employees in the sending industry s). Value: 
IOS. Employees: EHE. 

18. Receive Vector—X LN ((R/W,) + 1), 
where R; = value of goods and services received 
by the receiving industry to the ith industry; W, = 
number of employees in the receiving industry r. 
Value: IOS. Employees: EHE. 

B. Sources: _ 

AWS: U.S. Department of Commerce, Bureau 
of Labor Statistics, Analysis of Work Stoppages, 
annual. 

CBP: U.S. Department of Commerce, Bureau of 
the Census, County Business Patterns, annual. 

DNUEA: U.S. Department of Labor, Bureau of 
Labor Statistics, Directory of National Unions and 
Employee Associations, biannual. 

EHE: U.S. Department of Labor, Bureau of 
Labor Statistics, Employment, Hours, And Earn- 
ings, annual. 

IOS: U.S. Department of Commerce, Bureau of 
Economic Analysis, The Input-Output Structure of 
the U.S. Economy, occasional (and associated data 
tapes). 

IRS: U.S. Department of the Treasury, Internal 
Revenue Service, Statistics of Income: Corporate 
Income Tax Returns, annual. 

NIPA: U.S. Department of Commerce, Bureau 
of Economic Analysis, National Income and 
Product Accounts, 1929-1974; 1977, Survey of 
Current Business, annual. 

NLRB: National Labg 


<a 





214 


REFERENCES 


Ashenfelter, Orley and George E. Johnson. 1969. 
“Bargaining Theory, Trade Unions, and Strike 
Activity.” American Economic Review 59:35-49. 

Beck, E.M., Patrick M. Horan, and Charles M. 
Tolbert, II. 1978. “Stratification in a Dual 
Economy: A Sectoral Model of Earnings 
Determination.” American Sociological Review 
43:704—20. 

Bell, Daniel. 1973. The Coming of Post-Industrial 
Society. New York: Basic Books. s 

Buse, A. 1973. “Goodness of Fit in Generalized 
Least Squares Estimation." American Statisti- 
clan 27:106-8. 

Edwards, Richard C. 1979. Contested Terrain. 
New York: Basic Books. 

Freidland, Roger and Jimy Sanders. 1985. “The 
Public Economy and Economic Growth in 
Western Market Economies.” American Socio- 
logical Review 50:421—37. 

Gay, Peter. 1962. The Dilemma of Democratic 
Socialism. New York: Collier. 

Giddens, Anthony. 1973. The Class Structure of 
Advanced Societies. London: Hutchinson. 

Gordon, David M., Richard Edwards, and Michael 
Reich. 1982. Segmented Work, Divided Work- 
ers. New York: Cambridge University Press. 

Gouldner, Alvin. 1979. The Future of Intellectuals 
and the Rise of the New Class. New York: 

Griffin, Larry J. 1984. "Reviewers! Comments." 
American Sociological Review 49:412-21. 

Griffin, Larry J., Michael Wallace, and Beth A. 
Rubin. 1986. "Capitalist Resistance to the 
Organization of Labor before the New Deal." 
American Sociological Review 51:147-67. 

Hannan, Michael T. and Alice A. Young. 1977. 
“Estimation in Panel Models: Results on 
Pooling Cross-Sections and Time Series." Chap- 
ter 2 in Sociological Methodology, edited by 
David R. Heise, San Francisco: Jossey-Bass. 

Hibbs, Douglas. 1976. "Industrial Conflict in 
Advanced Industrial Societies." American Polit- 
ical Science Review 70:1032-58. 





AMERICAN SOCIOLOGICAL REVIEW 


Hodson, Randy and Robert L. Kaufman. 1982. 
“Economic Dualism: A Critical Review." Amer- 
ican Sociological Review 47:727-39., 

Kmenta, Jan. 1971. Elements of Econometrics. 
New York: Macmillan. 

Mann, Michael. 1973. Consciousness and Action 
among the Western Working Class. London: 
Macmillan. 

Marx, Karl. [1867] 1967. Capital. New York: 
International Publishers. 

Parkin, Frank. 1971. Class Inequality and Political 
Order. New York: Praeger. 

Perrone, Luca. 1983. “Positional Power and 
Propensity to Strike.” Politics and Society 
12:231-61. 

Perrone, Luca. 1984. "Positional Power, Strikes, 
and Wages." American Sociological Review 
49:412-21. 

Rees, A. 1962. “Industrial Conflict and Business 
Fluctuations.” Journal of Political Economy 
60:371-82. 

Rubin, Beth A., Larry J. Griffin, and Michael 
Wallace. 1983. “ ‘Provided Only That Their 
Voice Was Strong': Insurgency and Organiza- 
tion of American Labor from NRA to Taft- 
Hartley.” Work and Occupations 10:325—-48. 

Shorter, Edward and Charles Tilly. 1974. Strikes 
in France, 1830-1968. Cambridge: Cambridge 
University Press. 

Snyder, David. 1975. “Institutional Setting and 
Industrial Conflict: France, Italy, and the United 
States." American Sociological Review 40:259— 
78. 

U.S. Department of Commerce. Bureau of Eco- 
nomic Analysis. 1963/1967/1972/1977. The 
Input-Output Structure of the U.S. Economy. 
Washington, DC: U.S. Government Printing 
Office. 

Wallace, Michael, Beth A. Rubin, and Brian T. 
Smith. 1988. “American Labor Law: Its Impact 
on Working-Class Militancy, 1901-1980.” So- 
cial Science History 12:1-29. 

Wright, Erik Olin. 1984. "Postscript." American 
Sociological Review 49:421—25. 


THE REPRODUCTION OF PARENTING . 


ROBERT Max JACKSON . 
New York University 


es Using a sohasi model, this.analysis shows that Chodorow’s.theory (described 
in The Reproduction of Mothering, 1978) does not work. The theory claims . 


daughters became more nurturing than sons because mothers do more nurturing 
than fathers. Society then functionally allocates child rearing to women because 
they do it better and enjoy it more. This self-reproducing cycle supposedly explains 
why women's responsibility for rearing children persisted so long: The analysis in 
this article represents the social process transmitting nurturing in the theory with a 
stochastic model. Applying the model over several generations shows that the 
theoretical process does not preserve sex differences. Instead, it causes the 
proportion of women’ and men who nurture children to become steadily more 
equal. This analysis also considers several likely strategies for modifying 
Chodorow’s theory to overcome the contradictory results revealed by the stochastic 


model. None succeed. 


INTRODUCTION 


By applying a mathematical model to the 
transmission of nurturing between genera- 
tions, this analysis shows that the theory 
unfolded in Chodorow’s The Reproduction of 
Mothering (1978) fails to explain women’s 
child-rearing responsibilities. The theoretical 
processes she describes would reduce, not 
sustain, sex role differences. The theory links 
two propositions: it claims that if only women 
nurture children, daughters develop nurturing 
personalities but sons do not; and it asserts 
. that sex differences in personality decide who 
rears children. Therefore, it argues, women’s 
mothering monopoly in each generation 
guarantees their daughters will inherit their 
nurturing role and later—as part of a 
never-ending cycle—these daughters will 
pass it on to their own daughters. We can 
model this theory by specifying the likelihood 
that a child develops a nurturing personality 
as a joint function of the child’s sex and of the 
nurturing activity of parents. This model 
allows us to see how the processes described 
by the theory would affect the distribution of 
nurturing in a society after several genera- 
tions. It shows that the transmission process 
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described by Chodorow's theory, when ap- 
plied with any assumptions that are botti 
theoretically and empirically plausible, can 

not explain the persistence of women’s 
child-rearing responsibilities. Instead, the 
theory would lead to the opposite result. The 
processes she describes would, if present, 
eradicate any differences between women and 
men. . 


THE ASSUMPTIONS AND ARGUMENT 
OF CHODOROW’S THEORY 


Others have claimed that Chodorow’s analy- 
sis distorts families’ effects on children’s 
personalities and adult behavior. But all 
accounts seem to assume that Chodorow’s 
theory would account for the persistence of 
women’s child-rearing responsibility if its 
assumptions bow. dhie the transmission process 
were correct. A stochastic model of the theory 
shows this is not true. To build this model, I 
will begin by specifying the logic of Chodo- 
row's theory. (Throughout this article, when I 
refer to "Chodorow's theory" I mean .the 
sociological reasoning in her argument. My 
analysis has no bearing on Chodorow's 
extensive effort to advance psychodynamic 


theories.) 


Chodorow's theory claims to show why it 
*has unquestionably been true—that women 
have had primary responsibility for child care 
in families and outside them; that women by 
and large want to mother, and get gratifica- 
tion from their mothering; and . . . women 
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have succeeded at mothering” (1978, p. 7). 
Contending that neither biological differences 
nor socialization can account for women’s 
child rearing, she relies instead on Freudian 
and Parsonian theory (pp. 11-39). From 
Freudian analysis (especially the work of 
object relations theorists such as Harry 
Guntrip and W. R. D. Fairbairn and the work 
on mother-child relations by D. W. Winni- 
cott) the theory deduces mechanisms of 
psychological development that will produce 
sex differences in personality. From Parson- 
ian functionalist theory it derives an argument 
that such a sex difference in personalities 
would cause a sexual division of labor. 
Chodorow's creative synthesis links these 
arguments to explain how the sexual division 
of labor reproduces itself.! 

The theory’s fundamental proposition con- 
tends that parents’ nurturing activities cause 
sex differences in the nurturing propensities 
of their children (p. 169). Specifically, 
because mothers nurture while fathers do not, 
daughters become nurturing and sons do not. 
“The sexual and familial division of labor in 
which women mother creates a sexual divi- 
sion of psychic organization and orientation” 
(p. 209). Chodorow refers to the pattern in 
which one parent does most of the child 
rearing as “asymmetric” parenting. Accord- 
ing to the theory, when a nurturing parent and 
a child share the same sex, the child identifies 
more strongly with the parent, the parent 
identifies more strongly with the child, and 
their relationship escapes troubling sexual 
tensions (pp. 159—209). Because mutual 
identification and the absence of sexual 
tension promote ‘a nurturing propensity, 


1 The most fundamental difference between 
socialization explanations (Weitzman 1984) and 
Chodorow’s theory is due to the reflexive nature of 
her theory (and not, as will be discussed below, the 
use of psychoanalysis). Chodorow’s model explic- 
itly attempts to be self-contained: the processes 
producing female child rearing in one generation 
also ensure that the pattern will repeat again in the 
next generation. Socialization theory would achieve 
this goal only if it argued not only that children 
learned different motives, skills, and expectations 
regarding child rearing, but also that socialization 
gave children an enduring, precise commitment to 
socialize their own future children the same way 
they had been socialized. Socialization may 
explain how people acquire cultural expectations, 
but it plays no part in determining the content of 
those expectations (cf. Marini and Brinton 1984). 
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nurturing mothers produce nurturing daugh- 
ters and nonnurturing sons. (Chodorow makes 
no clear theoretical distinction between the 
amount of time spent with children and the 
quality of emotional care. For consistency 
with her theory, I will assume the two are 
congruent.) 

Furthermore, according to the theory, 
nurturing propensities formed during child- 
hood regulate how much child rearing adults 
do. The theory attributes the sexual division 
of labor to differing abilities, needs, and 
aspirations (pp. 208-9, 214). “Women’s 
capacities for mothering and abilities to get 
gratification from it are strongly internalized 
and psychologically enforced, and are built 
developmentally into the feminine psychic 
structure” (p. 39). “Women’s mothering 
includes the capacities for its own reproduc- . 
tion. This reproduction consists in the produc- 
tion of women with, and men without, the 
particular psychological capacities and stance 
which go into primary parenting” (p. 206). 
The theory contends, therefore, that a func- 
tional allocation of social positions causes 
women to rear children because they do a 
better job and are more motivated.? 

Chodorow claims considerable historical 
significance for the family processes and 
psychological forces she emphasizes. Respond- 
ing to a critique of her theory by Judith 
Lorber, Chodorow writes: 


Lorber holds that we can explain why modern 
women mother in terms of the separation of 
home and work, income inequality, the ideology 
of patriarchy, and a psychological ideology 
about mothering. But this theory of causation 
reverses the true case. The family division of 
labor in which women mother was responsible 
for the relegation of women to the home as 
home and work separated, and this division 
accounts for the sex segregation in paid work 
that produces income inequality. (Lorber, Coser, 
Rossi, and Chodorow 1981, p. 508) 


Chodorow’s theory, in summary, has a 
simple structure. Women raise children. Girls 
form a nurturing propensity while boys do not 


? This functionalist explanation of child rearing 
differs from other attempts to apply Freudian 
theory to the sexual division of labor. Others focus 
on conflict and suggest men seek control over 
women to reduce anxiety (e.g., Dinnerstein 1976; 
Rubin 1975; Slater 1968). In contrast, Chodorow 
emphasizes the social allocation of people to 
positions based on their skills and motivation. 
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because children acquire the propensity dis- 
played by the parent whose sex they share. 
Because of functionalist selective mecha- 
nisms, women’s greater nurturing propensity 
brings them major responsibility for rearing 
children. Thus, a sex difference in child 
rearing will sustain itself across generations. 


OVERCOMING THE THEORY’S SOCIAL 
PSYCHOLOGICAL WEAKNESSES 


While Chodorow’s theory has had extensive 
and enduring influence,? it has not escaped 
criticism. The critiques have focused on the 
theory's social-psychological assumptions 
(while ignoring the logic of its macrosocio- 
logical argument). It is possible to remedy 
some of the flaws the criticisms reveal 
without changing the theory's logical struc- 
ture. To fully test the theory's potential, my 
model will include these refinements. 


Criticisms of the Theory 

The theory's critics contend that the processes 
transmitting personality propensities to chil- 
dren have more variable and less predictable 
outcomes than the theory predicts and that 
psychoanalysis offers a poor analytical frame- 
work. Rossi (1981) argues that Chodorow has 
little empirical evidence to support her 
assumptions about parents and children. 
Recent research has not filled this gap. 
Studies of children show they develop a sex 
difference in nurturing, but not in the way 
Chodorow describes (Fogel, Melson, and 
Mistry 1986). Studies comparing male and 
female child rearers find little evidence of 
differences in their nurturing or its effects on 
children (e.g., Defrain and Elrich 1981; 
Maccoby and Jacklin 1974; Risman 1986; 
Santrock and Warshak 1979). Gerson's (1985) 
study of adult women choosing whether to 
emphasize family life or work life found 
compelling evidence that adult experiences 
greatly limit the influence of childhood 
experiences on adult commitments. Other 
sociologists (e.g., labor market analysts) 
ignore Chodorow's work without directly 


? In the 1980s Chodorow’s work averaged well 
over 50 references per year in the Social Science 
Citation Index. Also, other important works, such 
as Carol Gilligan's In a Different Voice (1982), 
refer to Chodorow's argument as a foundation on 
which they can build their own theoretical edifices. 
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criticizing it. They often seem to believe its 
merit depends on the validity of psychoana- 
lytic theory, which they consider untenable. 
In short, critics have questioned the theory's 
validity by challenging its social-psychological 
assumptions. 


Reformulating the Theory 


We can make the theory less vulnerable to 
these criticisms by relaxing two assumptions. 
First, we can remove dependence on the 
validity of psychoanalytic theory. Chodo- 
row's emphasis on psychoanalytic theory has 
mistakenly focused critical attention on per- 
sonality acquisition rather than on her model 
of the full social process that sustains a sexual 
difference in personalities and roles. The core 
of her theory, however, could be valid even if 
psychoanalytic premises were not, and there- 
fore we do not need the assumptions about the 
social psychology of personality develop- 
ment. Second, we can also avoid Chodorow's 
tendency to imply that all people, both as 
children and as adults, live in families with 
nurturing, stay-at-home mothers and nonnur- 
turing, rarely-present fathers. Thus we can 
change it into an explicitly probabilistic 
theory. These two changes preserve the core 
of Chodorow's theoretical contribution while 
dispensing with two primary weaknesses that 
have drawn criticism. 


The Irrelevance of Psychoanalysis’ 


The theory’s critical assumption about the 
transmission of child-rearing propensities 
from parents to children is empirical, not 
theoretical. If asymmetrical parenting pro- 
duces a greater nurturing propensity among 
children with the same sex as the caretaking 
parent, then it meets the needs of Chodorow's 
theory no matter what social-psychological 
mechanisms cause it to happen. 

Any plausible hypothetical process that 
would selectively transfer a nurturing propen- 
sity to children with the same sex as a 
nurturing parent will successfully substitute 
for psychoanalytic assumptions in the theory. 
For example, social learning theory (Bandura 
and Walters 1963) would suggest that aspira- 
tions and abilities arise in good part from 
children's efforts to imitate their parents, and 
that these efforts generally focus on the parent 
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whose sex a child shares.* This implies that if 
women do most of the child rearing in one 
generation, the women of the next generation 
will have greater desire and skill.for child 
rearing than men. This example shows how 
we can directly replace the Freudian assump- 
tions of Chodorow's theory with another 
theory of identification. 

We can, therefore, discard psychoanalytic 
assumptions while modeling the social pro- 
cesses of Chodorow's theory. For the theory 
to work, asymmetrical parenting must gener- 
ate somewhat different personalities in boys 
and girls. While we do not have the evidence 
to decide conclusively if asymmetric parent- 
ing has this effect, we know that women, on 
the average, show greater interest than men in 
caring for children, and it is likely that 
childhood development causes some of this 
difference. Hence, it is a credible empirical 
hypothesis, and my model of the theory will 
assume the pattern of parents' nurturing 
influences children's nurturing propensity as 
Chodorow expected. But the model will use 
no assumptions about why this happens. 


Accommodating Variation 


Chodorow's presentation of the theory also 
suffers unnecessarily by using an absolute 
language that neglects variation. To achieve 
plausibility, the theory must become probabi- 
listic. 

The Theory's Universalistic Style. Using a 
style common to psychoanalytic theory, 
Chodorow forges her theoretical analysis 
around the image of a "traditional" middle- 
class family: a married couple with the wife 
devoting herself to rearing children while her 
husband concentrates on his job and shares 
little of the child-rearing activity. All infants 
develop full, close emotional ties to an 
always-present, always-nurturing mother, and 
none has significant emotional ties to an 
aloof, rarely-present father. All these children 
reach a "correct" gender identification with 
the same sex parent, and they become 
heterosexual by symbolizing suitable sex 
objects with the opposite sex parent. In 





^ This should not be confused with socialization. 
In socialization theory, the motives and expecta- 
tions of parents determine their influence over 
children. Here the emphasis is on children's 
identification with their parents, regardless of the 


parents' aims. 
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consequence, (all) women become nurturing 
and (all) men do not. This style of assuming 
universal outcomes creates serious theoretical 
problems because it misrepresents the empir- 
ical conditions it claims to explain. 

"Deviant" Families and Sex Role Excep- 
tions. We know that social reality does not 
correspond to the imagery of "traditional" 
family life. In the United States by 1984 only 
42 percent of all children under six years were 
living with both parents and had a mother 
who did not hold a job (U.S. Department of 
Labor 1985, p. 124). In recent years, men 
who raise children alone or take major 
responsibility in couples have inspired a wide 
range of research (Ambert 1982; Chang and 
Deinard 1982; Defrain and Elrich 1981; Greif 
1985; Hipgrave 1981; O'Brien 1982; Pleck 
1985; Radin 1982; Rossi 1984; Santrock and 
Warshak 1979; Santrock, Warshak, and 
Elliott 1982; Smith and Smith 1981). In 1984 
widowed, divorced, separated, or single men 
were raising one-third of a million children 
under five years of age (U.S. Department of 
Labor 1985). 

People stray from the "traditional" family 
image for many reasons. Mortality, marital 
instability, unrestrained passions, economic 
imperatives, and ambitions all limit confor- 
mity to the model of an intact traditional 
family with a strict division of labor. Fathers 
and mothers die, spouses flee, pregnancies 
occur outside marriage, unavoidable unem- 
ployment throws men back into their homes, 
simple economic need forces some women to 
find jobs, and other women seek employment 
to fulfill their personal aspirations. 

As a result, not all women are nurturing, 
not all men are nonnurturing, and not all 
children grow up in a "traditional" family. 
We have no good data to yield a precise 
estimate of how many men have nurtured 
children or how many children grew up 
receiving a significant proportion of their 
nurturing from men. Some sources of devia- 
tions from the imagery of a traditional family, 
such as divorce and women's employment, 
have shown a strong increasing historical 
trend. But even in the mid-19th century, 
death, spouse abandonment, male unemploy- 
ment, and female employment assured that a 
minority of children grew up in surroundings 
unlike the traditional family. 

While variation plays no role in Chodo- 
row's presentation of her theory, introducing 
it does not contradict her own interpretation. 
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Although Chodorow largely presents her 
theory in absolute terms, her work does not 
claim that this faithfully mirrors reality. On 


the contrary, she recognizes that “all women. 


do not mother or want to mother, and all 
women are not ‘maternal’ or nurturant... . 
[And] some men are more nurturant than 
some women" (1978, p. 215 [emphasis in 
original]. She also remarks that “the evi- 
dence I have seen suggests that . . . at least 
some men parent just as well as women" 
(Lorber et al. 1981, p. 512) And she 
sometimes expresses her major assertions 
probabilistically rather than absolutely. For 
example, she says, “as a result of having 
been parented by a woman, women are more 
likely than men to seek to be mothers" (1978, 
p. 206 [emphasis added]). Moreover, she 
asserts that her claim of personality differ- 
ences “is not to argue that all women want to 
mother" (Lorber et al. 1981, p. 503), and she 
says, "I concede that all claims about gender 
differences gloss over important differences 
within genders and similarities between gen- 
ders" (1978, p. 215). 

Thus, to give the theory plausibility, my 
model will assume it says that most (but not 
all) women nurture children and most men do 
not. Furthermore, most children grow up with 
a nurturing mother (or mother substitute) and 
a nonnurturing (or absent) father. The model 
must also represent the significant minority 
who deviate from these norms. This theoreti- 
cal adjustment will accommodate empirical 
reality while keeping the original argument 
intact. $ 

Imperfect Transmission to Children. Just as 
women have not done all child rearing, it is 


not tenable that traditional family organiza- ' 


tion will cause all daughters to form an adult 
psychological propensity for nurturing or all 
adult sons uniformly to lack it. We have no 
data to apply directly to nurturing, but we can 
supply a conservative estimate by considering 
another adult propensity, heterosexuality. 
Like nurturing (in the theory), the develop- 
ment of sexual orientation depends on early 
childhood experience and concerns behavior 
subject to strong social norms. Moreover, 
Chodorow’s own explanation of the sex 
difference in nurturing depends on the 
formation of heterosexual object choice. 
Conservative estimates suggest that at least 3 
to 4 percent of adult men and 2 to 3 percent of 
women in the United States are exclusively 
homosexual (Gagnon 1977, pp. 253-56). 
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These proportions would grow if we included 
part-time homosexuals. Given the social 
sanctions against homosexuality and the depth 
of cultural indoctrination toward heterosexu- 
ality, we can deduce that the proportion of the 
population whose personalities favor homo- 
sexuality must exceed these numbers. If 
children develop a nurturing propensity as 
described by Chodorow’s theory, deviations 
from the theoretical norm must be at least as 
common as homosexuality. Thus, at least 
several percent of the daughters and the sons 
growing up in traditional families will acquire 
a nurturing propensity different from the 
norm for their sex. 

As a corollary to this extension of the 
theory, we also need to expand the argument 
to include families that diverge from the 
norm. This is already implicit within Chodo- 
row’s specification of the theory. Although 
she restricts her analysis to “traditional” 
families, with a nurturing mother and nonnur- 
turing father, the theory implies that reversing 
the parents’ nurturing propensities would 
reverse the outcomes for children (as noted by 
Rossi [1984, p. 10]). Furthermore, because 
the theory attributes the sex differential in the 
transmission of nurturing to a difference 
between the parents, it implies that in families 
where the parents both have, or both lack, a 
nurturing propensity, daughters and sons will 
have the same likelihood of developing 
nurturing personalities. The theory does not 
try to specify the results in such families. But 
the theory logically implies that the propor- 
tion of children acquiring a nurturing poten- 
tial when both parents are nurturing must be 
at least as large as the proportion of daughters 
within “traditional” families who become 
nurturing. Similarly, the proportion of chil- 
dren acquiring a nurturing potential when 
neither parent is nurturing must be at least as 
small as the proportion of sons within 
“traditional” families who become nurturing. 

The theory implies these outcomes be- 
cause—and this is one of its strengths— 
biological sex itself has no causal effects; 
instead it is nurturing, the time spent caring 
for a child and the quality of emotional 
relationships, that causes differences in chil- 
dren. That women should be the sex caring 
for children is theoretically incidental, al- 
though it has empirical primacy. 


A STOCHASTIC MODEL 
Having qualified Chodorow’s theory to rem- 
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edy (as much as possible) the weaknesses 
spotted by past critics, it is now possible to 
build a stochastic model representing the 
transmission of nurturing propensities be- 
tween generations. I will translate the theory’s 
fundamental propositions into mathematical 
statements after specifying assumptions about 
parts of the process that were not completely 
explored in the theory. Then I will use the 
model to explore the theory’s ramifications. 


Nurturing States 


Like Chodorow, I will assume that all 
members of the population fall into one of 
two mutually exclusive states: they are either 
nurturing or nonnurturing. Realistically, nur- 
turing activity lies on a continuum from 
almost none to excessive. Using a simplified 
assumption of only two nurturing states has, 


however, no substantive consequences for the ` 


outcomes of the model and avoids diverging 
from Chodorow's statements of the theory.5 
Following the theory, I also assume that 
people will fill adult roles fitting their 
personalities so that the orientation toward 
nurturing formed during childhood controls 
adult nurturing activity. 

This gives four types of couples (or 
families) based on the nurturing propensity of 
the spouses. These include: the “traditional” 
family with a nurturing wife and nonnurturing 
husband, the “antitraditional” family with the 
nurturing tendencies reversed, the “sym- 
metric nurturing” family with both spouses 
nurturing, and the “symmetric nonnurtur- 
ing.” 


Probabilities for Children Acquiring a 
Nurturing Propensity 


A model needs a fully specified relationship 
between parents’ nurturing propensities and 
those of their children. Parenting will follow 
four possible patterns based on the nurturing 
propensities of the wife and husband. I 
represent these by a matrix of transmission 
probabilities, illustrated by Table 1. In this 





5 Because Chodorow treated nurturing largely as 
a dichotomous state, her theoretical statements do 
not indicate how intermediate nurturing propensi- 
ties might result from, or contribute to, the 
reproduction of parenting. Thus, we can only 
introduce intermediate states by extending the 
theory. 
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Table 1. General Transmission Matrix: Probability that 
Sons and Daughters Acquire a Nurturing 
Propensity as a Function of Parents’ Nurturing 


Nurturing 
Child 
Parents (Probabilities) 

Type of Couple Father Mother Sons Daughters 
Symmetric nurturing Yes Yes Pi P, 
Symmetric 

nonnurturing No No Py Py 
Antitraditional Yes No P3 P, 
Traditional No Yes P, Pa 


table P, to P4 denote the probabilities that a 
daughter or son will develop a nurturing 
propensity in each of the four possible kinds 
of couples. Only four probability variables 
are necessary to cover the eight probability 
conditions because the model restricts some 
probabilities to equal each other. The last line 
of the table shows the “traditional,” asymmet- 
ric couple that is the focus of Chodorow’s 
theory. The theory demands that P}, the 
probability a daughter becomes nurturing, is 
much larger than P4, the probability for a son. 
The theory also implies that the probabilities 
will be exactly reversed for the "antitradi- 
tional" couple with reverse parental roles. 
The assumptions of the model make the 
probabilities the same for sons and daughters 
in each of the first two types of (symmetric) 
couples. The theory further implies that P, 
(the probability for nurturing couples) is at 
least as large as P4 (the probability that 
daughters become nurturing in traditional 
couples) and that P2 (the probability that 
children become nurturing when neither 
parent is) is at least as small as P4 (the 
probability that sons of traditional couples 
become nurturing).5 With these specifica- 
tions, this simple matrix embodies the 
fundamental propositions of Chodorow's the- 
ory as part of a stochastic process. 


Demographic Conditions 
To model the transmission of traits between 





$ Violations of these conditions would defy 
logic by implying that daughters with two 
nurturing parents were less likely to become 
nurturing than daughters with one nonnurturing 
parent or that sons with one nurturing parent were 
less likely to become nurturing than sons with two 
nonnurturing parents. 
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generations also requires specific marriage 
and birth rates. To avoid adding new 
propositions to Chodorow’s theory, I will 
assume a stable population in which nurturing 
propensities do not affect marriage and 
. reproduction. More specifically, I will as- 
sume the following: that nurturing and 
nonnurturing people marry equally often, 
people ignore nurturing propensity while 
choosing mates (random mating), and all 
types of parents bear children at the same 
rate. The assumptions supplied here follow 
the most direct line of reasoning available: as 
Chodorow's theory does not propose that 
nurturing propensities affect mating or repro- 
duction rates, we should assume they do not. 
Later in this analysis, I will show that these 
particular assumptions have no importance, 
because all plausible demographic assump- 
tions lead to the same substantive conclu- 
sions. 

The stochastic model formed by these 
various assumptions allows us to see the 
history of a population over time as people 
marry and have children, and their children 
grow up and. repeat the process. Given any 
beginning distribution of nurturing in the 
population, the demographic assumptions 
about mating regulate what proportion of 
parents fall into each of the four parent 
groups. Substituting any probabilities meeting 
the specified conditions into Table 1 will 
create a transition. matrix congruent with 
Chodorow's theory that links the nurturing 
propensities of children to the nurturing 
activities of their parents. The probabilities of 
the transition matrix than tell us the distribu- 
tion of nurturing propensities in the next 
generation. Thus, combining the transition 
matrix with the demographic assumptions 
defines a process that governs the distribution 
of nurturing propensities among women and 
men in each generation based on the distribu- 


tion in the preceding generation. 


An Illustration 


The meaning of the transition matrix and 
stochastic model becomes clearer if we look 
at an example. This example (Table 2) uses 
simple probabilities that conform to the 
expectations of Chodorow’s theory. Consis- 
tent with the theory, the matrix says that 
“traditional” couples (whom the theory as- 
sumes to predominate) will find that 90 
percent of their daughters develop a nurturing 
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Tabie 2. Illustrative Transmission Matrix Based on the 
Expectations of Chodorow’s Theory 
Nurturing 
Children 
(Probabilities) 
Type of Couple Sons Daughters 
Symmetric nurturing .90 .90 
Symmetric nonnurturing .10 .10 
Antitraditional .90 .10 


Traditional 10 90 


propensity but only 10 percent of their sons 
do so. The “antitraditional” couples with 
nonnurturing mothers and nurturing fathers 
(whom the theory assumes to be rare) have 
exactly the opposite influence on their 
children: most sons become nurturing while 
most daughters do not. Ninety percent of both 
daughters and sons of “symmetric nurturing” 
couples become nurturing. In families with 
two nonnurturing parents, only 10 percent of 
the children become nurturing. 

The model also needs some starting 
distribution of nurturing propensities in the 
population. I will assume, again choosing 
numbers that follow the intent of Chodorow’s 
argument, that at the starting point of my 
investigation in a hypothetical society, 90 
percent of the adult women have a nurturing 
propensity and 10 percent do not; men have 
just the opposite distribution. Under these 
assumptions, at first 81 percent of couples 
follow the “traditional” pattern, only 1 
percent are “antitraditional” with a nurturing 
father and a nonnurturing mother, and each 
type of symmetrical couple makes up 9 
percent. Using the illustrative transmission 
matrix in Table 3, I show the resulting 
distribution of individuals and couples for 
each generation from the 1st through the 20th. 

The outcomes in this table emphatically 
contradict the claim of Chodorow’s theory 
that it explains why women’s responsibility 
for child rearing endures in society. The 
outcomes also clash with most people’s 
commonsense expectations. Starting with a 
population in which most’ women have a 
nurturing propensity and most men lack it, 
and using a reproduction model that says most 
children born to “traditional” couples will 
develop “traditional” propensities, we expect 
that the starting differences between the sexes 
will stay unchanged over time. Instead, as 
shown in the table, the differences between 
the sexes decline. From a historical perspec- 
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Table 3. Ilustrative History of the Distribution of 
Nurturing (Based on the Transmission Matrix 


of Table 2) 

Gener- 5 ED) _ Couples (36) 
ation Ww M W-M MW mw Mw mW 
0 .90  .10 .80 09 09 .01 .81 
1 .82  .18 .64 5 .5 .03 .67 
2 6 24 .51 18 .18 .06 .57 
3 0  .30 Al .21 .21 09 .50 
4 66 .34 33 22 2 11 A44 
5 63.37 .26 23 23 .4 40 
6 60  .40 .21 24 24 16 37 
7 S8 .42 7 44. 24 .17 M4 
8 .57 43 13 35 25 19 32 
9 S55 45 al 25 25 20 31 
10 54 46 .09 25 325 21 29 
11 53 47 .07 .35 25 22 .29 
12 53.47 .05 35 25 22 328 
13 .52 48 04 35 25 23 2 
14 52 48 .04 25 285 2323 22 
15 .51 .49 .03 25 25 224 A26 
16 51 .49 .02 25 25 24 26 
17 51  .49 .02 25 .25 24 26 
18 51 .49 .01 35 .25 .24 26 
19 51 .49 01 25 28 24 26 
20 .50  .50 01 35 25 35 25 


Note: W signifies nurturing women, M nurturing men, 
w nonnurturing women, and m nonnurturing men. The 
couples notation follows suit, so that, for example, Mw 
signifies a couple with a nurturing husband and 
nonnurturing wife. 


tive, they decline rapidly. Assuming there are 
between four and five generations each 
century, this model suggests that in one 
century women's child nurturing responsibil- 
ity shrinks from being nine times as great as 
men's to only twice as much. “Traditional” 
families drop from 80 percent to 40 percent of 
families. After two centuries, the sex differ- 
ences become negligible. Eventually, both 
sexes become evenly split between nurturing 
and nonnurturing and the distribution stabi- 
lizes. 

While the results of this example contradict 
the theory's major argument, it might seem 
possible that this happened only because I 
used peculiar numbers. How general is this 
result? I will now mathematically prove that, 
under any assumptions consistent with the 
real world, the described by this 
theory will always move a population toward 
an equilibrium where the same proportion of 
"women and of men have a propensity for 
nurturing. After that, I will consider the 
substantive reasons why the model produces 
this outcome. 
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THE GENERAL MODEL: THE 
MATHEMATICAL NECESSITY OF 
CONVERGENCE? 


Chodorow's theory implies that the process 
that creates nurturing propensities will force- 
fully preserve sex differences across genera- 
tions. The illustration above contradicts this 
expectation. With each successive generation 
the distribution of nurturing changes to make 
women and men more alike until they look 
the same. It is possible to show that the 
stochastic process will always produce this 
outcome using any probabilities in the 
transition matrix consistent with the theory. 

The model representing Chodorow’s theory 
of the reproduction of mothering resembles 
stochastic models used in population genetics 
and Markov chain models. These models 
primarily allow us to assess how random 
events obeying certain rules over time lead to 
predictable outcomes, particularly stable equi- 
libria (Cavalli-Sforza and Feldman 1981). 
Population genetics uses stochastic models to 
analyze the transmission of traits across 
generations. The well-known Hardy-Weinberg 
theorem in population genetics, for example, 
shows how simple Mendelian genetic trans- 
mission produces (in only one generation) a 
stable distribution of genotypes in a popula- 
tion, regardless of the original distribution. 
Markov chains model recurring probabilistic 
transitions among a definable (usually finite) 
set of choices. The best-known applications 
of Markov chains in sociology involve social 
mobility tables (Prais 1955; Bartholomew 
1973; Boudon 1973), used, for example, to 
show that over time mobility processes (if 
constant) could produce a stable occupational 
distribution independent of the original occu- 
pational distribution. 

The models of population genetics better 
represent Chodorow’s theory, so I will follow 
their form of argument here. Because the 
stochastic model of Chodorow’s theory dif- 
fers somewhat from common models based 
on either population genetics or Markov 
chains, it is not possible to derive its general 
properties directly from proven theorems in 
either area. The models of population genet- 
ics generally do not apply to a bilateral 
sex-linked process such as that described in 





7 Gary King of the Political Science Depart- 
ment, Harvard University, generously contributed 
the mathematical proof contained in this section. 
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Chodorow’s theory, because only the Y- 
chromosome of the sex-linked chromosomes 
carries effective genes and only women can 
pass on sex-linked traits. Hence, Chodorow’s 
model resembles those normally used for 
sex-linked “inheritance” in population genet- 
ics but has a more general form and purpose. 
The model of Chodorow’s theory differs even 
more from most Markov chains, because it 
involves two transformations between succes- 
sive generations: the formation of couples and 
the production of individuals by couples. It is 
practical to represent the process as a Markov 
chain only with restrictive assumptions (a 
Markov analysis is available from the author 
in Jackson [1988]). As it is possible to show 
the implications of the full process with a 
model like those of population genetics, I will 
focus on it. 

To begin, let me define nomenclature. For 
generation i, the following symbols denote 
the proportions of men, of women, and of 
couples in the population: 


symbol represents the population propor- 
tion of... 

Mi nurturing men 

m, nonnurturing men 

W, nurturing women 

Wi nonnurturing women 

MW, nurturing = man/nurturing woman 
couples (“symmetric nurturing”) 

mw, nonnurturing man/nonnurturing 
woman couples ("symmetric non- 
nurturing") 

Mw, nurturing man/nonnurturing woman 
couples ("antitraditional") 

mW, nonnurturing man/nurturing woman 


couples (“traditional”) 
The transmission probabilities for a nurturing 
propensity will be represented by P; fori = 1 
to 4 as in Table 1. 

The distribution of nurturing among women 
and men decides the prevalence of each type 
of couple through random mating. Each type 
of couple will constitute an expected propor- 
tion of all couples equal to the product of the 
proportions of the relevant types of women 
and men. For example, random mating will 
make 81 percent of couples “traditional” 
when 90 percent of women are nurturing and 
90 percent of men are nonnurturing. More 
generally, for generation i, 


MW, = MW, l (1) 
mw, = mw, = (1 — MDO — Wọ) Q 
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Mw, = Mw; = M(1 — W) (3) 
mW, = mW; = (1 — M)W.. (4) 


Given the probabilities that daughters and 
sons in each type of family will become 
nurturing, the distribution of couples within 
each generation regulates the expected distri- 
bution of nurturing among individuals in the 
next generation. The expected distribution of 
nurturing for generation i is a function of the 
distribution of couples in the preceding (i — 
1) generation and the rates (or probabilities) 
from the transmission matrix (Table 1) 
linking parents and children. Thus 


W = P\MW,_, + P2mw,_; 


+ PsMw,_, + P3mW,_; (5) 
M, = P\MW,_, + P2mw,_; 
+ P4Mw,., + PamW,. ,. (6) 


Now we can consider the problem of 
convergence. We want to find what happens 
to the sex difference in nurturing propensity. 
By subtracting equation (6) from (5) we get 
the sex differential: 


W; — M, = P4Mwr,., + P3mW,., 
— P4Mw;., — PamW;-, (7) 


=(P4 — P3(Mw,., - mW,.). (8) 


And by substitution from equations (3) and 
(4) we get 


W; — M, =(P4 — P3KM,-,[1 — Wii] 
== M, Wi) (9) 


= (P3 — PV, — Mj). (10) 


Thus, the sex differential in each generation is 
a simple function of the sex differential in the 
preceding generation. Moreover, because 0 < 
P; = 1 for all i, and accordingly |P; — P,| = 
1, it follows that, as i goes to infinity, 


limit |W, — M] =0 for0 S |P} ~ P4| «1 
(11) 
=|Wo — Mol 
for |P; — P4 = 1. 
(12) 


In words, this says that the sex differential 
will approach closer to zero with every 
generation. 

We can solve for the exact proportion of 
nurturing characteristic of both sexes at equi- 
librium (see Appendix for complete deriva- 
tion). If the outcomes for symmetric couples 
mirror the related outcomes for asymmetric 
couples (the simplest and most straightforward 
assumption), then at equilibrium 


ie Ps 
P, + Ps —1 
E ES 4 |. 
P4 


(13) 


This equals one-half if and only if the 
bracketed factor in the denominator equals 
zero. This event occurs if one-half of all 
children born to families with asymmetrical 
patenting develop a nurturing propensity ([P3 
+ Pa] + 2 = 44). The convergence point 
goes up or down as does the transmission bias 
toward nurturing or nonnurturing. 

These results prove that the stochastic 
process defined by the model, with one 
exception, always converges to an equilib- 
rium where the proportion of women with a 
nurturing propensity equals the proportion of 
men with a nurturing propensity. The only 
exception would occur if P4 minus P4 could 
equal one. In substantive terms, for this 
exception to occur, all "traditional" families 
(with a nurturing mother and a nonnurturing 
father) must make absolutely all daughters 
into nurturing mothers and all sons into 
nonnurturing fathers. (Conversely, if there 
were families with nurturing fathers and 
nonnurturing mothers, the opposite out- 
comes must occur.) In reality, as dis- 
cussed above, social-psychological processes 
do not transfer any aspect of personality 
(including heterosexuality) with this absolute 
uniformity and rigidity; therefore, we can 
dismiss this prospect as empirically 
impossible.’ 

In short, this stochastic model shows that— 
with any plausible assumptions about trans- 
mission rates—the reproduction process of- 
fered by Chodorow to account for the 
persistence of women’s child-rearing activi- 
ties really leads to the opposite results. 
Instead of preserving sex differences in the 
propensity to nurture, it would end them. 





5 Even this extreme possibility, if it were to 
exist, would not produce a difference in the 
nurturing propensities of the sexes. Rather it would 
continue to reproduce any initial sex differences. It 
might converge to an equilibrium state different 
from the initial distribution of nurturing, but it 
would increase or decrease nurturing equally for 
the sexes so as to leave unchanged the differences 
between men’s and women’s aggregate propensity 
for nurturing. If the initial population showed no 
sex differences, the process would sustain that 


equality. 
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THE GENERAL MODEL: WHY IT 
CONVERGES 


Why does this model of the reproduction of 
nurturing imply that the sexes will always 
tend toward equal nurturing propensities? The 
theory emphasizes an action, nurturing, that 
both sexes can do. Because this action is 
independent of biological sex, the theory 
necessarily treats women and men entirely 
symmetrically. But with such symmetry, the 
transmission process always pushes a popula- 
tion toward equal rates of nurturing for 
women and men. 

Let us look at how this works using the 
example shown in Table 3. With each 
generation, the proportion of nurturing women 
falis while the proportion of nurturing men 
rises until they reach the same level. How 
does this happen? First, consider only women. 
Because this example uses slightly more 
restrictive assumptions than does the general 
model (by setting the probabilities for trans- 
mitting nurturing to children of symmetric 
couples to equal the coinciding probabilities 
for asymmetric couples, i.e., P; = P3, P; = 
P4), the probability that a daughter becomes 
nurturing depends only on the nurturing 
propensity of her mother; equally, a son's 
nurturing propensity depends only on his 
father. In the example, nurturing mothers 
produce 90 percent nurturing daughters while 
nonnurturing mothers produce only 10 per- 
cent nurturing daughters. 

Why do these numbers cause the propor- 
tion of nurturing women to decline? Look at 
the first generation of change. As nurturing 
women account for 90 percent of the children 
and 90 percent of their daughters become 
nurturing, they cause 81 percent of the 
second-generation women to be nurturing. 
This falls 9 percent short of their own 
proportion in the first generation. The rest of 
the first generation are the 10 percent 
nonnurturing women, and since only 10 
percent of their daughters become nurturing, 
they add only | percent to nurturing women 
in the second generation. This brings the 
proportion of nurturing women to 82 percent 
in the second generation. 

What, in words, is the social process at 
work here? For clarity, assume every woman 
has one daughter so we can use absolute 
numbers rather than proportions. Because the 
transmission of nurturing is variable, some 
daughters differ from their mothers. Conse- 


REPRODUCTION OF PARENTING , 


quently, neither nurturing nor nonnurturing 
women reproduce their own numbers in the 
next generation. As a result, the number of 
nurturing women will stay unchanged across 
generations only if nonnurturing women 
produce as many nurturing daughters as 
nurturing women produce nonnurturing daugh- 
ters. If not, the number of nurturing women 
must go down or up. In particular, when the 
number of nurturing women is high, the 
nurturing women will produce more nonnur- 
turing daughters than the nonnurturing women 
will produce nurturing daughters; this moves 
the next generation closer to the equilibrium 
point where the two groups of women 
produce the same number of daughters unlike 
themselves. 

How does this end sex differences in 
nurturing? Here we see the importance of the 
theory’s symmetry. The same process causing 
the proportion of nurturing women to move 
toward an equilibrium also affects men 
exactly the same way. As a result, the 
proportion of men who are nurturing moves 
toward the same equilibrium point as does the 
proportion of women who are nurturing. 

It is easy to miss this point by falling into a 
conceptual trap by focusing too narrowly on 
one kind of family, the so-called “traditional” 
family with a nurturing mother and nonnurtur- 
ing father. The theory assumes that most 
families are traditional and that most children 
will acquire the same nurturing inclination as 
the parent whose sex they share. Intuitively, it 
seems that the traditional family should 
continually reproduce itself and remain pre- 
dominant. But this reasoning fails to face the 
entire process defined by the theory. 

To avoid this conceptual error, we need to 
think of the full range of people and couples 
in a society. In the simplified form of the 
model used in Table 2, this means consider- 
ing both nurturing mothers (or fathers) and 
nonnurturing mothers (or fathers). In the 
general model, without the simplifying assump- 
tions used in Table 2, we must consider all 
four kinds of couples and families, as the 
transmission rates for nurturing depend on the 
nurturing propensities of both wife and 
husband. The logic of the process, although 
more complex, is unchanged. All four kinds 
of families produce children who tend to 
resemble their parents of the same sex; that is, 
each creates children after its own image. But 
because a significant minority of children 
differ from their parents, traditional families 
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cannot produce enough traditional children to 
reproduce themselves. The population be- 
comes stable only if the other three family 
types produce as many traditional children as 
the traditional families produce antitraditional 
children. Because symmetric couples produce 
equal numbers of nurturing males and fe- 
males, this process cannot reach a balance 
until the proportion of traditional families 
equals the proportion of antitraditional fami- 
lies. Until then, the proportion of women and 
of men who are nurturing people will shift 
toward the same equilibrium point. 


WOULD OTHER ASSUMPTIONS WORK? 


No change in the assumptions of this 
stochastic model can make it work as 
Chodorow intended. Because mathematical 
modeling forces greater specification of the 
theory, it invites speculation that the model 
(and theory) could work if we changed the 
specification in some reasonable way. But no 
strategy can simultaneously preserve the logic 
of Chodorow's original argument, stay consis- 
tent with empirical reality, and define a 
process that would account for women's 
child-rearing responsibility. To see why this 
is so, let us briefly consider each of the 
strategies holding some hope of salvaging the 
theory: using transmission probabilities that 
would slow down convergence; changing the . 
demographic assumptions to allow nurturing 
propensities to affect mating and birth rates; 
allowing the process to affect the two sexes 
differently for reasons not expressed in the 
theory; and reducing the theory's claim that it 
describes a sufficient cause for the persistence 
of a sexual division of labor to a more modest 
claim that it describes a contributory cause. 


A Slower Convergence Rate? 


The preceding analysis has not considered 
how fast the process described by Chodo- 
row's theory would move the sexes toward 
the same rate of nurturing. The proof offered 
above shows that the sexes' nurturing must 
grow more alike with each generation. Could 
we avoid the implications of this conclusion 
by choosing empirically plausible assump- 
tions that slow the rate of change enough that 
sex differences do not diminish significantly 
within realistic time spans? No. 

The rate at which the process moves the 
population toward equilibrium depends on the 
degree of difference in the outcomes for sons 
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and daughters born to asymmetric couples 
(regardless of the outcomes for symmetric 
couples). From equation (10) above we know 
that 


Wi — M, 


T (14) 
Wii m M, 


= (P, — P4). 


This means that the larger the disparity 
between daughters and sons produced by 
asymmetric parents, the Jarger the proportion 
of the sex difference in the distribution of 
nurturing left after each generation of change. 
At one extreme, if daughters and sons were 
equally likely to become nurturing (P4 — P4 
= 0), then any sex differential would entirely 
disappear in one generation. At the other 
(empirically impossible) extreme, if tradi- 
tional families produced only nurturing daugh- 
ters and nonnurturing sons, the process 
described by Chodorow's theory would not 
change the distribution of nurturing by sex. In 
Table 4 I show how conditions between these 
extremes affect the rate of decline in the 
nurturing sex differential. The first two 
columns show the proportionate decline in the 
nurturing sex differential after about a century 
for various levels of outcome differences in 
traditional families. To illustrate how this 
works, I show in the last five columns what 
would happen to the example presented in 
Table 3 if the transition probabilities were 
varied to match the outcome differentials in 
the first column. 

No choice of probabilities can cause the 
theory to fulfill Chodorow's expectations. As 
discussed above, to be empirically plausible, 
the theory must accommodate the existence of 
nurturing men and nonnurturing women and 
the variable outcomes characteristic of all 
natural and social processes transmitting 
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psychological conditions. So the theory must 
assume a significant minority of children 
deviate from the personality propensity of 
their same-sexed parent. Recall that even the 
transmission rates for heterosexuality are 
below 95 percent. But sex differences in 
nurturing decline substantially even if we 
assume an empirically unlikely degree of 
transmission rigidity much greater than the 
known transmission rates for heterosexuality. 
If, for example, "traditional" families pro- 
duce a nurturing personality in only 1 out of 
40 sons while installing nurturing propensities 
in 39 out of 40 daughters, in five generations 
nearly one-fourth of the sex differential would 
disappear and one-third of all families would 
deviate from the traditional model (Table 4). 

Hence, no empirically valid specification 
of differential nurturing outcomes can slow 
convergence enough to make the theory an 
acceptable explanation of women's persistent 
obligations for child rearing. Even under the 
most theoretically favorable assumptions con- 
sistent with empirical reality, the transition 
process contradicts the expectations of Cho- 
dorow's theory: it would reduce any existing 
sex differential in nurturing propensities to a 
socially significant degree. 


Demographic Possibilities? 


My model of Chodorow's theory has assumed 
nurturing propensities influence neither mat- 
ing nor birth rates. Because Chodorow 
ignored demographic issues, any other assump- 
tions would extend, and fundamentally change, 
the theory. Once we begin to look at the way 
the theoretical processes work over time, 
however, these issues become unavoidable. 
Would any possible change in the demo- 
graphic assumptions reverse the outcomes of 


Table 4. How the Size of the Asymmetry Effect (P4 — P4) Influences the Decline of the Nurturing Differential in 
Five Generations (Illustrations Assume Starting Proportions of Wo —.9, Mo—.1, mWp=.81) 





Differential 
Remaining 
Effect [= — Ms Probabilities 
[P — Pal Wo — M, P3 

.95 T1 .975 
.90 .59 .95 
.85 44 .925 
80 .33 .90 
70 7 85 
60 .08 80 
50 .03 75 


25 51 49 


Illustrative Processes 
Proportion of population 


after 5 generations 

P, Ws Ms mW; 
025 81 19 66 
.05 74 26 .54 
075 .68 432 .46 
.10 .63 .37 .40 
15 57 A3 .32 
.20 53 .47 .28 
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the model so that it conformed to the 
expectations of Chodorow’s theory? No. 

Mate Choice. Nurturing propensity could 
affect mate choice in several ways: people 
might prefer to marry someone with a like 
disposition, they might prefer someone who 
differs, or everyone could prefer to marry the 
same type (e.g., everyone wants a nurturing 
spouse). Each possibility would change- the 
distribution of couples produced by the 
transmission process: the first would produce 
more symmetric couples, the second more 
asymmetric couples, and the third more 
symmetric couples of the preferred type. But 
none of these three conditions would change 
the rate of convergence toward equilibrium. 
In each case, the difference between women’s 
and men’s nurturing declines toward zero at 
the rate stated in equation (14) above.? These 
marriage patterns lead to the same decline in 
sex differentials because the criterion for 
mating preferences —the presence or absence 
of a nurturing propensity —applies to both 
sexes equally. 

Nurturing might also affect whether people 
ever marry. Differential rates of ever marry- 
ing have the same implications for the 
reproduction of nurturing as differential birth 
rates. If nonnurturing people marry less often, 
for example, it has the same consequence as 
if they married equally often but had fewer 
children. Thus, the following analysis show- 
ing that variable birth rates cannot slow the 
process's elimination of sex differences ap- 
plies equally to changed assumptions about 
rates of avoiding marriage. 

Birth Rates. The model used in this 
analysis also assumed that birth rates were the 


same for all types of couples. If we allow . 


varying birth rates, the process will work out 
differently. As an extreme example, if only 
couples with two nurturing parents had 
children, in one generation the process would 
reach an equilibrium where most people of 
both sexes were nurturing. Although a 
nurturing propensity could influence people's 
appetite for children, no adjustment to birth 
rates consistent with both the real world and 





? A mathematical proof of these findings is 
available from the author (Jackson 1988). The 
point of convergence is largely irrelevant to the 
argument here, but the process would also 
converge to the same point as random mating 
under all these demographic conditions if P, = P; 
and Pa = P4. 
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the theory will allow the model to work as 
Chodorow intended. 

If nurturing propensities influence birth 
rates, then nurturing people long for children 
either more or less than nonnurturing people. 
The two possibilities have like implications 
for the. model, but only the first seems 
plausible. If it were true, then couples with 
two nurturing parents might have more, and 
couples with two nonnurturing parents might 
have fewer, children than asymmetric cou- 
ples. At an extreme, couples with two 
nurturing parents would have twice (or some 
other multiple) as many children as asymmet- 
ric parents while two nonnurturing parents 
would have no children. Even at this extreme, 
the process converges as before. Compared to 
the outcomes when nurturing does not affect. 
birth rates, assuming that nurturing parents do 
have more children shifts the equilibrium 
point (because it produces more nurturing 
parents of both sexes) and changes the pace at 
which women and men grow alike only 
slightly (increasing it for most interesting 
cases). !0 i 

Hence, if nurturing parents have more 
children than do nonnurturing parents, the 
process still ends sex differences because it 
still affects women and men in the same way. 
The only way that variations in the birth rates 
among the four types of couples could undo 
the transmission process’s resistance to sex 
differentials would be to treat the sexes 
differently. This would necessarily introduce 
a logic of determination altogether different 
from Chodorow’s theory. 


Treating the Sexes Differently? 


The model representing the processes of 
Chodorow’s theory consistently ends sex 
differentials regardless of the transmission 
probabilities or demographic assumptions 
used. This happens because the theory 
concerns nurturing actions and personality 
states, not biological sex. Without any causal 


10 The precise effects depend on further specific 
assumptions. Variable reproduction rates across 
the types of couples introduce a fluctuation in the 
aggregate birthrate across generations. This re- 
quires an adjustment in the equations for the 
reproduction of nurturing. Overall, if birthrates are 
proportional to the number of nurturing parents in a 
couple, the process will converge faster; a proof is 
available from the author (Jackson 1988). 
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connection between biological sex and nurtur- 
ing, any coincidental association between sex 
and nurturing disappears over time because of 
the randomization effects of a probabilistic 
transmission process. 

To some, this may suggest another possible 
tactic for changing the model so that its 
outcomes might conform to the theory’s 
expectation: we could assume that the process 
somehow affects women differently from 
men. We could, for example, assume that 
daughters become nurturing more often than 
do sons in comparable circumstances. This 
would mean that daughters in traditional 
families become nurturing more often than 
sons in antitraditional families. Alternatively, 
we could get the same effect if nurturing 
affects the mating or reproductive behavior of 
women differently from men. 

The strategy of treating the sexes differ- 
ently can successfully model sustained un- 
equal child rearing, but it fails to make the 
theory work as intended. The decisive causal 
agent would no longer be asymmetrical 
parenting but women’s greater susceptibility 
to nurturing or to nurturing’s effects on 
marriage and reproduction. Instead of Chodo- 
row’s argument, treating women and men 
differently in the model implies theories 
emphasizing social causes (such as social 
power differences or discrimination) or bio- 
logical causes (such as repeated pregnancy 
and the need to breast-feed infants). 

To show how this works, I will change the 
model in a way that uniformly favors 
nurturing in women. I will assume that the 
probability that daughters become nurturing 
exceeds the probability for sons in compara- 
ble circumstances by some constant differ- 
ence. This model will move a population 
toward an equilibrium where the ratio be- 
tween the proportion of all women who are 
nurturing and the proportion of nurturing men 
will equal the ratio between the probability 
that daughters become nurturing in antitradi- 
tional families and the probability that sons 
become nurturing in traditional families. For 
example, to yield twice as many nurturing 
women as nurturing men, daughters of 
“antitraditional” couples must become nurtur- 
ing twice as often as sons of “traditional” 
couples.!! 


11 This uses the simplifying assumptions that 
outcomes in symmetric couples equal the associ- 
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To give an idea of what this means, let me 
apply the revised model to the example shown 
in Tables 2 and 3. I will assume again that 90 
percent of daughters become nurturing in 
traditional families while only 10 percent of 
sons do. To produce an equilibrium with four 
times as many nurturing women as men, for 
example, antitraditional families must pro- 
duce nurturing in 40 percent of their daugh- 
ters (4 times 10 percent). The likelihood that 
daughters in any type of family become 
nurturing must exceed that of sons in 
comparable circumstances by 30 percent (40 
percent less 10 percent). This gives sons of 
antitraditional families a 60 percent chance of 
becoming nurturing. Simply put, in this 
example daughters' probability of becoming 
nurturing uniformly exceeds sons’ by 30 
percent for reasons that have nothing to do 
with parents’ nurturing. Given the starting 
assumptions, the greatest equilibrium ratio 
possible would match the outcome originally 
predicted by common sense: an equilibrium 
of 90 percent nurturing women and 10 percent 
nurturing men. To reach this, however, 
children in all types of families must develop 
nurturing at the same rate as in traditional 
families. But if all four types of families have 
the same outcomes, it means that parents’ 
nurturing has absolutely no effect. 

Consequently, introducing a female predis- 
position for nurturing can successfully model 
a process sustaining sex differentials in 
nurturing, but only by abandoning the logic of 
Chodorow’s theory. Asymmetric parenting no 
longer decides the issue. Women become 
nurturing (or marry or have children) more 
often than men under comparable circum- 
stances. The nonfamilial conditions responsi- 
ble for these differences in the nurturing 
probabilities for daughters and sons have 
become the decisive causal agents. The model 
now implies a theory that some conditions 
other than nurturing and family organization 
(such as employment discrimination or norma- 
tive expectations) cause women’s greater 
responsibility for rearing children. But the 


ated extreme outcomes in asymmetric couples. The 
assumptions imply that daughters with symmetric 
nurturing parents become nurturing at the same 
rate as daughters of traditional couples while 
daughters with two nonnurturing parents become 
nurturing at the same rate as those in antitraditional 
families; sons follow a similar pattern. A proof is 
available from the author (Jackson 1988). 
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influence of asymmetrical parenting is the 
keystone to Chodorow’s theory. Remove this 
keystone and the entire structure collapses. 


A Contributory Cause? 


Could Chodorow’s theory describe a process 
that helps sustain women's . child-rearing 
responsibilities although it is not their sole 
cause? For this to be true, other influences 
such as women’s exclusion from employment 
opportunities would work in parallel with sex 
differences in personality development so that 
they jointly caused women’s child-rearing 
responsibilities. Chodorow has occasionally 
implied this strategy by attributing some 
responsibility for women’s child rearing to 
social conditions exterior to the family. She 
says, for example, that “income inequality 
... makes it. . . rational, even necessary for 
fathers . . . to be primary wage-earners" 
(1978, p. 35). And she contends that the 
social division of labor by sex defines men by 
their employment and women through their 
family (p. 178). This strategy would abandon 
the theory’s claim to wholly account for 
women’s “mothering” through familial pro- 
cesses but insist that it identifies a primary 
contributory cause. Surprisingly, this tactic 
cannot salvage the theory, because it misinter- 
prets the theory’s effects. 
_ The process transmitting nurturing in Cho- 

dorow’s theory forces the distribution of 
nurturing propensities toward sex equality. It 
is not a weak cause of sex differentials. It is 
not even neutral. A neutral transmission 
process would act like cultural inertia: it 
would preserve any pattern. À neutral trans- 
mission process would not shift a population 
away from equality, but it would sustain any 
pattern of equality or inequality produced by 
other means. This is not the case here. Begin 
with a population strongly skewed toward 
"traditional" expectations: women make up 
the bulk of the nurturing members. Apply the 
model of the transmission process and after 
just several generations the distribution moves 
dramatically toward sex equality, as illus- 
trated in Table 3. This means that this 
reproduction process would not combine with 
other influences to preserve women's child- 
rearing responsibilities. Instead, it would 
oppose those other influences. 

Chodorow's theory about the reproduction 
of nurturing propensities is a theory of 
social-psychological processes that favor equal- 
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ity. It cannot combine with other causal 
explanations to explain women's child rear- 
ing. Just the opposite. If true, Chodorow's 
theory means that formidable social condi- 
tions must have kept women responsible for 
rearing children by defeating the equalizing 
influence of the nurturing process. This 
suggests that keeping women responsible for 
child rearing took even more powerful causes 
than would otherwise have been needed. 


CONCLUSION 


After putting the theory in its strongest 
possible form, this analysis has shown that 
the process described by Chodorow cannot 
sustain asymmetrical parenting under any 
plausible assumptions. Instead, if people's 
responsibility for rearing children depends 
primarily on the amount and quality of their 
parents' nurturing as her theory argues, then 
the process transmitting parenting motives 
and skills to children will inevitably push the 
population toward sex equality. 

The central sociological argument implied 
by Chodorow's theory has not received the 
attention it merits. Some have questioned the 
theory's assumptions about the dynamics 
within families, some have raised doubts 
about its assessment of psychological devel- 
opment, and others have objected to its 
indifference to structural causes. But every- . 
one, to my best knowledge, has assumed that 
the theory could work if its assumptions about 
personality development were correct. 

Chodorow's theory claims that asymmetri- 
cal parenting functions as a self-sustaining 
system over time. This occurs because 
asymmetrical parenting produces sex differ- 
ences among children in the capacity and the 
desire to care for others. *Mothers reproduce 
mothers through the creation of 
asymmetrical personality characteristics in 
daughters and sons" (Lorber et al. 1981, p. 
501). Women become better equipped and 
more willing to raise children. Men become 
better equipped and more willing to take part 
in the detached relationships characteristic of 
the economy and political order (notice that 
the theory implies that employers and apolo- 
gists for gender inequality have been right: 
women have not been made of the stuff to 
compete with men for jobs or public office). 
Relying on a functionalist argument, the 
theory assumes society allocates roles and 
positions to those fittest to fill them. 
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To treat the theory’s sociological argument 
justly, I develop a model heeding its primary 
assumptions and conclusions, while avoiding 
dependence on associated — but logically inde- 
pendent— controversial aspects of Chodo- 
row's theoretical work. In particular, I show 
the theory's argument about the transmission 
of nurturing across generations does not 
depend on the validity of psychoanalytic 
assumptions and that it can accommodate 
variable outcomes. Chodorow's formulation 
of the theory has received criticism for its 
rigid universalism and dependence on psycho- 
analytic speculation. But the model used here 
remedies these weaknesses while preserving 
the theory's fundamental argument. Hence, 
while these issues may reveal flaws in the 
theory's conception, they offer no decisive 
evidence about its potential validity or utility. 

A stochastic model of the theoretical 
process operating over time goes beyond 
these criticisms to establish decisively that the 
theory cannot work as previously believed. 
The model shows that the process transmit- 
ting nurturing propensities between genera- 
tions always eliminates any sex differential. 
Under any reasonable empirical assumptions, 
the process would seriously diminish any 
existing differential within a few generations. 

To bolster this finding, I have shown that 
no apparent strategy for rescuing the theory 
from this critique succeeds. The strategies 
considered include the following: choosing 
extreme transmission probabilities to slow 
down the movement toward equal rates of 
nurturing, allowing nurturing propensities to 
affect mating or childbearing rates, introduc- 
ing explicit sex differences for the transmis- 
sion of nurturing, and reducing the explana- 
tory claims of the theory to more modest 
dimensions. None of these strategies allows 
the theory to account for women's child- 
rearing responsibilities while staying consis- 
tent with empirical reality. : 

Although this analysis shows that Chodo- 
row's theory cannot explain the long-term 
persistence of women's responsibility for 
child rearing, the theory does loosely fit the 
pattern of women's increasing equality in 
modern history. The model, if true, implies 
that social constraints on opportunities and 
gender identity or biologically derived differ- 
ences in the sexes' reproductive activities 
must have preserved women's responsibility 
for rearing children across the centuries. It 
also predicts that if social or biologically 
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related conditions stop inducing women to 
nurture children more than men do, the sex 
difference in nurturing would disappear. This 
effect would not occur immediately. Instead, 
the distribution of child rearing would ap- 
proach equality over several generations. 
Thus, the model implies that there would be a 
lag in the change of social roles, but the 
change to equality would be inevitable. While 
it is difficult to compare these predictions to 
the muddy course of modern history, they 
seem reasonably consistent with events. We 
know that the structural barriers barring 
women from nondomestic roles have greatly 
declined over the past century, although the 
change has been gradual and is yet incom- 
plete. Similarly, the great drop in the birth 
rate (from about five to about two births per 
woman) and elimination of infants! depen- 
dence on breast-feeding have removed the 
most obvious biologically derived influences 
on the sexual division of labor. While women 
in the United States still do much more child 
rearing than men, child rearing plays a 
considerably reduced role in women’s lives, 
and men’s contributions to child rearing have 
increased. The changing disparities between 
women’s and men’s expectations and oppor- 
tunities may fully account for the distribution 
of child caring at each historical stage. But 
many people believe that personality differ- 
ences also promote the lingering differences 
in child rearing, because women so often do 
more, even if the wife has just as good or 
even better chances for economic success 
than her husband. If this is true, then 
women’s modern progress fits the predictions 


' of the stochastic model: as social structural 


and biologically derivative causes dissipate, 
gender differences in child rearing decline 
gradually but steadily, paced by the intergen- 
erational decline in personality differences. 

This analysis. shows that Chodorow's 
theory cannot work as intended. The stochas- 
tic model proves the process described by the 
theory would never create or even sustain a 
sex differential in nurturing. Instead, it would 
oust any existing sex differential and then 
keep women and men nurturing at the same 
rate. If the theory has any utility, it will be to 
help explain inequality's decline, not its 
origin or persistence. 
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APPENDIX 
Equilibrium Points 


It is possible to solve for the proportion of 
nurturing adults when the stochastic process 
reaches equilibrium using the probabilities in the 
transition matrix. To simplify the solution, assume 
that all the probabilities in the transition matrix are 
between (but not equal to) zero and one. This 
means that in none of the four kinds of couples do 
the children of either sex all become nurturing or 
all become nonnurturing. (Mathematically, it 
removes those cases with absorbing states.) This 
avoids empirically implausible cases that are 
also mathematically peculiar.!? The proof in the 
text above shows that this stochastic process will 
always converge to an equilibrium with equal 
proportions of women and men having nurturing 
propensities. 

By definition of the model, 

M; - P\MW,..; + P4mw,., 

+ P3Mw,_,; + PamW,.,. (6) 


And by substitution (from equations 1 to 4 for the 
distribution of couples), 


M; =P\M,_\W,, + PX1 — M0 

i — W,L) + PaM; — Wii) 
+ Pal — M;,LOW;.,. (15) 
But as i goes to infinity, and 0 = |P4 — P4] < 1, 
them limit |M; — Wj| = 0 as shown above 
(equation 11). Therefore as i goes to infinity, that 


is at convergence, where C stands for the value 


that both M; and W; converge to, 


C = P,C* + Pa — CY + PxC 
— C «P4(C — CÌ). (16) 


And by expanding, transposing, and reorganizing, 


(P, + P) — (P3 + PJC? + [72P; 
*(P.P.-DJC + Pp = 0. (17) 


This is a straightforward quadratic equation that 
can be solved by direct substitution in the usual 
formula. 

Under special conditions the solution reduces to 
a linear form that allows us to see how the 


12 For example, if all children of symmetrical 
couples become like their parents (i.e., P, = 1 and 
P5; = 0), then there are two possible equilibrium 
states: one with all people being nurturing and 
another with no one being nurturing. 
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transmission rates of asymmetrical couples affect 
the outcomes. Specifically, if (P; + P5) = (Pa + 
P4), (and P4 — P4 does not equal the empirically 
impossible value of one), the quadratic equation 
gives the linear solution shown in equation 13 
above. 
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THE RADICALIZATION OF THE BRAZILIAN 
CATHOLIC CHURCH IN COMPARATIVE PERSPECTIVE* 


KEVIN NEUHOUSER 
Indiana University 


The organizational transformation literature traditionally has focused on the 
processes by which organizations become politically more conservative. Recent 
changes in the Brazilian Catholie church, however, provide the opportunity for 
examining the conditions under which organizational transformation can take a 
politically radical direction. I argue that the church’s radicalization can be 
understood only by examining the interaction of the organizational value set and 
. elite response to environmental changes. Opportunity for organizational 
transformation is created by environmental changes, but the form that 
transformation takes is constrained by the value-defined terms of legitimate 
discourse. This interplay of opportunity and constraint explains both the 
transformation of the Brazilian church and the direction of that transformation. To 
develop this argument, I present a historical analysis of the process of change in 
Brazil and then reinforce the conclusions from this case study with a comparative 


analysis of Argentina, Brazil, Chile, and Colombia. 


INTRODUCTION 


While presiding over mass on March 24, 
1980, Archbishop Oscar Romero was assassi- 
nated by a professional gunman linked to at 
least one powerful member of the EI 
Salvadoran government. This brutal murder 
has come to symbolize the increasingly 
violent struggle between church and state in 
Latin America (Lernoux 1982, p. xi; Mont- 
gomery 1983, p. 81). Romero’s death, 
however, is only the most dramatic in a long 
series of church/state clashes. In the last two 
decades, hundreds of nuns, priests, and 
bishops have been tortured, expelled, and 
assassinated for their opposition to the 
socioeconomic status quo. Ironically, the 
most severe repression of the Roman Catholic 
church in the 20th century is occurring on the 
most “catholic” continent. 

State violence towards the church did not 
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erupt out of a vacuum. The Catholic church in 
Latin America has become an increasingly 
outspoken critic of oppressive regimes and 
the exacerbation of economic inequalities by 
dependent capitalist development. Going be- 
yond rhetoric, the church has actively encour- 
aged grass-root organization and political 
participation by both the urban and rural 
lower class (e.g., Montgomery 1983). Thirty 
years ago, or even twenty, who would have 
predicted that an oppositional stance towards 
the political and economic structures would 
become the official position of the Latin 
American Council of Bishops at Medillen in 
1968? 

In this paper, I explore not only how this 
radical transformation! of the church oc- 
curred, but also why the process has been so 
uneven across Latin America. I first analyze 
the Brazilian church, which (arguably) is the 


.most progressive on the continent. This 


analysis is further developed by comparing 
the Brazilian case with Argentina, Chile, and 
Colombia, to demonstrate how differences in 


! The term radical is used in a double sense. 
First, the shift from support to opposition to the 
Status quo represented rapid and far-reaching 
ideological movement by the church. Second, the 
church's oppositional stance, while less extreme 
than those of certain other groups, had an 
extremely disruptive impact on Latin American 
societies, precisely because it was the church that 
was in opposition. 
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national context and church resources account 
- for variation in its opposition to the status 
quo. 

The argument is that an organizational 
crisis, provoked by environmental changes, 
produced a willingness among Brazilian 
bishops to innovate. The value set of the 
church, however, constrained their possible 
solutions.? This combination of opportunity 
and constraint provided tbe opening for 
bishops who opposed the status quo on 
religious grounds to institutionalize their 
position within the church. In the Brazilian 
case, institutionalization occurred through the 
founding of the National Council of Brazilian 
Bishops (CNBB) and the Ecclesiastical Base 
Communities (CEBs). The CNBB and the 
CEBs served to organize those sectors of the 
church committed to an oppositional stance 
and provided the resources necessary to 
promote that stance within both the church 
and Brazilian society. 


THEORETICAL BACKGROUND AND 
LITERATURE REVIEW 


The “Weber-Michels model" has been the 


dominant theme in the organizational transfor- 
mation literature. According to this model, 
* Whatever the form of goal transformation, it 
is always in the direction of greater conserva- 
tism (the accommodation of organizational 
goals to the dominant social consensus)" 
(Zald and Ash 1966, p. 327). The processes 
by which this conservative organizational 
movement occurs are evoked by such telling 
phrases as “the iron law of oligarchy” and the 
“routinization of charisma” (Michels [1915] 
1962; Weber [1922] 1964, pp. 363-72). The 
familiarity of these concepts attests to their 
power and general acceptance in the sociolog- 
ical discipline. 

Scholars have begun to demonstrate, how- 
ever, that organizational transformation can 
occur in other than conservative directions 


? By “organizational value set,” I am referring 
to that set of values which represents the universe 
of legitimate discourse within the organization. 
These values are not only the values endorsed by 
organizational elites, but also the minority posi- 
tions that are acknowledged to be legitimate. In 
defining the terms of discourse within an organiza- 
tion, the value set influences the relative probabil- 
ity that transformation will be in any particular 
direction. 
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(e.g., Jenkins 1977; Schwartz, Rosenthal, 
and Schwartz 1981; Zander 1985, pp. 
129-44). Two major models explain organi- 
zational transformation in a nonconservative 
direction: organizational transcendence and 
elite response. 

Wood (1975, 1981) developed the concept 
of “organizational transcendence” to argue 
that the values of an organization can 
legitimately spur organizational action in 
directions individually disapproved of by 
many or even a majority of its (conservative) 
members. The weakness of this approach is 
that it cannot account for variation in an 
organization’s political stance. Since the 
values of an organization like the Catholic 
church are relatively constant, the emergence 
of a new orientation remains unexplained. 
Thus, authors (e.g., Lernoux 1982; Foroohar 
1986) who have emphasized the role of 
religious values in the radicalization of the 
Latin American church have to argue that the 
value change comes from outside (specifi- 
cally, Vatican ID. While this shifts the 
problem of explaining change from Latin 
America to Rome, there are more serious 
problems. First, what accounts for the wide 
variation in the political stances of national 
churches, given a single set of “catholic” 
values? Second, how can the oppositional 
stance be explained in light of actual 
resistance from Rome (e.g., Veja 6 Feb. 
1985, p. 5; Veja 15 May 1985, p. 67; Veja 19 
March 1986, pp. 100-101)? 

The second major model in the organiza- 
tional transformation literature is that of 
“elite-response”. An oppositional stance will 
be adopted only when "the preferences of 
dominant groups would be served" (Westhues 
1976, p. 307). Dependent on the organization 
for their status, the "preference" of elites is 
for continued organizational growth/survival 
(Westhues 1973, p. 109). Patterns of resource 
distribution in the environment, however, 
affect the ability of organizations to meet 
these goals (Aldrich 1979, pp. 61-70). When 
goal attainment becomes problematic, organi- 
zational elites respond by seeking new means 
of gaining access to crucial resources in the 
environment. A variant in the elite-response 
model argues that new elites may emerge to 
resolve the organizational crisis. The exis- 
tence of a problematic environment "provides 
the possibility of an important power base to 
organizational actors who can cope with it 
adequately. . . . The successful creation of 
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such an internal base leads to 
modifications in organizational goals" (Scott 
1981, pp. 274—75). Bruneau (1974, 1982) has 
applied the elite-response : model to the 
Brazilian church. Increasingly incapable of 
competing for the allegiance of the popula- 
tion, the church adopted an oppositional 
stance as a "different model of influence." 

The model of elites pragmatically respond- 
ing to environmental changes with an opposi- 
tional stance is insufficient on two grounds. 
First, the model does not explain why an 
oppositional stance is chosen as the response 
to organizational decline. It also fails to 
consider possible alternatives to the politically 
risky choice of opposing the status quo. 
Second, the model predicts that a violent state 
reaction should cause the church to moderate 
if not abandon its opposition. Levine (1981), 
Smith (1982), and Bouvier (1983) have all 
argued that bishops, in fact, did respond to 
repression by becoming more conservative. In 
. Brazil, however, the opposite occurred: the 
church radicalized under state repression 
during the 1960s and 1970s and under 
Vatican disapproval during the 1980s. 

I argue that the process of radicalization in 
the Brazilian Catholic church can be under- 
stood only be focusing on the interaction 
between the organizational value set and elite 
response to environmental changes. Opportu- 
nity for change is created by the organiza- 
tional crisis, but solutions are constrained by 
the boundaries of legitimate discourse. In the 
Brazilian case, some bishops advocated a 
religious opposition to the status quo as a 
means of countering a deteriorating environ- 
ment. This oppositional stance was available 
in the church's value repertoire and was 
therefore a legitimate response to a loss of 
popular support, whereas other possible 
solutions (e.g., ordination of women and 
married men) that were at odds with the 
church's existing value set were rejected out 
of hand. The institutionalization of the 
oppositional stance in the CNBB and the 
CEBs, then, prevented a retreat from the 
oppositional stance when the state responded 
with repression. 


THE RADICALIZATION OF THE 
BRAZILIAN CHURCH 


Historical Background 


From the beginning of Portuguese coloniza- 


235 


tion, the church was closely tied to the 
monarchy. The church was dependent upon 
the state for "building the first churches. . . 
paying the clergy, nominating bishops, approv- 
ing documents, selecting sites for convents" 
(Bruneau 1974, p. 13). The only group in the 
Brazilian Catholic church able to maintain 
independence from the Portuguese crown 
were the Jesuits, because of their direct access 
to the Pope and their independent sources of 
financing. The Jesuits resisted the Portuguese 
utilization and treatment of Indian labor and 
were ultimately expelled from Brazil in 1759 
(Bruneau 1974, pp. 18-19). 

From 1500 until the establishment of the 
First Republic in 1889, the Catholic church in 
Brazil was characterized by two traits: 
staunch support of the monarchy and institu- 
tional weakness. Both characteristics derived 
from the church's dependent position. With 
the Republic came separation of church and 
state, a separation which both parties wel- 
comed at the time. For the church, separation 
meant independence, while for the state, now 
controlled by men influenced by European 
positivism and liberalism, it meant freedom 
from the restraints of an archaic institution 
(Bruneau 1974, pp. 29-33). 

Although institutionally independent of the 
state, the church was not willing to play a 
marginal role in civil affairs. From 1889 to 
1930, the church accumulated institutional 
resources to regain political influence by 
demonstrating that church support was neces- 
sary for governmental stability. The culmina- 
tion of this movement came in 1930. By 
supporting Getulio Vargas's successful coup 
against the First Republic, the Brazilian 
Catholic church returned to political power, 
but now as an independent partner (Bruneau 
1974, pp. 32-39). 

Although its relation to the civil authority 
passed through several distinct phases, through 
four and a half centuries the basic position of 
the church vis-à-vis the state was one of 
consistent support of the social order. In turn, 
the state provided important resources to the 
church. After the ouster of Vargas in 1945, 
however, the church could no longer (either 
through formal or informal relations) depend 
on the state to maintain its dominant position 
in Brazilian society. The elimination of state 
support took on increasing significance in the 
following years as the church sought re- 
sources with which to counter a critical loss 
of influence. It is in this context, then, that 
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after 1945 the Brazilian Catholic church, for 
the first time, adopted an openly oppositional 
stance to the status quo. 


Religious Critique of the Status Quo As 
Minority Position 

Catholic social teachings have never fit neatly 
with capitalist social relations. According to 
Weber ([1915] 1976, pp. 328-40; [1904-5] 
1958), Catholicism has been in tension with 
the increasing rationalization of the economic 
and political spheres. Where state and capital- 
ist economy require impersonal relations 
based on physical violence and market forces, 
the Catholic “ethic of brotherliness” empha- 
sizes interpersonal relations based on love.3 
In addition, Berger (1969, pp. 111-12) 
argues that the process of secularization has 
been antithetical to Catholicism, which views 
all spheres of life as imbued with “sa- 
credness.” Thus, throughout the 20th cen- 
tury, Catholic Popes have intermittently 
warned of the materialism and individualism 
that dominate capitalist industrialization and 
their danger for the Catholic faith (Bigo and 
De Avila 1983, pp. 173-227). 

This oppositional strain within Catholi- 
cism, however, has rarely provoked open 
confrontation with the political and economic 
status quo. In Latin America, for over 400 


years the church faithfully legitimated the 


state (and the economy it defended) and 
received state support in return. Within this 
tradition of church support of the status quo, a 
prophetic minority has been heard, but not 
heeded. This minority oppositional stance in 
Latin America goes back at least as far as 
1511, when the Dominican friar, Antonio de 
Montesinos, denounced the Spanish abuse of 
the Indians by declaring from the pulpit, 
“You are in mortal sin . . . for the cruelty and 
tyranny you use in dealing with these 
innocent people. . . . Be certain that, in such 
a state as this, you can no more be saved than 
the Moors or Turks" (Herring 1961, pp. 
173-74). Like the Jesuits some 250 years 





3 The descendents of the radical Reformation 
(Mennonites, Friends, Brethren), who emphasized 
this ethic, also have traditionally opposed the 
socioeconomic status quo. However, the point is 
not that any particular group had these values while 
others did not, but that their presence within the 
Catholic repertoire was a necessary (though not 
sufficient) factor in the church’s transformation. 
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later, Montesinos’ oppositional stance was 
effectively silenced when he was recalled to 
Europe. 

The theology of liberation, then, which is 
the theological basis for the “preferential 
option for the poor” currently advocated by 
the Brazilian church, is related to a sub- 
merged dissident strain in Catholic doctrine. 
The writings of liberation theologians and 
declarations by bishops are carefully linked to 
past church documents in an attempt to 
establish the legitimacy/orthodoxy of their 
positions (de Kadt 1971, pp. 25-26; Boff 
1984; Bonino 1975; Galilea 1978; Gutierrez 
1973). What is new, though, is that opposi- 
tion to the status quo has become, for the first 
time, the primary position of the church in 
Brazil. 


Environmental Changes and 
Organizational Crisis 


In the 20th century, but especially after 1945, 
a variety of forces in Brazil threatened the 
dominant position of the Catholic church. 
Bruneau (1974, pp. 55-67) identified two 
basic social processes that weakened the ties 
between church hierarchy and laity in Brazil. 
The first was rapid rural to urban migration 
which destroyed the peasant/landowner rela- 
tion through which the church traditionally 
had exercised influence over the predomi- 
nantly poor rural population. The second was 
the decreasing importance of Catholic schools 
in which the urban middle and upper classes 
had been educated. While these were not the 
only social processes weakening popular ties 
to the church, they exemplified an environ- 
ment in which individuals were increasingly 
free to identify themselves as something other 
than “Catholic.” Thus, in the early 1960s, the 
CNBB published a list of “four mortal 
dangers” that threatened Latin America: 
naturalism, Protestantism, spiritualism, and 
Marxism (Antoine 1973, pp. 53-54). 

The nominally Catholic Brazilian popula- 
tion, for the first time, was being offered a 
variety of attractive alternatives to the status 
quo. The 1890 census reported that 98.9 
percent of the Brazilian population identified 
themselves as Catholics, but in 1940, this 
figure had declined to 95.0 percent. During 
the 1940s the decline quickened, with only 
93.5 percent of the population identified as 
Catholic in the 1950 census (Labelle and 
Estrada 1964, p. 79). Significantly, the 
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percentage of practicing Catholics was drop- 
ping even more rapidly.^ 

The critical resource for reevangelizing the 
laity, however, was becoming increasingly 
scarce. In 1912, Brazil had one priest per 
5700 inhabitants, but by 1950 this had 
decreased to only one priest per 6900 
inhabitants, the lowest ratio in South America 
(Wilkie 1987, p. 203). Thus, Brazil had 
relatively few priests to counter the increas- 
ingly aggressive prosyletizing efforts of other 
religious and ideological groups. 

The growing strength of such disparate 
groups as Communists, Protestants, and 
spiritists warned some priests and bishops that 
continuing the church alliance with the 
economic and political elites (and the status 
quo which they represented) would continu- 
ally erode popular allegiance to the Catholic 
church. For the first time, the socioeconomic 
needs and desires.of the nonelite majority of 
Brazilian society became a major concern of 
the Catholic hierarchy. The process by which 
that sector of the church which held opposi- 
tional values were able to use this historical 
situation to affect church policy began with 
the creation of the CNBB. 


Establishment of the CNBB 


The founding of the CNBB in 1952 was 
largely spearheaded by the energy and vision 
of Dom Helder Cámara (perhaps the most 
"progressive" Brazilian bishop), who was its 
General Secretary for over 10 years. One of 
the major goals behind the formation of the 
CNBB was to "encourage the whole institu- 
tion to take an active interest in social 
change" (Bruneau 1974, p. 108). The CNBB 
was specifically created as a vehicle to 
mobilize the Brazilian church' to oppose the 
status quo in society. 

Dom Helder, however, was not alone in his 
vision for social change. In the CNBB, he 
was joined by bishops with similar views, the 
majority of whom were from the impover- 
ished Northeast of Brazil (Bruneau 1974, p. 
108; de Kadt 1971, p. 26; Lernoux 1982, p. 





* Between 1900 and 1970, the percentage of 
"practicing" Christians in Brazil dropped from 80 
percent to 62 percent. Given the fact that Catholics 
represented well over 90 percent of all professing 
Christians during this time period, the drop must 
be almost entirely attributed to the Catholic 
population (Barrett 1982, p. 186). 
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32). The bishops from the relatively affluent 
south, on the other hand, historically have 
been the "least concerned" with socioeco- 
nomic problems (Bruneau 1974, p. 108). The 
bishops in Rio de Janeiro and So Paulo, 
though, have traditionally been the most 
prominent in Brazil, because of their proxim- 
ity to the centers of wealth, government, and 
communications. The CNBB provided a 
platform from which the numerical strength 
of the Northeastern bishops could overcome 
their social marginality (Bruneau 1974, pp. 
71-72, 109; de Kadt 1971, pp. 26-27). 
While devoid of any jurisdictional powers 
over the bishops, the CNBB came to be 
accepted as the voice of the Brazilian Catholic 
church, not only by Brazilians in general, but 
by the bishops themselves (Morais 1982, p. 


:17). The CNBB acquired a "semi-monopoly 


of the communication between the Church 
and the state and the Church and the exterior" 


(Marcio Alves 1978, p. 67). The position of 


the CNBB as the voice of the church was 
greatly enhanced when later attacks on the 
church required an institutional response 
which only the CNBB could provide. (Bru- 
neau 1974, pp. 113-17). 

Not only was the CNBB a platform from 
which to address society, but it also provided 
a means of influencing other Brazilian 
bishops (de Kadt 1971, p. 26), other national 
councils of bishops, and even the Latin 
American Council of Bishops—CELAM 
(Mutchler 1971, p. 111). One mechanism by 
which the CNBB has been able to influence 
the opinions of the Brazilian bishops is the 
“Curso de Atualização de Bispos,” a type of 
refresher course in liberation theology. The 
courses are offered free to invited bishops, 
with the result that each year, after their 
participation in the course, a number of 
“moderate” bishops begin voting with the 
“progressives” in the CNBB (Veja 9 July 
1986, p. 27). Another important tool for 


influencing the opinions of both hierarchy and 


laity is the Center for Religious Statistics and 
Social Investigations (CERIS), founded by 
the CNBB in 1962 (Cloin 1966, p. 106). By 
conducting studies and publishing statistics on 
Brazil, CERIS acts as a mechanism for 
shaping the hierarchy's perception of the 
church's relation to Brazilian society. 

The CNBB also provided an avenue by 
which radical priests could bypass conserva- 
tive bishops in whose dioceses they worked. 
The priests often were more radical than the 
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bishops because of their grassroots involve- 
ment in Brazilian society (de Kadt 1971, p. 
29). Prior to the establishment of the CNBB, 
however, these priests had very limited access 
to any bishop other than their immediate 
superior. The structure of the CNBB provided 
opportunities to influence and receive support 
from any bishop. The formation of the CNBB 
has also stimulated priests to organize, since 
there was now an institutionalized audience 
for their concerns. In 1967, for example, 300 
priests from three states sent a highly 
publicized letter to the bishops addressing the 
socioeconomic problem of Brazil and the 
church's role in society (Antoine 1973, pp. 
132-33). Through the CNBB, the priests, 
with their numerical strength, were able to 
influence the entire hierarchy. With the 
support and encouragement of the majority of 
the priests, the progressive bishops have been 
largely able to control the CNBB since its 
inception (Lernoux 1982, pp. 51, 415, 417; 
Veja 9 July 1986, p. 27). 

Beginning in the 1950s, the CNBB began 
to criticize the process of capitalist develop- 
ment in Brazil. Although the bishops did not 
withdraw support from the government (which 
at the time was relatively progressive), 
widespread and deepening economic inequal- 
ities became the focus of increasing attacks 
(Antoine 1973, pp. 52—56). While this was 
threatening enough, CNBB rhetoric was 
backed up with the establishment and support 
of various organizations working for social 
change. One of the most important of these 

, was the Base Educational Movement (Movi- 
mento de Educação de Base), which adopted 
Paulo Freire's method of “conscientization” 
(Bruneau 1974, p. 79; Antoine 1973, pp. 
39-43; Marcio Alves 1978, pp. 151-73; 
Freire 1970). 

An important factor that permitted the 
CNBB to mount its challenge to the status 
quo was its financial independence from the 
political and economic elites. Many of the 
activities of the CNBB were financed by 
Catholic groups in Western Europe (Marcio 
Alves 1978, pp. 60—61). For example, the 
financial support for the "Curso de Atuali- 
zação” referred to earlier came from Belgium 
and France (Veja 9 July 1986, p. 27). 
Because the CNBB has independent resources 
(finances and personnel), it can challenge the 
status quo without threatening its own sur- 
vival (Marcio Alves 1978, pp. 59-62; Bru- 
neau 1974, p. 113; Veja 9 July 1986, p. 25). 
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Institutional autonomy allowed the progres- 
sive bishops to maintain control of the CNBB 
despite active opposition by both the state and 
the economic elites. 

The CNBB, however, has never been 
completely unified in its oppositional stance. 
Because it is a "national council of bishops," 
all bishops are members, including those who 
maintain close ties with the conservative lay 
elite. Thus, when political circumstances 
favored the conservative minority, it has been 
able to influence CNBB statements. The 
clearest example occurred immediately follow- 
ing the 1964 coup, when the CNBB actually 
welcomed the military takeover. In short, the 
structure of the CNBB facilitates an opposi- 
tional stance while simultaneously ensuring 
that the opposition rarely will be unified. 

The presence of a dissident faction within 
the CNBB explains the varying degree of 
support given to the different oppositional 
issues. The CNBB has been most forceful and 
specific in its condemnation of human rights 
abuses because the arrest, torture, and murder 
of bishops, priests, and lay leaders are 
interpreted as an attack upon the church itself 
and therefore unite conservative and progres- 
sive bishops (Lernoux 1982, pp. 301-32; 
Maria Alves 1984, p. 81). While significant, 
the critique of socioeconomic structures has 
been couched in more general terminology as 
a result of the compromises necessary for the 
statements to be passed by the CNBB (de 
Kadt 1971, p. 29). Thus, the church hierarchy 
has supported the interests of an oppressed 
class while attempting to remain the church of 
all classes, a position that introduces appar- 
ently irresolvable tensions into the CNBB. 
The division within the church hierarchy 
serves two complementary functions. On the 
one hand, the conservatives act as a brake on 
the radicalism of the progressives. On the 
other hand, the presence of conservatives 
within the church protects the church (and the 
progressives within it) from complete state 
repression (de Kadt 1971, p. 37). 


Creation of the Ecclesiastical Base 
Communities (CEBs) 


Powerful as the oppositional stance of the 
CNBB was, it required grassroots involve- 
ment by the laity if the church's loss of 
popular allegiance was to be reversed. With 
the encouragement and support of the CNBB, 
local groups of lay Catholics were organized 
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to discuss how the liberating message of the 
gospel could be applied to the reality of their 
lives (Maria Alves 1984, pp. 82-84). The 
result of this initiative was the emergence 
within a few years of an estimated 50—80,000 
CEBs in Brazil with over four million 
participants (Lernoux 1987, p. 25; Rohter 
1979). The CEBs serve to mobilize the laity 
for social change, while "evangelizing" the 
laity despite the shortage of priests. 

On the part of the laity, the CEBs provided 
a setting in which they could participate more 
actively in both the life of. the church and 
social action. Cardinal Arns of São Paulo, 
who presides over the region of greatest 
concentration of CEBs in the country (Adri- 
ance 1986, p. 139), emphasizes the impor- 
tance of the religious nature of the base 
communities to the laity: 


People do not come to the base communities 
when there is no praying and singing. They 
come four or five times to organize practical 
things, but nothing further will come of it. 
When, however, people pray and sing, when 
they feel themselves together, when the gospel is 
read and, on this basis, concrete actions are 
organized and the national situation is analysed, 
then the groups remain united. (Emphasis added; 
Lernoux 1987, p. 26) ' 


The explanation for the rapid growth of the 
CEBs lies in the fact that a specifically 
religious critique of the status quo was 
attuned to the needs of the Brazilian lower 
class. i 
The formation of the CEBs was a crucial 
event in the institutionalization of the reli- 
gious critique of the status quo for at least two 
reasons. First, an institutional means was 
provided by which the laity could affect the 
decisions. of the hierarchy. The CEBs are the 
only place within the church where the 
hierarchy must listen to the needs, desires, 
and wisdom of the laity. Thus, the CEBs, in 
organizing the interests of the poor laity, 
serve to institutionalize the preferential option 
for the poor advocated by the CNBB. 
Second, while formally under the authority 
of the local bishop, in practice the church 
hierarchy has little control over the CEBs. 
Because the CEBs were established: in 
response to a situation in which the church 
had few resources, the communities are 
largely self-supported and self-directed. In the 
CEBs, lay leaders emerge who are responsi- 
ble for coordinating community social and 
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religious activities. In addition, since 1975 
there have been national meetings of base 
communities, resulting in an informal inter-. 
community network, which provides commu- 
nication and mutual support independent of 
the hierarchy and its resources. This means 
that if the church hierarchy, at some point, 
should revoke its oppositional stance, the 
CEBs would be able to continue their 
opposition, though they would certainly be 
more vulnerable to state repression. 

The growth of the CEBs indicates an 
increasing religious opposition to the status 
quo by the laity. This opposition was 
recognized by the military regime as a major 
threat to social stability and was often the 
focus of repressive actions (Lernoux 1982, p. 
55). In the short run, this mobilized laity 
reinforces the strength of those bishops who 
are in active opposition to the socioeconomic 
structure of Brazilian society. In the long run, 
though, the CEBs potentially represent a 
significant new force within the Brazilian 
Catholic church that is independent of the 
control of the bishops, whether progressive or 
conservative. Thus, even if Vatican pressure 
could force the CNBB in a more conservative 
direction, the base communities would still 
have sufficient autonomy to maintain an 
oppositional stance. 


THE BRAZILIAN CHURCH IN 
COMPARATIVE PERSPECTIVE 


The critical change that ultimately led to the 
institutionalization of an anti-status quo 
position within the Brazilian church has been 
identified in this paper as a decline in popular 
support. This decline resulted from a combi- 
nation of increasing religious and ideological 
competition and a lack of resources. In order 
to demonstrate that the loss of popular support 
crucially determines whether a national Cath- 
olic church will challenge the status quo, the 
level of competition and availability of 
resources in Brazil will be compared to the 
situations in Argentina, Chile, and Colombia.5 


5 When discussing the Brazilian case alone, I` 
focused on data from pre-1952, since I argue that 
the founding of the CNBB in that year was a 
response to an organizational crisis. In this 
comparative section, however, post-1952 data is 
included to demonstrate, in the cases of Argentina 
and Colombia, that the environment has yet to 
provoke similar crises for their churches. 
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Table 1. Number of Protestants in Thousands and as a Percentage of the Total Population of Four Latin American 


Countries— 1916 to 1961 


Nation 1916 1925 1938 
Argentina 
Total 6.9 11.3 38.3 
Percentage 0.08 0.11 0.28 
Brazil 
Total 50.3 101.5 241.1 
Percentage 0.20 0.35 0.61 
Chile 
Total 6.3 11.6 99.5 
Percentage 0.18 0.29 2.03 _ 
Colombia i 
Total 0.4 3.4 3.0 
Percentage 0.01 0.05 0.03 


1949 1952 1957 1961 
229.1 251.7 364.4 414.3 
1.37 1.42 1.91 2.00 
1657.5 1601.0 1755.9 4071.6 
3.27 2.91 2.75 5.66 
264.7 370.0 370.4 834.8 
4.44 5.87 5.19 10.76 
25.7 27.4 45.4 92.7 
0.23 0.23 0.32 0.58 





Sources: Damboriena 1963, pp. 16, 25; Wilkie 1987, pp. 74-76. 


These four countries have been chosen 
because their churches represent the extremes 
on a continuum ranging from support to 
opposition toward the status quo. Brazil and 
Chile generally are regarded as the two most 
progressive churches in Latin America, while 
Argentina and Colombia are considered the 
most conservative (e.g., Lernoux 1982, pp. 
‘415-16; MacEoin and Riley 1980, p. 11). 

Perhaps the most obvious threat to the 
Catholic church’s religious monopoly was the 
growth of rival Protestant churches. Table 1 
contains data on the absolute and relative 
number of Protestant inhabitants in the four 
countries between 1916 and 1961. Through- 
out the entire period, the Protestant churches 
in Brazil and Chile were significantly larger 
than those in Argentina and Colombia. In 
1952, the year of the founding of the CNBB, 
there were proportionately twice as many 
Protestants in Brazil as Argentina and nearly 
15 times as many as in Colombia. In turn, the 
relative number of Protestants in Chile 
generally has been about double that of 
Brazil. Thus, the level of Protestant threat 
was significantly greater in the two countries 
that produced progressive Catholic church 
movements. 

Leftist political movements also competed 
with the church for adherents and influence in 
Latin America. Before its legality was 
revoked in 1947, the Brazilian Communist 
Party was the largest on the continent with 
150,000 members (Bouvier 1983, p. 91). 
Though official party membership subse- 
quently declined, the growth of peasant 
leagues and the ever more leftist orientation 
of President Goulart during the 1960s were 
sufficiently threatening to provoke a military 


coup in 1964. In Chile, a coalition of 
Communists and socialists came within a few 
percentage points of winning the presidential 
election in 1958. In 1970, the Marxist 
Salvador Allende, with a plurality of the vote, 
did win the presidency, which provoked a 
military takeover in 1973. In Colombia, 
however, the political battles were waged to 
the right of center between the Conservative 
and Liberal parties. The years of “La 
Violencia," ended in 1955 with a power- 
Sharing agreement between the two parties, 
effectively eliminating radical groups from 
Colombian politics. In Argentina, the major 
political threat to the status quo did not come 
from the left, but rather from the Peronists, 
whose populist corporatism found support 
among the urban working class. Thus, the 
churches of Brazil and Chile experienced 
significantly higher levels of competition 
from leftist political movements than did 
those of Argentina and Colombia. 

Two resource variables are important. The 
first is the proportion of priests in the 
population. Traditionally, priests were the 
only Catholics qualified to carry out evange- 
listic activities. Thus, the proportion of 
priests in the population critically determines 
whether the church can successfully counter 
the prosyletizing efforts of other groups, 
without either restructuring itself or changing 
its stance toward the status quo. The second 
variable is the nature of the church/state 
relationship. The crucial dimension is the 
extent to which the state provides resources 
(legitimation, finances, legal advantages, 
etc.) to the church. When the state supplies 
such resources, the church is less likely to 
seek alternatives to the status quo, because 
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any attempt to alter the status quo potentially 
can restrict the flow of state resources. In 
their absence, the church is pressed to change 
its structure to generate greater resources 
and/or to alter its traditional support of the 
status quo in order to counter the appeal of 
competing groups. 

Brazil proportionally had the fewest avail- 
able priests to carry out evangelism, while 
Chile and Colombia had the most (see Table 
2). When change is considered, Argentina 
and Colombia witnessed a decrease in the 
number of people served by each priest. In 
Brazil and Chile, on the other hand, the 
number served by each priest grew signifi- 
cantly. The churches in Brazil and Chile, 
then, were faced with the steadily decreasing 
availability of priestly resources to meet 
institutional threats, while Argentina and 
Colombia had actually improved their posi- 
tions. 

In order to assess the nature of the church’s 
relationship with its respective state, it is 
necessary to provide brief historical sketches 
for each country. Obviously, much important 
detail must be omitted. 

While the Brazilian church was about to 
experience separation from the state, the 
Colombian church had its official status as the 
national religion restored in the Concordat 
signed by the Vatican and the Colombian 
State in 1888. The Concordat also granted 
privileges that allowed the church to become 
the primary educator in the country. By the 
1950s, this educational system was the largest 
in Latin America with the church owning 
nearly 75 percent of all secondary schools in 
Colombia. In 1953, two-thirds of the national 
territory was designated Catholic Missions 
Areas. In these regions, the only legal 
religious activities were ones pertaining to the 
Catholic church, and state officials ordered all 
non-Catholic groups to cease functioning. In 
addition, President Gomez publicly linked 
Protestantism and communism, and during 
the years 1948-53, 54 Protestant schools 
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were closed by the government (Mecham 
1966, pp. 115-38). Thus, the Colombian 
state not only provided resources that strength- 
ened the Catholic church, but also acted to 
weaken its competitors. 

In Argentina, the church has neither been 
established as the state religion nor explicitly 
separated from the state. Until about 1943, 
the Argentine state steadily decreased the 
flow of resources to the church. With the 
coming to power of the Peronists, however, 
the situation was reversed. Most significantly, 
the right to provide religious instruction in 
federally funded schools was returned to the 
church. In return, the church hierarchy urged 
faithful Catholics not to vote for anti-Peronist 
parties in the 1946 and 1948 elections. The 
church/state alliance was an uneasy one, 
though, with Peron fearful of church ties to 
the Conservative oligarchy and the church 
leery of Peron’s appeal among the working 
class. After 1954, Peron reversed himself and 
launched an aggressive attack on the church, 
culminating with a proposal to officially 
disestablish the Catholic church. The church 
counterattacked by excommunicating Peron 
and his top officials. Catholic opposition was 
instrumental in forcing Peron to abdicate in 
1955, which increased the church’s status. 
The new military government rewarded the 
church for its anti-Peron activities with new 
laws increasing the church’s role in the 
educational system (Mecham 1966, pp. 
245-51; Herring 1961, p. 691). The Argen- 
tine church has remained closely tied to the 
military, which has held the dominant politi- 
cal power during the last 20 years. Military 
chaplains draw two salaries, one as a priest 
and one as a military officer. This material 
incentive has helped produce one military 
chaplain for every 1850 military personnel, 
while the ratio is approximately one for every 
5400 inhabitants for the population as a whole 
(LARR 12 March 1987, p. 2; Wilkie 1987, p. 
203). In addition, military governments in 
1977 and 1979 authorized salary increases for 


Table 2. Inhabitants (in thousands) per Catholic Priest in Four Latin American Nations— 1912 to 1966 


Nation 1912 1945 1950 
Argentina 4.5 5.1 4.3 
Brazil 5.7 6.8 6.9 
Chile 2.1 3.2 3.3 
Colombia 3.8 3.8 3.7 


Source: Labelle and Estrada 1964, p. 79. 


% Change 
1955 1960 1966 1912-1966 
4.3 4.3 43 —4.496 
6.8 6.4 7.1 24.6% 
3.5 3.1 3.6 71.495 
3.7 3.6 3.3 — 14.3% 
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bishops and granted them government pen- 
sions (LARR 12 March 1987, p. 2). 

Disestablishment occurred in Chile with 
church support. As in Brazil, many Chilean 
Catholics, including the archbishop, believed 
that separation of church and state would 
benefit both institutions. That the church 
offered so little resistance to separation 
reflects the Jow level of state resources it was 
receiving. One important result of disestablish- 
ment was the breaking of the traditional link 
between the church and the Conservative 
party. Financial independence and the break 
with the Conservative Party permitted the 
church to become the first Latin American 
Catholic church to pursue reformist social 
policies (Mecham 1966, pp. 217-24). 

Thus, as measured both by level of 
institutional threats and by institutional re- 
sources to meet those threats, the Catholic 
churches of Brazil and Chile were signifi- 
cantly more vulnerable than the churches in 
Argentina and Colombia. Brazil and Chile 
had large Protestant populations and growing 
leftist political threats. At the same time, the 
Brazilian and Chilean churches had diminish- 
ing personnel and were receiving little state 
support. The situation in Argentina and 
Colombia was very different. In both coun- 
tries, the churches had high levels of 
personnel and state resources to meet rela- 
tively low levels of threats from Protestantism 
and leftist political groups. Thus, high levels 
of institutional threats and low levels of 
resources in Brazil and Chile are correlated 
with progressive Catholic churches, while 
low levels of threats and high levels of 
resources in Argentina and Colombia are 
correlated with conservative churches. This 
confirms a principal argument of this paper, 
that the church's increasing inability to 
maintain its level of societal influence was the 
critical change that prompted it to seek 
innovation, which ultimately led to the 
institutionalization of a religious critique of 
the status quo. 


SUMMARY AND CONCLUSIONS 


This paper argues that the Catholic church in 
Brazil began to radicalize when a sector of the 
church elite realized that support for the status 
quo was producing a steady decline in popular 
allegiance to the church. This institutional 
threat provided greater freedom for the 
hierarchy to innovate in order to marshal the 
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resources necessary to defend its position. 
Bishops who previously were critical of the 
status quo on religious grounds were able to 
seize the opportunity to institutionalize their 
concerns to make them part of the official 
position of the whole church. 

The two most important changes that 
institutionalized a religious opposition to the 
status quo were the establishment of the 
CNBB in 1952 and the CEBs beginning in the 
1960s. Both provided the independence and 
resources necessary to support those groups 
(from all levels—bishops, priests, and laity) 
which believed that the religious values of 
community, caring, and economic justice 
were threatened by capitalist development. 
This religious critique was formalized as the 
“preferential option for the poor" and pro- 
moted as "liberation theology." 

The religious critique of the status quo was 
not a new phenomenon. What was new was 
that opposition had replaced legitimation as 
the defining characteristic of the church's 
relation to society. This change occurred 
when (1) continuance of the status quo 
threatened organizational maintenance, and 
(2) an oppositional stance became institution- 
alized within the church. 

The first condition was tested by compar- 
ing the Brazilian situation to that of Argen- 
tina, Chile, and Colombia. This analysis 
confirmed that where high levels of competi- 
tion and low levels of resources combined to 
threaten the church (Brazil and Chile), the 
church adopted an oppositional stance. Con- 
versely, where low levels of competition and 
high levels of resources existed (Argentina 
and Colombia), little pressure was put on the 
church to abandon its support of the status 
quo. 

The second condition, though significant, 
has not been tested. In neither Brazil nor 
Chile were conservative members of the 
church able to prevent the institutionalization 
of an oppositional stance when the position of 
the church was threatened. Obviously, such 
organizational rigidity would hinder the 
church’s ability to adapt and thrive in 
changing social circumstances. One might 
expect that if strong competing groups should 
emerge in Argentina or Colombia, the 
church's strong ties to the state could help 
conservative elements to block the institution- 
alization of a religious opposition to the status 
quo. It may have been just this kind of tie to 
the status quo structures that prevented the 
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European Catholic church from radicalizing 
during the Protestant Reformation. 

One important question is the extent to 
which this process of organizational redirec- 
tion toward an oppositional stance is general- 
izable over time, space, and type of organiza- 
tion. For example, will the oppositional 
stance of the Catholic church in Brazil and 
elsewhere diffuse across national boundaries, 
so that other Catholic churches might radica- 
lize in an entirely different process? Organi- 
zations that do not contain an inherent critique 
of the status quo will find it very difficult to 
legitimate a move from a supportive to an 
oppositional stance toward the status quo. 

There are other pragmatic, but no less 
theoretically interesting questions, such as 
how far the radicalization of the Brazilian 
church will proceed; under what conditions 
the oppositional stance could be reversed; and 
what the implications would be for the church 
as a whole if a division should develop 
between the CNBB and the CEBs. In Brazil, 
the close cooperation between CEBs and such 
organized oppositional movements as the 
Partido dos Trabalhadores (Workers’ Party) 
raises a more fundamental question. How is 
the church to balance the need to act 
concretely on its oppositional stance with the 
desire to be the church of all classes? Will 
future organizational transformation be in a 
conservative direction or will the church 
deepen its oppositional stance? Or will the 
church attempt to follow some middle path 
that risks alienating sectors of Brazilian 
society at both ends of the political spectrum? 
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In this ecological study we show that the particular curvilinear formulation of the 
relationship between density and the failing rate of organizations proposed by 
Hannan and Freeman does not hold for the California wine industry between 1940 
and 1985. We demonstrate instead that an alternative model focusing on prior 
foundings and prior failings gives a superior account of the failing rate in this 
population. We argue that Hannan and Freeman’s density-dependent model of 
organizational mortality systematically overstates the negative effects of 
illegitimacy on business organizations. We argue further that their model 
underestimates business organizations’ ability to avoid elimination by migrating to 
neighboring niches and by enlarging their initial niche. 


INTRODUCTION 


Research in organizational ecology has been 
faulted for its lack of generalizability. In an 
effort to demonstrate the generalizing poten- 
tial of the field, Hannan and Freeman have 
recently set an agenda that ascribes great 
importance to the concept of population 
density. They argue in favor of a very specific 
formulation of density dependence applicable 
to organizational populations in general (1988, 
p. 20). In their formulation, both foundings 
and failings of organizations are a complex 
function of the changing “density” of the 
population, its size relative to the carrying 
capacity of the pertinent niche.! In this 
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! We use the term "density" the same way that 
Hannan and Freeman do. They do not distinguish 
clearly between population size and density as 
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paper, we examine the relationship between 
density, on the one hand, and the failing rate 
(or “death” rate) of organizations on the 
other. Our investigation of the California 
wine industry indicates that this particular 
formulation of density dependence is not 
universal; it obscures dynamic processes of 
prior foundings and prior failings which 
together better account for the observed 
failing rate of wineries. Based on this 
instance— which we think is exemplary — we 
argue that the formulation of density 
dependence advocated by Hannan and 
Freeman is inapplicable to business organiza- 
tions in general. We propose other directions 
more suitable to the shared objective of 
extending the generalizability of organiza- 
tional ecology. 


commonly used to denote a ratio. This departure 
from common usage may turn out to be theoretically 
important to Hannan and Freeman’s formulation. 
When they describe the acquisition of legitimacy 
by a new organizational form, only population 
size seems to matter. This is because legitimacy 
is not a finite quantity. However, when they 
discuss the effect of numbers in the context 
of resource competition, they seem to refer to 
density in its conventional meaning because 
resources are, by definition, finite. Hence, the 

of their U-shape curve, describing 
the relationship between “density” and the failing 
rate, may be undergirthed by terminological 
slippage. à 
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Organizational ecologists’ interest in density 
dependence is traceable to animal ecology 
(MacArthur and Wilson 1967) though it also 
has roots in sociology (Hawley 1950 and 
1986, pp. 24-25; Stinchcombe 1965, p. 153). 
Hannan and Freeman (1986b, 1988, and 
forthcoming) report a sturdy relationship 
between density and deaths (or failings) of 
social organizations. Rather than the usual 
monotonic positive relationship in biological 
ecology and commercial competition, they 
find a curvilinear relationship with a changing 
U-pattern of signs from negative to positive. 
They report this consistent curvilinear pattern 
for three populations of American labor 
unions, newspapers, and semi-conductor firms, 
observed from their historical origins. They 
argue that at low levels of density, each 
increment in density lowers the probability of 
failing while at high levels of density, each 
increment in density raises the probability of 
failing. Operationally, this formulation im- 
plies a regression equation incorporating both 
a linear term and a second-order quadratic 
term of the density variable. The coefficient 
attached to the linear term must be negative 
and significant and larger than the coefficient 
associated with the quadratic term which must 
be positive and significant. Moreover, the 
magnitude of these coefficients must fall 
within a reasonable range. That is, after 
controling for the joint effect of other 
variables on the failing rate, the effect of 
density must be consonant with the observed 
range of values on this variable. In this paper, 
we mostly dispense with considerations of the 
relative magnitude of the coefficients associ- 
ated with the two density terms because the 
results do not pass the prior tests of direction 
and of significance of effect. 

The second, ascending segment of the 
Hannan and Freeman curve is essentially 
unproblematic. It is explained by the opera- 
tion of competitive processes in an environ- 
ment with a finite carrying capacity. To 
explain the descending segment of the curve, 
Hannan and Freeman borrow from institution- 
alization theory (Meyer and Rowan 1977; 
Scott and Meyer 1983). They describe novel 
kinds of organizations as lacking “legit- 
imacy" in the sense that there is little 
congruence between their activities and the 
expectations of relevant parties (Dowling and 
Pfeffer 1975). They argue further that “growth 
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in numbers of organizations gives force to 
claims of institutional standing" (Hannan and 
Freeman 1988, p. 25). In support of this 
thesis, Singh, Tucker, and House (1986) find 
a negative effect of density on the failing rate 
of voluntary service organizations which they 
interpret as fitting the descending segment of 
this U-shaped curve. 

In an ecological context, the Hannan and 
Freeman legitimacy thesis requires a shift of 
attention from concrete organizations to the 
historical abstraction of organizational form 
(the latter term is developed in Hannan and 
Freeman 1986a). The idea that new organiza- 
tional forms lack legitimacy —also prefigured 
in Simon (1957) and in Stinchcombe (1965) — 
is forcefully illustrated in their first report of a 
study of American labor unions: 


as the organizational means of . . . disadvan- 
taged groups, they have special problems of 
legitimacy. . . . [they spark] intense opposition 
and repression. . . . when unions are few, the 
collective voice of leaders of the labor move- 
ment is diminutive. (Hannan and Freeman 
1986b, p. 10) 


ORGANIZATIONAL LEGITIMACY 
AND NUMBERS 


Hannan and Freeman's formulation of the 
relationship between the fate of organizations 
and their numbers is attractive on -two 
grounds. First, it contributes to our understand- 
ing of the replacement of old organizational 
forms by new ones over time, a phenomenon 
largely neglected by the economic and 
business disciplines. Secondly, this formula- 
tion breaks with the simpleminded econo- 
mism that dominates conceptualizations of the 
relationship between similar organizations. 
The idea that a new organizational form 
initially lacks legitimacy, intuitively self- 
evident for 19th-century labor unions, may 
seemingly be extended to all types of 
organizations, including noncontroversial ones. 
This idea is reducible to two nested proposi- 
tions. First, to subsist, organizations must be 
allowed to extract resources from their 
environment, even against environmental 
resistance. Secondly, this ability usually 
presupposes that organizations are legally 
permitted to exist. In most societies, organi- 
zations may not be created without the 
explicit assent of civil authorities. Further- 
more, in market-oriented societies, credit is 
often a precondition of continued existence. 
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In such societies, organizations of a new kind 
may find borrowing difficult because the risk 
associated with their operations cannot be 
assessed. Finally, gaining enough cooperation 
from individuals to recruit into and to staff 
novel kinds of organizations is a problem if 
they do not understand the function of such 
‘organizations. Thus, in modem societies, 
most organizations must obtain a state charter 
and financial as well as human resources. 
Private, capitalist firms may be more depen- 
` dent on credit, while public agencies and 
‘ voluntary associations may be more con- 
strained by the obligation to obtain official 
validation. In Hannan and Freeman's scheme, 
semiconductor firms (Brittain and Freeman 
1980) and nonprofit social service organiza- 
tions (Singh et al. 1986) need social legiti- 
macy no less than did 19th-century labor 
unions and newspapers. They argue (1986b, 
1988, and forthcoming) that mere increases in 
numbers heighten the visibility of new 
organizational forms, thus making them more 
familiar and augmenting their political clout. 
Both processes lead to an improved capability 
to extract resources from the environment and 
thence to enhanced survival chances for 
individual organizations of the new form. 
However attractive this conceptualization 
of organizational populations, as first fertil- 
ized by legitimacy and then decimated by 
overcrowding, it may not be pertinent to most 
business organizations in modern, market- 
based societies. We suspect that Hannan and 
Freeman overestimate both the decisiveness 
with which organizations lacking in legiti- 
macy are stricken and the fixedness of niches 
or resource spaces. These overestimations 
lead thém to two doubtful conclusions. On the 
one hand, force of numbers overcomes 
alleged environmental resistance to the emer- 
gence of new forms; on the other hand, once 
an organizational form has become estab- 
lished, the pressure of numbers against an 
inflexible environment causes massive mortal- 
ity in the relevant population. This scenario 
may constitute a blinder because density 
conceals dynamic processes of entries into 
and exits from the population. As noted by 
Carroll (1984, pp. 75-85), the frequency of 
such entries and exits is often considerable. 
The population-level consequences of such 
population dynamics of foundings and fail- 
ings are obscured by the focusing on (raw) 
density measures that Hannan and Freeman 
advocate. The subsuming of numbers of 
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entries and numbers of exits into a single 
density measure masks possible causal asym- 
metries in the two processes (i.e., for a given 
density count, one organization not dying is 
treated as equivalent to an organizational 
birth). i 

The second problem pertains to the likely 
fate of organizations threatened with being 
crowded out of their initial niche. They may 
be simply eliminated, in which case growing 
density raises the death rate as the second 
segment of Hannan and Freeman’s curve 
implies. However, elimination is not the only 
possible outcome of overcrowding. Organiza- 
tions may respond to the pressure of numbers 
by altering slightly the nature of their 
activities, thereby collectively enlarging the 
initial organizational niche (Hawley 1986, p. 
56), or by migrating laterally to neighboring 
niches (Swaminathan and Delacroix 1988). 
The more this response occurs, the more the 
relationship between density and the death 
rate must approximate zero. We argue that 
modern societies constitute flexible enough 
environments that this response to rising 
numbers is as common as elimination. 


The Legitimacy of Business 
Organizational Forms 


Contrary to Hannan and Freeman’s imagery, 
in developed, market-oriented societies, gain- 
ing state legitimation is largely a routine 
matter for most business organizations. Some 
activity sectors are defined as illegitimate by 
law (e.g., prostitution). Others are closed to 
private parties by law or custom or both (e.g., 
the military, telecommunications in most 
countries). The rules of the game are fairly 
clear. Legal legitimation is usually not an 
obstacle to the emergence of new business 
forms because of the diffuse belief that 
profit-seeking activity is valid, unless other- 
wise specified. The institution of the market 
itself legitimizes such endeavors a priori. The 
question of legitimacy relative to labor and 
financial markets is only slightly less tracta- 
ble. We must first realize that to initiate a 
novel organizational form, it is not necessary 
that numerous instances of the new form 
should appear. The first few instances can be 
supported by private resource networks (e.g., 
uncle’s savings and brother-in-law’s labor; 
see Rao and Delacroix 1988). The likelihood 
of occasional success at tapping public or 
institutionalized networks also must not be. 
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underrated. (The same banks that lent vast 
sums to the Argentine generals with a straight 
face might, occasionally, extend a loan to a 
well thought-out business plan, even though it 
may pertain to a novel idea.) As for staffing 
new kinds of organizations, in all modern 
societies, there are a few individuals who will 
go to work for a new project because it 
appears risky. In short, the nature of 
institutionalized resource networks does not 
mandate a low probability of business organi- 
zational innovation. Once a few instances 
have emerged, imitation does or does not take 
place on a large scale (DiMaggio and Powell 
1983). The new form may not survive, but 
then it may. 

The plausibility of this quasi-random sce- 
nario is obfuscated by the simple fact that all 
ecological studies to date bear the stigma of 
selection on the dependent variable of sur- 
vival of the form. The organizations we study 
have endured in large numbers for a long 
time. They have all passed the crucial test of 
the birth of the organizational form. Because 
many ephemeral and failed organizational 
forms are hidden from view, we are wont to 
underestimate the randomness of the pro- 
cesses by which new forms are generated and 
retained. Hannan and Freeman’s legitimacy 
formulation could be rooted in a tendency to 
‘rationalize what may be a fairly random 
process, given an institutionally broadly 
favorable environment. In this connection, we 
note that of the three populations on which 
they base their formulation, one (labor 
unions) clearly does not benefit from this 
broad-based market-conferred legitimacy. An- 
other (newspapers) emerges in one of the 
most societally chaotic times and places 
(post-gold rush San Francisco), hence under 
circumstances where normal ^legitimacy- 
granting processes may have been disrupted 
or not yet established. Thus, these two 
populations may constitute fairly exceptional 
instances in which a dearth of legitimacy is 
overcome through numbers. 


Lateral Migration to Neighboring Niches or 
Niche Enlargement 


As they underestimate the institutional favor- 
ableness of the environment for business 
organizations, Hannan and Freeman may 
overestimate the rigidity of developed socie- 
ties’ economies. It is easy to visualize the 
carrying capacity of a particular niche as 
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strictly limited, in the short run. The very 
process of imitation described above would 
be enough to guarantee such boundedness, 
again in the short run. Hence, in the absence 
of obstacles due to low legitimacy, a new 
niche, as initially defined, should quickly 
become overcrowded. However, competitors 
are seldom obligated to remain in their place 
in the original niche. There are three escape 
routes: an organization can outgrow the 
competition, or it can move laterally to a 
neighboring niche, or it can contribute to the 
enlargement of the initial niche. To assume 
the obligation to remain in place is to affirm 
the scarcity of neighboring niches, or a 
fundamental inability of organizations to 
effect the switch, or the impossibility of 
enlarging a niche. We argue that neither this 
scarcity nor this inability obtains widely for 
business organizations in developed societies 
and that initial niches are routinely enlarged. 

Most business organizations supply prod- 
ucts and services whose prices are small 
relative to the purchasing power of potential 
buyers, a six-dollar bottle of wine, for 
example. Similarly, office computers are 
inexpensive, relative to many companies’ 
sales volumes. Moreover, much spending is 
discretionary or nearly so. (It is unclear 
whether many of the companies that bought 
computers in the past few years need them or 
even use them.) One may purchase a bottle of 
wine every day, every month, or every year. 
The demand for products embedded in an 
industrial production chain may be more 
rigid. However, establishments thus special- 
ized constitute a small percentage of all 
business organizations (U.S. Bureau of Cen- 
sus 1982). A high degree of expenditure 
indifference and the shiftiness of demand, 
together with the frequent emergence of new 
markets (Stinchcombe 1965, p. 153) evoke 
environmental looseness rather than the envi- 
ronmental rigidity implicit in the Hannan and 
Freeman (1986b, 1988) model. Environmen- 
tal looseness, in turn, is propitious to the 
formation of new niches and the extension of 
existing niches. In either case, organizations 
may escape the fatal effects of overcrowding 
in the initial niche by engaging in slightly 
different activities, that is, unless their own 
rigidity forbids such a course. 

In their past writings, Hannan and Freeman 
(1977, 1984) have argued forcefully in 
support of such rigidity. Following Stinch- 
combe (1965), they maintain that structural 
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inertia severely limits lateral mobility and that 
the very process of ecological selection 
promotes structural inertia (1984). This line 
of argument was born in reaction to the 
dominant school of Strategic Management 
which implicitly presents business organiza- 
tions as sagaciously adapting under the firm 
hand of omniscient, omnipotent, and unrea- 
sonably rational executives... But to argue 
against structural inertia is not to argue for 
voluntarism. A third approach views lateral 
moves to a neighboring niche and niche 
enlargement as the probabilistic results of 
random groping, along the lines of the 
“garbage can” model of decision making 
(Cohen, March, and Olsen 1972). 

If the environment is rich in neighboring 
niches, overcrowding in one niche should 
promote random agitation, some of which 
must result in the chance encounter of an 
organizational capacity with an appropriate 
niche. To argue otherwise is to affirm a strict 
one-to-one correspondence between niche and 
organizational characteristics and, thereby, to 
endow organizations with an initial high 
degree of specialization they are unlikely to 
possess (unless they are designed by omni- 
scient, omnipotent, and unreasonably rational 
executives). Our alternative formulation thus 
requires some organizational flexibility, in 
terms of policy and in terms of structure, for 
random groping to result in viable matches. 
Such variability is not uncommon. A knit 
manufacturer can switch from ski hats to leg 
warmers without much travail. Such a move 
does not involve tampering with its “technical 
core” (Thompson 1967). In the population of 
California wineries, 40 percent of all estab- 
lishments begin with a capability to deliver 
two or more distinct products (e.g., fortified 
wines, sparkling wines) to the market. 
Probabilistically, some business organizations 
must be able to migrate from an overcrowded 
niche to one of many neighboring ones 
without lethal damage. Hence, a low proba- 
bility of success for each attempted lateral 
move is compatible with an overall relaxation 
of crowding in the original niche. This 
relaxation of crowding, in turn, means that 
the positive effect of rising density on the 
propensity to fail is lowered. The above 
discussion is couched mostly in terms of 
organizations’ ability to migrate to neighbor- 
ing (separate) niches. The implications are 
identical if we think, instead, in terms of an 
enlarged and diversified initial niche. 
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LEGITIMACY AND COMPETITION IN 
THE CALIFORNIA WINE INDUSTRY 


We propose that for business organizations, 
lack of legitimacy is seldom a factor of 
mortality and that competition is as likely to 
lead to lateral migration as to elimination. 
The wine industry is a favorable locale for the 
study of this alternative formulation, from the 
dual standpoint of legitimating and of compet- 
itive processes. 

A wine industry existed in the United 
States at least from the middle of the 19th 
century (Adams 1978, p. 579). The imposi- 
tion of national Prohibition from 1919 to 1934 
destroyed that industry in a way especially 
germane to the Hannan and Freeman formu- 
lation. Prohibition constituted a massive and 
nearly total withdrawal of legal legitimacy. 
The rescinding of Prohibition after 15 years 
did not imply the overnight restoration of 
industry legitimacy. Some fraction of the 
American population continued to consider 
wine evil, as evidenced by the fact that 
numerous localities voted to remain dry. 
According to Adams (1978, p. 9) as late as 
the 1970s, more than half the states “made it 
difficult” to buy wine through sumptuary 
taxes and artificial obstacles to distribution. 
Thus, in the wine industry, we have a test 
organizational population, a population which 
undeniably suffered a dearth of legitimacy 
during the period of its rebirth. If it were the 
case that rising numbers in a new business 
population lower the death rate by improving 
legitimacy, such an effect should be espe- 
cially evident for the wine industry. 

This industry is also a favorable locale to 
test the thesis that business organizations 
escape competitive pressures through lateral 
moves into neighboring niches (or, equiva- 
lently, through expansion of the initial niche) 
because there is substantive evidence for the 
existence of a neighboring niche (or for niche 
expansion) in that industry. During Prohibi- 
tion, though wineries went out of business, 
Americans drank homemade wine in vast 
quantities. The products developed by home 
fermenters tended to be sweet and high in 
alcohol (Teiser and Harroun 1984, p. 69). 
During the period following the repeal of 
Prohibition and to date, however, a strikingly 
new consumption pattern emerges. At the 
beginning of the period, Americans drink 
mostly dessert wines and fortified wines 
(including vermouth), and sparkling wine 
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consumption is low. By the end of the period, 
consumption of all categories of wines has 
increased sixfold, but some categories have 
increased disproportionately. National con- 
sumption figures are indistinct, but import 
figures show clearly this qualitative trend: in 
1940 (tart) table wines constituted only 16 
percent of total volume, in 1985 nearly 85 
percent; in 1940 fortified wines accounted for 
77 percent of imports, and in 1985 they had 
declined to less than 4 percent. Yet, it would 
not be correct to say that one kind of 
consumption replaced another. Again, import 
figures tell another story: in spite of their 
precipitous relative decline, fortified wine 
imports increased by 360 percent in absolute 
terms during this period (Delacroix and Solt 
1987). Domestic consumption figures for 
sparkling wines show a similar trend: the ratio 
of sparkling to all still wines increased 
fortyfold during the same period (Delacroix 
and Solt 1987; Teiser and Harroun 1984, p. 
78). 

In effect, a new way to make a living or a 
new niche appeared for wineries that was 
simply a previously nonexistent market seg- 
ment. It did not proceed from any voluntaris- 
tic action taken by the industry collectively 
but from exogenous factors and especially 
from the influence imports exert on the 
quality of consumer demand (Delacroix and 
Solt 1987, 1988). Exploitation of the new 
niche depended on the same basic technology 
as exploitation of the old niche. Fixed 
investment in particular grape varieties may 
have impeded wineries’ ability to move into 
the new niche to some extent. However, at 
least 30 percent of establishments have no 
investment in vineyard. Hence, for many 
wineries, a lateral move to the new neighbor- 
ing niche involved neither disruption of their 
“core” (Singh et al. 1986; Scott 1987, p. 
211) nor a major overhaul of their competen- 
cies (McKelvey 1982). 

Delacroix and Solt (1987, 1988) show that 
the emergence of the new, import-induced 
niche has a positive effect on the propensity 
to found California wineries. In this formula- 
tion, spells of winery foundings signal the 
emergence of the new niche. Since it is 
relatively easy for wineries to switch from the 
old to the new niche, the emergence of this 
niche should also improve the longevity of 
existing wineries, thereby decreasing the 
failing rate of the whole population. Then, 
any negative relationship between foundings 
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and the failing rate is indirect evidence that 
migration to the new niche decreases the 
failing rate. In Figure 1 we provide evidence 
in support of this position. The bottom curve 
shows the log survivor plot of all California 
wineries that do not produce either table wine 
or sparkling wine. The top curve shows the 
log survivor plot of establishments that do 
produce table wine and/or sparkling wine. In 
our formulation the latter categories effected 
the lateral switch to a new, import-led 
neighboring niche. Accordingly, wineries in 
this category exhibit higher longevity, as 
would be the case if lateral moves afforded 
escape from overcrowding. If this alternative 
formulation for business organizations were 
correct, we should also find that this 
relationship is stronger than any of the density 
effects predicted by Hannan and Freeman. 


THE CALIFORNIA WINE INDUSTRY AS 
AN ECOLOGICAL POPULATION 


This study covers the whole California wine 
industry from 1940 to 1985. The sometimes 
prosperous industries of other states have not 
been considered in order to sidestep the 
complicating effects of 49 additional jurisdic- 
tions in exchange for a small gain in 
population size. At all times, California 
accounts for a very high proportion of the 
national wine industry. According to the 
industry expert Moulton (1984, p. 382), in 
1982 California had 88 percent of all 
American wine vineyard acreage and 92 
percent of its production. The latter figure is 
in line with Adams’s estimate (1978, pp. 
39-40) that California produced 86 percent of 
all U.S. wine in the 1950s. The remaining 
production was extremely fragmented with 
the runner-up state, New York, accounting 
for only 7 percent of total U.S. production. 
The beginning of our period of observation 
is 1940, rather than 1934, the first year 
following the end of Prohibition. This is 
imposed by data limitations. Complete records 
containing the kind of detailed, exhaustive, 
industry wide data our study requires begin 
only in 1940. The loss of six years is not 
significant because that is the duration 
required to plant a vineyard of selected wine 
grape varieties, to harvest them, and to 
ferment, age, and commercialize wine (Dela- 
croix and Solt 1988). Nevertheless, evidence 
shows that many wine producers were ready 
immediately upon repeal. This six-year gap 
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—*— DENOTES ALL OTHER ESTABLISHMENTS 
Figure 1. Log Survivor Plot for Establishments that Produce neither Table Wine nor Sparkling Wine vs. Others; 


(dashes indicate 95% confidence interval). 


poses a potential problem we discuss in a 
subsequent section. 

Since there were wineries in California 
before Prohibition, the question arises whether 
we are observing a new population (as we 
claim) or an old one stricken by a catastrophe. 
We believe that the wine industry that 
emerged rapidly upon repeal was a new 
population on‘ the triple grounds of market 
interruption, technological discontinuity, and 
profound loss of social legitimacy. 

In the 15 years between 1919 and 1934, 
distribution systems had disappeared (Teiser 


and Harroun 1984, p. 75). More seriously, 
although American consumers had imbibed 
vast quantities of (home fermented) wine 
during Prohibition (Teiser and Harroun 1984, 
pp. 50-51), their taste had shifted drastically 
to sweet, high-alcohol wines (Adams 1978, 
pp. 30-31; Teiser and Harroun 1984, p. 69). 
Post-1934 vintners were caught unprepared 
for this change (Adams 1978, p. 31). From a 
technological standpoint, the interruption was 
even more pronounced. There was a severe 
shortage of cooperage (Teiser and Harroun 
1984, p. 68). The best grape varieties had 
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been lost and American wine technology was 
badly out of date (Adams 1978, p. 31; Teiser 
and Harroun 1984, p. 69); wine studies had 
been dropped and literature on professional 
wine making had practically disappeared 
(Teiser and Harroun 1984, p. 72). A "new 
generation of wine men" had to arise; even 
new wine hybrids had to be created (Teiser 
and Harroun 1984, pp. 72-73). After 1934, 
the California wine industry took up an 
entirely new scientifically based path, diver- 
gent from that of both its predecessor and its 
European counterparts (Adams 1978, p. 1). 

Finally, and perhaps most importantly for 
the Hannan and Freeman formulation, Prohi- 
bition decisively withdrew social legitimacy 
from the industry. First, there was an obvious 
denial of legitimacy in declaring a productive 
activity illegal. This denial lasted longer than 
the dates pertaining to national Prohibition 
indicate. The latter was preceded by waves of 
state prohibitions going back to the end of the 
19th century: by 1915, 11 states were already 
dry (Friedrich and Bull 1976, p. 309). 
Secondly, the active and thriving vineyard 
operations which often become the bases of 
new wineries had been sullied by massive 
.involvement with organized crime. (Teiser 
and Harroun 1984, pp. 56-62). Thirdly, as 
home-brewed and illegal wine came to 
replace liquor during Prohibition, and as 
fortified wine gained ground, wine became 
associated in the public mind with "the 
derelicts called ‘winos’ on the skid rows of 
the nation" (Adams 1978, p. 31). 

Hutchison (1984, p. 48) states that Prohibi- 
tion "blighted" the wine industry. Adams 
(1978, p. 32) summarizes its situation in 
1935: “It was making the wrong kind of wine 
from the wrong kind of grapes for the wrong 
kind of consumers in a whiskey-drinking 
nation with guilt feelings about imbibing in 
general and a confused attitude toward wine 
in particular.” We will return later to the 
issues of correct time specification because it 
is central to the testing of the Hannan and 
Freeman formulation. 


DATA AND ESTIMATION METHODS 


The population of interest is the complete set 
of California establishments producing wine 
from grapes for commercial purposes between 
1940 and 1985. There were 1,415 separate 
establishments in activity at one time or an- 
other during this period, with a high of 629 
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and a low of 221 in simultaneous existence. 
Approximately 70 may have been cellars en- 
gaged principally in the aging of wine (a phase 
of production). Approximately 70 percent of 
the establishments owned vineyard acreage out- 
right. The population excludes grape growers 
not engaged in pressing and fermenting. 

Following Hannan and Freeman (1986b, 
forthcoming), our dependent variable is the 
death rate or failing rate of wineries. We 
consider each disappearance from the record a 
failing. There were 2 mergers and 66 
acquisitions of which 1 and 26 respectively 
entered into our estimations; of these, 1 and 
17 respectively resulted in disappearance 
from the record. Since our population unit is 
the independently producing establishment, 
this treatment is without consequence for the 
results discussed here. The coding of the 
dependent variable is further discussed in the 
Data Appendix. 

The covariates of theoretical interest are 
population density and prior foundings. Pop- 
ulation density is the number of wineries in 
existence at the end of the previous year or at 
the beginning of the current year. The density 
variable is used in both its linear and its 
quadratic forms, to follow Hannan and 
Freeman (1986b, 1988, forthcoming). Num- 
ber of foundings (t— 1) is the number of new 
wineries appearing in the record in the year 
preceding the current year. All data pertaining 
to wineries are from Wines and Vines Annual 
Directory and Wineries and Vineyard Indus- 
try Suppliers of North America, both industry 
publications by the Hiaring Company of San 
Rafael, California. 

Several fairly self-evident control variables 
are explained as they are introduced, and fur- 
ther described in the Data Appendix. In keep- 
ing with previous studies of the same genre, 
there are organization-level variables such as 
size and age, population-level variables such 
as number of failings, and environmental vari- 
ables such as volume of wine consumption, 
Gross National Product, Per Capita Personal 
Consumption Expenditures, number of busi- 
ness failures in the concurrent year of winery 
failing, and Investment in Nonresidential Pro- 
ducers Equipment, a measure of industrial in- 
vestment. We also employ a measure of chro- 
nology explained in the text. 

Following Hannan and Freeman (forth- 
coming), we estimate the effects of these 
covariates on the failing rate with maximum 
likelihood estimation techniques. 


DENSITY DEPENDENCE 


The relationship between the covariates and 
the failing rate is expressed as a Gompertz 
function with the following form: 


rát) = exp[BX<s)] exp [y t] 


where rt) is the failing rate for firm i at age 
t, X; is a vector of covariates for each firm for 
each year, B is a vector of unknown 
regression coefficients and y is the coefficient 
for organizational age. 

In the several models discussed below, the 
explanatory power of each model is repre- 
sented by a global Chi-square statistic with 
degrees of freedom equal to the number of 
covariates in the model. This statistic tests the 
hypothesis that the regression coefficients in 
the model, taken together, are not different 
from zero. In addition, an F-test against the 
hypothesis that each regression coefficient 
considered separately is not different from 
zero is computed. A positive coefficient 
implies that the associated covariate has a 
positive effect on the failing rate; a negative 
coefficient indicates the opposite. Significant 
coefficients.are at the .05 level or better. 

All the models discussed in this paper were 


253 


estimated with the RATE version 2.0 statisti- 
cal package developed by Tuma (1979). 


MODELS OF FAILING RATES IN THE 
CALIFORNIA WINE INDUSTRY 


Testing for Curvilinear Density Dependence 


We estimate first a simple version of Hannan 
and Freeman’s curvilinear density depen- 
dence model, where the failing rate is a 
function of a linear term and of a quadratic 
term of population density. This version, 
shown in Model 1, also includes a measure of 
establishment age in years at the time of 
failing, following Stinchcombe (1965) and 
several findings which indicate that age is 
inversely associated with failing (Carroll and 
Delacroix 1982; Freeman 1982; Carroll 1983; 
Freeman, Carroll, and Hannan 1983; Singh et 
al. 1986; Carroll and Huo 1988). A general- 
ized measure of establishment size, storage 
capacity measured in thousands of wine 
gallons, is also included because previous 
research suggests that size may be negatively 
associated with failing (Freeman et al. 1983; 


Table 1. Effects of Winery Size and Age, of Niche Capacity and Calendar Time, Density, Number of Foundings, and 
Number of Failings on the Failing Rate of Wineries 


Model 
Independent Variables 1 2 3 4 5 6 
Constant —3.612* — 0.2565 — 2.738 — 0.3914 0.4098 —0.0107 
(.7255) (1.70) (1.776) (1.799) (.5581) (.9764) 
Size —0.0599* —0.0621* —0.0622* —0.0626* —0.0629* —0.0626* 
(.0198) (.0203) (.0204) (.0204) (.0205) (.0205) 
Age — 0.0044 —0.0013 — 0.0026 —0.0052 —0.0057 — 0.0053 
(.0050) (.0053) (.0054) (.0054) (.0053) (.0054) 
Density 0.0041 — 0.0063 0.0010 0.0018 0.0005 
(.0038) (.0055) (.0056) (.0056) (.0010) 
(Density)?/1000 — 0.0069 0.0063 — 0.0006 —0.0017 
(.0050) (.0069) (.0068) (.0068) 
Wine consumption 0.0020 0.0015 0.0079* 0.0089* 0.0082* 
(.0024) (.0025) (.0026) (.0019) (.0024) 
Calendar time —0.0302 —0.0104 —0.0592* —0.0686* —0.0621* 
(.0205) (.0210) (.0222) (.0144) (.0190) 
No. of foundings 1-1 —0.0146* — 0.0230* —0.0219* —0.0229* 
(.0034) (.0037) (.0032) (.0037) 
No. of failings t-1 —0.0353* —0.0353* —0.0353* 
(.0041) (.0041) (.0041) 
No. of spells ` 11550 11550 11550 11550 11550 11550 
No. of events 459 459 459 459 459 459 
Chi-square 
Likelihood ratio 24.41 34.42 53.38 134.02 133.68 133.96 
Degrees of freedom 4 6 7 8 6 7 


Note: Asymptotic Standard Errors are shown in parentheses. 


* p<.05 
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Marple 1982; Aldrich 1979, p. 43), though 
somewhat inconclusively.? 

In the simple Model 1, the coefficients 
associated with the linear term and the 
quadratic term of, density are opposite the 
direction predicted by Hannan and Freeman 
and nonsignificant. This model also confirms 
that large organizations are hardy: the coeffi- 
cient associated with size is large, negative, 
and significant. This will remain true in all 
subsequent models and we will refrain from 
commenting on this effect which is abun- 
dantly discussed elsewhere (Carroll 1983, 
1984). The coefficient associated with age is 
negative but not significant. This will remain 
true below and we will refrain from discuss- 
ing this finding which is the subject of an 
intensive exploration by Aldrich and Auster 
(1986). The findings associated with organi- 
zational size and age simply confirm that our 
winery population is ordinary, from an 
ecological standpoint, rather than idiosyn- 
cratic in some unspecified way. 

In Model 2, we refine our search by adding 
a measure of national wine consumption in 
millions of gallons. This is done to control for 
the fact that the concept of density depen- 
dence is explicitly rooted in the idea that the 
environment has a limited carrying capacity 
(Carroll 1984, p. 85). We reason that the 
more wine Americans drink, the greater the 
resource space available to the population of 
wine-producing establishments, by analogy 
with Nielsen and Hannan (1977) and Hannan 
and Freeman (1978). In Model 2, we also 
introduce a variable designed to capture the 
idea that the environment of wineries may 
change over time in unspecified ways. 
Following Snyder and Kick (1979), we 
construct this variable, called “calendar 
time," as an ordinal scale with values (19)40, 
41, 42, to (19)85. Depending on how the 
initial value, (19)40, is construed, this 
variable may also be interpreted as a measure 
of organizational form maturity. 

Neither wine consumption nor calendar 
time has a significant effect in Model 2. 
Nevertheless, the introduction of these two 
variables improves the explanatory power of 
the model as the associated Chi-square jumps 
from 24.41 in Model 1 to 34.42 in Model 2, a 





? For discussions of empirical findings on this 
issue, see Freeman et al. 1983. Aldrich and Auster 
(1986) discuss the necessity to separate size and 
age effects. 
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statistically significant increase. In this im- 
proved model, the signs of the coefficients 
associated with the density terms switch to the 
pattern (—, +) required by the Hannan and 
Freeman formulation. However, both coeffi- 
cients remain statistically nonsignificant. They 
are also of equal size, contrary to the 
requirements of the curvilinear relationship 
specified by Hanran and Freeman (1986b) 
and further explicated by Carroll and Hannan 
(1988, p. 10). 


Exploring Population Dynamics Models 


In Model 3, we allow our alternative 
formulation to compete directly with the 
Hannan and Freeman formulation of density 
dependence in mortality rates. We do this by 
introducing a measure of prior foundings 
which, we argued above, proxies for the 
existence of a new niche into which wineries 
may migrate to avoid elimination. 

In Model 3, the signs associated with the 
density terms switch again, against the (—, 
+) pattern predicted by Hannan and Free- 
man, and their coefficients remain nonsignif- 
icant. Any temptation to assign meaning to 
the density coefficients in spite of their 
nonsignificance should be discouraged by the 
instability of the signs attached to them. The 
coefficient associated with number of prior 
foundings is high, significant, and negative, 
as would be the case if this variable indicated 
a process by which migration to a lateral 
niche decreased the failing rate. The Chi- 
square attached in Model 3 shows a very 
significant improvement in predictive power 
over Model 2. Hence, there are reasons to 
believe that Model 3, which shows simulta- 
neously the weakness of the density depen- 
dence formulation and the plausibility of a 
population dynamics/lateral migration formu- 
lation, is the best model presented thus far. 

In a population where both failing and 
founding rates are high, density at time f is 
partly a function of failings and foundings at 
time t—1. The California wine industry 
constitutes such a population with a mean 
annual failing rate of 5 percent and a 
mean founding rate of about 6 percent 
(implying more than a 50 percent turn- 
over between 1940 and 1985). We just saw 
that the introduction of a foundings measure 
alters radically a model constructed initially 
around a notion of density dependence. 
Hence, it is worth exploring the consequences 
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of introducing a measure of prior failings. 
We are also interested, for substantive 
reasons, in the effect of prior failings on the 
failing rate. 

In the wine industry as in many other 
industries, when an establishment fails, the 
resources it frees up may be recirculated into 
the environment and utilized by others. Each 
failing may thus enhance the viability of 
survivors and, therefore, lower the population 
failing rate in the next period (Delacroix and 
Carroll 1983). In those industries where much 
recycling takes place, failings may improve 
systematically the longevity of survivors, an 
intrapopulation dynamic process capable per- 
haps of distorting all models of population 
death rates that privilege exogenous variables. 
Recycling is important in the wine industry, 
giving us an opportunity to examine an 
extreme case of this process. Capital invest- 
ment in presses, vats, barrels, and bottling 
equipment can hardly be converted to alterna- 
tive uses. Wine making is a little codified and 
narrow skill not readily applicable to other 
endeavors. Vineyards may be turned to other 
agricultural uses, but this is a difficult-to- 
reverse process that seems to induce 
hesitation.? These arguments notwithstand- 
ing, it is not a foregone conclusion that the 
net effect of prior failings must be to lower 
the current failing rate. Business failings, 
within some industries, occur in epidemic- 
like waves. In our population, failings at time 
t—1 may denote the occurrence of such 
a wave of mortality, still operative at time t 
and caused by environmental factors left 
unmeasured in our model. We adjudicate 
between these two formulations by adding in 
Model 4 a variable measured as numbers of 
failings in the year preceding the year of 
observation. 

The introduction of this variable improves 
dramatically Model 3. The Chi-square leaps 
to a very high 134.02, and the wine 
consumption and calendar time variables 
become significant. The negative coefficient 
associated with calendar time indicates that 





? Of the 38 years for which we have data on 
change in total acreage under cultivation, 6 years 
witnessed little or no change. In 14 of the 38 years, 
the total acreage under cultivation increased by an 
average of 24 percent over the preceding year, and 
in the remaining 18 years, the total acreage 
decreased by an average of 6.4 percent over the 
preceding year. 
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winery mortality declines over time. Why this 
should be may be suggested indirectly by the 
positive coefficient attached to wine consump- 
tion. Growing wine consumption may dispro- 
portionately favor large wineries which are 
more likely to profit from economies of scale. 
Thus, somewhat paradoxically, the terms of 
competition may become unfavorable for 
smaller wineries consequently which die first. 
The separate, negative effect of establishment 
size tends to support such a scenario of 
industry consolidation.* Mean establishment 
size as measured by capacity increases 
regularly from 406 gallons in 1940 to 3510 
gallons in 1973, that is, for 75 percent of our 
period of observation. Thereafter, the emer- 
gence of boutique wineries reverses this trend 
and the population again grows in.size, as 
shown in Figure 2. The industry consolidation 
process expressed by the wine consumption is 
fully revealed when number of failings is 
introduced into the equation. We speculate 
that increasing failings of small wineries 
create loose resources which are recycled to 
the benefit of the largest establishments. 
However, we cannot rule out completely the 
possibility that collinearity between wine 
consumption and calendar time affects the 
findings pertaining to the first variable. 
Hence, the speculation concerning its effect 
must be considered cautiously. This, how- 
ever, has no consequences for the finding of 
no curvilinear density effect. 

In Model 4, the significant negative effect 
of prior foundings is strengthened. The effect 
of prior failings itself is negative and 
significant, in conformity with the recycling 
hypothesis. The coefficients associated with 
density remain nonsignificant with signs 
opposite those required by the Hannan and 
Freeman formulation. 

The inadequacy of a density dependence 
formulation in explaining the failing rate of 
wineries is demonstrated in Model 5 where 
the two density terms are removed from the 
equation. In this most parsimonious model, 
nothing changes in comparison to Model 4 
except that model 5, with a Chi-square of 
133.68 for six degrees of freedom, is 
significantly more powerful than Model 4, 
with a Chi-square of 134.02 for eight degrees 
of freedom. 





* We are indebted to an anonymous reviewer for 
this interpretation. 
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Figure 2. Total Number of Establishments in the California Wine Industry. 


TESTING THE BEST MODEL WITH 
ENVIRONMENTAL VARIABLES 


Models 8, 9, 10, 11 in Table 2 report on some 
commonsense attempts to control for general- 
ized macroeconomic processes pertaining to 
the commercial environment of wineries. 
These four models are estimated over their 
common set of spells for comparability. This 
common set of spells is smaller than the set 
pertaining to Models 1 through 5. In Model 7, 
we duplicate Model 5 over this smaller set to 
ensure that variation in numbers of spells 
(i.e., cases) does not impede comparability 
between Tables 1 and 2. Model 7 is in every 
way analogous to Model 5, indicating that 
this question does not arise. 


In Model 8, we control for the growing size 
of the national economy with a measure of 
Gross National Product. This variable has no 
effect on the failing rate. Other coefficients 
remain essentially as in Models 7 and 5. In 
Model 9 we try to refine the putative effect 
that the size of the national economy may 
have on the failing rate of wineries by 
introducing a variable more closely related to 
the idea of a winery niche than is GNP. This 
measure, Per Capita Personal Consumption 
Expenditure, captures the growing ability of 
Americans to purchase consumer products, 
including wine. The fact that the effect 
associated with this variable is negative and 
significant suggests that the variable does 
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Table 2. Model 5 Controlling for Gross National Product, Per Capita Personal Consumption Expenditures, Number of 
Business Failures, and Investment in Nonresidential Producers Durable Equipment 





Model 
Independent Variables 7 8 9 10 11 
Constant 0.2191 — 0.0229 0.2851 0.4079 0.1794 
(.5743) (.6266) (.5838) (.5958) (.6412) 
Size —0.0589* —0.0593* —0.0593* —0.0589* ~0.0589* 
(.0203) (.0203) (.0203) (.0203) (.0203) 
Age —0.0056 — 0.0065 —0.0059 — 0.0053 ~ 0.0057 
(.0058) (.0058) (.0058) (.0058) (.0058) 
Wine consumption 0.0083* 0.0107* 0.0124* 0.0086* 0.0086* 
(.0020) (.0031) (.0028) (.0020) (.0030) 
Calendar time —0.0644* — 0.0640* —0.0271 —0.0715* —0.0643* 
(.0148) (.0149) (.0235) (.0158) (.0149) 
No. of foundings t-1 —0.0186* —0.0172* —0.0175* —0.0173* —0.0183* 
(.0039) (.0041) (.0039) (.0040) (.0043) 
No. of failings t-1 —0.0345* —0.0341* —0.0358* —0.0353* —0.0344* 
(.0042) (.0042) (.0043) (.0043) (.0043) 
GNP — 0.2864 
(2830) 
Per capita personal —1.114* 
consumption expenditures (.5429) 
No. of business failures 0.0156 
(.0123) 
Investment in nonresidential —0.5186 
producers durable equipment (3.703) 
No. of spells 10085 10085 10085 10085 10085 
No. of events 429 . 429 429 429 429 
Chi-square 
Likelihood ratio 99.98 101.01 104.28 101.58 100.0 
Degrees of freedom 6 7 7 7 7 


Note: Asymptotic Standard Errors are shown in parentheses. 


* p«..05. 


capture this dimension of the winery niche. 
The Chi-square associated with Model 9 
indicates a significant improvement over 
Model 7. However, none of the other 
coefficients change. Hence Model 9 is a 
better model which confirms the population 
dynamics formulation expressed by Model 5. 
Models 10 and 11 capture other features of 
the business climate through the introduction 
of a measure of total number of business 
failures and of Investment in Nonresidential 
Producers Durable Equipment, respectively. 
Both variables are measured at the time of the 
failing of the focal winery. The Chi-square 
attached to these models indicates that they 
have no greater explanatory power than 
Model 7. The two macroeconomic variables 
exhibit effects in the intuitively correct 
directions. However, both coefficients fail to 
reach statistical significance. The coefficients 
associated with the other covariates remain 
unchanged as compared to Models 5 and 7. 
These macroeconomic tests together simply 
confirm the sturdiness of the population 


dynamics formulation. Below, we turn again 
to the question of the correctness of the 
chronological specification of all the models 
thus far examined. 


TESTING FOR CHRONOLOGICAL 
MISSPECIFICATION IN THE 
BEST MODEL 


Is the Post-Prohibition California Wine 
Industry a New Population? 


Our period of observation begins in 1940. 
That year is very close to the origin of the 
present California wine population, we think, 
because it follows the end of Prohibition by 
only six years. However, sources recount that 
many vineyardists and other entrepreneurs 
bad anticipated the repeal. According to 
Teiser and Harroun (1984, p. 73), there were 
804 wineries in California in 1934, soon after 
Prohibition was repealed. Since neither Prohi- 
bition nor its repeal was ever an object of 
political unanimity, a degree of illegitimacy 
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must have adhered to the wine business after 
repeal. If Hannan and Freeman’s theoretical 
view were correct, such a massive entry of 
wineries should have overcome this obstacle 
to the growth of the population. Correspond- 
ingly, numbers—that is, density—should 
have increased in the ensuing period. Instead, 
the 1934 density was the highest ever. The 
number of wineries declined to 465 within 10 
years (Teiser and Harroun 1984, p. 73) and 
reached an all-time low of 221 in 1973. This 
40-year decline does not bespeak a process of 
establishment of legitimacy but, instead, one 
of straightforward commercial competition 
and industry consolidation. 

It is known that wineries existed in Califor- 
nia since at least the mid-1850s. Thus, the 
present California wine industry might be con- 
sidered contemporaneous of Hannan and Free- 
man’s labor unions. In this interpretation, the 
population of this study would constitute a late 
sample of a population in existence for about 
130 years. It is unlikely that legitimacy has 
been a continuing problem for this putatively 
uninterrupted industry for such a long period. 
Consequently, the 1940-85 population of our 
study should have found itself on the upward 
segment of Hannan and Freeman's (—, +) U- 
curve, and the coefficient associated with a lin- 
ear density term should be positive. We test the 
plausibility of this reasoning in Model 6, Table 
1. In this model, the parsimonious, population 
dynamics-centered Model 5 is reestimated with 
a linear term of density included to test this 
formulation. 

The coefficient associated with this linear 
term of density in Model 6 is positive, but 
miniscule and nonsignificant. Moreover, the 
Chi-square associated with Model 6, which 
has seven degrees of freedom, is virtually 
identical with the Chi-square associated with 
Model 5, which has only six degrees of 
freedom, indicating that the predictive power 
of Model 6 is inferior to that of Model 5. 
Therefore, the view that Prohibition failed to 
destroy the original California wine industry 
is not supported. Hence the findings in Model 
5 stand. 


Testing for Period Sensitivity 


We address another potential source of 
weakness in our findings. Model 5, which 
sets limits on the usefulness of the concept of 
density dependence, might rest on idiosyn- 
cratic features of the California wine industry. 
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In particular, the findings in Model 5 might 
result from the peculiar shape of our density: 
variable over time. It describes a V-curve 
(Figure 2) rather than the more familiar 
ascending straight line or inverted U-curve of 
Hannan and Freeman’s and other studies 
(Carroll 1984; Carroll and Huo 1986; Barnett 
and Carroll 1988). 

We tested for the potentially distorting 
effects of this peculiarity by introducing a 
period dummy variable in an equation of the 
Model 5 form. The dummy variable is coded 
“0” for any observation occurring during the 
subperiod (1940-1972) when the population 
density was decreasing and “1” for the next 
subperiod, when density was increasing. In 
this test (not shown), the dummy period 
variable is not statistically significant and the 
strong negative, significant effects of found- 
ings and failings are unaltered. Model 5 is not 
period-sensitive. 


DISCUSSION AND CONCLUSION 


The essence of our thesis is expressed in the 
comparison between Models 2 and 4, on the 
one hand, and Model 5, on the other, 
summarized in Figure 1. The main defect of 
our estimation is, paradoxically, a surfeit of 
explained variance in Model 4 where the 
inclusion of one variable, number of failings, 
increases the Chi-square by 80 points. One 
may question whether tbe recycling of the 
resources of failed wineries reasonably ac- 
counts for such a large increment in survival 
probability. Beyond this, the generalizability 
of this particular finding is dubious. Even if it 
is the case that recycling is responsible for 
much increased survival chances of California 
wineries, it is not likely that the same 
relationship obtains in many industries. Nev- 
ertheless, our demonstration does not rest on 
this particular finding. The increase in 
explanatory power due to the inclusion of 
number of foundings (Models 2 and 3) would 
be considered very large if it were not 
dwarfed by the even greater increase pursuant 
to the introduction of a variable correspond- 
ing to the number of failings. 

Overall, our findings demonstrate that the 
rate of failing of California wineries is not 
density-dependent in the manner described by 
Hannan and Freeman. We showed that 
California wineries are not destroyed by 
illegitimacy. We provided findings consonant 
with the view that they escape overcrowding 
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by migrating laterally to a neighboring niche 
(or by enlarging the initial niche). However, 
this is not the end of the task. 

Ecological studies must avoid thinly dis- 
guised historicism (Perrow 1979; Betton and 
Dess 1985; Fombrun 1988). Instead, we must 
ascertain whether our findings apply only to 
one small industry during a relatively short 
period or whether they can be generalized. 
Hannan and Freeman advance the claim that 
they have identified “a generic organizational 
process” (1988, p. 31). Logically, a single 
deviant case should dent this claim. Beyond 
this, we think that our counterexample has 
broader theoretical implications. 

Hannan and Freeman state that at first, 
organizations of a new form must struggle 
against the limitations imposed by an environ- 
ment that apportions legitimacy sparingly. 
Once the new form overcomes this initial 
obstacle, individual organizations multiply 
and, as a result, relevant resources become 
scarce. Here again, environmental stinginess 
is the link between density and failing. The 
theoretical underpinnings of density depen- 
dence must therefore vanish in all circum- 
stances where the environment is indifferent 
to matters of legitimacy and munificent with 
economic resources. We think that this 
indifference and this munificence are more 
common than Hannan and Freeman’s theoret- 
ical formulation allows, because they are the 
rule insofar as business organizations are 
concerned. 

However, nonprofit organizations such as 
labor unions, political organizations such as 
newspapers (Carroll 1987, pp. 134-35), and 
all government-sponsored organizations whose 
survival is threatened by illegitimacy may see 
their failing rate decrease as rising numbers 
enhance their visibility. More interestingly, 
the ability of such organizations to migrate to 
a neighboring niche or to enlarge their initial 
niche is bound by their original charter. Each 
attempted move and each try at broadening 
the initial niche will pose anew the question 
of legitimacy, possibly with positive effects 
on the failing rate. Hence, we think that the 
Hannan and Freeman formulation is quite 
compatible with our own, for all nonbusiness 
organizations. 

California wineries between 1940 and 1985 
are ordinary business organizations. They are 
more typical of that broad class of organiza- 
tions than are labor unions and even San 
Francisco area newspapers after the Gold 
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Rush. Wineries may not be more representa- 
tive than semiconductor firms, the third 
population on which Hannan and Freeman 
base their findings. However, there is some 
indication that the semiconductor industry 
began its existence largely as a creature of the 
federal government (Rao and Delacroix 1988). 
Such a genesis may have rendered this 
industry more sensitive to matters of legiti- 
macy and more restricted to a single niche of 
fixed dimensions than is the case for 
populations of business organizations in 
general. This interpretation is in keeping- with 
the theoretical formulation of Meyer and 
Rowan (1977, p. 353), a major influence in 
Hannan and Freeman’s approach. Hence, 
Hannan and Freeman may have constructed a 
theory of density dependence in failing rates 
that fits well all institutionally constrained 
organizations only. More studies taking into 
account the distinction between business and 
nonbusiness organizations will tell whether it 
deserves to be taken seriously. Already, 
Barnett and Carroll (1988) report that they 
have discovered no evidence of density 
dependence in the failing rate of telephone 
companies. 

We conclude by stating what this paper 
does not claim. First, we do not address at all 
the issue of density dependence in the 
founding rate of organizations (Hannan and 
Freeman 1987). Secondly, we do not question 
Hannan and Freeman’s formulation as it 
pertains to nonbusiness organizations. Thirdly, 
this study should not be construed as a 
repudiation of organizational ecology theory 
but as a contribution to it. This paper is not an 
act of apostasy but, at most, of heresy. 
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DATA APPENDIX 


Our dependent variable is measured at yearly inter- 
vals while some of our organizational and population- 
level covariates are available only at two- and three- 
year intervals. We dealt with this measurement 
problem with the simplifying assumption that the 
value of these covariates changes as a step function 
at the beginning of each interval. (This technique is 
discussed in Tuma and Hannan 1984, pp. 245-50.) 
With this approach, we obtain one case for every 
year in which an organization is in existence for a 
total of 17,167 cases, or “spells.” 

The limited availability of data on the covariates 
over time introduces an element of imperfection 
into the study. However, we obtained and included 
the covariates below which were available for a 
substantial number of firms over a long enough 
period of time relative to the period under 
consideration. 

The organizational-level covariates used in this 
study are the following: 


AGE— Age in years of each organization relative 
to the year of first appearance in the record. Data 
available for all organizations over the entire 
period. 

SIZE—Storage capacity in thousands of gallons. 
Data available for 1,079 establishments which 
provided a total of 12,534 observations. 


The population-level covariates included in the 
analysis are the following: 


POPULATION DENSITY — Total number of win- 
eries in existence at the end of the previous year or 
the beginning of the current year. Data available 
for the period 1941—86. 

NUMBER OF FOUNDINGS :— 1 — Total number 
of wineries founded in the preceding year. Data 
available for the period 1942-85. 
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NUMBER OF FAILINGS 1— 1 — Total number of 
wineries that failed in the preceding year. Data 
available for the period 1942-85. 

WINE CONSUMPTION- Total withdrawal of 
wines from bonded warehouses in the preceding 
year. This variable is constructed in units of 
millions of tax gallons. 

CALENDAR TIME--Year of observation minus 
1900. The substantive meaning of this variable is 
explained in the text. 


Data on the following environmental or macro- 
economic covariates are from the January 1956 and 
February 1985 issues of the "Economic Report of 
the President" and relevant issues of the "Survey 
of Current Business" (published by the U.S. 
Department of Commerce, Bureau of Economic 
Analysis). Page numbers refer to specific parts in 
either the 1956 or the 1985 edition of the 
"Economic Report of the President." 


GNP —Gross National Product, pages 165 (1956) 
and 232 (1985) coded in trillions of (1972) dollars. 
Data available over 1940—84. 

PER CAPITAL PERSONAL CONSUMPTION 
EXPENDITURE —Coded in thousands of (1972) 
dollars. Calculated on a ratio of aggregate 
consumption expenditures (pages 178 [1956] 
and 261 [1985] to U.S. population (source: 
Bureau of Census Reports). Data available over 
1940-84. 

TOTAL NUMBER OF BUSINESS FAILURES — 
This number has been constructed by the addition 
of two variables: Number of business failures 
(liabilities under $100,000) and number of busi- 
ness failures (liabilities over $100,000), pages 231 
(1956) and 337 (1985). This variable is constructed 
in units of thousands of failures. Data available 
over 1940-83. 

INVESTMENT IN NONRESIDENTIAL PRO- 
DUCERS DURABLE EQUIPMENT—Coded in 
trillions of (1972) dollars. Pages 165 (1956) and 
232 (1985). This variable, which is a measure of 
aggregate investment, is reported under “gross 
private domestic (fixed) investment." Data avail- 
able for the period 1940—84. 
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Merton's theory of strain and crime has withstood half a century of theoretical 
controversy, but recent disillusionment with its empirical verification has led many 
to reject it as a viable explanation for delinquency. We question that the theory has 
been falsified by evidence to this date, on the grounds that conceptual 
reinterpretations have differed from Merton’s original statement. We examine the 
empirical and theoretical implications of the reconceptualization of, strain implied 
in its current operationalization. Analyses of self-reported juvenile data indicate 
that the overall predictive validity of a measure based on the dysjunction of 
economic goals and educational means surpasses the more common measure of a 
dysjunction between educational aspirations and expectations. The theoretical 
implications of this measurement issue are illustrated in the context of ongoing 
controversy about the redundancy of strain, when commitment alone (a partial 


component of strain) seems adequate to predict delinquency. 


INTRODUCTION 


In the annals of modern criminology, few 
theories have achieved the saliency of Mer- 
ton’s (1983) theory of strain and deviance. Its 
preeminence for half a century has withstood 
a substantial literature critical of its theoreti- 
cal premises.! Recent disillusionment with its 
empirical vertification, however, has led 
many to reject it as a viable explanation for 
delinquency (Hirschi 1969; Johnson 1979; 
Kornhauser 1978; Liska 1971). 

Since strain theory incorporates both psy- 
chological and structural explanations for 
crime, its dismissal would be a serious loss to 
criminology. Combined with recent doubts 
about the importance of social class in the 
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genesis of crime (e.g., Tittle, Villemez, and 
Smith 1978), the rejection of Merton’s theory 
of structurally induced strain could portend a 
paradigmatic shift toward theories of individ- 
ual behavior devoid of structural context. The . 
historical importance and unique contribution 
of strain theory merit a reassessment of the 
falsifying evidence, prior to its final rejection. 

With this premise, we explore the possibil- 
ity that research interpretations differing 
conceptually from Merton's original state- 
ment contribute to the recent empirical failure 
of strain theory. Indeed, we hypothesize that 
the operationalization of strain in a manner 
consistent with its original conceptual defini- . 
tion will produce results different from and 
more supportive than those recently reported. 
If so, then a wholesale rejection of strain 
theory for its failure to explain delinquency is 
premature. 


THEORETICAL BACKGROUND 


The proximate causes of deviance were 
described by Merton as individual adaptations 
to perceived strain, but the origins of that 
Strain were traced to contradictions in the 
structure and cultural goals of modern indus- 
trial society. On a societal level, Merton 
described the meritocratic ideal that universal 
economic success is possible through effort 
and ability. The meritocratic ideal is belied by 
the reality of inequality in the social structure, 
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which precludes universal economic success. 
Thus cultural goals of success are proposed 
for all members of society, but not all groups 
have equal access to the means for their 
attainment. This dysjunction between cultural 
prescriptions and access to desired goals can 
create an acute sense of strain on the 
individual level. Merton proposed that indi- 
vidual strain is most likely among lower-class 
members who internalize cultural goals of 
wealth and status but recognize blocks to 
conventional means for their attainment 
(Merton 1938). 

Criminologists have long recognized the 
promise of this logical explanation for the 
Statistical association between social class and 
official crime rates (e.g., Lander 1954; Short 
1964). Enthusiasm for the theory was tem- 
pered, however, by reservations about some 
of its basic assumptions. The reality of 
inequality in our society has been well 
documented, but the assumption of universal 
success goals was countered with reference to 
our society’s cultural diversity (e.g., Hyman 
1953). Merton’s original exposition of strain 
was also faulted for its theoretical ambiguity 
(Cohen 1955; Lindesmith and Gagnon 1964). 
For example, Merton gave illustrations of 
deviance probably associated with different 
modes of adaptation but did not provide 
propositional statements concerning the pro- 
cesses by which each adaptive mode might 
' effect different deviant outcomes (Clinard 
1964a). . 

The ramifications of this kind of impreci- 
sion are evident in attempts to operationalize 
delinquency for the study of strain effects on 
juveniles. The theory seems to imply that 
innovation leads to utilitarian types of delin- 
quency (those with material payoff) but does 
not specify whether strain explains common 
types of juvenile lawbreaking such as vandal- 
ism or personal offenses of a nonutilitarian 
nature (Gibbons and Jones 1975; Thio 1975). 
The theory is silent as to whether strain 
should predict delinquency prevalence or 
frequency or both, or serious versus non- 
serious types. Ambiguity also surrounds the 
conceptualization and operationalization of 
strain in the study of juveniles: 


OPERATIONALIZING STRAIN 
FOR JUVENILES 


Merton described strain as an individual’s 
reaction to a dysjunction between universal 
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goals and access to the institutionalized 
means for their attainment. Merton’s earliest 
writings (1938; 1957, pp. 131-94) seem to 
specify unequivocally that economic wealth is 
a dominant goal in our meritocratic society 
and that education is the conventional means 
for attaining wealth.? Currently, for example, 
a college degree is commonly viewed as a 
minimal prerequisite for entry to a good job 
or professional career. Strain would be likely 
when a person is strongly committed to 
making a lot of money but views college as 
beyond attainment. We believe that structur- 
ally induced strain among juveniles would 
therefore be measured appropriately as the 
dysjunction between economic goals and 
expectations for completing college. 

In contrast, the favored operationalization 
of strain in delinquency studies has been the 
difference between educational aspirations 
and expectations (e.g., Hirschi 1969; Liska 
1971; Quicker 1974).3 The argument for 
using this measure is that job prospects are 
less useful as goals for juveniles because 
these prospects are too far removed from their 
conscious concems (Agnew 1986, 1984; 
Elliott, Huizinga, and Ageton 1985).* This 
commonly used measure departs substantially 


? We acknowledge that this is not the only 
logical interpretation of Merton’s theory, as the 
theoretical literature attests (e.g., Bernard 1984; 
Cohen 1965; Cloward and Ohlin 1960; Kornhauser 
1978). Our goal here is neither to enter the 
theoretical fray concerning the "best" interpreta- 
tion, nor to claim unique fidelity to Merton's 
unstated intentions underlying theoretical ambigu- 
ities. Rather, the theoretical issues central to this 


. study are embedded in empirical research and more 


appropriately addressed in analyses than in further 
debate. j 

3 This is the current measurement convention. 
Early analyses operationalized strain variously as 
community SES, unemployment rates, educational 
goals alone, social status, tract characteristics, 
alienation or anomie, relative deprivation, father’s 
occupation, or the dysjunction between occupa- 
tional aspirations and expectations. For a sampling 
of these measurement approaches, see Clinard 
(1964b). 

* As Quicker (1974, p. 77) observed, research- 
ers “seem to be making the assumption that 
delinquent adolescents are in a sense miniature 
adults, since it is the frustrated occupational goals 
that are the prime ccntributors to delinquency. .. . 
they have reduced the importance frustrated 
educational goals may have on the adolescent’s 
life.” 
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from Merton’s concept. If strain is recast 
entirely in the educational realm, the educa- 
tional means in Merton’s original theory 
become both goals and means, and the central 
theoretical importance of economic goals is 
lost. 

The rationale for this reconceptualization of 
Strain for juveniles is problematic. Even 
though juveniles may have difficulty in 
thinking about future jobs, their monetary 
ambitions may be strong and clear. For both 
theoretical and logical reasons, therefore, we 
propose the operationalization of juvenile 
strain as the dysfunction between economic 
goals and educational expectations rather than 
as a dysjunction between educational aspira- 
tions and expectations. 


RELATED STUDIES 


The empirical evidence indicates that the 
predictive efficiency of strain theory is better 
when strain is represented as an educational 
aspirations/expectations dysjunction rather than 
as a dysjunction based on occupational 
measures (Bordua 1961; Hirschi 1969; Quicker 
1974). But even when educational aspirations 
and expectations are used, reviews of the 
literature suggest weak support for strain 
theory (e.g., Elliott et al. 1985; Johnson 
1979). Thus the educational dysjunction 
measure does improve prediction when com- 
pared with the occupational dysjunction. Two 
influential studies have revealed, however, 
that educational aspirations alone, without 
reference to educational expectations, account 
for the observed association between delin- 
quency and strain (Hirschi 1969; Liska 1971). 
These findings have been interpreted to 
suggest that strain is redundant as an 
explanation of delinquency since goal commit- 
ment alone suffices without reference to 
differential means for goal attainment (Johnson 
1979; Kornhauser 1978). 


Evidence for the Redundancy Thesis 


As early as 1969, Hirschi suggested within a 
control perspective that high aspirations to 
conventional goals acted as constraints on 
delinquency (1969, p. 162) and that the 
addition of a measure of strain would not 
improve the explanatory efficiency of commit- 
ment alone. Countering the strain position 
that high aspirations in the presence of low 
expectations increase the likelihood of delin- 
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quency, Hirschi (1969, p. 9) hypothesized 
that "the (negative) relation between aspira- 
tions and delinquency" (supportive of control 
theory) "does not reverse when expectations 
are held constant” (as in the measurement of 
strain). His analysis using educational aspira- 
tions and expectations demonstrated that 
while higher aspirations decreased the proba- 
bility of delinquency in his sample, 


discrepancies in educational aspirations and 
expectations are not important in the causation 
of delinquency for two reasons: few boys in the 
sample have aspirations greatly in excess of their 
expectations; and those boys whose aspirations 
exceed their expectations are no more likely to 
be delinquent than those boys whose aspirations 

and expectations are identical. (1969, p. 172, 

emphasis in the original) 

Further analyses by Liska (1971) with 
multiple data sets reinforced Hirschi's conclu- 
sion. Like Hirschi, Liska measured juvenile 
strain as the dysjunction between educational 
aspirations and expectations and reported 
findings indicating that Merton's "stress 
proposition" might be explained more simply 
by commitment or control theory. Thus the 
most devastating contemporary criticism of 
strain theory concerns its apparent failure in 
empirical study, particularly its failure rela- 
tive to control theory (Johnson 1979; Korn- 
hauser 1978). 

On the other band, most of the research 
supporting such conclusions neglected the 
centrality of economic success goals in 
generating strain (Bernard 1984). Hirschi 
acknowledged the possible value of income 
aspirations in testing control and strain 
propositions (1969, p. 180). His and Liska's 
refutations of strain theory, however, relied 
on the measurement of goals and means as 
educational aspirations and expectations. 


IMPLICATIONS FOR THE 
PRESENT STUDY 


The purpose of this study is to consider the 
impact of different conceptualization and 
operationalization strategies on empirical find- 
ings. Comprehensive reviews of empirical 
studies seem to assume that strain, regardless 
of the realm in which it is measured, has the 
same implications for delinquency. This is 
consistent with Stinchcombe's (1964) articu- 
lation thesis that juveniles who are currently 
unsuccessful in school anticipate future fail- 
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. ure as adults. If this is the case, then different 
measures of strain are empirically interchange- 
able and the findings from studies using 
various operationalization procedures can be 
summarized in a cumulative way. 

To date, however, researchers have not 
explored systematically the alternative possi- 
bility that measurement matters. Neither have 
they examined the possibility that a return to 
Merton’s conceptual definition of strain might 
produce better results than those reported 
above. We propose that the lack of support 
for strain theory may stem from the implicit 
revisions to the theory evident in empirical 
studies. Thus the primary objective of the 
present study is to provide an empirical basis 
for assessing the theoretical significance of 
alternative strategies for operationalizing strain. 


THE PRESENT STUDY 
Data and Sample 


The analysis was conducted with secondary 
data collected in Seattle, Washington, in 
1978—79. Stratified disproportionate random 


sampling procedures were used to oversample ` 


youths at a high risk for delinquency. Of the 
1614 respondents aged 15 to 18, 75.2 percent 
were male, 30.5 percent were nonwhite, and 
64.4 percent resided in lower socioeconomic 
areas of the city when the survey was 
conducted. A complete description of the 
sample and data is provided by Michael J. 
Hindelang, Travis Hirschi, and Joseph G. 
Weis (1981).5 These data include a broad 
range of survey items about self-reported 
delinquency and youths’ attitudes, plans, and 
ambitions that is well suited to the present 
study. : 


Measurement 


Table 1 describes five sets of variables for the 
analysis: goals and means; strain measured as 
a dysjunction between economic goals and 
educational means; strain as the dysjunction 
between educational aspirations and expecta- 
tions; last-year prevalence of self-reported 
delinquency; and last-year frequencies of 


? For the purposes of their analyses, Hindelang 
et al. (1981) oversampled official delinquents. 
Thus delinquency distributions in the present study 
do not generalize to a normal population without 
appropriate weighting. 
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delinquency among those reporting any in- 
volvement in the types of delinquency 
examined here. 

Strain. Strain is measured first as the 
dysjunction between (1) economic goals and 
expectations for completing college (Hi 
$Goals, No Coll. Exp.), and (2) the extent to 
which financial goals exceed educational 
expectations in general ($Goals-School Exp.). 
Juvenile strain has been measured in the past 
as the dysjunction between educational aspi- 
rations and expectations. Dysjunctions of this 
type are replicated with two variables: a 
dichotomy that identifies those whose educa- 
tional goals exceed their expectations (School 
Asp. GT School Exp.); and a measure of the 
extent to which levels of educational aspira- 
tion exceed educational expectations (School 
Asp.-School Exp.).$ 

Measuring delinquency. Merton's theory 
has been widely invoked to explain why 
economically deprived juveniles commit util- 
itarian crimes: those to which material 
rewards are attached. Many offenses by 
juveniles, however, are of a nonutilitarian 
type, with no apparent material payoff. Thus, 
the generality of strain as a theory of juvenile 
delinquency is limited to the extent that strain 
is not relevant for such common types as 
assault and vandalism, which are nonutilita- 
rian in that they do not suggest obvious 
material rewards. 

The theory is unclear concerning the extent 
to which strain is associated with serious 
versus nonserious crimes, or with any 


$ We thank Robert Agnew for suggesting that 
the component variables constituting strain might 
be better measured as degrees of commitment than 
as simple dichotomies. Analyses.not reported here 
suggested that measures indicating degrees of 
strain were somewhat stronger in their association 
with dichotomous delinquent variables, as might 
be expected. The significance of the association, 
however, was in all cases identical when either 
type of strain measure was employed and substan- 
tive differences were negligible. Depending upon 
the analysis procedure used, we therefore em- 
ployed either dichotomies or categorical variables 
indicating degree as interchangeable measures of 
the same concepts. 

7 A distinction between serious and nonserious 
types of delinquency is appropriate in view of the 
acknowledged limitations of self-reported data in 
this regard. The distinction is a relative one, and 
even our comparatively sérious self-reported items 
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Table 1. Variables for the Analysis (N= 1614)* 


Variable 


Economic aspirations 
School aspirations 
School expectations 


College aspirations 
College expectations 


Hi $Goais, 
No. Coll Exp. 
$Goals —School Exp. 


School Asp. 
GT School Exp. 


School Asp. —School Exp. 


Prevalence, nonserious 
Utilitarian delinquency 

Prevalence, serious 
Utilitarian delinquency 

Prevalence, nonserious 
Nonutilitarian delinquency 


Prevalence, serious 
Nonutilitarian delinquency 


Frequency, nonserious 
Utilitarian delinquency 
(N= 827) 

Frequency, serious 
Utilitarian delinquency 
(N= 655) 


Frequency, nonserious 
Nonutilitarian delinquency 
(N= 1206) 


Frequency, serious 
Nonutilitarian delinquency 
(N= 467) 


E) 


material payoff. 
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Meaning ; Value Range 
Goals and Means 
Commitment to financial goals. Agreement with: “I want to make 0-4 
lots of money.” Higher scores indicate greater commitment. 
Response to: “How much schooling would you like to get 0-35 
eventually?" . 
Response to: "How much schooling do you expect to get Q-3* 
eventually?" 
Aspirations for college completion. 0-1 
Expectations for college completion. 0-1 
Dysjunction between Economic Goals and Educational Means 
High commitment to financial goals, no expectation for college 0-1 
degree. 
Economic aspirations minus school expectations. 0-4 
Educational Aspirations/Expectations Dysjunction 
Educational aspirations greater than educational expectations. 0-1 
Educational aspirations minus educational expectations. 0-2 
. Last Year Prevalence, Delinquency 
Any relatively nonserious delinquency with material payoff. 0-1 
Any relatively serious delinquency with material payoff. 0-1 
Any relatively nonserious delinquency with no apparent material 0-1 
payoff. 
Any relatively serious delinquency with no apparent material 0-1 
payoff. 
Losed Frequencies, Last Year Delinquency (Offenders Only) 
Frequency of relatively nonserious delinquency with material 0 to 4.87 
payoff. 
Frequency of relatively serious delinquency with material payoff. 0 to 5.08 
Frequency of relatively nonserious delinquency with no apparent 0 to 5.15 
material payoff. 
Frequency of relatively serious delinquency with no apparent 0 to 4.63 





* Table A.1 presents descriptive statistics. 


> Where 0=some high School; 1=high school graduation, on-the-job apprenticeship, trade or vocational school; 
2 = some college or junior college; and 3 = college graduation. 


involvement (prevalence) versus the fre- 
quency of offending. Yet these measurement 
distinctions are substantively important. A 
one-time offender involved in rather trivial 


may produce results different from those of 
analyses with official data (Bernard 1984). 


lawbreaking behavior is qualitatively different 
from the serious habitual offender whose 
delinquency persists and perhaps escalates 
over the span of a lengthy criminal career. 
Past empirical studies of strain have not 
considered all three of these important 
dimensions of delinquency in a single design. 
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Some have differentiated among types and 
seriousness of delinquency (e.g., Agnew 
1984; Datesman, Scarpitti, and Stephenson 
1975), but others have employed an omnibus 
measure that does not make these distinctions 
(e.g., Hirschi 1969; Liska 1971). 

The delinquency measures used here were 
based on self-reported delinquency for the 
twelve-month period preceding the survey. 
For the purposes of this analysis, eight 
delinquency variables were constructed on the 
basis of the three cross-cutting dimensions 
described above: type (utilitarian or nonutili- 
tarian); seriousness; and prevalence versus 
frequency. To examine involvement in delin- 
quency, dichotomous measures of prevalence 
were constructed for each serious and nonse- 
rious type of delinquency. Analyses of 
frequencies were conducted using the natural 
logs of each frequency measure, in view of 
considerable skewness in their distributions. 
In Table A.l of the appendix we report 
descriptive statistics for the variables in 
Table 1.8 


Analytic Procedures 


Merton presented strain theory in a typology 
of adaptive modes rather than in a series of 
deductive propositions expressing association 
or causal relationships among concepts. For 
this reason, the value of the theory to generate 
causal analyses has been widely questioned 
(Dubin 1959; Simon and Gagnon 1976; 
Cohen 1965; Cloward and Ohlin 1960). 
Another limitation of a causal modeling 
approach stems from the categorical nature of 


3 A list of the survey items on which each scale 
is based is available upon request from the first 
author. 
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the variables implied in the concept of strain. 
In the present study. for example, strain 
reflecting two different perspectives is opera- 
tionalized first as dichotomies (Hi $Goals, 
No Coll. Exp.; School Asp. GT School Exp.) 
and then as categorical variables with a 
modest range of values ($Goals-School Exp.; 
School Asp.-School Exp.). 

The questions guiding this analysis, how- 
ever, concern prediction and are testable 
without recourse to multivariate causal mod- 
eling approaches. We therefore employed 
nonparametric statistical procedures (chi- 
square analyses) to test for independence 
between noninterval measures of strain and 
the prevalence of delinquency (Siegel 1956). 
Parallel analyses with frequency measures of 
delinquency were conducted using one-way 
analyses of variance (ANOVA) with dichoto- 
mous measures of strain as factors. 

The question of primary interest concerns 
the impact on empirical results of different 
strategies for operationalizing strain. We first 
compared the predictive efficiency of dichot- 
omous measures based on the dysjunction 
between financial goals and educational 
expectations (Hi $Goals, No Coll. Exp.), 
versus the dysjunction between educational 
expectations and aspirations (School Asp. GT 
School Exp.), for prevalence (Table 2). Next 
we addressed this question with delinquency 
measures of frequency substituted for preva- 
lence (Table 3). : 

Tests of partial association (HILOGLI- 
NEAR) assessed the "redundancy" hypothe- 
sis that commitment alone is adequate to 
predict delinquency. With this procedure, we 
tested for the association between delin- 
quency and strain measures, including both 
goals and means, while allowing for the 
relationship of goals alone to each delin- 


Table 2. Association between Different Dichotomous Strain Measures and Prevalence of Delinquency (N= 1614) 





Hi $ Goals, No Coll. Exp. 


Yes No 
N 524 1090 
Prevalence, nonserious 
Utilitarian delinquency 59% 48% 
Prevalence, serious 
Utilitarian delinquency 56 33 
Prevalence, nonserious 
Nonutilitarian delinquency 84 . 70 
Prevalence, serious 
Nonutilitarian delinquency 4l 23 





* p<.05: 


School Asp. GT School Exp. 


Yes No 
Chi-Sq. 279 1335 Chi-Sq. 
18.10* 53% 51% .36 
72.86* 49 39 10.59* 
35.86* 82 73 9.20* 
50.94* 35 28 5.93* 
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Table 3. Mean Frequency of Self-Reported Delinquency by Dichotomized Strain Measures 
- Frequency of Frequency of Frequency of Frequency of 
Nonserious Serious Nonserious Serious 
Utilitarian Utilitarian Nonutilitarian Nonutilitarian 
Delinquency Delinquency Delinquency Delinquency 
(N—827) (N= 655) (N= 1206) (N= 467) 
N Mean N Mean N Mean N Mean 
Hi $Goals, 0 518 1.34 363 1.71 765 1.89 254 .88 
No. Coll. Exp. 1 309 1,58 292 2.00 441 2.24 213 1.18 
F 7.78* 9.15* 22.68* 9.68* 
Probability <.01 <.01 «.001 <.01 
School Asp. 0 679 1.39 517 1.83 977 1.97 369 .98 
GT School Exp. 1 148 1.60 138 1.88 229 2.22 98 1.16 
3.84* 20 7.72* 2.33 
Probability .05 NS «01 NS 
* p<or=.05. 


quency outcome (Table 4). These analyses 
included variables reflecting the extent to 
which financial goals differed from educa- 
tional means ($Goals-School Exp.) and the 
extent to which educational aspirations ex- 


ceeded expectations (School Asp.-School . 


Exp.). 
In the framework of our conceptual argu- 
ment, we hypothesized that: 


H, DEL/Hi $Goals, No Coll. Exp. > DEL/ 
School Asp. GT. School Exp., 


where DEL = the prevalence or frequency of 
different types of delinquency; Hi $Goals, 
No. Coll. Exp. = a dysjunction between 
commitment to financial goals and college 
expectations; and School Asp. GT School 
Exp. = a dysjunction between educational 
aspirations and educational expectations. 

The results reported in Table 4 assess the 
implications of measurement for the “redun- 
dancy thesis” that goal commitment alone 
sufficiently predicts delinquency without re- 


course to the interplay between goals and 
means. If strain variables improve upon the 
predictive efficiency of control variables 
alone, regardless of how strain is measured, 
we would expect that: 


H2 DEL/School Asp.-School Exp. > DEL/ 

School Aspirations and 

H3 DEL/$Goals-School Exp. > DEL/Economic 
. Aspirations, 


where DEL = the prevalence of different 
types of delinquency; School Asp.-School 
Exp. = the degree of dysjunction between 
educational aspirations and expectations; 
School Aspirations = educational aspirations 
alone; $Goals-School Exp. = the degree of 
dysjunction between financial goals and 
educational expectations; and Economic Aspi- 
rations = the degree of commitment to 
financial goals. 

The redundancy thesis predicts support for 
the null hypotheses for H; and H; respec- 
tively: 


Table 4. Commitment Versus Strain: Predictive Efficiency of Goals Alone Versus Dysfunction between Goals 


and Means 














School 
Aspirations 
Prevalence, nonserious 
Utilitarian delinquency 34.03* 
Prevalence, serious 
Utilitarian delinquency 71.33* 
Prevalence, nonserious 
Nonutilitarian delinquency 31.14* 
Prevalence, serious 
Nonutilitarian delinquency 73.31* 


* p<.05. 


Partial Chi-Square Values* 


School Asp- Economic $Goals— 
School Exp. Aspirations School Exp. 
3.14 4.15 24.49* 
23.88* 3.11 74.10" 
16.95* 1.80 29.14* 
18.72* 3.44 
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H4 DEL/School Aspirations = or > DEL/ 
School Asp.-School Exp., and 
H; DEL/Economic Aspirations = or > 


DEL/$Goals-School Exp. 


If past operationalization strategies for mea- 
suring strain are implicated in the failure of 
strain relative to contro] theory, then support 
for H, does not preclude nonsupport for Hs. 
In the context of the arguments underlying 
this study, we hypothesize support for H4, as 
in past research, and nonsupport for Hs. 


Results 


Table A.1 in the appendix indicates that the 
measures of goals and of strain are somewhat 
skewed in their distribution, but our empirical 
tests are not invalidated because of the 
absence of “strained” individuals (Agnew 
1986).? The first question for research 
concerns the relative predictive efficiency of 
different measures of strain. 

Strain and prevalence. H; predicts that 
strain measured as the dysjunction between 
financial goals and college expectations will 
be a better predictor of delinquency than 
strain measured as the dysjunction between 
educational aspirations and expectations. This 
hypothesis receives some support in analyses 
of prevalence (Table 2). The chi-square 
values from analyses with the variable based 
on a dysjunction between educational aspira- 
tions and expectations are significant for all of 
the delinquency measures except nonserious 
utilitarian offenses. The results are significant 
for all of the outcomes when strain is 
measured as the dysjunction between eco- 
nomic goals and college expectations. The 
results differ somewhat as a function of the 
type of delinquency examined. For the more 
serious forms of each type of delinquency, 
both measures were more effective in predict- 
ing the prevalence of utilitarian than of 
nonutilitarian delinquency. For the less seri- 
ous forms, both measures were better predic- 
tors of nonutilitarian delinquency. 

Strain and the frequency of delinquency. A 
comparison of the results from analyses of 
variance reported in Table 3 indicates clearer 


? This may be related to the nature of the sample 
analyzed. See footnote 5, supra. It is also 
interesting to note that a strong commitment to 
economic goals is the mode but is not universal in 
the sample (Table A. 1). 
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Table A.1. Descriptive Statistics: Distribution of Varia- 


Name Values N (%) 
Economic aspirations 0 25 (1.5) 
1 60 (3.7) 
2 115 (7.1) 
3 520 (322) 
4 894 (55.4) 
School aspirations 0 10 (6) 
1 433 (26.8) 
2 307 (19.0) 
3 864 (53.5) 
School expectations 0 26 (1.6) 
1 501 (31.0) 
2 373  Q3.D 
3 714 (44.2) 
College aspirations 0 750 — (46.5) 
1 864 (53.5) 
College expectations 0 901 (55.8) 
1 73 (442) 
Hi $Goals, No Coll. Exp. 0 1090 — (67.5) 
1 524 (32.5) 
$Goals—School Exp. 0 916 (56.8) 
1 343 (21.3) 
2 324 (20.1) 
3 20 (12) 
4 li (.7) 
School Asp. GT School Exp. 0 1335 (82.7) 
1 279 (17.3) 
School Asp. —School Exp. 0 1335 (82.7) 
1 216 (13.4) 
2 63 (3.9) 
Prevalence, nonserious 0 787 (48.8) 
Utilitarian delinquency 1 827 (51.2) 
Prevalence, serious 0 959 (59.4) 
Utilitarian delinguency 1 655 (40.6) 
Prevalence, nonserious 0 408 (25.3) 
Nonutilitarian delinquency 1 1206 (73.7) 
Prevalence, serious 0 1147 (701.0 
Nonutilitarian delinquency 1 467 = (28.9) 
Mean STD 
Frequency, nonserious 
Utilitarian delinquency 
(N= 827) 0 to 4.87 1.43 1.19 
Frequency, serious 
Utilitarian delinquency 
(N= 655) 0to 5.08 1.84 1.24 
Frequency, nonserious 
Nonutilitarian delinquency 
(N= 1206) 0105.15 2.02 1.24 
Frequency, serious 
Nonutilitarian delinquency 
(N= 467) 0 to 4.63 1.02 1.07 


* Missing values were assigned to the model categories. 


support for H, when delinquency is measured 
as frequency rather than prevalence. Strain as 
the dysjunction between economic goals and 
college means is significant for all four types 
of delinquency. The school aspirations/expec- 
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tations dysjunction is significant only for the 
nonserious forms of utilitarian and nonutilita- 
rian delinquency. 

The findings reported in Tables 2 and 3 
suggest that .past studies that relied on 
educational aspirations and expectations to 
evaluate strain theory, to the neglect of the 
dysjunction between financial goals and 
college expectations, underestimated the sig- 
nificance and strength of the association 
between strain and delinquency. 

The redundancy of strain. The redundancy 
thesis proposes, however, that the findings in 
Tables .2 and 3 may be irrelevant, if 
aspirations alone are adequate to predict 
delinquency without recourse to a measure of 
strain. The findings in Table 4 indicate 
support for this hypothesis when strain is 
operationalized as the educational aspiration/ 
expectation dysjunction. School aspirations 
alone are significant for all four types of 
. delinquency. Strain as the educational aspira- 
tions/expectations dysjunction is not signifi- 
cantly associated with nonserious utilitarian 
delinquency, and the partial chi-square values 
are in all cases lower than for school 
aspirations alone. These analyses represent a 
quasi replication of earlier studies by Hirschi 
(1969) and Liska (1971), and the results are 
consistent with the findings from those earlier 
studies. H, rather than H3 is supported, 
indicating that the social control variable is a 
better predictor of delinquency than is strain 
and that the use of a measure of strain is 
redundant when commitment alone suffices to 
predict delinquency. 

These results are reversed, however, when 
commitment is measured as economic aspira- 
tions and strain is measured as the dysjunction 
between economic goals and educational 
expectations. Partial chi-square values indi- 
cate that commitment to financial goals is not 
in itself significantly associated with any type 
of delinquency. But when educational expec- 
tations are combined with this goal commit- 
ment measure to represent strain, the partial 
chi-square values are significant for all types 
of delinquency. Support for H3 indicates that 
strain is a better predictor of delinquency than 
is commitment. The failure of Hs indicates 
nonsupport for the redundancy thesis when 
strain is reconceptualized to consider prone 
goals and educational means. !° 


10 To examine redundancy thesis, we used tests 
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SUMMARY AND DISCUSSION : 


These findings support our broad hypothesis 
that operationalization strategies affect the 
nature of findings concerning the predictive 
value of strain for delinquency. Tests of the 
pragmatic validity of different measures of 
strain, with the prevalence and frequency of 
self-reported delinquency as criterion vari- 
ables, indicated that the better and more 
consistent measures were those based on 4 
dysjunction between commitment to eco- 
nomic goals and expectations for educational 
attainment. This effect was not attributable to 
commitment alone. Strain measured in this 
way was a better predictor of delinquent 
involvement than was commitment to eco- 
nomic goals alone. As in the past study, some 
effect of the dysjunction between educational 
aspirations and expectations was identified, 
and educational aspirations by itself was a 
better predictor of delinquent involvement 
than was educational strain. 

Strain measured as the gap between 
economic goals and educational expectations 
was more effective in predicting the preva- 
lence of serious utilitarian than serious 
nonutilitarian delinquency. This is consistent 
with general expectations concerning the 
theory's particular value for different types of 
offending. In all cases, however, this measure 
of strain was also significant for nonutilitarian 
delinquency and was equally useful in 
predicting the frequency of serious offenses 
of both types. It was also a better predictor of 
both the prevalence and frequency of nonse- 
rious nonutilitarian than of nonserious utilitar- 
ian types. Thus the criticism concerning the 
limited generality of strain to predict types of 
delinquency other than those involving mate- 
rial payoff found little support in this 
analysis. 


of partial association. Zero-order correlations (not 
reported here) that address a slightly different 
question were also informative. The substantive 
implications for the redundancy thesis when we 
used different measures were the same as in Table 
4. But while educational aspirations were nega- 
tively related to delinquency prevalence, the 
association between economic aspirations . and 
delinquency was positive. This positive effect of 
economic commitment was intensified (rather than 
reversed as with educational commitment) . when 
educational expectations was also considered. 
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Implications for Theory 
In its long history in criminology, Merton’s 


theory has generated several works that 
attempted explicit revisions or elaborations 


(e.g., Agnew 1985; Cohen 1955; Cloward . 


and Ohlin 1960). Key concepts have also 
been altered over time in operationalization 
strategies that attempted to tailor the theory to 
the concerns of juveniles. These conceptual 
revisions have sometimes been implicit in the 
design of research. Whether implicit or 
explicit, they have been crucial nonetheless in 
their implications for reformulating the the- 
ory. 

On the basis of findings from past research 
that included various reconceptualizations of 
strain, researchers have advocated the out- 
tight abandonment of strain theory (Korn- 
hauser 1978), its truncation in integrative 
attempts (Elliott et al. 1985; Johnson 1979), 
or revisions that remove the theory from its 
social structural context (Agnew 1985). The 
findings from this study suggest, however, 
that slippage between the original theoretical 
statement of strain theory and its expression 
in research attenuates the criticisms that 
derive from empirical findings. Indeed, on 
the basis of our findings, we suggest that the 
apparent failure of strain theory in recent 
empirical study might well be a function of 
inappropriate operationalization. Thus we 
propose that empirical findings to date are not 
sufficient to falsify the basic postulates -of 
Merton’s theory of strain and deviance. 

But neither should our findings concerning 
the relative efficiency of single measures of 
strain versus single measures of commitment 
be construed as a falsification of control 
theory. Rather, our findings inform ongoing 
debate about the relative value of either 
control or strain variables in the study of 
delinquency, and underscore the theory-laden 
nature of the commitment concept. We found 
that juveniles’ commitment to conventional 
educational goals is significant in its inhibi- 
tion of delinquency, as Hirschi (1969) and 
other control theorists have proposed. In 
contrast, strain is a better predictor of 
delinquency than financial goals alone when 
commitment is operationalized in a way that 
is consistent with a strain perspective. Both 
theories are therefore useful for the study of 
delinquency but pertain to theoretically dis- 
tinct referents for the measurement of commit- 
ment. 
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While clearly moving beyond our data and 
findings, we might speculate on the implica- 
tions of these findings for integrating strain 
with control theory. The fact that commit- 
ment can be measured as either economic or 
educational goals, with different implications 
for delinquency, suggests that their combina- 
tion in a single model would maximize the 
variance explained in a prediction model. It 
has been argued elsewhere, however, that a 
logical integration of theories must accommo- 
date more than the combination of single 
variables in research models, and that theory 
integration ought to accomplish more than 
simple predictive efficiency (Farnworth forth- 
coming). Others have observed that strain and 
control theories differ in their beginning 
assumptions about the nature of man and of 
society (Hirschi 1969; Kornhauser 1978). To 
this argument against integration, we add that 
each theory directs researchers to quite 
different operationalization strategies to mea- 
sure commitment. 


Directions for Research 


The analysis results provide rather compelling 
evidence for the cogency of our initial 
premise: operationalization procedures in the 
analysis of strain models are nontrivial in 
their implications for theory and for the 
falsification of theory in empirical testing. 
Along these lines, further study might explore 
the promise of the strain variable we 
introduced, compared with measures reflect- 
ing the diversity of cultural goals in our 
society. Future study might also examine the 
implications of other modes of adaptation 
(e.g., ritualism or retreatism) for different 
types of delinquency and crime. 

Longitudinal study could extend our cross- 
sectional analyses to examine strain effects on 
delinquency as a dynamic process. At present, 
little is known about the stability of strain 
perceptions over time, or the possibility that 
individuals’ modes of adaptation shift as a 
function of maturation and social develop- 
ment. 

Finally, our earlier discussion proposed 
that strain theory makes a unique contribution 
to criminology by hypothesizing both struc- 
tural and psychological influences on crime. 
Our analyses with a demographically undiffe- 
rentiated sample suggest a promising alterna- 
tive for measuring psychological strain. We 
did not, however, address the structural 
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origins of that strain in a society that 
differentially “rewards” individuals on the 
basis of social background, race, or sex. A 
fuller test of strain theory should accommo- 
date social class differences. In addition, a 
determination of the strain effects that stem 
from sex or racial inequality would serve to 
update this classical theory first promulgated 
in 1938, when structurally induced strain was 
assumed to be a white male phenomenon. 

In conclusion, we acknowledge that our 
analyses have not resolved or even begun to 
address all of the theoretical and methodolog- 
ical issues surrounding strain theory. To the 
extent that those issues lend themselves to 
empirical expression and resolution, how- 
ever, we maintain that Merton’s theory of 
strain and deviance remains now, as in the 
past, a viable and promising theory of 
delinquency and crime. 
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This research examines the campaign contributions of 629 members of 100 wealthy 
capitalist families to the presidential campaign of 1972, the last campaign 
conducted without any limits on individual contributions. The analysis reveals that 
roughly half of the members of these families contributed little or nothing to this 
presidential campaign, but other family members were often major contributors. 
Family members who were most visible as corporate directors or foundation 
trustees or who were listed in Who’s Who in America contributed the most to this 
campaign. In addition, wealthy entrepreneurs contributed more to the campaign 
than members of subsequent generations of wealthy capitalist families. Members of 
wealthy Jewish and Southern families contributed slightly more to the Republican 
party than the Democratic party, but they contributed much more to the 
Democratic party than did the members of other wealthy capitalist families. In 
addition, the members of families that were major stockholders in oil companies or 
companies with large government contacts contributed more to the campaign than 
did the members of other wealthy capitalist families. Conversely, the members of 
families that were major stockholders in media companies contributed relatively 


little to this campaign. 


STATEMENT OF THE PROBLEM 


One of the most important and enduring 
issues in sociology involves the distribution 
of power in society. Indeed, this issue 
literally defines the field of political sociol- 
ogy. The theoretical controversies surround- 
ing this issue can be distilled into a few major 
paradigms (Alford and Friedland 1985). 
These formulations typically assume that an 
elite exercises inordinate power within Amer- 
: ican society, even though considerable contro- 
versy remains over the size and composition 
of this “elite” as well as the extent and nature 
of its “power.” On the one hand, elite 
theorists (Mills 1956) maintain that this elite 
comprises institutional leaders such as the 
chief executive officers. of major corpora- 
tions, senior military officers, and influential 
politicians. Marxist theorists (Miliband 1969), 
on the other hand, maintain that this elite is 





* A preliminary version of this paper was 
presented at the Annual Meeting of the Pacific 
Sociological Association, Las Vegas, Nevada, 
April 1988. The authors are indebted to Herbert E. 
Alexander for his assistance and to G. William 
Domhoff, John Campbell, Maurice Zeitlin, Dan 
Clawson, Val Burris, Terrence Cook, and J. Scott 
Long for their comments and suggestions. 


American Sociological Review, 1989, Vol. 54 (April:275-287) 


composed largely of the members of the 
capitalist class and their representatives. 
Recent research on this issue has focused not 
only on the composition of this elite, but also 
on the mechanisms that the members of this 
elite use to maintain their power. For 
example, a series of empirical studies by 
Domhoff (1967, 1970, 1983) have provided 
substantial evidence for the proposition that a . 
relatively small "power elite," composed 
largely of various segments of the capitalist 
class, effectively dominates the American 
political system. Among other things, he 
argues that wealthy individuals exercise 
inordinate power as a result of their ability to 
influence the selection of candidates for 
political office at the nomination stage 
through their campaign contributions (1983, 
pp. 116-29). 

There have been, of course, numerous 
studies which have demonstrated that wealthy 
individuals have long been among the major 
contributors to presidential campaigns (Lund- 
berg 1937; Overacker 1941; 1945; Heard 
1960; Alexander 1971, 1976). Much of this 
research, however, has been descriptive and 
atheoretical. Only a few researchers have 
attempted to use campaign contribution data 
to examine specific theoretical propositions 
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concerning the political activities of different 
segments of the capitalist class (Domhoff 
1972; Nichols 1974). Moreover, almost all of 
these studies have proceeded without an 
adequate sample of the members of the 
capitalist class. Indeed, most researchers have 
inadvertently sampled on the dependent 
variable of interest by studying only large 
contributors to political campaigns. Conse- 
quently, they have not been able to determine 
even what proportion of the members of 
wealthy capitalist families ever contribute to 
political campaigns. As a result, very little is 
known about the campaign contributions of 
most members of the wealthy families that 
compose the core of the capitalist class. This 
study attempts to redress this problem by 
examining the contributions of 629 members 
of 100 wealthy capitalist families to the 
presidential campaign of 1972. It focuses 
exclusively on campaign contributions to 
presidential candidates because the presi- 
dency is the single most powerful position in 
American government. Furthermore, it exam- 
ines the presidential election of 1972 because 
that was the first year in which there was 
virtually complete disclosure of major cam- 
paign contributions and the last year in which 
there were no limitations on the magnitude of 
individual contributions to presidential cam- 
paigns. 


REVIEW OF THE LITERATURE 


Over the past several decades, there has been 
a series of studies of campaign finance, 
especially of contributions to presidential 
campaigns. These studies have devoted spe- 
cial attention to the contributions of wealthy 
individuals because they have typically been 
among the largest campaign contributors. 
Lundberg (1937), for example, revealed that 
wealthy entrepreneurs and their descendants 
were among the largest contributors to 
virtually every presidential campaign from 
1896 to 1936. Later, a series of investigations 
conducted between 1940 and 1956 revealed 
that many of the largest contributors to 
presidential campaigns were members of 
wealthy capitalist families (Overacker 1941, 
1945; U.S. Senate 1957). On the basis of 
these investigations, Heard concluded that, 
“as a class, America’s very rich are regular 
campaign donors" (1960, p. 136). He also 
examined the extent to which the members of 
wealthy capitalist families coordinated their 
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campaign contributions. Although a few 
families appeared to coordinate their cam- 
paign contributions, most of them did not 
evidence any clear pattern of coordination. 
Overall, Heard concluded that "the dates, 
amounts, and destinations of political gifts by 
members of the same family often suggest a 
lack of family cohesion" (1960, p. 138). Ina 
series of subsequent studies, Alexander (1971, 
1976) confirmed that members of these same 
wealthy capitalist families remained among 
the major contributors to presidential cam- 
paigns between 1960 and 1972. 

The research on the presidential campaign 
contributions of wealthy capitalists was readily 
incorporated into the work of elite theorists. 
For example, the importance of contributions 
to political campaigns by the members of 
wealthy capitalist families, as a mechanism of 
elite domination, was first discussed by Mills 
in his seminal analysis of the “power elite" 
(1956). He observed that wealthy entrepre- 
neurs and their families had long been major 
contributors to presidential campaigns. At the 
same time, he noted that the full extent of 
campaign contributions by these families was 
largely unknown because many contributions 
were made by family members with different 
names. As the result of these campaign 
contributions and other, less direct, forms of 
political influence, Mills concluded that 
"money allows the economic power of its 
possessor to be translated directly into 
political party causes" (1956, p. 166). This 
same argument was later elaborated upon by 
Domhoff in his study of “governing class" 
(1967). He refined the elitist argument by 
noting that presidential candidates often 
depend upon large campaign contributions 
from wealthy individuals in order to gain the 
nomination of their party. Specifically, Dom- 
hoff concluded, “Unless a person has large 
financial reserves or the backing of wealthy 
men, he cannot hope to develop a national 
following or compete in party primaries" (p. 
85). In short, wealthy individuals are able to 
exercise an inordinate influence on the 
electoral process by contributing large amounts 
of money in the early stages of a campaign. 

Wealthy individuals contribute to political 
campaigns for a variety of reasons. To begin 
with, large contributors often expect to gain 
"access" to a candidate in return for their 
campaign contributions. As Thayer put it, 
“More than anything, regardless of whether 
or not a contributor has anything specific in 
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mind, serious money buys access” (1973, p. 
136). Similarly, Adamany suggests that large 
contributors gain more direct access to 
candidates than do small contributors because 
they are typically solicited for funds by 
individuals who are close advisors to the 
candidate (1969). As a result, “the large 
contributor has noteworthy advantages in 
exercising his influence within the political 
system” (Adamany 1969, p. 212). At the 
same time, the members of wealthy capitalist 
families are often solicited for large contribu- 
tions by candidates and their fund raisers. 
According to Heard, many wealthy individu- 
als almost inadvertently become major cam- 
paign contributors simply because “they are 
more likely to be asked and are more able to 
give than other citizens” (1960, p. 136). For 
example, in his analysis of the large contrib- 
utors in one major metropolitan area, Nichols 
discovered that most of these contributors 
were directors of corporations (1974, pp. 
71—18). The members of wealthy capitalist 
families who serve as directors of family 
corporations or trustees of family foundations 
may contribute to political campaigns because 
they have deliberately chosen to become 
actively involved in civic, economic, and 
political affairs. Alternatively, these individu- 
als may contribute to political campaigns 
simply because they receive more solicita- 
tions for funds. 

In this same vein, some researchers (Dom- 
hoff 1972; Thayer 1973) have suggested that 
the members of different generations within 
wealthy capitalist families evidence different 
patterns of campaign contributions. There is 
some evidence, for example, that wealthy 
entrepreneurs, who accumulated great wealth, 
are more likely to be major campaign 
contributors than the children and grandchil- 
dren of wealthy entrepreneurs, who simply 
inherited great wealth. For example, Thayer 
concluded that most large contributions do 
not come from the members of established 
wealthy families (1974, pp. 134—35). Instead, 
he observed that "the more typical big 
contributor is usually self-made and absolute 


' or near-absolute ruler of his own business 


empire" (p. 135). This pattern may be 
attributable to the fact that much of the wealth 
of the descendants of wealthy entrepreneurs is 
held in trusts in order to avoid progressive gift 
and estate taxes (Allen 1987). Consequently, 
the children and grandchildren of wealthy 
entrepreneurs do not always have complete 
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control over their share of the family fortune. 
Conversely, Domhoff discovered that some of 
the largest contributors to the Democratic 
party in recent years have been scions of 
established wealthy capitalist families (1972, 
pp. 63-66). One implication of this pattern is 
that individuals who have inherited great 
wealth are likely to be less politically 
conservative than those who have amassed 
wealth on their own. Although some of the 
distant descendants of wealthy entrepreneurs 
may contribute to the Democratic party, most 
of the members of wealthy capitalist families 
are likely to contribute to the Republican 


A few of the more recent studies of 
campaign contributions by wealthy individu- 
als have sought to identify the political 
affiliations of particular segments of the 
power elite or the capitalist class. In an early 
study, Domhoff (1972) examined the involve- 
ment of wealthy individuals as major cam- 
paign contributors within the Democratic 
party. Even though the overwhelming major- 
ity of the wealthy have traditionally allied 
themselves with the Republican party, a 
significant minority have long identified with 
the Democratic party. In particular, Domhoff 
confirmed that wealthy Jews and wealthy 
Southerners have been among the major 
contributors to the Democratic party in recent 
years. This pattern may persist because the 
Republican party has remained largely under 
the control of wealthy individuals who are 
Northern Protestants (Domhoff 1972, pp. 
489-55). This interpretation conforms to the 
“investment theory” of electoral politics 
developed by Ferguson and Rogers (1986) in 
which political parties and their candidates 
respond to the interests of major campaign 
contributors. As Ferguson and Rogers put it, 
“Such investors generally have good and 
clear reasons for investing to control the state, 
and the resources necessary to sustain the 
costs of such an effort” (1986, pp. 45-46). 
Although Ferguson and Rogers consider only 
economic interests as a basis for campaign 
contributions (Domhoff 1988), it can be 
argued that many Jews and Southerners 
“invested” in the Democratic party as a 
means of gaining political influence, primar- 
ily because of their somewhat marginal status 
within the national elite structure. 

Elite researchers have long recognized that 
neither the power elite nor the capitalist class 
is monolithic. Baltzell (1964) first suggested 
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the importance of religion and ethnicity as a 
source of differentiation in the upper class. 
He concluded, nevertheless, that wealthy 
Jews are eventually compelled to put their 
class interests ahead of their ethnic identity, 
even though they are not fully integrated into 
the upper class (pp. 62-63). In a more recent 
study of Jews in the national elite, Zweigen- 
haft and Domhoff conclüded that "class 
identifications have become of primary impor- 
tance" (1982, p. 110). These observations 
may appear to be inconsistent with the 
assertion that wealthy Jews are among the 
largest contributors to the Democratic party. 
However, this pattern is not as paradoxical as 
it might seem at first. To begin with, Jews 
have voted overwhelmingly for Democratic 
candidates in virtually every election since 
1928 (Isaacs 1974, pp. 151-53). The alle- 
giance of Jews to.the Democratic party stems 
from their belief that it is more concerned 
with minority group rights than is the 
Republican party. Wealthy Jews have also 
been able to enhance their social status and 
political power by becoming major contribu- 
tors to the Democratic party. For its part, the 
Republican party has not been especially 
solicitous of wealthy Jews because it has been 
able to rely upon wealthy Protestants for large 
contributions (Isaacs 1974, pp. 125-26). In 
short, it appears that many wealthy Jews have 
“invested” in the Democratic party in order to 
maximize their national political influence. 
Another source of differentiation in the 
upper class is regionalism. Domhoff (1972, 
pp. 48-55) discovered that members of 
wealthy capitalist families in the South and 
the Southwest have been among the largest 
contributors to the Democratic party. For a 
variety of historical and economic reasons, 
most Southerners and, to a lesser extent, most 
Southwesterners have traditionally identified 
with the Democratic party. Despite recent 
victories by Republican presidential candi- 
dates, the South has remained a stronghold of 
- the Democratic party since the Reconstruction 
era. Although many Southerners are conser- 
vatives, the association between political 
ideology and party identification remains 
weak because of Democratic party strength at 
the state and local levels in the South (Black 
and Black 1987, p. 251). The party has also 
been dominant in the Southwest. As late as 
1972, for example, almost two-thirds of the 
members of the House and Senate from the 15 
Southern and Southwestern states were Dem- 
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ocrats (Sale 1975, p. 124). Moreover, these 
“Southern Rim” states rely heavily upon 
federal agricultural subsidies and defense 
contracts (Sale 1975, pp. 59-67). It seems 
that wealthy Southerners and Southwestern- 
ers, like wealthy Jews, have sought to 
enhance their social status and national 
political influence by contributing to the 
Democratic party. 

A final source of differentiation within the 
capitalist class stems from the specific 
economic interests of particular families. 
Some wealthy capitalist families are major 
stockholders in corporations that are subject 
to extensive government regulation or are 
dependent upon government contracts. Drug 
companies, which produce prescription drugs, 
and media companies, which own radio and 
television stations, are subject to federal 
licensing decisions. Similarly, chemical com- 
panies are subject to federal pollution control 
regulations. Along these same lines, oil 
companies are affected by government policy 
concerning oil import quotas and the regula- 
tion of natural gas prices. Finally, major 
defense contractors are directly dependent 
upon government for a large portion of their 
revenues and profits. There is some evidence 
for such a "rational choice" theory of 
campaign contributions. In a study of major 
contributors to the 1972 presidential cam- 
paign, Pittman (1977) found that the contribu- 
tions by officers and directors of large 
corporations were directly related to the 
dependence of these firms on federal con- 
tracts and their susceptibility to federal 
regulation. As a general rule, those families 
that are major stockholders of corporations 
that rely on government contracts or are 
subject to government regulation are probably 
more likely to contribute to presidential 
campaigns than are other families. In particu- 
lar, those families associated with corpora- 
tions that are subject to government regula- 
tion may be more likely to contribute to the 
Republican party because it has traditionally 
opposed government regulation of business. 

These empirical generalizations and theoret- 
ical formulations provide the basis for a series 
of predictions concerning the campaign con- 
tributions of the members of wealthy capital- 
ist families. To begin with, the capitalist class 
is extremely small in comparison to the other 
classes in American society and its economic 
interests are often in opposition to the 
economic interests of other major classes. 
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Consequently, the members of this class 
compensate for their numerical inferiority by 
becoming a major source of campaign finance 
to those political parties and candidates that 
promise, either explicitly or implicitly, to 
protect their economic interests. They have 
contributed primarily to the Republican party 
because it has traditionally championed the 
interests of the business community. More- 
over, those family members who are active in 
economic and civic affairs as corporate 
directors and foundation trustees are more 
likely to contribute to political campaigns 
than are those family members who have 
eschewed any civic and economic involve- 
ment. At the same time, the descendants of 
wealthy entrepreneurs, who are generally less 
involved in the affairs of family corporations, 
are also less likely to contribute to political 
campaigns of Republican candidates than are 
those entrepreneurs who founded family 
corporations. Those segments of the capitalist 
class that have often been excluded from the 
national elite and the upper class, such as 
Jews and Southerners, are more likely to 
contribute to the Democratic party than the 
Republican party. Finally, those families that 
are major stockholders in corporations that 
are recipients of government contracts or are 
subject to government regulation are more 
likely to contribute to presidential campaigns, 
and especially to Republican candidates, than 
are other wealthy capitalist families. 


SAMPLE AND MEASURES 


The primary limitation of previous studies of 
the presidential campaign contributions of 
wealthy capitalist families has been their 
inadequate sampling procedures. Almost ev- 
ery empirical study has, in effect, sampled on 
the dependent variable of interest. These 
samples typically comprised those wealthy 
individuals who had contributed to a presiden- 
tial campaign. They inadvertently excluded 
those individuals of comparable wealth who 
had not contributed to a presidential cam- 
paign. Thus, it was impossible to determine 
the campaign contributions of members of 
wealthy capitalist families as a whole. 
Perhaps the most notable exception was a 
study conducted by Alexander that examined 
the presidential campaign. contributions in 
1972 of a sample of wealthy Americans, 
using a list of 51 centimillionaires compiled 
by Fortune (Loomis 1968). However, such 
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lists are problematic as samples. because they 
are often inaccurate. Specifically, they typi- 
cally include only the most prominent mem- 
bers of the wealthiest families and ignore 
other members of these families who are not 
known to the public at large. - Similar 
methodological limitations have afflicted those 
few studies that have focused on the presiden- 
tial campaign contributions of particular 
wealthy families (Heard 1960; Alexander 
1971, 1976). As Heard put it, “In many such 
analyses ‘persons have been included who 
were only remotely related to one another, if 
at all, and close relatives have been excluded 
because their surnames were different" (1960, 
p. 137). 

In order to examine the presidential cam- 
paign contributions of the members of the 
capitalist class systematically, it is necessary 
to construct a relatively large sample that is 
representative of wealthy capitalist families in 
general. The first step is .to identify the 
wealthiest capitalist families at a particular 
time. This study examines contributions to 
the presidential campaign of 1972 because 
that was the first year in which there was 
virtually complete disclosure of all major 
contributions to the presidential campaign. 
More important, 1972 was also the last year 
in which there were no limitations on the 
amount an individual could contribute to a 
candidate or party at the federal level. The 
sample employed in this study consists of 
those wealthy families that were major 
stockholders in large corporations (Zeitlin 
1974). In almost every case, one or more 
family members also served as officers or 
directors of these corporations. Specifically, 
this sample includes 100 wealthy capitalist 
families that were each worth at least $100 
million in 1972. These families were identi- 
fied from earlier studies of family control in 
large corporations (Lundberg 1968; Burch 
1972; Allen 1987). Although this sample is 
not exhaustive,’ it includes the bulk of the 


wealthiest families in America in 1972. For ` 


the purposes of this analysis, a family is 
defined as the descent group consisting of a 
wealthy entrepreneur and his or her lineal 
descendants and heirs. A list of these 100 
wealthy capitalist families, including the 
number of family members in the sample and 
their total contributions to the 1972 presiden- 
tial campaign, is presented in the Appendix. 

The main obstacle confronted by this 
research was the lack of readily accessible 
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. information on the members of wealthy 
^ capitalist families. In order to maintain their 
privacy, many wealthy individuals are pur- 
posely reclusive. Thus, only a fraction of 
these family members are listed in Who’s 
Who in America (Priest 1982). Nevertheless, 
it was possible to identify, using a range of 
genealogical sources, the most senior mem- 
bers of these families. Information from 
published sources, such as biographies, com- 
pany histories, obituaries, and biographical 
directories, was often supplemented with 
information from probate and other court 
records. All of the adult members of the most 
senior generations of each family in 1972 are 
included in the sample. In all, this sample 
includes 629 wealthy family members, most 
of whom have never been included in any 
published list of the wealthiest individuals in 
America. In many cases, the only personal 
information available on these individuals 
was their names. In fact, it was not always 
possible to confirm that the married name of a 
female family member was current for 1972. 
This research does not distinguish between an 
individual and his or her spouse because 
campaign contributions are often given in the 
names of both individuals. Moreover, it is not 
uncommon for the spouse of a wealthy 
individual to serve as a corporate director or 
foundation trustee (Tickamyer 1981). Indeed, 
in a small number of cases, the sample 
includes the surviving spouses of deceased 
family members because they have inherited a 
share of the family fortune. 

The dependent variable in this research is the 
presidential campaign contributions of each 
member of these wealthy capitalist families. The 
contributions of these individuals and their 
spouses are aggregated even if these contribu- 
tions were given under their separate names. 
Information on campaign contributions to the 
1972 presidential campaign was obtained from 
several sources. The primary source of data was 
an extensive report prepared by the Office of 
Federal Elections (U.S. General Accounting Of- 
fice 1972), which lists all of the contributions, 
loans, and other monetary transfers to presiden- 
tial and vice-presidential committees and can- 
didates in excess of $100 between April and 
December of 1972. Data on other large contri- 
butions prior to April of 1972 were obtained 
from various campaign finance disclosure state- 
ments compiled by the Citizens' Research Foun- 
dation (1972). Finally, data on large secret do- 
nations to the campaign of President Nixon were 
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compiled from a list obtained as the result of a 
lawsuit filed by Common Cause. In addition, 
information was compiled on the visibility of 
each family member who served as a corporate 
director (Standard and Poors 1973) or founda- 
tion trustee (Foundation Center 1971) or who 
was listed in Who's Who in America (Marquis 
Who's Who 1972). Certain families were iden- 
tified as Jewish on the basis of the religious 
affiliation of the family founder (Krefetz 1982). 
Other families were identified as Southern or 
Southwestern on the basis of the primary resi- 
dence of the family founder (Sale 1975). 


RESULTS 


The first issue to be resolved is the extent to 
which members of wealthy capitalist families 
contribute to presidential campaigns. Because 
of limitations in the campaign contribution 
data, it is not possible to distinguish between 
noncontributors and those who contributed 
less than $100 to any campaign. Neverthe- 
less, the frequency distribution presented in 
Table 1 reveals that roughly half of 629 
individuals from these 100 wealthy families 
contributed less than $100 to any of the 
presidential candidates in 1972. Another 6 
percent contributed at least $100 but less than 
$1,000 to this campaign. At the same time, 
however, the remaining members of these 
families were often major contributors. For 
example, about one-quarter of these family 
members gave more than $1,000 but less than 
$10,000 to the 1972 presidential campaign. 
Another tenth gave over $10,000 but less than 
$50,000. At the extreme, 6 percent of these 
wealthy individuals gave more than $50,000 
to the campaign. Altogether, the 304 mem- 


Table 1. Distribution of Contributions to 1972 Presiden- 
tial Campaign by Members of 100 Wealthy 


Capitalist Families 

Size of Contribution Number Percent 
None or less than $100 325 51.7 
$100 to $499 19 3.0 
$500 to $999 2 3.5 
$1,000 to $2,499 81 12.9 
$2,500 to $4,999 34 5.4 
$5,000 to $9,999 47 7.5 
$10,000 to $24,999 38 6.0 
$25,000 to $49,999 23 3.7 
$50,000 to $99,999 23 3.7 
Over $100,000 17 2.7 

Total sample 629 
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bers of these 100 families who contributed to 
the presidential campaign of 1972 raised a 
total of $9,072,828. Obviously, the distribu- 
tion of campaign contributions is highly 
skewed. Although the median campaign 
contribution for those who did contribute to 
this campaign was $4,000, the mean contri- 
bution for these same individuals was $29,845. 
In summary, despite the fact that most 
members of these families did not contribute 
anything to this campaign, other family 
members were a major source of funds for the 
1972 presidential campaign. 

Given this disparity in campaign contribu- 
tions, the next question concerns the effects 
of visibility on the probability of being a 
contributor. Of the 629 individuals in this 
sample, 30 percent were directors of major 
corporations, 51 percent were trustees of 
philanthropic foundations, and 27 percent 
were listed in Who's Who. Only 37 percent of 
the individuals in this sample were not visible 
inasmuch as they were not listed in Who's 
Who and did not serve as either corporate 
directors or foundation trustees. As shown on 
Table 3, almost three-quarters of the 171 
family members who were listed in Who's 
Who contributed to the 1972 presidential 
campaign, whereas only a one-quarter of the 
234 family members who were not visible 
contributed to this campaign. Of course, there 
is considerable overlap among these various 
types of visibility. For example, 82 percent of 
those family members who were corporate 
directors were also listed in Who's Who. 
Conversely, only 40 percent of those who 
were foundation trustees were so listed. 

In order to disentangle the effects of these 
variables, a series of logit analyses was 
conducted on the effects of different types of 
visibility on the odds of being a presidential 
campaign contributor. The preferred model, 
in terms of both parsimony and goodness of 
fit, contained interactions between being 
listed in Who's Who and being either a 
corporate director or a foundation trustee. As 
shown in Table 2, the odds ratios derived 
from the preferred model reveal that being a 
director, as opposed to not being a director, 
increased the odds of being a contributor to 
the 1972 presidential campaign by a factor of 
4.4 for those individuals not listed in Who's 
Who (Long 1984). Conversely, being a 
director, as opposed to not being a director, 
increased the odds of being a contributor by a 
factor of only 1.2 for those individuals listed 
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Table 2. Logit Analyses of the Effects of Type of 
Visibility on the Odds of Contributing to 1972 


Presidential Campaign 
Main Interaction 
Effects Effects 
Type of Visibility Model Model 
Corporate director 2.74 
Not Who's Who 4.40 
Who's Who 1.20 
Foundation trustee 2.27 
Not Who’s Who 2.52 
Who's Who 1.87 
Who's Who in America 2.35 
Not director or trustee 4.94 
Not director but trustee 3.67 
Not trustee but director 1.34 
Director and trustee 1.00 
Likelihood ratio 9.52 0.05 
Probability 0.05 0.98 


Note: These effects are odds ratios that represent the 
estimated odds of contributing for those within each 
category relative to the estimated odds of contributing for 
those not within that category. 


in Who’s Who. In general, the results of these 
logit analyses reveal that each of these forms 
of visibility has an independent and statisti- 
cally significant effect on the odds of 
contributing to this campaign. However, 
visibility in one capacity tends to diminish the 
effects of additional visibility in other capac- 
ities. For example, being listed in Who’s 
Who, as opposed to not being so listed, had 
no effect on the odds of being a contributor 
for those individuals who were both founda- 
tion trustees and corporate directors. 

The use of logit analysis to assess the 
effects of various types of visibility on the 
odds of being a campaign contributor ignores 
the differences in the magnitudes of these 
contributions. However, an examination of 
the actual campaign contributions is inevita- 
bly distorted by the existence of a few major 
contributors. For example, there were 17 
major contributors who gave over $100,000 
to this campaign. In order to correct for the 
effects of such major contributors, the 
campaign contributions of each individual 
were subjected to a square-root transforma- 
tion. Thus, a contribution of $1,000,000 is 
reduced to $1,000, a contribution of $10,000 
is reduced to $100, and a contribution of $100 
is reduced to $10. This transformation 
significantly reduces the skewness of the 
original distribution. As shown in Table 3, 
the results of an analysis of variance, using 
these transformed values, parallel those ob- 
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Table 3. Square Root of Contributions to 1972 Presiden- 


tial Campaign by Type of Visibility 
ion Mean Total . 

Type of Visibility Contributors Contribution 
Who's Who in America 

(N: 171) 0.737 54.87** 
Corporate director (N: 190) 0.737 53.92** 
Foundation trustee (N: 320) 0.625 45.07** 
Not visible (N: 234) 0.247 13.73** 

R squared 0.135 

Probability 0.001 


Note: The mean total contribution for each type of 
visibility is the expected value controlling for the effects 
of other types of visibility. 


tained from the logit analysis. Those mem- 
bers of wealthy capitalist families who were 
in any way visible contributed significantly 
more to the 1972 presidential campaign than 
did those family members who were not 
visible. Each of these three forms of visibility 
has an independent and statistically signifi- 
cant effect on the square root of campaign 
contributions. Overall, 13 percent of the 
variance in the square root of campaign 
contributions is explained by the four types of 
visibility among family members. 

The next issue raised by this research is the 
extent to which the members of different 
generations of wealthy capitalist families 
differ in their total campaign contributions, as 
well as their contributions to the Democratic 
party. Because entrepreneurs and their chil- 
dren are more likely to serve as corporate 
directors and foundation trustees than the 
grandchildren and great-grandchildren of en- 
trepreneurs, it is necessary to control for the 
effects of visibility. The results of an analysis 
of covariance, which gives the mean contri- 
butions for each generation controlling for the 
effects of a dichotomous visibility variable, 
are presented in Table 4. To begin with, these 
results indicate that wealthy entrepreneurs 
contributed much more to the presidential 
campaign, on average, than did the descen- 
dants of such entrepreneurs. Indeed, the 
members of each successive generation within 
these wealthy capitalist families contributed 
less, on average, than the members of the 
previous generation. The pattern of contribu- 
tions to the Democratic party among the 
members of the different generations of these 
wealthy capitalist families is similar but not 
statistically significant. 

Another research issue concerns the extent 
to which members of wealthy capitalist 
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Table 4. Square Root of Total Contributions and 
Contributions to Democratic Party in 1972 
Presidential Campaign by Generation Control- 


ling for Visibility 
Mean Mean 
Generation within Family Total Democratic 
1— Founders (N: 41) 103.54** 20.28 
[118.01] [16.96] 
2— Children (N: 194) 63.53** 12.85 
; [69.01] [14.19] 
3—Grandchildren (N: 303) 43.38** 9.58 
[40.48] [8.86] 
4—Great-grandchildren 32.69** 6.27 
(N: 31) [13.19] [1.48] 
R squared [C.V.] 0.075 0.012 
R squared [C.GV] 0.097 0.014 
Probability [(C.G(V)] 0.010 . n.s. 


Note: The mean contributions for each generation are 
expected values controlling for the effects of visibility. 
The unadjusted means are presented in brackets. 


families that are either Jewish or Southern 
contribute to the Democratic party as opposed 
to the Republican party. As shown in Table 5, 
the results of an analysis of variance indicate 
that the 75 individuals from Jewish families 
contributed somewhat more to the Republican 
party than the Democratic party during the 
1972 presidential campaign. Similarly, the 66 
individuals from Southern families contrib- 
uted only slightly more to the Republican 
party than the Democratic party. However, 
the disparity in contributions to the two major 
parties among the members of families that 
are either Jewish or Southern is not nearly as 
great as the disparity among members of other 
wealthy capitalist families. Indeed, the differ- 
ences in the mean contributions to the two 
parties for Jews and Southerners are not 
statistically significant. Conversely, members 
of wealthy capitalist families with Protestant 
or Northern origins contributed much more to 
the Republican party than the Democratic 
party. In other words, those individuals from 
wealthy Jewish and Southern families contrib- 


Table 5. Square Root of Contributions to Political 
Parties in 1972 Presidential Campaign by 


Family Origin 

] Mean Mean Difference 
Origin of Family Democratic Republican of Means 
Jewish (N: 75) 30.95 35.46 4.51 
Southern (N: 66) — 23.81 25.12 1.31 
Other (N: 488) 6.36 47.98 41.62** 

Total sample 11.12 44.09 

R squared 0.020 0.007 

Probability 0.002 0.004 
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uted more to the Democratic party and less to 
the Republican party, ori average, than those 
individuals from other wealthy capitalist 
families. Overall, the ratio of contributions to 
the Republican party as opposed to the 
Democratic party, even after the square-root 
transformation, is on the order of four to one 
among the members of these wealthy capital- 
: ist families. 

The last issue addressed by this research is 
the extent to which families that are major 
stockholders in corporations subject to federal 
regulation or dependent upon government 
contracts contribute more to the presidential 
campaign and more to the Republican party 
than do other wealthy capitalist families. The 
results of the analysis of variance presented in 
Table 6 indicate that those members of 
wealthy families that were major stockholders 
in oil companies or companies with major 
government contracts contributed more to the 
presidential campaign in general and more to 
the Republican party than did the members of 
other wealthy capitalist families. Contrary to 
expectations, however, those individuals with 
large stockholdings in drug companies and 
chemical companies did not contribute more 
to either the presidential campaign or the 
Republican party than those individuals with 
large stockholdings in other types of corpora- 
tions. Indeed, the members of families that 
were major stockholders in media companies 
contributed much less to the presidential 
campaign in general and much less to the 
Republican party than the members of other 
wealthy capitalist families. In general, these 
differences between the contributions of 
families with large stockholdings in these 
different types of corporations are statistically 
significant. The six categorical industry 
variables explain 4 percent of the variance in 


Table 6. Square Root of Total Contributions and 
Contributions to Republican Party in 1972 
Presidential Campaign by Type of Corporation 
j Mean Mean 

Type of Corporation 


Total _ Republican 
Drug (N: 34) 39.97 38.68 
Media (N: 52) 12.52** 8.40** 
Chemical (N: 128) 39.48 38.57 
Oil (N: 68) 96.94** 90.13** 
Government contractor 
(N: 13) 105.39*»* 103.62** 
Other (N: 334) 54.81 40.62 
R 0.039 0.051 
Probability 0.001 0.001 
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the total contributions of these individuals and 
5 percent of the variance in their contributions 
to the Republican party. 


CONCLUSIONS 


The results of this analysis reveal a number of 
interesting patterns in the campaign contribu- 
tions of the members of wealthy capitalist 
families. Indeed, these results serve to clarify 
many of the theoretical issues raised by earlier 
researchers. Moreover, this analysis demon- 
strates the importance of constructing samples 
that are truly representative of the population 
of interest. Because of the limitations of their 
samples, earlier researchers were simply 
unable to investigate many of the most 
important theoretical problems conceming 
campaign finance. For example, one of the 
most significant findings of this research is 
that a majority of the members of wealthy 
capitalist families did not contribute at all to 
the presidential campaign in 1972. This 
apparent political apathy is somewhat paradox- 
ical in view of the fact most of these wealthy 
individuals were undoubtedly opposed to 
many of the policies advocated by the 
Democratic candidate, George McGovern. 
One possible explanation for this finding is 
that most of these individuals did not consider 
the Democratic challenger to be a serious 
threat to the incumbent Republican president, 
Richard . Nixon. Nevertheless, a substantial 
minority of the members of these wealthy 
capitalist families were major contributors to 
this presidential campaign. At the very least, 
this finding suggests that the members of . 
wealthy : capitalist families often fail to 
exercise the full extent of their power in terms 
of their ability to finance political campaigns. 
Clearly, an enormous unused financial capac- 
ity to contribute exists among these wealthy 
capitalist families and within the capitalist 
class as a whole (Domhoff 1972). 

Another significant finding of this research 
is that the odds of being a campaign 
contributor and the magnitude of campaign 
contributions among the members of wealthy 
capitalist families are largely determined by 
their visibility to the public at large. This 
finding suggests two alternative theoretical 
explanations. On the one hand, it is possible 
that many members of wealthy capitalist 
families do not voluntarily contribute to 
political campaigns but are simply solicited 
for contributions by candidates and their 


284 


campaign staffs. On the other hand, it is 
equally possible that certain family members 
choose to become involved in public affairs 
by serving as corporate directors and founda- 
tion trustees and by becoming campaign 
contributors. In fact, individuals who contrib- 
ute to political campaigns are often involved 
in other political activities as well (Brown, 
Hedges, and Powell 1980). Unfortunately, it 
is impossible to choose between these two 
alternative explanations on the basis of the 
available data. A time series analysis, which 
examined how the emergence of wealthy 
individuals as campaign contributors was 
affected by their emergence as foundation 
trustees or corporate directors, would be 
required to distinguish between these alterna- 
tive explanations. Furthermore, it is not 
necessary to distinguish between these two 
explanations inasmuch as they are entirely 
compatible with respect to the effect of 
campaign contributions on the candidate 
selection process. 

Last, this research demonstrates that there 
are important sources of differentiation be- 
tween and even within wealthy capitalist 
families in terms of their campaign contribu- 
tions. To begin with, there are significant 
differences in the patterns of campaign 
contributions among the members of the 
different generations of wealthy capitalist 
families. It is clear that the members of each 
successive generation contributed less to this 
campaign in general than did the members of 
the previous generation. This pattern may be 
related to the fact that each succeeding 
generation of family members is less involved 
in the activities of the family corporation 
(Allen 1987). This research also reveals that 
the members of wealthy Jewish and Southern 
families contributed more to the Democratic 
party and less to the Republican party, on 
average, than did the members of other 
wealthy capitalist families (Domhoff 1972). 
These findings are significant in view of the 
fact that many wealthy Jews did not contrib- 
ute to the Democratic party in 1972 because 
George McGovern proposed changes in 
foreign policy concerning Israel. Of course, 
most members of wealthy capitalist families 
contributed overwhelmingly to the Republi- 
can party. Finally, this research discovered 
that those families that are major stockholders 
in corporations subject to government regula- 
tion did not always contribute more to 
presidential campaigns in general or Republi- 
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can candidates than did other wealthy capital- 
ist families. In fact, families that control 
media companies with lucrative broadcasting 
licenses may actually avoid overt partisan 
political activity: 

In conclusion, these findings have a 
number of important implications for future 
theory and research on the political sociology 


‘of campaign finance. Despite the fact that 


most of the members of wealthy capitalist 
families are not major campaign contributors, 
the remaining members of these families 
serve aS an important source of large 
contributions for presidential candidates. Fur- 
ther research is needed to assess the impact of 
the individual campaign contribution limits 
imposed by recent amendments to the Federal 
Election Commission Act (Alexander 1987; 
Bedlington and Powell 1986). Given the 
limitations on contributions to individual 
candidates, future research might also exam- 
ine the contributions of wealthy individuals to 
political action committees as well as congres- 
sional candidates (Ferguson and Rogers 
1986). Moreover, recent research has estab- 
lished the existence of more or less cohesive 
groups of corporations whose political action 
committees are major contributors to congres- 
sional campaigns (Burris 1987; Neustadtl and 
Clawson 1988). One issue that deserves 
attention is the relationship between the 
contributions of wealthy capitalist families 
and the contributions of political action 
committees formed by major corporations. It 
is possible that corporate political action 
committees have largely supplanted the mem- 
bers of wealthy capitalist families as sources 
of large contributions for political campaigns. 
Last but not least, it is apparent that wealthy 
capitalist families are clearly differentiated 
along ethnic, regional, and even generational 
lines (Domhoff 1972). Further research is 
needed to examine these and other sources of 
differentiation within and between wealthy 
capitalist families. 
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Appendix. Sample of 100 Wealthy Capitalist Families 
with Number of Family Members and Total 
. Contributions to 1972 Presidential Campaign 


Number Total 

of Contri- 
Family and Corporation Members butions 
Abercrombie (Cameron Iron Works) 2 13,000 
Allen (Allen & Co.) 2 42,500 
Annenberg (Triangle Publications) 8 255,000 
Barron (Dow Jones) 2 2,500 
Bechtel (Bechtel Group) 3 40,000 
Beinecke (Sperry & Hutchinson) 7 13,000 
Blaustein (Amoco) 9 15,800 
Bloch (H. & R. Block) 2 1,000 
Block (Block Drugs) 2 17,000 
Bradley (Allen-Bradley) 4 6,000 
Busch (Anheuser Busch) 15 30,816 
Cabot (Cabot Corp.) 15 14,566 
Carlson (Carlson Co.) 3 3,265 
Carver (Bandag) 1 264,933 
Chandler (Times-Mirror) 23 5,500 
Clark (Avon Products) 4 39,000 
Coors (Adolph Coors) 4 12,500 
Cox (Cox Enterprises) 3 10,504 
Crown (General Dynamics) 3 52,069 
Cullen (Quintana Production) 3 11,250 
Danforth (Ralston Purina) 7 23,173 
Dayton (Dayton Hudson) 5 39,900 
Deere (Deere & Co.) 6 11,500 
Disney (Walt Disney Productions) 5 17,660 
Doheny (Unocal) 5 15,500 
Donnelley (R.R. Donnelley & Sons) 15 43,683 
Dorrance (Campbell Soup) 14 57,600 
Dow (Dow Chemical) 17 48,450 
Duke (Duke Power) 6 6,224 
du Pont, L. 

(E.I. du Pont de Nemours) 34 170,539 
du Pont, W. 

(E.I. du Pont de Nemours) 5 242,125 
Engelhard (Engelhard Minerals) 1 0 
Firestone (Firestone Tire & Rubber) 20 357,497 
Ford (Ford Motor) 5 159,376 
Galvin (Motorola) 2 46,500 
Gates (Gates Rubber) 5 900 
Getty (Getty Oil) 5 127,000 
Haas (Rohm & Haas) 2 200 
Haas (Levi Strauss) 9 114,950 
Hall (Hallmark) 5 5,400 
Heinz (H.J. Heinz) 7 103,743 
Hess (Amerada Hess) 1 490,000 
Heublein (Heublein) 4 1,000 


Appendix. Continued 


Family and Corporation 


Hewlett (Hewlett Packard) 
Houghton (Corning Glass) 
Hunt (Placid Oil) 
Ingram (Ingram Corp.) 
Johnson (S.C. Johnson & Son) 
Johnson (Johnson & Johnson) 
Kauffman (Marion Laboratories) 
Keck (Superior Oil) 
Kerkorian (MGM) 
Kettering (General Motors) 
Knight (Knight-Ridder) 
Kirby (Alleghany) 
Lilly (Eli Lilly) 
Long (Longs Drug Stores) 
MacMillan (Cargill) 
Manoogian (Masco) 
Marriott (Marriott) 
Mars (Mars Inc.) 
McDonnell (McDonnell Douglas) 
McGraw (McGraw-Hill) 
McKnight 

(Minnesota Mining & Manuf.) 
McNeil (Johnson & Johnson) 
Medill (Tribune Co.) 
Mellon (Gulf Oil, Alcoa, etc.) 
Mott (General Motors) 
Mudd (Cyprus Mines) 
Murchison (Murchison Bros.) 
Murphy (Murphy Oil) 
Newhouse (Advance Publications) 
Ochs (New York Times) 
Olin (Olin Corp.) 
Ordway 

(Minnesota Mining & Manuf.) 
Packard (Hewlett Packard) 
Perot (Electronic Data Systems) 
Petrie (Petrie Stores) 
Pew (Sun Co.) 
Phipps (Bessemer Securities) 
Pitcairn (PPG Industries) 
Pritzker (Hyatt, Marmon, etc.) 
Reynolds (R.J. Reynolds Industries) 
Robins (A.H. Robins) 
Rockefeller (Exxon, Chevron, etc.) 
Rosenwald (Sears Roebuck) 
Rousch (Roadway Express) 
Scripps (E.W. Scripps) 
Searle (G.D. Searle) 
Stern (Hartz Mountain Products) 
Stranahan (Champion Spark Plugs) 
Stuart (Carnation) 
Taper (First Charter Financial) 
Thompson (Southland) 
Tisch (Loews) 
Upjohn (Upjohn Co.) 
Watson (IBM) 
Whitehead (Technicon) 
Whitney 

(Whitney Communications) 
Woodruff (Coca-Cola) 
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THE RELIGIOUS ECOLOGY OF DEVIANCE* 


WILLIAM SIMS BAINBRIDGE 
Illinois State University 


Although several sociological theories imply that religion deters individual deviance, 
until recently few empirical studies have tested this proposition. This article does so 
by using a unified ecological dataset that includes measures of suicide, crime, homo- 
sexuality, and cultism. Substantial negative associations between rates of church 
membership and rates of crime and cultism survive statistical controls, while the 
negative associations with suicide and homosexuality do not. The results pose a 
severe challenge to simple theories of the role of religion in society. 


A central concern of social theory is to explain 
the role religion may play in deterring devi- 
ance. Both functionalism and Marxist theory 
assert that religion reinforces prevailing moral 
norms and fosters their docile acceptance 
(Merton 1968, p. 98). Tittle and Welch (1983) 
argued that a deviance-countering role is im- 
plied by functionalism, control theory, differ- 
ential association theory, and a collection of 
social-psychological perspectives such as cog- 
nitive dissonance theory. The role of religion in 
deterring deviance provides a crucial test of any 
theory that assumes human behavior is condi- 
tioned by societal values or concepts of per- 
sonal identity. The mix of functionalism and 
symbolic interactionism that used to be pre- 
sented to undergraduates as standard sociology 
also made such assumptions. If religion were 
shown to have no power to deter deviance, most 
theories that claim ideas have behavioral con- 
sequences would be in serious trouble. 
Although the founding fathers of social 
science paid great attention to religion, only 
recently have many quantitative studies of 
religion and deviance begun to appear, and 
scholars of deviance still generally ignore the 
topic. For example, only 4 of 28 criminology 
textbooks checked by Stack and Kanavy 
(1983) mentioned religion in their indexes. 
This essay explores four varieties of deviance 
that religion presumably deters: suicide, 
crime, homosexuality, and cultism. I use an 
ecological dataset that permits examining, on 


* Direct all correspondence to William Sims 
Bainbridge, Department of Sociology, Illinois 
State University, Normal, IL 61761. 

The dataset is available from the author, in the 
form of written coding sheets or IBM-PC disk with 
data file suitable for analysis with MacroCase: 
Aggregate Analysis Software published by Cogni- 
tive Development, Inc. 
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a unified empirical base, the statistical 
associations that link these forms of deviance 
with the rate of church membership and with 
other theoretically important variables. 

Some surveys have failed to find that 
religion affects crime among young people 
(Hirschi and Stark 1969; Burkett and White 
1974), but others have suggested that religion 
does deter delinquency and have begun to 
explore factors that might render religion’s 
influence variable (Rhodes and Reiss 1970; 
Higgins and Albrecht 1977; Albrecht, Chad- 
wick, and Alcorn 1977; Jensen and Erickson 
1979). Thus, there is good reason to wonder 
whether ecological data will support the 
following hypothesis: 


Hypothesis 1. Religion tends to support the 
norms and values of society, thus 
deterring deviance. ' 


If zero-order associations emerge from our 
data, it remains possible that they reflect the 
action of some other factor incidentally 
connected to religion. From the work of 
Durkheim (1897) to the present, one of the 
most challenging questions has been whether 
the influence of religion can be reduced 
simply to the action of social bonds. There- 
fore, I must consider a second general 
hypothesis: 


Hypothesis 2. Religion is merely an 
expression of society, and thus its 
apparent power to deter deviance will 
vanish when appropriate statistical con- 
trols for social integration are employed. 


RELIGION AND DEVIANT BEHAVIOR 


Morselli ([1879] 1882) showed a connection 
between religious variation and suicide rates 
in 19th-century Europe, calling “the influ- 
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ences of religion” among “the strongest 
motive powers which act on the will of man.” 
Masaryk ([1881] 1970) argued that increasing 
suicide rates were the result of weakening 
religious faith. Studies based on SMSAs in 
1971 and cities in 1906, 1916, and 1926 
found substantial negative correlations be- 
tween church membership and suicide, robust 
enough to survive the effects of several 
control variables (Stark, Doyle, and Rushing 
1983; Bainbridge and Stark 1981). 

However, previous ecological research, 
using data from 1926, found no association 
connecting homicide with church membership 
(Bainbridge and Stark 1981), and Stark 
(1984) argued we should not expect religion 
to influence a crime of extreme passion. As 
noted, survey studies disagree whether reli- 
gion has the power to deter juvenile delin- 
quency. Quantitative studies on religion and 
adult crime are rare, but at least one found 
negative correlations (Stark, Doyle, and Kent 
1980). 

Even before the recent antigay reactions to 
the AIDS epidemic, conservative religious 
groups occasionally launched campaigns 
against homosexuality (Fischli 1981). The 
ample evidence from surveys that religion 
deters premarital and promiscuous heterosex- 
uality should apply as well to homosexuality 
(Schulz, Bohrnstedt, Borgatta, and Evans 
1977; Kelley 1978; Wuthnow 1978). 

One might argue that there should be either 
a positive or a negative correlation between 
the church member rate and the rate for 
deviant cults. If cults are a mere fringe 
phenomenon of conventional religion, then 
they should be found exactly where the 
conventional denominations are. These areas 
will be less secularized than other places, and 
thus they will have relatively large pools of 
potential converts to religions of any kind, 
both normal and deviant. However, cults 
could represent religious innovation arising in 
response to the weakness of conventional 
religion, and thus they would show a negative 
association with church membership. In areas 
where normal denominations are organization- 
ally weak, they will be unable to suppress 
religious deviance, although I acknowledge 
that such suppression is primarily informal in 
modern societies (Shupe and Bromley 1980). 
Further, where the churches are weak, a 
substantial number of people are going 
without religious fellowship and thus will be 
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open to appeals from new churches (Stark and 
Bainbridge 1985, 1987). 

All four types of deviance might be 
encouraged by instability in secular social 
relationships, and geographic mobility that 
disrupts attachments. linking congregation 
members has often been proposed as an 
explanation of low church . membership 
(Wuthnow and Christiano 1979; K. Welch 
1983; M. Welch and Baltzell 1984). There- 
fore I shall have to introduce statistical 
controls for social instability throughout my 
analysis. Like geographic mobility, divorce is 
an important variable, but there is room to 
debate whether it is independent or dependent 
with respect to religion. Strong religion may 
deter divorce (Richardson: and Weatherby 
1983). Thus divorce might be an intervening 
variable through which religion affects devi- 
ance. This means that we must be cautious 
when interpreting the effect of divorce rates 
as a control variable and consider the 
possibility that it represents the indirect effect 
of religion. 

Other factors that influence rates of each 


.kind of deviance must also be employed as 


controls, especially those that might be 
associated with religiousness. Suicide is 
stimulated by the despair that sometimes 
accompanies old age, and many kinds of 
crime may be more common where substan- 
tial portions of the population experience 
economic and social frustrations. Higher 
education encourages people to launch new 
cultural and political movements, and col- 
leges have often sponsored homosexual orga- 
nizations, so the level of college education 
may predict the gay rates. Considerable 
evidence suggests that the typical cult recruit 
has more than the average level of education 
(Stark and Bainbridge 1985). Most cults cost 
money, some setting exact fees for each 
service they provide members, so one would 
expect cults to do well in prosperous areas. 


THE 1980 RELIGIOUS 
ECOLOGY DATASET 


My data are from the 75 American metropol- 
itan areas outside New England with popula- 
tions over 500,000 in 1980, derived in part 
from the U.S. Census and other government 
publications. I tabulated data on homosexual- 
ity and religious cults from original sources, 
frequently from lists of cases that had to be 
transformed into rates for the appropriate 
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metropolitan areas. Data on church member- 
ship came from a massive private survey of 
denominations (Quinn, Anderson, Bradley, 
Goetting, and Schriver 1982). To create 
SMSAs, I combined counties or similar units 
tabulated separately in the reports, but this 
could not be done satisfactorily for New 
England. An adjustment was made for 
underreporting of black denominations (Stark 
1980), and the American Jewish Year Book 
(Himmelfarb and Singer 1980) was used to 
improve estimates for Jews. The church 
membership rate averages 53 percent across 
the 75 SMSAs, ranging from 29 to 77. 

Vital Statistics of the United States pro- 
vides suicides by SMSAs. I used rates for the 
three years 1980-1982, as well as for 1980 
alone, in order to make my results compara- 
ble with earlier studies that used either 
one-year or three-year rates. I occasionally 
work with multiple-year rates of other kinds 
of deviance as well, but often the lack of data 
or a large number of cases in a single year 
makes this either impossible or unnecessary. 

The Uniform Crime Report provides crimes 
(known to police) in eight categories: arson, 
motor vehicle theft, homicide, forcible rape, 
aggravated assault, burglary, larceny-theft, 
and robbery. The arson statistics are fragmen- 
tary. Motor vehicle theft is a poor category 
for geographical analysis; some cities have 
high rates simply because they are near large 
markets for stolen autos or parts. 

The survey of religious denominations 
included the Metropolitan Community Church 
(MCC), a gay denomination. Other data on 
the geographical distribution of homosexual 
organizations come from four guidebooks 
designed to help gay travelers find the 
subculture in cities they visit. Published 
annually for several years, these guides all 
seem to have achieved excellent coverage. I 
use the 1984 editions, because later editions 
are less reliable as a result of AIDS epidemic 
publicity. 

I tabulated all standard listings in Bob 
Damron’s Address Book, but ignored what 
Damron calls “cruisy areas" —public places 
where gays can meet informally—because 
their number may reflect the diversity of 
public facilities in a town rather than the 
number of gays. Three guides have classified 
sections: The Gay Yellow Pages, Places of 
Interest, and Gaia’s Guide. Because they 
contain duplicate listings, and the overlap 
between them is imperfect, I created a unified 
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dataset based on the telephone numbers of the 
organizations and businesses. In addition to 


‘total listings, I consider two subclassifica- 


tions: (1) bars and restaurants and (2) 
bookstores and special interest groups. Book- 
stores and groups represent the social- 
movement aspect of the gay subculture, and 
thus they have different social sources from 
the bars. 

My cult data begin with rates of Christian 
Science practitioners and healers, based on 
addresses in the Christian Science Journal. 
Because Christian Science gained significant 
acceptance, and its growth slacked off 
decades ago, we cannot be sure it still reflects 
the geographic variation of cultism as it once 
did. A second cult rate is based on addresses 
of letterwriters published in 120 issues of 
Fate magazine from 1975 through 1984. The 
central publication of the American occult 
since its first issue in 1948, Fate carries three 
letter columns, reporting readers’ experiences 
of apparitions, messages from the beyond, 
and the whole range of paranormal and occult 
phenomena. The 1974 edition of Spiritual 
Community Guide (SCG) provides excellent 
coverage of the “New Age” Movement. Over 
half of the listings are “centers” and 
“communities,” organizations providing med- 
itation and various other practices designed to 
raise spiritual consciousness. The rest are 
bookstores, foodstores, and restaurants serv- 
ing the New Age and occult subcultures. 

The religious cult with the largest number 
of American participants in the 1970s was 
probably Transcendental Meditation (TM). 
TM kept excellent computerized records of all 
initiates, primarily to ensure that local centers 
sent the proper fees to the central organiza- 
tion. With the help of Daniel H. Jackson, I 
was able to obtain the numbers initiated into 
TM for each of about 3,200 urban areas, 
1970-1977. Scientology, a cult founded by a 
science fiction writer, claims that each 
individual has hidden godlike powers that can 
be unlocked by its training and therapy 
techniques. After years of treatment, mem- 
bers can attain an honored status known as 
clear. David Aden, public relations director 
of Scientology of Boston, provided the 
number of clears in each postal zip-code area 
as of November 1985. 

Working with detailed maps, I was able to 
place gay and cult cases in the right metropol- 
itan areas and calculate reasonably accurate 
rates. The denominators for the deviance rates 
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are estimated populations for the appropriate 
years, except that TM rates are for urban pop- 
ulations of the SMSAs, and the crime rates are 
adjusted for the proportions of each SMSA from 
which the FBI had reports. 

I shall use several control variables, the 
most important of which is the proportion of 
the population who moved to a different 
house in the past five years, the most potent 
indicator of instability in social relationships 
in the 1980 census reports. Other control 
variables are the percent divorced, the percent 
of the population aged 65 and over, the 
percent of families below the official poverty 
level, the proportion of the population 
identified as black, the proportion of the 
population over 25 who completed four or 
more years of college, and the per capita 
income. 


ANALYSIS 


A substantial negative correlation between 
church membership and the suicide rate 
(Table 1) replicates earlier studies. However, 
a multiple regression analysis, merely adding 
the proportion who have moved to a different 
house in the past five years, shows a complete 
collapse of the religion effects. Because 
earlier ecological studies found that the 
negative correlation between suicide and 
church membership survived controlling for 
Social instability, it is essential to consider 
whether the different house variable permits a 


Table 1. Church Membership and Suicide Rates 


Suicide Rate for 


1980 1980-1982 
Cases 14,153 43,021 
Mean rate per 100,000 12.3 37.2 
Correlations 
Church members -.3Tee* — ,44*** 
Different house 5344k .66*** 
Divorced 64*** i** 
Age 65+ .17 .24* 
Standardized betas 
Church members —.01 .02 
Different house 3*4 .67*** 
r .28 .44 
Standardized betas 
Church members ll 13 
Different house .20 40*** 
Divorced .Sge*e 52*** 
Age 65+ .26** ,J5*9* 
p .48 .66 


Significance (1-tailed); 
*** 001 level. 


* 05 level; ** .01 level; 
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fair test. A portion of those who have moved 
in the past five years did so as the result of 
divorce, and divorce may be deterred by 
religion. Therefore, I compared the effect of 
two geographic mobility measures used in 
previous studies that do not include the 
short-distance migration often occasioned by 
divorce: the proportion born in the same state 
where the census found them, and the rate of 
population growth over the previous decade. 
Regression analyses not shown here, using 
each of these alternative measures of social 
instability, did not give substantially different 
results from those employing different house. 

The analysis at the bottom of Table 1, 
adding divorce and old age as independent 
variables, does not restore the religion 
coefficients, indeed placing them on the side 
of zero contrary to prediction. Taken to- 
gether, the secular variables provide a quite 
satisfactory explanation of variations in the 
suicide rate. It remains possible that religion 
has an indirect effect as one of the factors 
influencing the rate of divorce. But while 
religion is widely held to enhance family life, 
if it has any real power to promote societal 
norms one would expect it to operate partly 
outside the family, and thus the association 
between religion and suicide should survive a 
control for divorce. Therefore, Table 1 
appears to verify Durkheim's contention that 
religion can be reduced to social solidarity. 

In Table 2, I show that the church member 
rate achieves significant negative correlations 
with four of the six crime rates. With crime as 
with suicide, the first control variable to 
introduce is a measure of low social integra- 
tion, and only larceny shows a connection to 
religion when different house is added to the 
analysis. However, when we control for the 
effect of poverty and the blocked opportunity 
represented by the proportion of the popula- 
tion that is black, three index crimes again 
show significant negative coefficients with 
the rate of church membership: assault, 
burglary, and larceny. The association for 
robbery nearly achieves significance. Divorce 
has a powerful connection to rape, and the 
possibility that weaknesses of religion may be 
a major cause of divorce might restore a role 
for religion in preventing rape, albeit an 
indirect one. 

The fact that assault achieves significant 
negative coefficients with the rate of church 
membership weakens Stark's argument that 
the lack of a murder association is due to the 
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Table 2. Church Membership and Crime Rates 


Murder Rape 

Cases 15,217 54,025 
Mean rate per 100,000 11.0 44.9 
Correlations 

Church members .03 — .38*** 

Different house .25* 5g 

Families in poverty .62*** 37*** 

Blacks .SS*** .32"* 

Divorced .38*** T2*** 
Standardized betas 

Church members .36** .03 

Different house Ag** .60*** 

n .13 34 

Standardized betas 

Church members .06 —.11 

Different house .03 .10 

Families in poverty 39» .17* 

Blacks 3444k 33s 

Divorced ove .59*** 

r 59 .69 
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Assault Burglary Larceny Robbery 
398,283 2,303,043 4,122,960 446,720 
317.7 1938.9 3660.5 276.0 

— .23* — gare — .63*** — OL 
29** .60*** .G4** 04 
36*** 3gee ~ 02 gue 
.29** 12 —.10 AA 
.39*** .60*** .Sg*** 21* 

—.06 —.05 —.36** .04 
.25* .57*** .40*** .07 
.09 36 48 .00 

— .26* = 22" —.39** -.23 

—.08 .26* .30* -.31* 
.28* 40*** .03 pem 
.24* .03 .08 .29** 
.30* .25* 1 31i* 
32 56 49 .36 


Significance (1-tailed); * .05 level; ** .01 level; *** .001 level. 


emotional nature of this crime. The strongest 
association is for larceny, the most common 
crime and probably the one committed by the 
widest range of people. Burglary and robbery 
may more often be professional crimes, done 
by individuals committed to a deviant li- 
festyle. Thus, acts of larceny may be more 
readily deterrable by religious beliefs. In any 
case, the crime results give religion an 
independent power to deter several kinds of 
deviant behavior in which harm is done to 
other persons. 

As shown in Table 3, the church member 


rate has significant negative correlations with 
the homosexuality variables, but in the 
middle of the table the social instability 
variable (different house) reduces them to 
insignificance. The multiple regression analy- 
sis at the bottom shows that it is really college 
attendance and divorce that are responsible. 
Many listings in the gay guides have an 
obvious connection with higher education, 
notably the bookstores. Colleges have often 
permitted gay student groups to form and, 
through giving them free use of facilities, 
have effectively subsidized them. 


Table 3. Church Membership and Rates of Homosexual Groups 


MCC Damron's Three Gay Guides 
Members Guide Total Bars Groups 
Cases 18,259 2,759 5,499 2,090 1,726 
Mean rate per 100,000 13.6 2.1 3.8 1.5 1.3 
Correlations 
Church members —.45*** —.27** —.30** -.23* -.22* 
Different house 54444 35** .30** .35** .28** 
Attended college .33*9* .35** .53*** 319* 60v 
Divorced .61*** .A0*** .24* qo*** .13 
Standardized betas 
Church members —.16 —.06 —.17 .02 ~ 06 
different house 43% .31* .18 .36** .23 
.30 .13 .10 12 .08 
Standardized betas 
Church members — .06 .03 —.11 AL — 02 
Different house 01 — .07 —.15 - 01 01 
Attended college .19* .31** .52*** .26* Sg» 
Divorced .53*** .40** .18 .42** — 02 
r 42 23 31 .22 .36 





Significance (1-tailed); * .05 level; ** .01 level; *** .001 level. 


and 
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For three of the five gay rates, divorce 
achieves the highest coefficients, suggesting 
that homosexuality greatly represents a disin- 
tegration of relations between the sexes, and 
it remains possible that weakness of orga- 
nized religion has contributed to this. But 
with college education and divorce in the 
equation, religion has no direct effect on the 
gay rates. This seems to contradict the survey 
research that shows a consistent negative 
correlation between religion and deviant 
sexuality. However, the surveys generally fail 
to control for social and family disruption 
variables, and in Table 3 I do show 
significant bivariate negative correlations be- 
tween religion and the gay rates. Further- 
more, the surveys measure mere attitudes 
toward sexuality, not social behavior as my 
indicators do. 

In Table 4 I show that the five cult rates 
have substantial negative correlations with the 
rate of church membership, thus confirming 
that cults succeed where the conventional 
churches are weak. In the middle section of 
the table, the church member rate remains a 
strong predictor even when the migration rate 
is added to the equation, and for three cult 
rates church membership overpowers migra- 
tion. At the bottom of Table 4, we see that 
multiple regression equations involving all 
variables give rather different roles to the five 
predictors, depending upon which cult rate 
they are explaining. 


Table 4. Church Membership and Cult Rates 


Christi 
Science Fate 

Cases 1,653 1,182 
Mean rate per 100,000 1.3 0.9 
Correlations 

Church Members —,58*** —.62*** 

Different house .60*** 47e 

Attended college 75e .27* 

Per capita income .50*** .28** 

Divorced .58*** 4Q*** 
Standardized betas ; 

Church members -.32** —.56*** 

Different house — .38** .09 

r Al .38 

Standardized betas 

Church members —.08 — .S4*** 

Different house .42* .10 

Attended college —.16 .02 

Per capita income .A5*** .04 

Divorced .13 — .02 

r .54 : .39 
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Earlier I said I was not sure that Christian 
Science still functioned as the cult it was a 
century ago. If not, we cannot explain 
participation in it with the same theories used 
for undeniable cults like TM and Scientology. 
To explore the basis of Christian Science 
further, I tried another analysis, adding the 
proportion of the population age 65 and older 
as the sixth independent variable. And age 
turned out to be the best predictor, having a 
standardized beta of .43 in an equation with a 
multiple 7? of .69. Thus, Christian Science 
may currently function as a denomination (but 
not as a cult) offering religious healing to 
older persons. An explanation of the SCG 
pattern is harder to find. 

These results reinforce the need for more 
quantitative research, on deviant religion. 
Only when we have a solid tradition of such 
research will we understand the sources of 
anomalies—indeed only then will we know 
for sure which are the anomalies and which 
are the consistent findings. On the whole, the 
data support the thesis that conventional 
religion limits cults. 


CONCLUSION 


The results of this study remind us that 
neither religion nor deviance is a unity. 
Although many forms of crime and cultism 
are directly deterred by religion, the influence 
of religion upon suicide and homosexuality 


SCG TM Scientology 
1,471 454,328 7,888 
1.2 459.5 4.3 
= Aon -a ,SO*** — S5*** 
Sa*** .A6* Ag" 
S7*** 63*** .39*»*» 
pee 45w .38g** 
Agere :35** .5]*e* 
— .24* -,52*** — 4lte* 
.3T** .10 .20 
.32 35 .33 
—.17 -.52*** —.32* 
—-.02 -.23 —.09 
A7 .62*** .23 
—.02 —-.13 .04 
.31* Al .33* 
48 57 40 


Significance (1-tailed); * .05 level; ** .01 level; *** .001 level. 
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appears indirect, if it exists at all. Both of our 
original hypotheses are thus disconfirmed: 
religion does not directly deter some forms of 
deviance, but for other forms of deviance the 
role of religion cannot be reduced to mere 
social solidarity. Indeed, we found great 
variation in the influence of religion across 
types of deviance, and it is difficult to suggest 
a single theory that can make sense: of this 
complexity. Published survey research on 
religion and deviance has been extremely 
contradictory, offering a severe challenge to 
explanatory theory. It is now clear that 
ecological data add to that challenge. 

An intriguing possibility that cannot be 
tested with the present data is that Morselli 
and Masaryk were right to suspect a decline 
of religion was bringing an increase in suicide 
and that the process has now progressed so far 
that religion has lost its former capacity to 
deter suicide (cf. Stack 1983). Although 
previous studies are not entirely comparable, 
they appear to show declining correlations 
over the decades (Bainbridge and Stark 1981; 
Stark et al. 1983; Breault 1986). 

These conclusions are not necessarily im- 
periled by the fact that I found robust effects 
of religion on crime and deviant cults. Reli- 
. gion has several aspects, and they do not all 

change at the same pace. Years ago church 

doctrine strongly asserted that suicide was a 

sin of great magnitude, and suicides were de- 

nied burial in holy ground (Cavan 1928), but 

this is no longer the case. In contrast, mis- 

treatment of other human beings is condemned 
` by the churches as strongly as ever. 

The research reviewed in the article was 
done in the United States, and the topic cries 
out for studies to be done in distinctly 
different societies where religion and secular 
institutions have more intimate connections 
and where intellectual elites have not turned 

` their backs on religion as they largely have in 
industrialized nations. Activity in this field is 
so new that studies using adequate control 
variables are only beginning to be published 
(Blifson, Petersen, and Hadaway 1983), and 
it is still difficult to evaluate the quality both 
of particular datasets and of theoretical 
interpretations of statistical results. Panel 
studies will be especially useful, because they 
will allow us to establish time order of 
variables, and thus give us a far better picture 
of causation (Schultz et al. 1977; Peek, 
Curry, and Chalfant 1985). But we cannot 
simply compare a survey and an ecology 
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study, hoping to verify one against the other. 
Sociologists have always known that very 
different phenomena may exist at the individ- 
ual and group levels, and this certainly is the 
case for religion and deviance. 

A nagging problem for analysis of data from 
metropolitan areas is that some correlations 
are far weaker for small cities than for large 
ones, a problem that prevented me from con- 
sidering small SMSAs here. This is greatly the 
result of unstable rates for small cities, and the 
gay and cult rates are based on very small 
numerators in small cities. After we have solved 
this methodological problem, we can consider 
the possibility that small cities really do have 
lower associations between religion and devi- 
ance, perhaps in some way reflecting higher 
community solidarity. 

It is clear now that the effect of religion 
changes with the social context, and social 
conditions may vary drastically over time and 
space and from one aspect of life to another. 
Religion deters some deviant acts, but not 
others. If religion lacks a particular influence 
today in a particular community, it may have 
had an influence before and may still have an 
influence elsewhere. Perhaps religion reduces 
some social problems through inhibiting 
divorce, and we should be alert to other 
indirect effects that it might have. The 
contingent nature of religion's influence 
implies that social theorists cannot be content 
with very general statements about the 
influence of values and beliefs, but they must 
specify the social relations that render values 
and beliefs salient for particular kinds of 
behavior. 
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pology, and Social Work at Illinois State 
University. His book, The Future of Religion 
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FERTILITY AMONG WOMEN ON WELFARE: 
INCIDENCE AND DETERMINANTS* 


MARK R. RANK 


Washington University 


This article examines the incidence and determinants of fertility among women on 
welfare. The analysis reveals that welfare recipients have a relatively low fertility 
rate. This rate is below that of women in the general population and is not an 
artifact of a more favorable demographic structure. Furthermore, the longer a 
woman remains on welfare, the less likely she is to give birth. The underlying 
mechanisms behind these findings are explored in the discussion section. 


The issue of fertility among women on 
welfare has been the subject of controversy 
for many years. How often do women 
receiving welfare give birth? Is their fertility 
rate higher or lower than that of the general 
population? What factors are related to 
childbearing for women on welfare? While 
such questions have been frequently asked, 
no single study has provided satisfactory 
answers. 

This article explores the incidence and 
determinants of fertility among women on 
welfare. Longitudinal caseload data of wel- 
fare recipients is used to calculate the fertility 
rate for women aged 18—44. These rates are 
compared (and standardized) with rates in the 
general population. In addition, a logistic 
regression analysis models the determinants 
of childbearing among women on welfare. 
Finally, I explore in the discussion section the 
potential reasons behind the observed welfare 
fertility rates. 


PREVIOUS RESEARCH 


As noted, previous research has failed to 
adequately address the topic of fertility 
among women on welfare. Three reasons 
account for this failure. 

First, a number of studies have looked at 
the effect of various state welfare benefit 
levels upon illegitimacy rates (Cutwright 
1973; Ellwood and Bane 1984; Fechter and 
Greenfield 1971; Janowitz 1976; Weingarden 


* 'The author would like to thank Donald Cox, 
Marvin J. Cummins, and Paul R. Voss for their 
ideas and suggestions. In addition, the comments 
and advice of both the ASR reviewers were greatly 
appreciated. Any remaining errors are the sole 
responsibility of the author. 
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1973). While informative, this research does 
not answer the question of what the fertility 
rates of women on public assistance are. 
These analyses have focused on illegitimacy 
for the general population, rather than fertility 
for women on welfare. 

Second, research that has focused upon 
women on welfare has been severely flawed 
by data limitations. Most serious has been the 
inability to firmly establish when and for how 
long a woman was receiving welfare, and if 
her birth coincided with her spell of welfare 
recipiency. The data sets utilized have been 
either cross-sectional and consequently rely- 
ing on recall (Placek and Hendershot 1974; 
Polgar and Hiday 1975; Weingarten 1974), or 
longitudinal but lacking an exact record of 
when women were on and off the welfare 
rolls during the year (for example, the Panel 
Study of Income Dynamics). In either case, it 
is impossible to calculate with any precision 
the fertility rate of women on public assis- 
tance. In addition, samples drawn of the 
welfare population are often not random 
and/or are too small to allow for reliable 
fertility estimates (e.g., Placek and Hender- 
shot 1974). 

A final problem plaguing previous research 
has been the lack of an appropriate compari- 
son group. For example, although Keefe 
(1983) has analyzed fertility among women 
on welfare in California from 1970 to 1971, 
there is no comparison group presented in the 
analysis.! Without such a comparison, the 
reader has no way of judging whether the 
fertility rate of women on welfare is high or 
low. 


! Keefe does not actually estimate the fertility 
rate for women on welfare but rather looks at the 
determinants of their fertility. . 
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This analysis attempts to address these 
shortcomings by using a large, longitudinal 
random sample of welfare households in the 
state of Wisconsin. This data set allows us to 
establish firmly the time spent on welfare, 
and whether a birth occurred during that time. 
Wisconsin as well as national fertility rates 
for women aged 18-44 are used as an 
appropriate comparison group. In addition, 
in-depth qualitative interviews with a matched 
sample of welfare recipients provide insight 
into the processes modeled in the quantitative 
analysis. These findings are presented in the 
discussion section as a potential explanation 
for the observed fertility rates. 


METHODOLOGY 
Data 


Two separate yet complementary sources of 
data are used in this analysis. Both samples 
were drawn in a similar fashion for similar 
populations. However, the emphasis in each 
data set clearly differs. The quantitative 
sample is longitudinal and designed for 
statistical modeling of various events, while 
the qualitative sample provides greater insight 
into those events. Each is described below. 

Longitudinal Caseload Sample. Since 1980, 
administration of the AFDC, Food Stamp, 
and Medicaid programs in the state of 
Wisconsin has been computerized. Applica- 
tion records for all eligible cases are on-line 
in a central data-base system. Changes are 
made on individual case records as they 
occur. For example, a change in a family's 
earned income is entered into the system as 
Soon as it is reported. Changes in a family's 
structure, such as births, marriage, and 
divorce, are updated as well. 

A 2 percent random sample was drawn of 
all cases participating in Wisconsin as of 
September 1980, in one or more of the three 
programs. The sampling unit consisted of the 
case head (or applicant) and those individuals 
listed on the application form who were 
residing in the household of the case head. 
This resulted in a sample of 2,796 house- 
holds. The sample comprised families that 
had been receiving public assistance for 
varying lengths of time. Some represented 
new cases, others had been on for six months, 
and so on. 

The cases were then followed at 6-month 
intervals (through September 1983). The 
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sample includes records from September 
1980, March 1981, September 1981, March 
1982, September 1982, March 1983, and . 
September 1983. There is no refusal rate, 
since case records rather than actual individu- 
als were sampled. : 

A major administrative policy change 
affecting welfare recipients occurred during 
this period. From November 1981 to January 
1982, the Reagan administration’s budget 
cuts went into effect, incorporated in the 
Omnibus Budget Reconciliation Act (OBRA) 
of 1981. The regulations reduced benefit 
levels for various welfare programs and 
tightened eligibility requirements. The OBRA 
changes had an impact on the AFDC, Food 
Stamp, and Medicaid programs. For example, 
benefit levels for households on AFDC were 
cut back, and in some cases, families were no 
longer eligible. These changes were accom- 
plished largely through reducing the amount 
of allowable deductions for earnings and 
work-related expenses. Thus the incentives . 
for working while on welfare were reduced. 
The effects of these changes became apparent 
in the fourth period of observation—March 
1982. Beginning with March 1982, benefit 
levels and work incentives for the AFDC, 
Food Stamp, and Medicaid programs were 
lower than they had been before that date. . 

Once an individual leaves welfare, informa- 
tion on that case (e.g., income, family size) is 
no longer updated. Thus, the analysis is 
limited to families currently on welfare. We 
are confined to asking, “What is the inci- 
dence of childbearing among women on 
welfare, and what factors predict such an 
event?” 

In-Depth Qualitative Interviews. During 
the summer of 1986, qualitative data were 
gathered through in-depth interviews. A 
stratified random sample of welfare recipients 
was generated using a logic procedure similar : 
to that used in drawing the caseload sample. 
That is, case records were drawn randomly 
from the universe of households receiving one 
or more of three public assistance programs. 
Because the interviews were face-to-face, it 
was impractical to randomly sample the entire 
state given the cost and time constraints 
involved. A representative county was chosen 
which reflected the overall state population, 
containing urban and rural areas, occupational 
diversity, and so on. 

The response rate was 76 percent. Individ- 
uals without telephones were tracked down, 
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and several interviews were conducted in 
Spanish with the aid of an interpreter; in 
short, all avenues were used to contact 
sampled recipients. Participating respondents 
were paid $15. 

Fifty families were interviewed (in addi- 
tion, five households were interviewed in the 
pretest). The composition of the sample 
approximately mirrored that of the longitudi- 
nal data set. The interviews were conducted 
in respondents’ homes and averaged between 
one and a half to three hours long. All 
interviews were taped and transcribed. 


The interviews were open-ended and semi- . 


structured around several major topics such as 
family dynamics, employment, and the expe- 
rience of getting on and off welfare. As a 
reliability check on the answers given during 
the interviews, information from respondents’ 
caseload records (which had been made 
available by the state) was compared to 
information given during the interviews. The 
match was high, thus lending confidence in 
the interview data. 

The interviews provide insight into the 
process of and attitudes toward childbearing. 
They enable women to construct, in their 
words, their experiences and attitudes regard- 
ing fertility behavior. 

The Setting. As discussed, both the quanti- 
tative and qualitative data were gathered in 
the State of Wisconsin. Historically, Wiscon- 
sin has been one of the most generous states 
in determining and allocating welfare bene- 
fits. During the 1980s, the state has ranked in 
the top five in terms of average and maximum 
allowed payments for AFDC. In addition, if 
we calculate ratios of average welfare pay- 
ments to average earnings, it is also ranked in 
the top five. On the other hand, the overall 
demographic and socioeconomic backgrounds 
of Wisconsin recipients are fairly similar to 
national welfare samples (see Rank and Voss 
1984). 


Measurement 


Analysis is confined to women aged 18—44, 
which is one of the two standard age brackets 
used for estimating fertility rates. This age 
bracket is used rather than 15—44 because 
very few women aged 15—17 head households 
on welfare. Using either age bracket allows 
for a comparison of welfare fertility rates with 
the overall national and state rates. 

The analysis pools married and unmarried 
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women together. This is standard procedure 
for calculating and reporting overall fertility 
rates of women. However, marital status is 
taken into account in the aggregate compari- 
sons, as well as in the multivariate analysis. 

The event being modeled in the life table 
and logistic regression analyses is the first 
observed spell of childbearing. Once a birth 
has occurred, women are no longer included 
in later time intervals. The numbers of 
women experiencing a birth are extremely 
small, which prevents a detailed analysis of 
the occurrence and determinants of a second 
observed birth. In addition, analyzing repeat- 
able events raises a number of problematic 
statistical questions (see Allison 1984). I did, 
however, include such events in the total 
sample in a separate analysis. I found no 
significant differences from the results pre- 
sented here.? 

Once women exit from the welfare rolls, 
they are no longer included in the analysis 
should they subsequently reenter the welfare 
system. Including such women distorts the 
representativeness of the sample. However, 
separate analyses were also conducted includ- 
ing such women. Again, I found no signifi- 
cant differences from the results presented 
here. 

Finally, I include in the analysis all women 
who have been on welfare for at least nine 
months. The reason for this is obvious. Some 
women may have entered the welfare system 
as the result of an upcoming birth. In these 
cases, cause and effect are reversed—a 
forthcoming birth leading to welfare use, 
rather than welfare use leading to birth. By 
including only women that have been receiv- 
ing welfare for at least nine months, I 
eliminate this bias. 


RESULTS 


Three questions are addressed. First, what is 
the likelihood of childbearing among women 
on welfare? Second, how do these rates 
compare with those of the general population? 
And third, what are the determinants of 
fertility among welfare recipients? 


? In addition, the estimated fertility rate for 
women having given birth was estimated at 78.0, 
which is higher than the overall welfare rate of 
45.8. However, considerable caution is needed in 
interpreting this rate since it is based on only 17 
births. 


FERTILITY AMONG WOMEN ON WELFARE 


The Likelihood of Childbearing 


In Table 1 I present a life table analysis of 
childbearing among women on welfare aged 
18-44. The life table allows us to calculate 
the proportion of women giving birth during a 
specified period of time. Table 1 lists the six 
observed half-year intervals that women could 
enter in the analysis. The 0—6-month interval 
represents September 1980 to March 1981, 
the 6-12-month interval represents March 
1981 to September 1981, and so on. In 
column 1 I show the number of women at risk 
of childbearing. Because some women exited 
from welfare during an interval, they were no 
longer at risk of childbearing during the entire 
interval. To adjust for this problem of 
censoring, a standard approach is to assume 
that the censored observations were at risk for 
half the interval (Anderson, Auquier, Hauck, 
Oakes, Vandaele, and Weisberg 1980). Thus, 
in column 1, I have taken into account the 
number of censored cases in which a birth had 
not occurred, divided it in half, and added the 
result to the remaining uncensored cases for 
that interval. In addition, I allow women who 
had not been on welfare for nine months prior 
to September 1980 to enter the analysis 
during the 6—12- or the 12-18-month interval. 
In column 2 I present the number of births 
occurring during intervals, in column 3 the 
proportion of births among women (column 2 
divided by column 1), and in column 4 the 
cumulative proportion of births that occur 
across intervals. 

During the first observed 6-month interval, 
1.89 percent of women on welfare gave birth 
(column 3). During the 6—12-month interval, 
2.74 percent gave birth. The percentage 
giving birth during any 6-month interval is 
relatively stable at approximately 2 percent. 
In column 4, it is estimated that 4.58 percent 
of women will give birth during a one-year 
interval. The overall fertility rate for women 


Table 1. Life Table Analysis of Births Among Women 
Aged 18-44 on Welfare 











Cumulative 
Monthly Number Number Proportion Proportion 
Interval at Risk of Births of Births of Births 
0-6 795 15 .0189 .0189 
6-12 950 26 - 0274 .0458 
12-18 910 22 .0242 .0689 
18-24 717 12 .0167 .0844 
24-30 624 13 .0208 .1035 
30-36 553 7 .0127 .1149 
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on welfare is therefore 45.8. In addition, 
11.49 percent of women will bear children 
during a three-year spell on welfare.? 


Comparison with the General Population 


How do these estimates compare with the 
overall population? Our one-year fertility rate 
among women on welfare occurred from 
1980 to 1981. Based on data from the 
Wisconsin Department of Health and Social 
Services (1981) and the U.S. Bureau of the 
Census (1981), the fertility rates in Wisconsin 
and the national population in 1980 per 1,000 
women aged 18—44 were 75.3 (Wisconsin) 
and 71.1 (national population). These rates 
are considerably higher than the one-year 
fertility rate of 45.8 for women on welfare. 
Women on welfare have a substantially lower 
fertility rate than women in the general 
population. 

Yet to what extent are these differences the 
result of differences across populations? For 
example, perhaps the demographic structure 
of the welfare population lends itself to a 
lower overall fertility rate. The key demo- 
graphic differences between women on wel- 
fare and women in the general population 
(that can be controlled for) are race, marital 
status, parity, education, and age. Women on 
welfare are more likely to be black, be 
unmarried, have at least one child, possess 
less education, and be in their 20s when 
compared to women in the general popula- 
tion. 

To account for these population differ- 
ences, welfare fertility rates were standard- 
ized for the national and Wisconsin popula- 
tion compositions with regard to the above 
characteristics.^ In other words, if the welfare 


> Thus the average fertility rate over this period 
was 38.3. However, the one-year rate of 45.8 is 
used for comparison purposes instead. To use a 
three-year average distorts the representativeness 
of the sampled welfare population (e.g., longer- 
term cases are being overrepresented). 

* These characteristics were categorized as 
follows: age (18-24, 25-29, 30-34, 35-39, 
40-44); children (no children, one or more 
children); marital status (married-spouse present, 
not married); race (white, black); and education 
(less than 12 years, 12 or more years). The fertility 
rates for each of these categories were calculated 
for women on welfare. The rates were then 
multiplied by the proportion of women in each 
category for the Wisconsin and national popula- 


300 


AMERICAN SOCIOLOGICAL REVIEW 


Table 2. Welfare Fertility Rates Standardized for Population Characteristics 




















Standardized Percentage Standardized Percentage 

for National Difference for Wisconsin Difference 

Population from Population from 
Characteristics Characteristics National Rate Characteristics Wisconsin Rate 
Age 45.8 —35.6 48.3 —35.9 
Children 50.2 —29.4 50.6 -32.8 
Marital status 53.1 —25.3 53.3 —29.2 
Race 37.1 —47.8 34.3 ~ 54.4 
Education 37.7 —47.0 36.3 —51.8 


71.1— National rate 


population had the same demographic makeup 
as the general population (in terms of age, 
race, etc.), what would the fertility rate for 
women on welfare be? Table 2 indicates that 
whether we standardize for age, children, 
marital status, race, or education, the fertility 
rates of women on welfare are still consider- 
ably below those of the national and Wiscon- 
sin populations. The lower overall fertility 
rate among women receiving public assis- 
tance programs is therefore not an artifact of a 
more favorable demographic structure. Rather, 
it is clearly lower even when major demo- 
graphic compositional differences are taken 
into account. 


Determinants of Fertility 


Having looked at the likelihood of births 
among women on welfare, and a comparison 
of their rates with the general population, I 
focus on the factors that influence the 
probability of childbearing. Three sets of 
variables are used in predicting fertility 
among welfare recipients: demographic (edu- 
cation, race, employment status, and age); 
household (number of children, age of 
youngest child, and marital status); and 
variation in welfare programs (number of 
programs received, before and after OBRA, 
and length of time on welfare).5 Employment 





tions. This technique results in a direct standard- 
ization using the general population as the standard 
(see Shryock and Siegel 1976). The population 
proportions for the nation were based upon 
information from the U.S. Bureau of the Census 
(1981), while the Wisconsin proportions were 
derived from the Wisconsin Department of Health 
and Human Services (1981) and the U.S. Bureau 
of the Census. 

5'fhe variable definitions are as follows: 
education (less than 12 years/12 or more years); 
employment (employed/not employed); race (white/ 


75.3 — Wisconsin rate 


status, number of children, age of youngest 
child, and number of welfare programs 
received were lagged to avoid causality 
problems with fertility status. 

How do these factors affect the likelihood 
of fertility in a multivariate analysis? For this 
purpose I have taken an event history 
approach (Allison 1982, 1984) The distinc- 
tion between continuous and discrete time is 
an important one when such an approach is 
used. Methods which assume that the time of 
event occurrence is measured exactly are 
called continuous-time methods (Allison, 
1984, p. 14). Those that treat events as 
occurring during specific periods are known 
as discrete-time methods. This analysis relies 
on a discrete-time methodology. Births are 
observed at 6-month intervals, rather than the 
exact time of occurrence, and hence a 
discrete-time methodology is the more appro- 
priate. 

A separate observational record was cre- 
ated for each interval of time that a case was 
at risk of childbearing. As shown in Table 1, 
795 cases were at risk of giving birth during 
the first 6-month interval. At the second 
6-month interval, 950 women were at risk, 
and so on. The total sample constructed for 
women is the sum of column 1, or 4,549 





nonwhite); age (current age of recipient); number 
of children (number of children present in 
recipient's household); age of youngest child (child 
under age four in the household/no child under age 
four in the household); marital status (married — 
spouse present/not married —never married, sepa- 
rated, divorced, widowed); number of programs 
received (all three programs — AFDC, Food Stamps, 
Medicaid/one or two programs received); changes 
in regulations (before OBRA/after OBRA); length 
of welfare use in exact years and months (length of 
time on welfare since the beginning of the current 
spell, a date that may precede the beginning of the 
study). 
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records. Thus a woman who was on welfare 
for ali six intervals and did not give birth 
contributed six records, while a woman who 
gave birth during the fourth interval contrib- 
uted four records. 

Those cases that were censored (i.e., they 
exited from welfare without giving birth) 
were weighted one-half and included in the 
logistic regression equation. This procedure 
mirrors the method for dealing with censored 
cases found in Table 1, and results in an 
identical number of cases at risk (the sum of 
column 1 in Table 1). 

For each record, a dichotomous dependent 
variable was created in which 1 represented 
birth, and O represented nonbirth. The 
independent variables were assigned the value 
they took on at each interval. Finally, the 
4,549 records were pooled into a single 
sample. The dependent variable was then 
analyzed by logistic regression using the 
method of maximum likelihood. The proba- 
bility of a birth occurring during a 6-month 
interval is therefore being modeled. 

Age, length of welfare use, and number of 
children are entered as continuous, while the 
remaining variables are dichotomous. The 
antilogs of the coefficients are presented in 
parentheses, providing easily interpretable 
odds ratios. 

Table 3 indicates that several variables 
significantly affect the likelihood of childbear- 
ing among women on welfare: education, 


Table 3. Logistic Regression Model Predicting Child- 
bearing Among Women Aged 18-44 on 














Welfare 
Independent Variables Coefficients 
Demographic 
Less than 12 years education 277 (1.32)* 
Not employed —.171 (84) 
Nonwhite .638 (1.89)*** 
Age .902 (2.46)*** 
Age —.013 (.99)** 
Household 
Number of children -—.224 (.80) 
Child under age 4 .394 (1.48) 
Married .372 (1.45)* 
Welfare variation 
Receiving all 3 programs .083 (1.09) 
After OBRA 098 (1.10) 
Length of welfare use —.384 (.68)* 
Constant —]15.213*** 
N 4196 








* Significant at the .10 level. 
** Significant at the .05 level. 
*** Significant at the .01 level. 
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race, age, marital status, and length of 
welfare use. The effects of education, race, 
age, and marital status are similar to their 
aggregate effects upon fertility in the general 
population (U.S. Bureau of the Census 1981). 

Women with less than 12 years of 
education are 32 percent more likely to have a 
child than women with 12 or more years of 
education. Likewise, nonwhites are more 
likely to bear children than whites. Nonwhites 
have an 89 percent greater probability of 
giving birth than whites. 

The effect of age upon fertility is quadratic. 
That is, the odds of giving birth significantly 
increase until the early to mid-30s (the 
variable Age). At that point, the odds of 
childbearing significantly decrease as women 
get older (the variable Age?).5 In addition, 
marital status is positively related to fertility. 

Perhaps most interesting is the effect of 
length of welfare use upon fertility. The 
longer a woman remains on welfare, the less 
likely she is to give birth. Each additional 
year on welfare lowers the probability of birth 
by .68 to 1. Thus, controlling for confound- 
ing factors, length of welfare use is negatively 
correlated with the probability of childbear- 
ing. Rather than encouraging women to give 
birth, being on public assistance for longer 
periods of time reduces the rate of fertility." 


DISCUSSION 


The quantitative analysis has demonstrated 
that welfare recipients have a relatively low 
fertility rate. Their rate is considerably below 
that of women in the general population and 
is not an artifact of a more favorable 
demographic structure. Furthermore, the longer 
a woman remains on welfare, the less likely 
she is to give birth. This effect is independent 
of various demographic and household char- 
acteristics. 

This discussion section raises the question 


$ The peak of the parabola (34.7 years) for the 
quadratic relationship of age and fertility is derived 
from the following equation: 


3P/3AGE = .902 — .026(AGE) = 0 

7 Caution is clearly warranted in the interpreta- 
tion of the length of welfare variable upon fertility. 
Without a control group of poor women not 
receiving public assistance, it is difficult to 
determine whether the welfare programs them- 
selves, or some other factor(s), are lowering the 
fertility rate over time. 
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of why? Why is the rate of fertility among 
women on welfare relatively low? What 
reasons can account for these findings? 
Although far from conclusive, the qualitative 
data are suited to exploring the potential 
reasons behind the overall demographic and 
statistical patterns found in Tables 1-3. The 
interviews enabled women on welfare to 
construct their experiences and attitudes 
regarding pregnancy and childbirth. These 
experiences and attitudes shed considerable 
light on why fertility behavior appears to be 
suppressed. 

Of the 50 interviews 29 were with a female 
household head or wife aged 18—44. Ages 
ranged from 18 to 43, with most women 
being in their 20s or 30s. Two women were 
pregnant at the time of the interview. The 
majority of women had borne one or two 
children. 

Regarding fertility desires, none of the 27 
nonpregnant were contemplating having a 
child in the near future, and only a handful 
were considering having more children in the 
long term. Rather than expressing high 
fertility desires, women held extremely low 
fertility desires. This is consistent with the 
findings from the quantitative analysis. Women 
clearly wished to avoid childbearing at this 
particular time in their lives. 

Several examples illustrate these attitudes 

. and more importantly the reasons behind such 
attitudes. A 25-year-old, separated woman 
was asked, “Do you think you'll ever want to 
have any more children?" 


No. No. I don't think that I ever want to have 
another child. I think that will stop me from 
doing things that I want to do. And it won't be 
fair to me. It won't be fair for the new child. 
And it won't be fair at all for the two that I have. 


A never-married woman in her early 30s with 
one child commented, "Try to avoid the 
accidents now. I figure, well, it's bad enough 

. it’s not bad enough . . . it’s hardest to get 
by with one, let alone have another one just 
me by myself, ya’ know. Tryin’ to raise 
two.” 

Or a 19-year-old woman with one child 
was asked if she had considered having any 
more children: 


Not quite yet [laughter]. I kinda thought about 


8 Within the transcripts, the symbol, 
represents a slight pause during a sentence. 
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it, you know, the age difference. I don't want it 
to be real far, but I'm not quite ready for another 
kid, financially or . . . or mentally. I don't think 
I could handle two kids [laughter]. 


These examples illustrate several of the 
predominate feelings and attitudes of women 
that were interviewed. They provide a 
potential explanation for why the fertility rate 
of women on public assistance is relatively 
low. The economic, social, and psychological 
situation that women on welfare find them- 
selves in is simply not conducive to desiring 
more children. Becoming pregnant and hav- 
ing a child is perceived as making the 
situation worse, not better.? 

In addition, virtually all of the interviewed 
women expressed the desire to get off public 
assistance. Having an additional child was 
perceived as severely limiting the likelihood 
of escaping welfare dependence. This percep- 
tion served to further dampen women's 
fertility desires. 

For women who did become pregnant, it 
was often accidental. For example, a never- 
married women in her mid-20s was ques- 
tioned about her previous three pregnancies, 
and when asked if she had wanted to have a 
child, she responded, 


Never. After the first time, my mom kept 
drilling into me about school and the importance 
of education and all. So that stuck with me all 
those years. And I always got pregnant through 
carelessness, you know. I kept saying, "Well 
when I get married one day and settle down, and 
make sure my life is secure, then I'll have the 
children." So that's sae I always got the 
abortions. 


Analysis of the qualitative data revealed that 
many of the pregnancies and subsequent 
births occurring to women while on welfare 
had been accidental rather than planned. 
This is consistent with several of the 
predictors of childbearing found in Table 3. 
Education and race were significant factors 


? The question that arises with qualitative data is 
how representative of the sampled population are 
the quotes given? In this case, 26 of the 27 
nonpregnant women responded in a manner 
consistent with the reported attitudes in this article. 
The exception was a married woman who stated 
that although she was not planning on having any 
more children (she was in her 40s), when she was 
in her early 20s and on welfare, she had had 
several children in order to collect greater welfare 
benefits. 
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affecting the likelihood of childbearing. Both 
have been shown in previous research to be 
highly correlated with an increased likelihood 
of unplanned births (Pratt, Mosher, Bachrach, 
and Houn, 1984). 

Although we have raised the question of 
why the fertility rate for women on welfare is 
low, there is a clear need for additional 
research to further examine this relationship. 
Nevertheless, the in-depth interviews do 
provide a potential explanation. They suggest 
that the financial and social situation that 
women on public assistance find themselves 
in is not conducive to desiring more children. 
Such women would appear to be motivated by 
cost/benefit considerations. But it is the costs 
that outweigh the benefits, not vice versa. 
The economic, social, and psychological 
costs of becoming pregnant and having a 
child while on public assistance are perceived 
as clearly outweighing the benefits. It is 
argued that the economic and psychological 
stress that virtually all welfare recipients 
experience exerts a powerful effect on 
women's fertility behavior. That effect is to 
lower the overall fertility rate among women 
on public assistance. !° 


Study Limitations 


While moving beyond the current research, 
this study clearly has limitations. First, I have 
focused on only one state. Would the reported 
patterns be found nationally as well? The 
issue of generalizability is an important 
limitation. 

Second, the analysis has focused on 
fertility among women on welfare. I have not 
addressed the equally important question of 
whether public assistance programs encour- 
age women not on welfare to have a child in 
order to collect such benefits. ; 

Nor can I convincingly argue that it is the 
welfare programs themselves that are lower- 
ing the overall fertility rate of women on 
public assistance. Although our findings are 
consistent with this hypothesis, they do not 
demonstrate it. To do so, one would need a 
control group of poor women not receiving 
public assistance in order to contrast their 


10 For an expanded analysis of the qualitative 
data, as well as a methodological discussion 
regarding the blending of quantitative and qualita- 
tive data in family research, see Rank 1988. 
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behavior with poor women who were receiv- 
ing public assistance programs. As stated 
above, further research is necessary to 
establish firmly the underlying mechanisms 
behind the overall reported demographic 
patterns. 

And finally, although the sample size is 
adequate, a larger N would provide greater 
confidence in the fertility rate estimates. 


CONCLUSION 


The idea that women on welfare have a high 
fertility rate is often implicitly accepted by 
social and policy analysts (e.g., Working 
Group on the Family 1986). Such beliefs are 
simply not supported by this analysis. In fact, 
women on welfare have a relatively low rate 
of fertility, with that rate dropping with years 
of recipiency. It is argued that the dire 
Situation that welfare recipients find them- 
selves in is responsible for their low overall 
fertility rate. 

The issue of fertility among women on 
welfare has been, and is likely to remain, of 
interest to academics and policymakers alike. 
It is a question that is often raised, with little 
evidence on which to base an answer. While 
further research is clearly warranted, this 
analysis has attempted to move toward that 
answer. 


MARK R. RANK is Assistant Professor of 
Sociology at Washington University. He is 
currently completing a book that explores the 
lives of welfare recipients, using both quanti- 
tative and qualitative data. In addition, he is 
working on a monograph examining poverty . 
in America. 


REFERENCES 


Allison, Paul A. 1982. “Discrete-Time Methods 
for the Analysis of Event Histories.” Pp. 61-98 
in Sociological Methodology, edited by Samuel 
Leinhardt. San Francisco: Jossey-Bass. 

. 1984. Event History Analysis. Beverly 
Hills, CA: Sage. 

Anderson, Sharon, Ariane Auquier, Walter H. 
Hauck, David Oakes, Walter Vandaele, and 
Herbert I. Weisberg. 1980. Statistical Methods 
for Comparative Studies. New York: Wiley. 

Cutwright, Phillips. 1973. "Illegitimacy and In- 
come Supplements." Studies in Public Welfare, 
paper 12, prepared for the Joint Economic 
Committee. Washington, DC: U.S. Government 
Printing Office. 





304 


Ellwood, David T. and Mary Jo Bane. 1984. “The 
Impact of AFDC on Family Structure and Living 
Arrangements." Report for the U.S. Department 
of Health and Human Services, 1984. 

Fechter, Alan and Stuart Greenfield. 1971. “Fre- 
quency, Duration and Probability of Marriage 
and Income.” Journal of Marriage and the 
Family 33:307-17. 

Janowitz, Barbara S. 1976. “The Impact of AFDC 
on Illegitimate Birth Rates.” Journal of Mar- 
riage and the Family 38:485-94. 

Keefe, David E. 1983. “Governor Reagan, 
Welfare Reform and AFDC Fertility.” Social 
Service Review 57:234—54. 

Moore, Kristin À. and Steven B. Caldwell. 1977. 
“The Effect of Government Policy on Out-of- 
Wedlock Sex and Pregnancy." Family Planning 
Perspectives 9:164—69. 

Placek, Paul J. and Gerry E. Hendershot. 1974. 
"Public Welfare and Family Planning: An 
Empirical Study of the ‘Brood Sow’ Myth." 
Social Problems 21:658-73. 

Polgar, Steven and Virginia Hiday. 1975. “The 
Effect of an Additional Birth on Low Income 
Families." Population Studies 28:463—71. 

Pratt, William F., William D. Mosher, Christine 
A. Bachrach, and Marjorie C. Houn. 1984. 
"Understanding U.S. Fertility: Findings from 
the National Survey of Family Growth, Cycle 
III." Population Bulletin 39:1—43. 

Presser, Harriet B. and Linda S. Salsberg. 1975. 
“Public Assistance and Early Family Formation: 
Is There A Pronatalist Effect?" Social Problems 
23:226—41; 

Rank, Mark R. 1988. "The Blending of Quantita- 


AMERICAN SOCIOLOGICAL REVIEW 


tive and Qualitative Data in Family Research." 
Paper presented at the National Council on 
Family Relations Pre-Conference Workshop on 
Theory Construction and Research Methodol- 
ogy, November 11-13, Philadelphia, Pennsyl- 
vania. 

Rank, Mark R. and Paul R. Voss. 1984. 
Demographic Characteristics of Wisconsin's 
Welfare Recipients. Madison: Applied Popula- 
tion Laboratory. 

Shyrock, Henry S. and Jacob S. Siegel. 1976. The 
Methods and Materials of Demography. New 
York: Academic. 

U.S. Bureau of the Census. 1981. “Fertility of 
American Women: June 1980." Current Popula- 
tion Reports, Series P-20, No. 364. Washing- 
ton, DC: U.S. Government Printing Office. 

. 1983. "Detailed Population Characteris- 
tics: Wisconsin." 1980 Census of Population. 
Washington, DC: U.S. Government Printing 
Office. 

Weingarden, C.R. 1973. "The "Welfare Explo- 
sion’: Determinants of the Size and Recent 
Growth of the AFDC Population." American 
Journal of Economics and Sociology 25:245-56. 

. 1974. The Fertility of AFDC Women: Àn 
Econometric Analysis." Journal of Economics 
and Business 26:159-66. 

Wisconsin Department of Health and Social 
Services. 1981. “Public Health Statistics: 1980.” 
Division of Health, Bureau of Health Statistics. 

Working Group on the Family. 1986. The Family: 
Preserving America’s Future. United States 
Department of Education, Office of the Under 
Secretary, Washington, DC. 








THE EFFECT OF INTERNATIONAL DEPENDENCE ON 
INCOME INEQUALITY AND POLITICAL VIOLENCE* 
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Recent studies of collective political violence have overlooked international causal 
factors. This cross-national study of 46—51 nations (including analyses of 
influential cases) expends on earlier analyses by simultaneously examining both the 
“external” (international) and the "internal" (intranational) determinants of 
collective political violence. Regression analysis suggests that transnational 
corporate penetration, an international factor, contributes to increased political 
violence both directly and indirectly, through its effect on income inequality. We 
conclude that theoretical and empirical integration of intra- and international 
factors yields the most comprehensive explanation of the causes of collective 


political violence. 


Muller's (1985) analysis of the relationship 
between income inequality and deaths from 
political violence has generated considerable 
response (see, for example, Weede 1986; 
Dietz, Frey, and Kalof 1987; and Hartman 
and Hsiao 1988). Neither the reanalyses nor 
Muller’s own replies (1986, 1988) considered 
the possible confounding effect of interna- 
tional dependence on the purported relation- 
ship between income inequality and political 
violence. Àn explicit early statement (Hibbs 
1973) noted this possibility, while there are 
sound theoretical reasons to expect interna- 
tional factors to affect income inequality and, 
in turn, political violence (cf. Timberlake and 
Williams 1987). We will assess this possibil- 
ity by replicating Muller's key equation and, 
then, adding to it a widely used measure of 
investment dependence. 


THE ARGUMENT 


Previous research has found a strong relation- 
ship between such international factors as 
investment dependence or transnational corpo- 
rate penetration and income inequality (see 
Bornschier and Chase-Dunn 1985, esp. ch. 
8). Therefore, despite Muller's findings, 
income inequality may have little independent 
effect on collective political violence net of 
the effects of such international factors. 
Dependency theory suggests that this is 


* Direct all correspondence to Bruce London, 
Department of Sociology and Social Psychology, 
Florida Atlantic University, Boca Raton, FL 
33431. 
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indeed the relationship between transnational 
corporate penetration, income inequality, and 
collective political violence. Penetration fos- 
ters income inequality in noncore nations by 
altering certain structural conditions within 
these nations (Bornschier and Chase-Dunn 
1985, p. 121). Firms locate in peripheral 
nations in order to take advantage of the 
lower wage rate in these nations. Neverthe- 
less, the wages paid by transnational corpora- 
tions are still well above the going rate in 
peripheral countries. This results in a large 
income gap between those citizens who are 
employed by transnational firms and those 
who are not. Furthermore, the "comprador 
class" has no incentive to initiate redistribu- 
tive governmental policies (Bornschier and 
Chase-Dunn 1985, p. 120). Therefore, public 
policies are often designed to induce foreign 
capital investment by providing tax conces- 
sions together with guarantees of profit 
repatriation and lower labor costs. Such 
policies, in turn, often call for the prohibition 
or hindrance of labor unions and/or strikes 
(Bornschier and Chase-Dunn 1985, p. 121). 
There is a clear contradiction between the 
multinationals' tendency to exacerbate inequal- 
ities and their need for political stability. 
Their very presence tends to generate pre- 
cisely the sort of instability and, in turn, 
political violence that they seek to avoid (see 
London and Williams 1988). In addition, the 
transfer of resources to the core through 
repatriation of profits restricts the amount of 
capital available for investment in domestic 
(noncore) industry, thus further exacerbating 
the existing income inequality problem. 
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DATA AND METHODS 


The first task of this analysis is to replicate 
Muller’s (1985) findings for the 1968-1977 
period, specifically his equation 3.2. Follow- 
ing Muller, the dependent variable is logged 
deaths from political violence, 1973-1977 
(LDPV75). The lagged dependent variable 
covers the years 1968-1972 (LDPV70), and 
the independent variables are level of devel- 
opment/energy consumption per capita 
(LNRG70), regime repressiveness (RR) and 
its square (RR^), income inequality (UQ70), 
and governmental acts of coercion (LSANCT). 
For details, see Muller (1985) and London 
and Robinson (1989). The latter is a more 
detailed version of the present analysis 
(available on request). 

The second task, following replication, is 
to add a control for investment dependence. 
We use Bornschier and Chase-Dunn's (1985) 
indicator of transnational corporate penetra- 
tion (LPEN) (for previous uses and justifica- 
tions, see London 1987). 


FINDINGS 


Equation 1 in Table 1 is Muller's (1985) 
equation 3.2, included for comparative pur- 
poses. Equation 2 is a near, but not exact, 
replication of Muller's equation 3.2. In 
addition to the cases examined by Muller, the 
nations of Taiwan and the United Kingdom 
are included (for details, see London and 
Robinson 1989). 

The results from equation 2 are quite 
similar to those obtained by Muller in his 
equation 3.2 (1985, p. 58), as well as to those 
of equation 1.1 in his later paper (Muller 
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1986, p. 442). An important finding is the 
continued significance of both RR and RR?, 
which supports the curvilinear relationship 
found previously between regime repressive- 
ness and political violence. Also important, 
especially in the present context, is the 
significance of income inequality in explain- 
ing political violence, a finding that also 
replicates Muller (1985, p. 58; 1986, p. 442). 

Having replicated Muller's key equations, 
we next include LPEN in the analysis (see 
equation 3). The inclusion of LPEN alters the 
findings in two important ways. First, LPEN 
itself is significant, lending support to the 
hypothesis that this international factor (eco- 
nomic dependency) is associated with an 
increased likelihood of political violence. 
Second, UQ70 becomes insignificant when 
LPEN is added to the equation. This suggests 
that the previously cited relationship between 
income inequality and political violence is 
spurious. 

These findings may be an artifact of the 
deletion of the four cases with missing data 
on the LPEN variable. Equation 4 addresses 
this issue. It replicates equation 2 in Table 1, 
but excludes those four nations (Barbados, 
Gabon, East Germany, Hungary). When 
equation 4 is compared with both Muller's 
original equation (eq. 1 in Table 1) and our 
equation 2, the results are quite similar. 
Thérefore, the significance of LPEN (and the 
lack of significance of UQ70) is not an 
artifact resulting from the loss of the four 
cases. 

However, as Muller notes (1986, p. 441), 
"Regression equations should be examined 
for the sensitivity of the parameter estimates 


Table 1. Regression of Deaths from Political Violence, 1973-77 (LDPV75*) on Measures of Income Inequality and 
Selected Controls, with and without a Measure of Multinational Corporate Penetration (reported coefficients 


are betas) 














(1) 





(2) (3) (4) 





Deaths from political violence, 1968-72" (LDPV70) .33* .63* .62* .63* .60* .62* 
Energy consumption p.c., 1970* (LNRG70) 29 .42* A7 .46* 44* 47* 
Governmental acts of coercion* (LSANCT) .26 M .01 .09 —.05 —.01 
Regime repressiveness (RR) 375 1.53* 1.78* 1.62* 2.16* 1.90* 
(Regime repressiveness)* (RR?) . —1.51* —1.54* -1.51* -—1.88* -—1.64* 
Income inequality, 1970 (UQ70) 27* .30* .20 alt .10 .18 
Multinational corporate penetration* (LPEN) — — 25* | — .31* .27* 
Constant —6.16  —11.69 —16.62 -13.03 -16.52 -—16.59 
Adjusted R? 49 54 56 .52 .55 57 
N 49 51 47 47 46 47 


* Logged to correct for skewness. 





> See Muller (1985, p. 58) for an explanation of the combined RR and RR? regression coefficient. 


* B at least 2 times its standard error. 
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to one or a few influential cases.” In order to 
identify influential cases, the Cook’s distance 
(or Cook’s D) summary measure is used in 
this analysis. For equation 3 we find one case 
with a moderately high score (Zimbabwe, 
with a Cook’s D = .19). Equation 5 deletes 
Zimbabwe. The exclusion of Zimbabwe does 
not substantially alter our initial conclusions. 
In fact, LPEN's explanatory power is in- 
creased. 

The ceiling corrective procedure advocated 
by Muller (1986) is employed in equation 6 in 
order to retain Zimbabwe in the analysis. In 
equation 6, all LDPV75 scores of — 1.71 or 
greater are set at — 1.70 (approximately two 
standard deviations above the mean). The 
results of equation 6 are nearly identical with 
those of equation 3, and the adjusted R? 
reaches its highest level. Again, this strength- 
ens our confidence in the conclusions drawn 
on the basis of equation 3. 

The regression equations in Table 1 support 
the assertion that studies of income inequality 
and political violence must consider the 
possible confounding effect of international 
dependence. Much research has been done 
examining the effects of investment penetra- 
tion on income inequality (see Bronschier and 
Chase-Dunn 1985, p. 117), and most studies 
have found this and other measures of 
dependency to be related to greater income 
inequality. We attempt to confirm this 
relationship using the present data set by 
regressing UQ70 on LPEN, controlling for 
LNRG70. Our results (available on request), 
including an analysis of influential cases, 
show LPEN to be positively and significantly 
related to UQ70 (beta = .34**, N —47), net of 
the significant negative effect of LNRG70. 
These findings corroborate the results of 
previous research. 


CONCLUSION 


Taken together, the findings above lead to the 
conclusion that it is multinational corporate 
penetration, not income equality, that directly 
accounts for increased levels of collective 
political violence experienced by nations. 
Studies that found income inequality to have a 
direct effect on the level of collective political 
violence did so because they did not include a 
control for international dependency, which 
significantly affects collective political vio- 
lence both directly and indirectly (through its 
effects on income inequality). Therefore, it is 
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imperative that both intra- and international 
causal factors be examined in future studies 
focusing on collective political violence. 
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IN THE DEVELOPMENT OF SOCIAL CITIZENSHIP: 
SOCIAL RIGHTS DURING SICKNESS IN 
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The extension of social citizenship via modern social policies is a fundamental 
macrolevel social change of the past hundred years. This paper attempts to 
reorient the empirical study of social policy development from its present 
concentration on aggregated social expenditures to a focus on the multidimen- 
sional aspects of the development of welfare states, social rights, and social 
citizenship. On the basis of a new data set describing the development of citizens’ 
social rights in the main social insurance programs in 18 OECD countries since 
1930, causal hypotheses derived from pluralist industrial, neo-Marxist, popular 
protest, state autonomy, and power resources approaches are tested. On the 
crucial issue of the role of left government participation in the extension of social 
rights, the hypotheses of the power resources approach are supported. 


A salient instance of social change during the 
past century is the extension of citizens’ 
social rights via modern social policies. The 
old poor laws had often deprived citizens of 
their civil rights. In the late 19th century, 
these laws gradually began to yield to a new 
invention, social insurance, which instead 
extended the rights of citizens. This extension 
of social rights led to a development that we, 
along with Marshall (1950), term social 
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citizenship. Among the Western countries, 
the speed and degree of the development of 
social citizenship and the welfare state have 


. varied. In the social sciences the causes of 


this macrolevel political and social change 
have been vigorously debated. 

Research on the causes of welfare state 
development has revolved around three basic 
issues: the relevance of class, the nature of 
the distribution of power resources, and the 
possibilities of democratic politics in Western 
societies. The long-dominant pluralist indus- 
trialism approaches belittle the role of class in 
modern societies, tend to assume that the 
distribution of power resources among inter- 
est groups is not so unequal as to be of major 
interest, and have an optimistic view of the 
possibilities of political democracy. Function- 
alist neo-Marxist approaches are focused on 
class, assume the distribution of power 
resources to be largely given by the capitalist 
mode of production, and take a rather 
pessimistic view of the possibilities of 
political democracy. The various state auton- 
omy approaches view state officials as largely 
unconstrained by societal forces and thereby 
also question the efficacy of representative 
democratic processes. The power resources 
approach expects class to be one of the major 
determinants of conflicts of interest in West- 
ern societies, holds that inequality in the 
distribution of power resources is of central 
interest. but assumes that the degree of 
inequality can vary over time and between 
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countries, and accords political democracy an 
important role in the processing of conflicts of 
interest. 

Most causal empirical studies. on the 
development of modern social policies have 
had a cross-national design (for comprehen- 
sive surveys of this literature, cf. Alber 1982; 
Shalev 1983a, 1983b; Uusitalo 1984; Sharpe 
and Newton 1984; Therborn 1986; Skocpol 
and Amenta 1986; Quadagno 1987; O’Connor 
and Brym 1988). With a few noteworthy 
exceptions (such as Rimlinger 1971; Flora 
and Alber 1981; Alber 1982; Schmidt 1982), 
the overwhelming majority of these studies 
have been focused on a global, aggregated 
characteristic of the modern welfare state, 
namely, the proportion of total public expen- 
ditures used for social security purposes (cf. 
the above surveys). 

The heavy concentration of previous re- 
search on social expenditures cannot be 
defended on grounds that in a sociological or 
political science perspective the level of 
expenditures is the most central aspect of 
welfare state development. Instead, the con- 
centration on expenditures has been largely 
data driven, comparative expenditure data 
being easily available in published sources. 
Unfortunately, however, the bulk of compar- 
ative welfare state research has come to deal 
with a global and unidimensional indicator of 
a clearly complex and multidimensional 
reality. Thus expenditure levels have only 
indirect bearing on what is the core of the 
modern welfare state—the extent and quality 
of the social rights that constitute social 
citizenship. The significance of the shift from 
the poor laws to social insurance legislation, 
from means testing to social rights, is not 
captured by the increase in social expendi- 
tures. The global unidimensional expenditure 
indicator cannot reflect differences among 
countries in types of programs and institu- 
tional structures, nor the ways in which 
welfare states differentially affect and benefit 
various categories of citizens. In studies of 
causal factors behind welfare state develop- 
ment, this type of dependent variable thus 
excludes the possibility to analyze the ways in 
which political, economic, and other factors 
have generated differential peters and struc- 
tures of welfare states. 

The time has now come to reorient the 
study of welfare state development from 
aggregated expenditures toward a multidimen- 
sional perspective, where a number of 
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different dimensions are identified and their 
interrelationships are analyzed. In this broad- 
ening of perspectives, it is fruitful to pay 
attention to the development of social rights 
via the welfare state. Since the days of 
Bismarck and Lloyd George, in the European 
countries the concept of social rights tied to 
worklife participation or citizenship has 
played a central role in policy debates as well 
as in scholarly writings. This concept appears 
to have been.of less importance in some other 
countries, such as the United States, where 
means-tested benefits continue to play a 
comparatively large role. Yet it must be 
recognized that the break between the old 
poor laws and the new welfare state was 
based precisely on the introduction of social 
rights via social insurance. Therefore, a focus 


. on the development of social rights helps us 


to chart the diffusion of this now century-old 
innovation. A plea for a focus on the multiple 
dimensions of welfare state development does 
not exclude interest in more specific aspects 
of this development. On the contrary, in 
efforts to achieve a more comprehensive 
picture of welfare state change, separate 
studies of different aspects of social policy 
development play a key role. 

The purpose of this paper is to discuss 
some of the theoretical approaches to the 
explanation of the development of modern 
social policy and social citizenship and to 
empirically test hypotheses derived from 
these theories. The paper is based on an 
ongoing longitudinal and comparative project 
on welfare state development which includes 
the specification of the rights accorded 
citizens through four basic social insurance 
programs: sickness insurance, old age pen- 
sions, unemployment insurance, and work 
accident insurance. The data set created 
within the project covers the period since 
1930 and includes 18 OECD countries: 
Australia, Austria, Belgium, Canada, Den- 
mark, Finland, France, Germany, lreland, 
Italy, Japan, Netherlands, New Zealand, 
Norway, Sweden, Switzerland, the United 
Kingdom, and the United States. These 
countries have had a record of political 
democracy during the entire postwar period.! 





! The selection of countries is thus based on the 
"comparable-cases" or “most similar systems" 
strategy (Lijphart 1975). Two European democra- 
cies, Iceland and Luxemburg, are excluded from 
the sample because of their very small size. 
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By including a relatively large number of 
countries and by extending measurements 
back to 1930, I have made an effort to 
overcome two of the major problems in 
comparative studies: the small number of 
cases and the limitation to cross-sectional 
data. 

In this paper I focus on the development of 
one of the main social insurance programs— 
sickness insurance. Although sickness insur- 
ance is only one particular aspect of social 
rights in the welfare state, it is obviously a 
central one. Health and illness are basic 
concerns for all citizens, and sickness insur- 
ance therefore constitutes one of the pillars of 
the welfare state. The significance of sickness 
insurance is, however, not based on the fact 
that it represents a large proportion of social 
expenditures. Not everybody becomes ill and 
relatively few remain ill for long periods. But 
the risk of illness applies to all citizens, and 
for the economically active also shorter 
periods of illness may involve economic 
hazards. Since sickness cash benefits have a 
bearing on workers' participation in the labor 
market, this area is likely to generate 
significant social and political issues in all 
countries. 

In the next section I will briefly review the 
main theoretical approaches to the explana- 
tion of welfare state development. On the 
bases of these theories, I derive a set of 
hypotheses which in the following sections 
are tested through an analysis of the develop- 
ment of sickness insurance programs in the 18 
countries. 


THEORETICAL APPROACHES 


Attempts to account for the development of 
social rights and modern social policies have 
generated theoretical positions that are partly 
overlapping. These positions, however, differ 
among themselves with respect to the extent 
to which they view social policy development 
as a result of rational actions of individuals or 
collectivities, as well as with respect to the 
relative role in this process assigned to factors 
such as group and class conflict, the distribu- 
tion of power resources, political parties, 
parliamentary processes, interest organiza- 
tions, and the autonomous actions of state 
officials. 

During the early stages of research, the 
pluralist school of thought provided an 
explanation of social policy development in 
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terms of a "logic of industrialism," according 
to which welfare arrangements serve the 
function of maintaining the new type of 
differentiated labor force needed in a techno- 
logically advanced society (e.g., Kerr, Dun- 
lop, Harbinson, and Myers 1960, chap. 6). 
The basic hypothesis of the pluralist industri- 
alism approach is a dual one. It holds (a) that 
economic growth provides the basis for social 
policy expansion to satisfy needs created by 
demographic and other changes accompany- 
ing industrialization, and (b) that ideological 
factors and partisan politics are of little or no 
importance in this context. It is clearly the 
latter part of the hypothesis that is the 
controversial one. Thus, for example, Stinch- 
combe (1985, pp. 423—24) argues that "the 
central false description of social insurance is 
that it is ‘redistributive’ " and that “a second 
major misperception of social insurance is 
that it was instituted in response to trade 
unions and the labor movement.” 

The functionalist neo-Marxist explanation 
agrees with the conclusion of pluralist indus- 
trialism that in the capitalist democracies, 
partisan politics play only a minor role in 
social policy development. The functionalist 
neo-Marxists maintain, however, that the 
welfare state grows not because of its 
beneficial functions for the whole society but 
because of its functions in the stabilization 
and legitimation of the position of the 
economically dominant class in capitalist 
society (e.g., O'Connor 1973). 

The appearance and expansion of the 
welfare state have also been interpreted as 
largely a product of a “modernization pro- 
cess” involving urbanization, industrializa- 
tion, and the political mobilization of the 
working class (e.g., Flora and Alber 1981). 
This approach attempts to integrate function- 
alist and conflict models of social policy 
development and predicts that in this context 
political factors are of some relevance. 

. Some of those maintaining that partisan 
politics and the formal democratic apparatus 
are ineffective in converting popular demands 
into welfare policies have suggested that 
under certain circumstances the elites in the 
capitalist democracies can be influenced to 
concede to such demands when expressed 
outside the representative, parliamentary chan- 
nels, in the form of disruptive riots, strikes, 
or demonstrations. This “popular protest” 
hypothesis has been developed especially t 
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relation to the American experience (Piven 
and Cloward 1971). 

In recent years the role of the state and the 
autonomy of state officials have been inten- 
sively debated. Thus, among sociologists 
analysing policy making, Skocpol and her 
colleagues have argued that state officials 
must be seen as weighty actors, the goals and 
actions of whom cannot be reduced to the 
demands and preferences of any social 
interest group (Skocpol 1979; Orloff and 
Skocpol 1984; Weir and Skocpol 1985). 
Among political scientists, the central role in 
policy making of state officials acting inde- 
pendently from, or contrary to, the interests 
of powerful private actors has been stressed 
by Nordlinger (1981) among others. In 
economics a separate but largely parallel 
theoretical development has taken place. Here 
Downs’s (1957) assumptions of rational 
actions by purely self-interested politicians 
have become the centerpiece in the vigorous 
public choice stream of writing, which is 
explicitly based on the assumption that state 
officials are egoistic utility maximizers, who 
seek to maximize their budgets and the 
personnel of their bureaucratic organization 
(e.g., Mueller 1979). 

In criticizing both pluralist and neo-Marxist 
functionalist interpretations, different scholars 
have stressed the role of the distribution of 
power resources in social policy development 
in Western societies. Also here the emphasis 
clearly is on rational actors. À common 
denominator within this power resources 
approach is the assumption that the class 
structure forms a highly salient potential base 
for political cleavages in Western societies, 
and that in capitalist democracies these 
conflicting interests are channelled to a 
significant extent through electoral outcomes 
and partisan control over government. Inequal- 
ities generated by class structure and reflected 
in partisan politics are therefore expected to 
be important factors behind welfare state 
development (cf., e.g., Korpi 1974, 1978, 
1980, 1983; Castles 1978; Stephens 1979; 
Shalev 1983a; Esping-Andersen and Korpi 
1984; Esping-Andersen 1985).? 


? Selected aspects of the power resources 
approach are sometimes presented as "the working 
class strength" hypothesis or the "social demo- 
cratic model." Contrary to, for instance, Skocpol 
and Amenta (1986, p. 140), I do not interpret this 
as a one-factor theory claiming to explain welfare 
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The power resources approach to social 
policy development starts in the observation 
that in the welfare state politics is deliberately 
used to modify the play of market forces 
(Briggs 1961, p. 228). The welfare state can 
therefore be fruitfully analyzed in terms of the 
ways and the extent to which it affects the 
relative roles played by politics and markets 
in a society. Markets and politics are seen as 
institutionalized, partly alternative, partly 
overlapping strategies or arenas for the 
mobilization of resources, the distribution of 
rewards, and the steering of society. From a 
power resources perspective, a crucial assump- 
tion is that the types of power resources that 
can be mobilized and used in politics and on 
markets differ in class-related ways (Korpi 
1985, 1978). In the market, capital and 
economic resources form the basis of power, 
and such resources tend to be more or less 
unequally distributed among social classes 
and interest groups. In democratic politics, 
however, the principal power resources are 
the right to vote and the right to organize for 
collective action. While suffrage can be based 
in factors such as property and income, in 
democratic societies it is tied to citizenship 
and is thus equally distributed, as is the right 
to organize. 

Numerically large collectivities, which are 
relatively weak in terms of their market 
resources, can therefore be expected to 
attempt to use their more favorable positions 
in terms of political resources to affect the 
conditions for and outcomes of distributive 
market conflicts. Such attempts are likely to 
be resisted by actors whose market positions 
are more favorable, something which will 
create a tension between markets and politics. 
This tension between markets and politics is 
likely to be reflected in the development of 
social citizenship and the welfare state. 

Marshall (1950, p. 28) defined citizenship 
in rather general terms as "a status bestowed 
on those who are ful members of a 
community." All citizens are thus equal with 
respect to the rights and duties with which 
this status is endowed. What these rights are, 
however, is an open question. According to 
Marshall (1950, p. 11), the social element of 
citizenship can run "the whole range from the 





state development more or less exclusively in 
terms of working class or left strength. A plea for a 
multifactorial understanding of welfare state devel- 
opment is made in Korpi 1983, p. 187. 
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right to a modicum of economic welfare and 
security to the right to share to the full in the 
social heritage and to live the life of a 
civilized being according to the standards 
prevailing in the society." Social policy can 
be used by those who are relatively weak in 
terms of market resources to extend social 
rights and thereby to enrich the status of 
social citizenship.? The establishment of 
legislated social rights decreases the scope of 
markets and changes the basis of distribution 
in these areas from market power to political 
‘resources. In the areas involved in the equal 
status of citizenship, the criteria for distribu- 
tion thus shift from buying power on the 
markets toward politically based consider- 
ations of justice. ; 

In industrializing societies the interest in 
limiting the reach of the market was shared by 
many categories of citizens (Polanyi 1944). 
Relative to most other groups or collectivities 
in Western societies, wage earners and 
workers can, however, be assumed to be 
disadvantaged in terms of individual market 
resources. The power resources approach thus 
generates the hypothesis that the majority of 
wage earners can be expected to have greater 
interest than other groups in using political 
interventions to modify market processes and 
market criteria in distribution, and therefore 
in extending social rights and social citizen- 
ship. Contrary to the economic theory of 
political democracy (Downs 1957), in the 
power resources perspective, political parties 

` and other interest organizations thus tend to 
appear as the organized expressions of 
collective action geared toward safeguarding 
the diverging interests of groups differently 
located in the social structure. Therefore, 
parties can be expected to adopt differing 
political stands in areas such as social and 
welfare state policies. 

The power resources approach does not, 
however, imply that social policy develop- 
ment is based on the organizational and 
political power of the working class and left 
parties alone.^ In many countries, for exam- 





3 The concept of social rights is here used 
primarily in the sense of claim rights (cf. Weale 
1983, chap. 7). 

* Borg and Castles (1981) and Wilensky (1981) 
consider the strength of the main right-wing parties 
and of Catholic parties, respectively, to be the key 
variables behind welfare state expansions. These 
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ple, farmers' parties, conservative parties, 
and Catholic parties have been significant 
actors in this area. Instead, the power 
resources approach generates what in essence 
amounts to a game theoretical perspective on 
the analysis of interdependent actors, where 
the decisions made and the strategies adopted 
by each actor are seen as affected by the 
actor's perceptions of the relative power and 
probable choices of other actors. Therefore, 
in the study of policy development, inten- 
tional analysis must complement causal anal- 
ysis (Elster 1983, chap. 3). 

Empirical attempts to test to what extent 
"politics matter" are generally focused on the 
correlation between partisan government in- 
cumbency and policy output. The game 
theoretical perspective adopted here indicates 
that this approach is too limited. Thus the 
effects on social policy of changes in the 
distribution of power resources in favor of 
wage earners and the left need not be 
mediated via left government control. Strate- 
gic action by conservative governing elites in 
the context of mass mobilization via unions 
and the presence of a left party drawing 
significant electoral support may thus lead to 
what Duverger (1954, p. xxvii) termed 
"contagion from the Left," where demands 
from the political left are taken up by other 
parties, and the political middle ground shifts 
leftwards. This development is, however, not 
primarily a result of party competition as such 
but by party competition in the context of a 
relatively strongly mobilized left. In this 
perspective also the early social policy 
initiatives such as those of Bismarck— 
coming already when suffrage was more or 
less narrowly restricted —can be interpreted as 
strategic responses by conservative actors to 
the changing distribution of power resources 
brought about through the political mobiliza- 
tion of relatively large sectors of the less 
privileged parts of the population. 

In this context it is also important to keep 
the time dimension in mind and to remember 
that the frontiers of distributive conflicts can 
shift over time. What once were heated 
political issues may gradually come to be 
accepted by all interest groups, conflicts then 
shifting to new issues concerning the relative 
roles of markets and politics. The analysis of 


alternative formulations will be tested in another 
context. d 
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how market processes are modified by 
welfare state policies must therefore look at a 
broad range of issues and be sensitive to the 
historical context. 

The recent debate on the state involves two 
separate issues—the role of state institutions 
and the degree of autonomy of state officials. 
Contrary to the neutrality implied in much of 
pluralist writings, in the power resources 
perspective institutional arrangements appear 
as the results of investments of power 
resources and therefore as involving selectiv- 
ity in the opportunities and constraints for 
action that they introduce (Korpi 1985). 
Institutions are thus seen as structures of bias. 
This applies also to the institutions and the 
legal order that constitute the state. Thus the 
views on the role of institutional aspects of 
the state in the power resources and in the 
state-centered approaches are largely in agree- 
ment. When it comes to the question of the 
autonomy of state officials, the difference 
between these two approaches is a matter of 
degree. Contrary to an overly simplified 
interpretation sometimes presented (Weir and 
Skocpol 1985, p. 117), the power resources 
approach does not imply that politics in 
representative democracies is a simple trans- 
mission belt conveying the preferences and 
demands of various interest groups to the 
leaders, who implement them. Instead the 
relationship between democratically elected 
state officials and the rank and file member- 
ship of their base organizations is seen as one 
of interaction, role differentiation, and divi- 
sion of labor. In the power resources 
perspective, these circumstances contribute to 
grant a considerable freedom of choice to 
state officials. In the modern Western democ- 
racies, however, because of their tenuous 
control over the main basic power resources, 
the autonomy of state officials will be 
circumscribed. Public choice and state auton- 
omy approaches, of course, argue for a much 
larger role for the autonomous actions of state 
officials in policy formation. 


MEASUREMENT OF SOCIAL RIGHTS 
DURING SICKNESS 


Social citizenship refers to rights that individ- 


5 Democratically elected leaders, as well as 
career bureaucrats on whom the elected officials 
partly rely, are expected to take initiatives and to 
develop plans, generally within broadly defined 
mandates from their constituents. 
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uals have as an outflow of their status as 
citizens. These social rights can be claimed 
without citizens having to demonstrate eco- 
nomic need (for example via a means test). 
Instead, claims are accepted if the individual 
falls in a specific category, for example in 
terms of age, health, or employment. 

When exercised, social rights established 
through legislation generate public expendi- 
tures. While expenditures, especially in 
disaggregated forms (e.g., Flora 1986, 1987) 
clearly are of interest in themselves, the two 
commonly used sources of aggregated social 
expenditures are only indirect, partial, and 
partly misleading indicators of the develop- 
ment of social rights.$ Variations in social 
expenditures are the result of changes in a 
number of factors — social rights, but also, for 
example, the demographic composition of the 
population, needs, and economic conditions. 
Increasing social expenditures thus need not 
reflect an enrichment of the status of social 
citizenship. If we are interested in the 
development of social citizenship, it is 
therefore clearly necessary to complement 
figures on social expenditures with direct 
measures of social rights. 

The data on social rights analyzed here 
concern sickness cash benefits and reflect the 
extent to which the normal living standards of 
an average industrial worker are protected 
through legislated rights to cash benefits 
during illness.? (For details concerning empir- 
ical data and construction of indices, see the 
Methodological Appendix.) My data thus 
reflect to what degree social rights insulate 
such a worker from the pressures to partici- 
pate in the labor market, by enabling the 
worker to maintain the customary standard of 
living in case of sickness. Since I am focusing 
on social rights, I do not here include rights to 
cash benefits acquired through market mech- 
anisms such as private insurance or collec- 
tively bargained insurance. 

Social rights during sickness can be 
described in terms of several aspects referring 
to levels of and conditions for income 
replacement. To make possible meaningful 
comparisons across countries and over time, I 
have chosen to calculate the right to cash 
benefits of a full-time industrial worker 
earning the average wage of industrial 


$ ILO, various years, and OECD (1985). 
7 The health-care and hospitalization parts of 
sickness insurance are excluded here. 
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workers during each year and in each country, 
respectively, and to relate the cash benefits to 
this average wage. While sickness cash 
benefits used to be nontaxed income, in 
recent years the benefits have often been 
made taxable. It has therefore been necessary 
here to compute net benefits (after taxes and 
social security contributions) and to relate 
them to net earnings. Since replacement rates 
may vary between different types of house- 
holds, I describe them for two traditional 
household types—a single person, and a 
four-person family with one wage earner. It is 
also necessary to take into account that 
replacement levels may vary over the duration 
of illness. Therefore, I describe benefit levels 
during short-term (one week) and long-term 
(26 weeks) illness. Among the conditions for 
the right to sickness cash benefits, I have here 
included waiting days (i.e., the number of 
days at the beginning of an illness during 
which benefits are not paid).® 

Each one of the above five aspects of 
sickness cash insurance has been separately 
regressed on the independent variables. In 
general these five measures were found to 
have similar relationships to the independent 
variables. They are also intercorrelated among 
themselves. I have therefore combined them 
into an Index of Social Rights which thus 
consists of the following components: 


1. Replacement level for a single worker, which 
indicates the proportion that the posttax 
benefit (after waiting days) makes up of the 
posttax wage of an average production 
worker, during (a) the first week of illness; 
(b) 26 weeks of illness and 26 weeks of 
normal earnings. ` 

2. Replacement level for a couple with two 
minor children with posttax benefits (after 
waiting days) related to the posttax wage of 
an average production worker during (a) the 
first week of illness; (b) 26 weeks of illness 
and 26 weeks of normal earnings. 

3. The number of waiting days before a worker 
can claim benefits. To indicate change in 
social rights in the same direction as the 
replacement levels, the number of waiting 
days has been reflected, that is, taken as the 
maximum number of waiting days (in this set 


8 Because sickness insurance programs are 
sometimes phased into invalidity programs, it has 
been difficult here to define a valid measure of the 
duration of sickness benefits. Duration is therefore 
not included in the present definition of social 
rights during sickness. 
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of countries during the whole period) minus 
the observed number of waiting days. 


In the combined’ index, each of the four 
components 1a, 1b, 2a, and 2b (benefit 
levels) is given one-sixth of the total weight, 
while component 3 (waiting days) is given 
one-third of the total weight. 

The second index is the Coverage of 
Sickness Insurance, that is, the number of 
insured persons with the right to cash benefits 
in relation to the population 15—64 years of 
age. The right to cash benefits must, 
however, be seen as a matter of degree. In the 
Index of Social Rights, I have therefore 
included income-tested programs such as 
those found in Australia and New Zealand. 
Since the social insurance concept does not 
apply to income-tested programs, I have, 
however, excluded them from the coverage 
measures. 


MODEL SPECIFICATION 
AND ESTIMATION 


A problem that plagues comparative research 
is the difficulty in finding valid and reliable 
empirical indicators of the independent vari- 
ables that enter into the hypotheses one wants 
to test. While this is a problem for qualitative 
as well as quantitative comparative studies, it 
is more likely to be made explicit in the 
quantitative ones. Because of this lack of 
fully satisfactory indicators of relevant vari- 
ables, it is difficult to make clear-cut tests 
between competing hypotheses. 

The capacity aspect of the pluralist indus- 
trial hypothesis will here be specified in terms 
of the economic growth rate, that is, the rate 
of change of GDP per capita.? It has not been 
possible to devise a conceptually satisfying 
indicator of need for which quantitative data 


? Since the GDP/capita variable reflects a 
conglomerate of different societal characteristics 
and since both the index of social rights and 
GDP/capita show clearly increasing trends over 
time (the correlation between GDP/capita and year 
of observation is 0.82 for the period 1930-1980), it 
is not possible to interpret the relationship between 
economic level and social rights in causal terms. 
For the period 1930-1980, the correlation of social 
rights with GDP/capita is positive, as expected, but 
low (r = 0.22). The partial correlation between 
GDP/capita and the extent of social rights, with 
time (year) held constant, is low with an 
unexpected negative sign ( —0.27). 
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are available. This omission is, however, not 
of crucial importance, since the idea that in 
some sense needs can be of relevance is 
hardly contested by the competing ap- 
proaches. The controversial part of the 
pluralist industrial hypothesis is instead the 
assumption that partisan politics will be of 
little or no significance in this context. 

A good specification of the power re- 
sources approach would need to consider the 
distribution. of important power resources 
among major interest groups or classes in 
society. Because of the glaring lack of 
relevant and comparable empirical data on 
such issues, I am here forced to limit myself 
to indicators of the relative strength of left 
parties (traditional social democratic parties 
and parties to their left) in the electorate, and 
in the cabinets, as well as to the cabinet share 
of the left weighted by its share of seats in the 
legislature, and the proportion of time during 
which left parties have held seats in the 
governments.!° Also union density (i.e., the 
number of union members in relation to the 
size of the nonagricultural labor force) will 
here be used as an indicator of left power. 

As noted above, the power resources 
approach predicts that the effects of the 
relative power of the left can be expected to 
follow different paths. Thus, evidently the 
extent, weight, and duration of left parties in 
government position are expected to improve 
sickness cash insurance. But the increasing 
strength of the left outside cabinets, as 
indicated by an increasing number of votes 
for left parties of the level of union density, 
can be expected to lead to a “contagion from 
the Left,” thus generating strategic action to 
improve social policy among nonleft govern- 
ment parties as well. To the extent that this 
occurs, the correlation between government 
partisan complexion and policy output will be 
attenuated. The rate of economic growth can 
be expected to affect the ease with which 
social reforms can be introduced. Here the 
alternative hypothesis, suggested by function- 
alist theories of both pluralist and neo- 
Marxian provenance, is that the above 


10 The social democratic parties are defined as 
those which are members of the Socialist Interna- 
tional. According to this definition, the United 
States has no major left party. In Italy the very 
small social democratic party (PSDI) has not been 
included in determining the length of left participa- 
tion in government. ; 
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political variables will be of little or no 
importance for the development of sickness 
insurance. 

The state autonomy and public choice 
approaches are not easy to specify.!! As noted 
above, one of the basic assumptions within 
the public choice and state-autonomy ap- 
proaches is that it is in the self-interest of state 
officials to increase not only their budgets but 
also the size of public bureaucracies. If we 
take this assumption seriously, we would 
expect the relative number of central govern- 
ment personnel to reflect the extent to which 
state officials have been able to use their 
autonomy to expand their bureaucracies and 
budgets. A possible indirect indicator of state 
autonomy would therefore appear to be the 
numeric strength of central government per- 
sonnel relative to the population.!? It is, 
however, also likely that the number of state 
personnel is affected by other factors, for 
example, by welfare state expansion, and that 
this indicator therefore cannot fully distin- 
guish between the determinants of welfare 
state growth hypothesized in the state auton- 
omy approach, and in the power resources 
and other approaches. 

A key assumption in state autonomy 
theories is that “one hidden (or overt) feature 
of all autonomous state action will be the 
prerogatives of state officials” (Skocpol 
1985, p. 15). This assumption is here taken as 
a basis for a second indirect indicator of state 
autonomy. The proportion of all social 
expenditures that state officials have managed 
to direct to programs exclusively benefiting 
civil servants is taken as a measure of “civil 
service prerogatives,” and thus as an indica- 
tor of the extent to which state officials have 
had opportunities for autonomous action. This 
indicator is, however, available only for the 
postwar years. 

The rationale for both the above indicators 
of the latent variable “state autonomy” is the 
same. I assume that ceteris paribus the 
self-seeking drive among state officials is 
constant, and can therefore take the extent to 
which they have been able to realize their 


H Skocpol (1985, p. 14) accepts that cross- 
national research should be able to indicate in 
general terms whether a state has stronger or 
weaker tendencies towards autonomous action. 

12 Of course only a minor part of the central 
government personnel as defined here will partici- 
pate in policy-making. 
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interests in increasing the size of state 
bureaucracies and in safeguarding their spe- 
cific interests within the social insurance 
system as indirect indicators of their auton- 
omy from societal constraints. Increasing 
autonomy of state officials reflected in 
increasing size of state bureaucracies and in 
increasing prerogatives of civil servants is 
hypothesized to increase social rights and 
coverage in sickness insurance. 

The popular protest approach is also 
difficult to specify in a comparative study. As 
a proxy for the intensity of protest, I have 
here taken the volume of industrial conflict, 
that is, the average number of working days 
used in industrial conflicts during a year, 
related to the size of the nonagricultural labor 
force. This index is especially sensitive to 
large and prolonged conflicts. With increas- 
ing volumes of industrial conflict, social 
rights and coverage of sickness insurance are 
thus expected to increase. 

The size and complexity of the present data 
set make it advantageous to use quantitative, 
statistical methods of analysis. In comparative 
studies of this type, however, it is not a 
realistic goal to attempt to determine more 
precisely the relative roles played by different 
factors in the historical processes of interest 
here. At best, regression analyses can give a 
relatively objective basis for interpretations of 
the presence or absence of effects of potential 
causal variables and for plausible accounts of 
causal processes in this context. 

To make more effective use of my 
empirical information, I will pool cross- 
section and time series data and apply 
regression analysis to the combined data set. 
In so doing, we run into two partly related 
problems. Thus, because of the impossibility 
of including all relevant causal variables into 
regression analyses of comparative, historical 
data sets, specification errors are bound to 
occur. In addition, time series data tend to 
exhibit autocorrelation in residuals, and error 
variances may differ among cross-sectional 
units. If ordinary least squares (OLS) assump- 
tions are not fulfilled, the OLS parameter 
estimator remains unbiased and consistent but 
it is no longer efficient. Furthermore, the 
estimator of error variances of parameter 
estimates will be biased. 

A number of statistical techniques have 
been developed to handle these problems 
(e.g., Kmenta 1986, chap. 12; Johnston 
1984, chap. 10—13; Stimson 1985). Most of 
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the commonly used techniques are based on 
the assumption that slope coefficients are the 
same for all cross-sectional units, but the 
models differ in their assumptions about 
intercepts and characteristics of residuals. An 
important question here is whether we can 
assume that observations in different units are 
samples from the same population, that is, 
whether units have the same intercept. If this 
question is answered positively, we would 
typically as a first step apply àn OLS model to 
the pooled data, and in second and further 
steps transform data.to make residuals 
approach OLS assumptions. 

Quite often, however, cross-sectional units 
differ in terms of levels of the dependent 
variable. In such a case a model with equal 
intercepts would be a misspecification, which 
generates high serial correlation of a type that 
does not decay with increasing lags. One 
common cause for autocorrelation is that 
relevant causal variables have been omitted. 
In the presence of high autocorrelations stable 
over time, the first step should therefore be an 
attempt to include relevant causal variables in 
the model rather than variable transformations 
to correct for autocorrelation or other devia- 
tions from OLS assumptions. In such situa- 
tions it is possible to introduce varying 
intercepts for units (and/or time periods) to 
cover up for our ignorance of relevant causal 
variables. Depending upon our assumptions 
about these intercepts, we can construct two 
different but partly related models. _ 

If intercepts are assumed to be constants 
(dummy variables) that differ among coun- 
tries, we arrive at the fixed effects, Least 
Squares with Dummy Variables (LSDV) or 
covariance model. This model utilizes only 
the within-unit portion of the total variance 
and relegates between-unit-variance to the 
intercept dummies. These dummies are ex- 
pected to account for differences between 
countries caused by unmeasured factors that 
are stable over time. Here errors are typically 
assumed to follow OLS specifications. Al- 
though it is seldom done, if the structure of 
residuals deviates from these specifications, it 
is, however, also possible to construct 
Generalized Least Squares (GLS) estimators, 
which, for example, correct for autocorrela- 
tion in residuals (Johnston 1984, p. 400). 

Another generalized least square model, 
the random effects or error component model 
(GLSE), assumes intercepts to be random 
variables, the average of which constitutes the 
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common intercept. In one version of this 
model, in each unit a positive or negative 
random increment is assumed to be added to 
the common intercept. These differential 
random increments are merged with the 
disturbance term, which thus is assumed to 
have two independent components. The 
presence of the unit intercept component 
produces a correlation between residuals of 
the same units at different points of time. This 
serial correlation is assumed to be stable and 
to be generated only by the unit-specific 
random increments to the common intercept; 
that is, no “genuine” autocorrelation is 
present. 

As noted above, in contrast to the standard 
OLS model using both within- and between- 
unit variation, the LSDV model uses only 
variation within cross-sectional units and 
disregards the between-unit component of the 
total variance. The error component model 
was originally developed to utilize also some 
of the between-unit variation. It is thus a 
compromise between the OLS and the LSDV 
model and uses a weighted average of the 
within- and the between-unit parts of the total 
variance. It has been shown that LSDV 
estimators asymptotically approach GLS esti- 
mators and that “covariance estimators come 
off with a surprisingly clear bill of health” 
(Wallace and Hussain 1969, p. 68). Where 
independent variables are correlated with 
intercepts, the error components model should 
not be used. 

Since it is not obvious which of these 
models should be chosen here, I opt for 
testing both types of models. While these 
models thus differ with respect to assump- 
tions concerning country intercepts, they both 
assume slope coefficients to be the same in all 
countries. Where autocorrelation remains in 
the residuals from the LSDV models, vari- 
ables will be transformed in an attempt to 
correct for serial correlation. 

A serious problem with regression analysis 
of comparative data is that because of the 
relatively small number of observations, 
when the number of independent variables 
increases, the estimated coefficients will often 
be based on rather few empirical observations 
(Shalev 1985; Ragin 1987, chap. 4). Even 
when we pool cross-section and time series 
data, the number of independent observations 
often is limited. This circumstance adds to the 
general difficulties involved in the use of 
statistical controls in nonexperimental re- 
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search (Lieberson 1985). Furthermore, in the 
present data set the three indicators of left 
party government participation are ‘highly 
intercorrelated, something which introduces 
problems of multicollinearity. I will therefore 
limit the number of independent variables in 
the regression models, and government vari- 
ables will be introduced only one at a time. ` 
Since both the dependent and some of the 
independent variables exhibit increasing trends 
over time, a linear trend variable (year) will 
be intróduced here to guard against erroneous 
interpretations of spurious effects.!? 

Because of the very long time period 
studied here—half a century — it is likely that 
causal processes have changed over time. 
Therefore, separate models are estimated for 
two different subperiods. One rather natural 
subperiod covers 1950—1980 and is thus 
dominated by the "golden years" of postwar 
economic growth but involves also the 
beginning of the present worldwide recession. 
The remaining period, 1930—1947, starts in 
the years of the Great Depression and ends in 
the immediate aftermath of World War II. 

The hypotheses discussed above generate 
the following basic regression model, where 
the signs of the independent variables indicate 
the expected direction of their effects on the 
dependent variable—social rights in sickness 
insurance: 


SOCRIGHTS;, = ag + b; GDPGROWTH,, 
+ b; INDCONFLICT,, 
+ b STATEPERS,, 
+ b, LEFTVOTE; 
+ bs LEFTCAB, + 
+ bsUNIONDENSITY,, 
+ bITREND; + ug. 


Here subscript i refers to countries (i = 1, 
FIO ,18) and subscript ¢ to years of 
observation (t = 1930, 1933, 1939, and 1947 
for one time period, and 1950, 1955, 1960, 
1965, 1970, 1975, and 1980 for the other 
period), and 


SOCRIGHTS = Social rights in sickness 
insurance as defined above, 


13 Specifications with time (year) dummies have 
been tested and yield results that are very similar to 
the ones reported here, however with slightly 
lower R?. 
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GDPGROWTH = Growth of gross domestic 
product per capita, 

INDCONFLICT = Number of working days 
used for industrial conflicts per 10,000 persons 
in the nonagricultural labor force, 


STATEPERS = Central state personnel as 
proportion of the total population (for the 
postwar period also Civil Service Prerogatives, 
i.e., the proportion of total social expenditures 
going to programs exclusively directed to civil 
servants), 


LEFTVOTE = Proportion of votes in general 
elections cast for left parties, 


LEFTCAB = Proportion of cabinet seats held 
by left parties, alternatively Weighted Left 
Cabinet Share, that is, the proportion of left 
cabinet seats weighted by the proportion of left 
seats in the parliament, or Time in Government 
of Left Parties, that is, proportion of time with 
left parties in government, 


UNION DENSITY = Number of union mem- 
bers in relation to the size of the nonagricultural 
labor force, 


TREND = Linear time trend (year of observa- 
tions), 


a) = Common intercept, and 
ug = Error term. 


For each of the two time periods, three 
models that differ in terms of the left cabinet 
variables included (cf. above) will be esti- 
mated. In equations dealing with the postwar 
period, the index of civil service prerogatives 
is used as an alternative indicator of state 
autonomy. Similar equations will also be 
estimated with the coverage of sickness cash 
benefit insurance as the dependent variable. 

Since we do not have continuous overtime 
observations on the dependent variables, the 
. values of the independent variables refer to 
averages for the (usually) five-year period 
preceding the observation on the dependent 
variable. The indicators referring to political 
conditions, industrial conflict, and economic 
growth rates are lagged by one year.!^ As an 


14 Lags with 0,1 and 2 years were tested: For the 
lagged variables, annual observations are avail- 
able. The state autonomy indicators are indirect 
measures and I expect that the degree of autonomy 
will be simultaneously reflected in the size of state 
personnel, the extent of civil service prerogatives, 
and social rights in sickness insurance. The state 
autonomy variables have therefore not been 
lagged. They change relatively slowly. 
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example, the extent of social rights observed 
in a country in 1980 is assumed to be affected 
by the average values of the lagged indepen- 
dent variables for the period 1974—1979 and 
by the average values for 1975 and 1980 of 
the state autonomy variables. This averaging 
of the values of the independent variables will 
decrease their variation and thus their poten- 
tial effects. In this connection it should also 
be noted that while the hypotheses predict 
monotonic relationships between the political 
variables and social rights, the present 
specifications make the rather strong assump- 
tion that the effects of these variables are 
linear. 

Although the observations do not constitute 
a random sample, significance tests are used 
as a heuristic device in the discussion of 
results.5 Since the hypotheses predict the 
direction of the effects on the dependent 
variable, one-tailed tests are applied. OLS 
estimations of these models result in strong 
autocorrelations among residuals.!6 Inspec- 
tion of data as well as statistical tests indicates 
that country intercepts clearly differ. I will 
therefore use LSDV and error component 
models to estimate the above equations.!7 

I begin the statistical analyses by looking at 
the determinants of social rights in sickness 
insurance during the period 1930-1947 (Table 
13.18 Since residuals from the LSDV models 
do not indicate serial correlation, no correc- 
tion for autocorrelation is necessary here (cf. 
Methodological Appendix). On the whole, 
the results from the LSDV and the error 
component models are very similar. Thus, in 
the LSDV as well as in the error component 
models, the rate of economic growth gives the 
expected positive coefficients which, how- 
ever, are far from significant. Also the 


15 The interpretation of significance tests is here 
open to discussion, since no sampling variation is 
present and, at least for the political variables, 
measurement errors are small. 

16 Median values of rho in different countries 
range from 0.70 to 0.80. 

17 This means that in the LSDV models, 17 
country dummies are added. Since the models to 
be estimated here contain a common intercept, to 
avoid collinearity one of the country dummies 
(France) is omitted. The common intercept thus is 
the intercept for this unit. 

18 Because of the presence of nondemocratic 
Bovernments, observations for Austria and Ger- 
many in 1939 and for Italy and Japan 1930-1939 
have been omitted in these regressions. 
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Table 1. Unstandardized Coefficients for Estimates of Effects on Social Rights in Sickness Insurance in 18 OECD 


Countries, 1930-1947 


























(1) (2) (3) (4) (5) (6) 
LSDV GLSE LSDV GLSE LSDV GLSE 
Economic 0.731 0.499 0.607 0.431 0.455 0.265 
growth rate (3.458) (3.175) (3.325) (3.043) (3.645) (3.346) 
Volume of 0.543* 0.526* 0.582* 0.567* 0.410 0.400 
industrial conflict (0.304) (0.281) (0.292) (0.269) (0.313) (0.288) 
Central state —0.200 —0.001 —0.280 —0.093 —0.048 0.131 
personnel (0.528) (0.443) (0.509) (0.430) (0.550) (0.457) 
Left votes 3.136 2.619 2.011 1.710 4.968 4.106 
(3.180) (2.688) (3.117) (2.637) (3.217) (2.688) 
Left cabinet share 2.014** 1.872** 
(0.806) (0.741) 
Weighted left 4.983** 4.729*** 
cabinet share (1.579) (1.438) 
Time in government 0.773 0.690 
of left parties (0.564) (0.523) 
Union density —0.447 —0.484 —0.245 —0.368 —1.041 —0.874 
(2.432) (2.091) (2.340) (2.018) (2.545) (2.175) 
Trend 0.044 0.038 0.041 0.036 0.045 0.041 
(0.030) (0.026) (0.028) (0.025) (0.031) (0.027) 
Constant — 75.796 — 68.582 —71.066 — 64.598 — 79.034 — 73.254 
(56.279) (50.175) (54.164) (48.334) (59.333) (52.563) 
df 39 56 39 56 39 56 
R? (adjusted) 0.896 0.035 0.904 0.349 0.884 0.252 
Standard error 0.892 0.831 0.858 0.796 0.939 0.875 
Country components 
Netherlands 3.192 3.382 2.995 3.185 3.453 3.577 
Japan 2.807 2.724 2.374 2.365 3.384 3.147 
Austria 1.662 1,595 1.707 1.632 1.614 1.544 
Treland 0.354 0.338 0.023 0.046 0.763 0.657 
Switzerland 0.330 0.474 0.202 0.344 0.355 0.478 
Germany — 0.069 0.040 —0.265 —0.143 0.144 0.198 
Italy —0.769 —0.584 —0.706 —0.568 —0.439 —0.243 
Belgium — 1.174 —0.977 —1.211 — 1.037 — 1.074 — 0.887 
Norway —1.501 — 1.349 — 1.669 — 1.532 — 1.039 —0.916 
United Kingdom — 2.081 — 1.704 —2.080 — 1.747 — 1.616 — 2.292 
Sweden ~ 2.870 — 2.663 —3.046 —2.878 —2.556 — 2,354 
New Zealand — 3.304 — 3.470 — 3.535 — 3.689 — 3.297 —3.447 
Denmark ~ 3.786 — 3.399 —3.882 —2.825 —3.078 —2.749 
USA —4.861 —4.621 — 5.373 — 5.096 —4.149 — 4.037 
Canada — 4.950 — 4.646 — 5.436 —5.105 —4.285 — 4.100 
Australia — 5.739 — 5.280 — 5.950 —5.534 —5.289 — 3.689 
Finland —6.120 —5.708 —6.184 — 5.826 —6.131 —5.684 
* px0.05. ** ps0.01. *** ps0.001. 


volume of industrial conflict has positive 
effects, four of which are significant. The 
relative size of central state personnel re- 
ceives weak and mostly negative coefficients. 
The effects of left votes are positive but not 
significant. Coefficients for left cabinet share 
and weighted left cabinet share are positive 
and clearly significant, while the coefficients 
for time of left parties in government are 
positive but not quite significant. The effects 
of union density are consistently negative and 
weak. The time trend during this period is 
positive but not very marked. 
Residuals in the LSDV models approach 


normality without outliers, but some degree 
of heteroscedasticity remains. According to 
the Hausman specification test (Kmenta 1986, 
pp. 634-35), in the error component models 
independent variables are not correlated with 
country intercepts. The error components 
model results in standard errors of estimates 
that are somewhat lower than for the LSDV 
models. As a result of the presence of country 
dummies, the proportion of variance ex- 
plained is, however, much higher for the 
LSDV models than for the error components 
models. 

When the LSDV models are estimated for 
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the 1950-1980 period, the originally very 
high levels of serial correlation in OLS 
residuals are considerably decreased. How- 
ever, some, perhaps nonnegligible, amount of 
autocorrelation remains. Therefore, also a 
GLS-LSDV model is estimated, where vari- 
ables are transformed to attempt to correct for 
serial correlation. The estimates based on 
LSDV and error component models are again 
very similar, while the results from the 
GLS-LSDV model occasionally differ some- 
what from the others. 

Also for the postwar period, the effects of 
economic growth rates on social rights in 
sickness insurance are consistently positive 
but now they are not significant (Table 2). 
Volume of industrial conflict shows weak 
effects with mixed signs. Estimates for civil 
service prerogatives are mixed—negative but 
insignificant in the LSDV model, negative 
and significant in the GLS-LSDV model, and 
positive but very weak in the error compo- 
nents model.!9 The effects of left votes are 
generally weak, positive in the LSDV and 


,error components model but negative in the 


GLS-LSDV model. All models do, however, 
indicate positive and significant effects of the 
three government composition variables — left 
cabinet share, weighted left cabinet share, 
and time of left parties in government. Also 
the effects of union density are now positive 
and clearly significant. During the postwar 
period, time trends are marked. 

In the LSDV models, residuals closely 
approach a normal distribution. No outliers 
are present, but some degree of heteroscedasti- 
city between countries remains. In the error 
component model the Hausman specification 
test does not indicate correlations between 
country intercepts and the independent vari- 
ables. Standard errors of estimates are 
roughly similar for all models, but again 
because of the presence of separate country 
dummies, the LSDV models account for a 
larger proportion of total variance than do the 
error component models. 

As indicated above, country dummies are 
expected to pick up differences between 
countries which are the result of omitted but 
time-invariant causal variables. To bring out 
possible patterns among countries, in the 





19 When the indicator of civil service preroga- 
tives is replaced by the measure of central state 
personnel, the coefficients of these models remain 
far from significance. 


tables countries have been ordered in terms of 
decreasing size of dummies in the first 
equations. Country components in the error 
component models as well as dummies in the 
GLS-LSDV models roughly follow the same 
rank order. It must be remembered here that 
the sign, size, and significance of intercepts 
depend upon the choice of unit for which the 
intercept is omitted (here France) The 
relative size of the country intercepts can, 
however, be taken as a starting point for 
speculations about the nature of omitted 
causal variables. They indicate that different 
types of political factors and state institutions 
may be involved in the omitted variables. 

For the 1950-1980 period, my models give 
the largest overestimates of the extent of 
social rights for the United States, Canada, 
Finland, and Australia. These countries have 
introduced sickness cash insurance relatively 
late or not at all. The United States, Canada, 
and Australia are all federal states, where 
constitutional factors may have acted as a 
hindrance to social policy legislation. Finland 
is the Protestant country that has had the most 
severe split among the left parties during the 
postwar period. This split may have made it 
difficult for its labor movement to utilize its 
numeric strength in political conflicts. 

On the other hand, my models underesti- 
mate the level of social rights primarily in 
Germany, the Netherlands, Japan, and Bel- 
gium. Three of these are countries where 
Catholic mass parties have been important 
during the postwar period and may have 
served as channels for wage-earner demands. 
In Japan a conservative, patriarchal, and state 
interventionist pattern of policy making has 
dominated since the end of the 19th century. 
An important part of this tradition is an 
attempt to gain the allegiance of the more 
privileged sections of the working class 
tnrough selective favors. The sickness cash 
benefits registered here may be results of such 
a political strategy. With the exception of 
Germany and Belgium, the country intercepts 
Show relatively similar patterns during the 
1930-1947 period.?° 

My models do not fare weil when it comes 
to predicting the coverage of sickness cash 
benefit insurance in the population 15—64 
years of age. Figures not given here indicate 


20 Since periods of nondemocratic government 
are excluded, in Japan the 1930-47 intercept is 
based only on the 1947 observation. 
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that in LSDV specifications of the same type 
as used above, in the 1950-1980 period only 
union density comes out with significant 
coefficients and the expected positive signs. 
Coefficients for the volume of industrial 
conflict are significant but negative. The three 
indicators of left government participation are 
positive and insignificant. Coefficients for the 
proportion of left votes and for civil service 
prerogatives are insignificant with mixed 
signs. For the period 1930-1947, none of the 
variables of interest yields significant coeffi- 
cients. 


DISCUSSION 


As noted above, research on welfare state 
development has revolved around the issues 
of the relative importance of class, the 
distribution of power resources, and the 
possibility of political democracy in this 
process. In this debate the question of the role 
played by class-based, left political parties in 
the development of social rights is a contro- 
versial and crucial one. 

The analyses in the present paper indicate 
that as far as the development of social rights 
in sickness insurance is concerned, left party 
government participation clearly has been an 
important factor during both the prewar and 
the postwar periods. Since some of the 
consequences of political factors may have 
been captured by the country dummies, it is 
not possible to specify the relative size of the 
effects of left parties. It goes without saying, 
however, that they have not been the only 
factors of importance here. Yet, in contrast to 
the highly contradictory results of previous 
comparative studies on the determinants of 
social expenditures, the present findings 
give rather unequivocal support for the 
assumption of the significance of left govern- 
ment participation in the development of 
social policy. 

The present results are, however, some- 
what less clear when it comes to the relevance 
of “contagion from the Left,” strategic action 
by nonsocialist governments, and party com- 
petition in this development. Once we control 
for left party government participation, varia- 
tion in left voting strength does not appear to 
have much effect on social rights during 
sickness. Yet, with the exception of the also 
otherwise somewhat deviant GLS-LSDV spec- 
ifications, the effects of left vote are positive 
during the prewar as well as the postwar 
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period. In addition, union density receives 
strong positive coefficients during the postwar 
period. In view of the relatively strong 
intercorrelations between indicators of left 
strength and the limited number of available 
observations, it is not realistic to hope to 
clearly isolate the separate effects of these 
factors. On the whole, however, the data do 
not contradict the hypothesis that variations in 
left strength outside cabinets have tended to 
generate strategic action to extend social 
rights. In this area future research should 
contrast the outcomes in individual countries, 
or groups of countries, with differing patterns 
of political development. 

The game theoretical perspective adopted 
here suggests that the popular protest hypoth- 
esis on the role of civil unrest comes in as a 
special case of what I above have termed 
strategic action in relations to mass mobiliza- 
tion. My specification of the extraparliamen- 
tary input into the political processes in terms 
of the volume of industrial conflict does, 
however, receive support only for the 
1930-1947 period. This difference between 
time periods may indicate a macropolitical 
shift from a situation where political conflicts 
are partly fought out on the labor market to a 
situation where in several countries macro- 
level “societal bargaining” between govern- 
ment, unions, and business interests has 
relegated industrial conflicts to the sidelines 
(Korpi 1983, chap. 2 and 8). Within political 
democracies, during the postwar period, the 
effects of social unrest on policy-making may 
be limited to specific countries, specific time 
periods, and more spectacular forms of mass 
action rather than being a more general and 
recurrent pattern. With the exception of strike 
waves such as those in France in 1936 and 
1968 and during the “hot autumn” in Italy in 
1969, industrial action in the Western coun- 
tries appears to have become limited to issues 
concerning wages and working conditions. it 
is also possible that the extent to which 
political demands are voiced outside the 
parliamentary channels is indicative of the 
responsiveness of the party system to the 
demands and interests of the electorate. 

In the debate on the role of economic 
development in welfare state expansion, two 
different factors are intertwined—on the one 
hand, the level of economic development, 
and on the other hand, the rate of economic 
growth. In the Western nations, both the level 
of GDP and the quality of social insurance 
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programs have markedly increased during the 
past half century. It is not possible, however, 
to take this correlation as the basis for the 
conclusion that economic development has 
caused the expansion of the welfare state. 
While it appears reasonable to assume that the 
rate of growth of economic resources is of 
relevance for the opportunities to enact social 
reforms, the analyses here—with consistently 
positive but insignificant coefficients—give 
limited support for this hypothesis. 

As generally the case in comparative 
studies, the operationalization of latent vari- 
ables is limited by available data. This is also 
the case here, not the least for the state 
autonomy variable. It goes without saying 
that the present analysis does not claim to be a 
full test of Skocpol’s theory of autonomous 
state action. Yet, if we start from the basic 
assumptions in public choice and state 
autonomy writings that increasing the size of 
bureaucracies and acting in the self-interest of 
the collectivity of state officials are basic goal 
parameters of state officials, it can be argued 
that both the relative size of central state 
personnel and the relative role of civil service 
prerogatives in social insurance are interesting 
indicators of the extent to which state officials 
have ‘been able to exercise their autonomous 
role. 

The empirical analyses carried out here do 
not, however, give any support for the state 
autonomy hypothesis in the present context. 
These results may thus indicate that in 
Western democracies, state officials do not 
control enough independent power resources 
to make it possible for them to define and to 
act on their own self-interests largely indepen- 
dently from their constituents. My results are, 
however, compatible with the assumptions 
that state capacities as well as state institu- 
tions (such as federalism) are of relevance for 
policy output. And they do not deny the 
possibility that, as, for example, studies of 
social policy development in the United 
States indicate (Skocpol and Ikenberry 1983; 
Quadagno 1984), appointed state officials can 
play an active role within the limits set by the 
societal power structure. But they indicate 
that the hypothesis of autonomous state action 
needs to be specified in terms of direction as 
well as with respect to the areas into which 
self-interested action is most likely to be 
directed. The extension of social rights of the 
citizenry needs probably not be the first 
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choice of egoistic state officials acting to 
safeguard their private interests. 

Granted that in the last instance only 
empirical tests can verify the fruitfulness of a 
hypothesis, it remains as much for the 
proponents of the state autonomy hypothesis 
as for others to bring forth new data that can 
test to what extent state officials act in their 
own self-interest. That they do so to some 
extent is beyond dispute. The question 
remains, however, whether this is a universal 
predominant feature in Western democracies, 
as state autonomy proponents claim. The fact 
that state officials have only limited control 
over the main power resources in capitalist 
democracies throws doubts on the realism of 
such claims. Heclo’s (1974) study of social 
policy development in Britain and Sweden is 
the empirical study most often claimed to 
support this hypothesis. However, Heclo’s 
interpretation that civil servants probably 
have been the most influential agents in 
promoting welfare state expansion is not well 
supported.?! The position of the state auton- 
omy proponents would be strengthened if 
they would not limit themselves to deductive 
theorizing, as in much of public choice 
writings, or not concentrate on qualitative 
case studies which leave ample room for 
flexible ad hoc explanation, but also make 
attempts at quantitative comparative studies. 
While such studies cannot compete with the 
rigor of deductive theorizing nor with the 
richness of life in detailed case studies, they 
have a greater capacity for theoretical disillu- 
sionment via confrontations with specific 
aspects of reality. In these respects the 
different methodological approaches can com- 
plement each other. 

In few of the statistical models estimated 
here has coverage of sickness cash benefit 


?! Because of his highly restrictive criteria for 
viewing elections as influencing social policy 
development, Heclo concludes that, for example, 
the exceptionally protracted and very intensive 
political conflicts around the introduction of 
superannuation in Sweden during the late 1950s 
were of little relevance for changing pension 
legislation. His neglect of the role of institutional 
arrangements in social insurance leads him to 
interpret the preference of union leaders for a union 
controlled unemployment insurance over a national 
insurance scheme as lack of interest in this area. 
Heclo also appears to have misunderstood the role 
of civil servants in Swedish parliamentary commis- 
sions (cf. Meijer 1969). 
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insurance turned out to be affected by my 
independent variables. These results can 
perhaps be interpreted as a case of the 
changing frontiers of political conflicts in the 
development of social rights. In the great 
majority of these countries, sickness insur- 
ance was introduced at a relatively early 
stage, and since then conflicts have centered 
around the enrichment of the status of social 
citizenship in this area. 

On the three controversial issues in the long 
debate on the development of social rights, 
the empirical analyses carried out here are 
generally in accord with hypotheses derived 
from the power resources approach. Thus, 
contrary to pluralist assumptions about the 
increasing irrelevance of class, and fears of 
functionalist neo-Marxists and state autonomy 
proponents about the capacity of representa- 
tive democratic processes, class-based left 
parties appear to have played a significant 
role in the development of social rights within 
the area studied here. It is a task for future 
research to analyze to what extent and in what 
respects the democratic class struggle has 
affected the multidimensional aspects of the 
development of welfare states and social 
policies. 
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His interests include political sociology, class 
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METHODOLOGICAL APPENDIX 


Data sources. Figures on GDP per capita used to 
calculate average annual rates of economic growth 
are primarily from Maddison (1979) and have been 
complemented for Ireland and New Zealand from 
Summers and Heston (1984) and Clark (1951). 
Data on central state personnel are largely from 
Flora, Alber, Eichenberg, Kohl, Kraus, Pfennig, 
and Seebohm (1981), complemented by linear 
interpolations and (for Ireland, Norway, and 
Japan, as well as for all countries for 1980) with 
data from national sources. Information on social 
security expenditures for civil servants is from 
ILO, “The Cost of Social Security” (various 
years). Data on election outcomes and the 
composition of legislatures are from Mackie and 
Rose (1974) and the annual updates by these 
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authors in the European Journal of Political 
Research. Figures on cabinet participation were 
obtained from a data file at the Department of 
Political Science, University of Minnesota, and 
complemented by information from Keesing’s 
Contemporary Archives. 

The information on legislated or mandated social 
insurance programs has been collected by a 
research team based at the Swedish Institute for 
Social Research in Stockholm. Our data describe 
the “central tendency” in the programs in terms of 
the social rights of an average industrial worker, 
and the range of variation in terms of minimum 
and maximum conditions. Only programs on the 
national level are included. Where several different 
programs have existed, the “central tendency” in 
social rights has been coded in terms of the 
program covering the largest proportion of indus- 
trial (metal) workers. Our data are largely derived 
from national sources. Country experts and civil 
servants in relevant administrative branches have 
frequently been consulted. International compila- 
tions, such as U. S. Department of Health and 
Human Services, Social Security Programs 
Throughout the World (various years), have been 
used primarily as reference works. The decision to 
start the time series in 1930 and 1933 was 
prompted by the availability of descriptive data for 
these years in ILO (1933, 1936). 

Figures on average wages of industrial workers 
are based on ILO, Yearbook of Labour Statistics 
(various years), and Swedish Employers’ Confed- 
eration, Wages and Total Labour Costs (various 
years), complemented by national sources. Hourly 
wages for adult male and female workers in 
manufacturing (including overtime pay and bo- 
nuses but excluding vacation pay) have been 
multiplied by normal hours of work for full-time 
workers to achieve a weekly wage, 52 weekly 
wages making up the annual wage. Information on 
taxes and social security contributions for 1975 and 
1980 is from OECD (1986). For earlier years they 
are based on information supplied by the Bureau of 
Fiscal Documentation in Amsterdam, or from 
national sources. 

The data collection has been done in a careful 
and conscientious way and has therefore been 
work-intensive and time-consuming, involving a 
team of four to seven persons roughly on a 
half-time basis over a nine-year period. To achieve 
uniformity and reliability in the coding process, 
emerging problems have been continuously dis- 
cussed within the working group and codings have 
generally been double-checked. Detailed coding 
instructions have been worked out and are 
available from the author. 

Index of Social Rights. The five components of 
the Index of Social Rights have been combined on 
the basis of their similar relationships to the 
independent variables, and on the basis of their 
intercorrelations given below: 
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2 3 4 5 6 

1. Replacement, 
single, Ist week 

2. Replacement, 


34 96 88 67 43 


family, 1st week 1.00 .94 95 .69 .48 
3. Replacement, 

single, 26 weeks 1.00 .95. 65 .52 
4. Replacement, 

family, 26 weeks 1.00 .65 .57 
5. Waiting days 

(reflected) 1.00 .18 
6. Year 1.00 


3 

When combining the above five components 
into an additive scale, one has to equalize their 
standard deviations. To avoid introducing a 
negative correlation between the weight of a 
component and the extent to which the component 
has changed over time (i.e., its trend over time), I 
have regressed each component on year of 
observation and have divided each observation 
with its standard deviation around this regression 
line (rather than with its total standard deviation). 
This amounts to dividing each observation with an 
expression arrived at by multiplying the total 
standard deviation with the co nding coeffi- 
cient of alienation, that is /1— 7? where r is the 
correlation coefficient between the respective 
component and year. These correlations are given 
in the table above. As can be seen, this operation 
does not have any major consequences for the 
relative weights. The decision was then made to 
give each of the four replacement variables 
one-sixth of the total weight of the index, while 
waiting days were given one-third of the total 
weight. In the total index, the number of waiting 
days has been reflected. 
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In two countries, Italy and Japan, the social 
rights in existing sickness insurance programs from 
1930 to 1939 have been disregarded, since their 
coverage was extremely low (below 10 percent). 
Here, as well as in other countries and periods 
without sickness insurance programs, variations in 
the social rights indicator are caused only by the 
component containing 26 weeks of normal income. 

Variables. To give the reader some information 

on the characteristics of the data set, the following 
table gives means and standard deviations for the 
variables (with the values for the 1930—1947 
period on line I and for 1950-1980 on line II), and 
their correlations with social rights, insurance 
coverage, and year. 
Regression Analyses. Because of the small number 
of observations in the time series, to avoid values 
larger than unity autocorrelations have been 
estimated as Pearson correlations (Kmenta 1986, 
p. 619). Since only a small number of observations 
are available for each country, an overall correla- 
tion coefficient based on data for all countries was 
computed here (Krnenta 1986, p. 621). For the 
LSDV equations in Table 1, estimates of rho range 
from —0.04 to 0.06. In Table 2 they range from 
0.42 to 0.49. Standard methods (y, — p y,-1; x, — 
p %—1) have been used to transform variables 
(including dummy variables). In this context, 
however, the standard method of transforming first 
observations (yj /T—p* x,\/1—p*) appears to 
generate some problems. Thus, after transforma- 
tions, within each ‘country the first observations 
deviate much more clearly from the following ones 
than they did originally. 


Correlations with 





Variable Mean Std. Dev. Social rights Coverage Year 
Social rights I 7.93 2.76 1.00 0.55 0.17 
II 10.13 2.85 1.00 0.57 0.34 
Coverage I 0.28 0.22 0.55 1.00 0.10 
H 0.43 0.29 0.57 1.00 0.25 
Economic growth I 0.006 0.038 —0.14 0.01 —0.10 
I 0.035 0.022 0.31 0.10 —0.00 
Industrial conflict I 0.47 0.60 0.05 0.11 0.33 
0 0.29 0.32 —0.36 —0.33 0.00 
Central state I 1.79 0.97 0.37 0.13 0.33 
personnel 0 — — — — — 
Civil service I — — — — — , 
prerogatives nu 117.09 57.80 0.37 0.06 —0.23 
Union density I 0.24 0.13 0.21 0.34 0.10 
I 0.41 0.14 0.27 0.39 0.12 
Left votes I 0.31 0.16 0.24 0.35 0.09 
n 0.37 0.14 0.42 0.36 —0.02 
Left cabinet I 0.20 0.26 0.08 0.29 0.36 
b 0.31 0.34 0.25 0.42 —0.03 
Weighted left I 0.09 0.13 0.08 0.19 0.38 
cabinet 0 0.15 0.18 0.20 0.38 ~0.06 
Left government I 0.24 0.33 0.03 0.23 0.33 
time 0 0.46 0.42 0.34 0.47 —0.08 
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CONCEPTUALIZATION OF TERRORISM* 


JACK P. GIBBS 
Vanderbilt University 


Many issues and problems surround the conceptualization of terrorism. Most 
definitions of the term are indefensible if only because they do not speak to those 
issues and problems. An assessment of contending definitions can transcend purely 
personal opinions; and an assessment can be undertaken without a theory, even 
‘though an impressive theory is the ultimate justification of its constituent 
definitions. The present conceptualization goes beyond a definition of terrorism by 
emphasizing the definition’s bearing on five major conceptual questions, each of 
which introduces a major issue and/or problem. Then it is argued that thinking of 
terrorism and other sociological phenomena in terms of control promotes 
recognition of logical connections and/or empirical associations, each of which 


could become a component of a theory. 


Definitions of terrorism are controversial for 
reasons other than conceptual issues and 
problems. Because labeling actions as “terror- 
ism” promotes condemnation of the actors, a 
definition may reflect ideological or political 
bias (for lengthy elaboration, see Rubenstein 
1987). Given such considerations, all of 
which discourage attempts to define terror- 
ism, it is not surprising that Laqueur (1977, 
p. 5) argued that 


a comprehensive definition of terrorism . . . 
does not exist nor will it be found in the 
foreseeable future. To argue that terrorism 
cannot be studied without such a definition is 
manifestly absurd. 


Even granting what Laqueur implies—that 
terrorism is somehow out there awaiting 
definition—it is no less “manifestly absurd" 
to pretend to study terrorism without at least 
some kind of definition of it. Leaving the 
definition implicit is the road to obscu- 
rantism. 

Even if sociologists should overcome their 
ostensible reluctance to study terrorism (for a 
rare exception, see Lee 1983), they are 
unlikely to contribute to its conceptualization. 
The situation has been described succinctly 
by Tallman (1984, p. 1121): “Efforts to 
explicate key concepts in sociology have been 
met with stifling indifference by members of 
our discipline.” 

There are at least two reasons why 


* Work on this paper was supported in part by a 
grant from the University Research Council of 
Vanderbilt University. The author is indebted to 
two diligent reviewers. 
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sociologists commonly appear indifferent to 
conceptualizations. First, Weber and Parsons 
gave the work a bad name in the eyes of those 
sociclogists who insist (rightly) on a distinc- 
tion between substantive theory and concep- 
tua] analysis. Second, conclusive resolutions 
of conceptual issues are improbable because 
the ultimate justification of any definition is 
an impressive theory that incorporates the 
definition. Nonetheless, it is crippling to 
assume that productive research and impres- 
sive theories are possible without confronting 
conceptual issues and problems. The argu- 
ment is not just that theorizing without 
definitions is sterile, nor merely recognition 
that theory construction and conceptualization 
should go hand in hand. Additionally, one 
can assess definitions without descending to 
purely personal opinion, even when not 
guided by a theory. 

Sysiematic tests of a theory require defini- 
tions of at least some of the theory's 
constituent terms; but test findings, even 
those based on the same units of comparison, 
will diverge if each definition's empirical 
applicability is negligible, meaning if indepen- 
dent observers disagree when applying the 
definition to identify events or things. To 
illustrate, contemplate a question about any 
definition of terrorism: How much do inde- 
pendent observers agree in judging whether or 
not President Kennedy's assassination was 
terrorism in light of the definition? As 
subsequent illustrations show, simple defini- 
tions may promote agreement in answers to 
the Kennedy question and yet be objectiona- 
ble for theoretical reasons; but the immediate 
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point is that an empirically applicable defini- 
tion does not require a theory. By contrast, 
given evidence that a definition promises 
negligible empirical applicability, no theory 
can justify that definition. 

Still another “atheoretical” criterion is the 
definition’s consistency with convention. That 
criterion cannot be decisive, because it would 
preclude novel definitions; but it is important 
when the field’s professionals must rely on 
“outsiders” for data and, hence, presume 
appreciable congruence between their defini- 
tions and those of the outsiders. That 
consideration is particularly relevant here, 
because in analyzing terrorism social scien- 
tists often rely on reports of government 
officials, journalists, and historians. 

Conceptual issues and problems haunt 
virtually all major terms in the social and 
behavioral sciences, and any definition is 
ambiguous if it does not answer questions 
bearing on those issues and problems. There 
are at least five such questions about 
terrorism. First, is terrorism necessarily 
illegal (a crime)? Second, is terrorism neces- 
sarily undertaken to realize some particular 
type of goal and, if so, what is it? Third, how 
does terrorism necessarily differ from conven- 
tional military operations in a war, a civil 
war, or so-called guerrilla warfare? Fourth, is 
it necessarily the case that only opponents of 
the government engage in terrorism? Fifth, is 
terrorism necessarily a distinctive strategy in 
the use of violence and, if so, what is that 
Strategy? 

The questions are answered in light of a 
subsequent definition of terrorism, but more 
than a definition is needed. The pursuit of a 
theory about terrorism will be furthered by 
describing and thinking about terrorism and 
all other sociological phenomena in terms of 
one particular notion, thereby promoting the 
recognition of logical and empirical associa- 
tions. The most appropriate notion is identi- 
fied subsequently as "control," but a defense 
of that identification requires a definition of 
terrorism (not of "terror"). 


A DEFINITION OF TERRORISM 


Terrorism is'illegal violence or threatened 
violence directed against human or nonhuman 
objects, provided that it: 


(1) was undertaken or ordered with a view to 
altering or maintaining at least one putative 
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norm in at least one particular territorial unit 
or population; 

(2) had secretive, furtive, and/or clandestine 
features that were expected by the partici- 
pants to conceal their personal identity 
and/or their future location; 

(3) was not undertaken or ordered to further the 
permanent defense of some area; 

(4) was not conventional warfare and because of 
their concealed personal identity, conceal- 
ment of their future location, their threats, 
and/or their spatial mobility, the participants 
perceived themselves as less vulnerable to 
conventional military action; and 

(5) was perceived by the participants as contrib- 
uting to the normative goal previously 
described (supra) by inculcating fear of 
violence in persons (perhaps an indefinite 
category of them) other than the immediate 
target of the actual or threatened violence 
and/or by publicizing some cause. 


Clarification, Issues, and Problems 


In keeping with a social science tradition, 
most definitions of terrorism are set forth in a 
fairly brief sentence (see, e.g., surveys by 
Oots 1986, pp. 5-8, and Schmid and 
Jongman 1988, pp. 32-38). Such definitions 
do not tax the reader's intellect or patience, 
but it is inconsistent to grant that human 
behavior is complex and then demand simple 
definitions of behavioral types. 

The illegality of terrorism. Rubenstein's 
definition (1987, p. 31) is noteworthy if only 
because it makes no reference to crime or 
illegality: “I use the term ‘terrorism’ . . . to 
denote acts of small-group violence for which 
arguable claims of mass representation can 
be made." However, even granting that 
terrorism is an illegal action, there are two 
contending conceptions of crime, one empha- 
sizing the reactions of officials as the 
criterion and the other emphasizing normative 
considerations (e.g., statutory law). Because 
of space limitations, it is not feasible to go 
much beyond recognizing the two contending 
conceptions. It must suffice to point out that 
an action may be illegal or criminal (in light 
of statutes and/or reactions by state officials) 
because of (1) where it was planned; (2) 
where it commenced; and/or (3) where it 
continued, especially in connection with 
crossing a political boundary. Such distinc- 
tíons are relevant even when contemplating 
the incidence of terrorism. 

One likely reaction: But why is terrorism 
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necessarily a crime? The question suggests 
that classes of events or things exist indepen- 
dently of definitions. Thus, it may appear that 
“stones” and “humans” denote ontologically 
given classes, but in the context of gravita- 
tional theory stones and humans are not 
different. However, to insist that all defini- 
tions are nominal is not to imply that 
conventional usage should be ignored; and, 
again, the point takes on special significance 
when defining terrorism. The initial (unnum- 
bered) part of the present definition is 
consistent with most other definitions and 
also with this claim: most journalists, offi- 
cials, and historians who label an action as 
“terrorism” evidently regard the action as 
illegal or criminal. However, it is not denied 
that two populations may differ sharply as to 
whether or not a particular action was a 
crime. As a necessary condition for an action 
to be terrorism, only the statutes and/or 
reactions of officials in the political unit 
where the action was planned or took place 
(in whole or in part) need identify the action 
as criminal or illegal. 

Violence and terrorism. Something like the 
phrase “violence or threatened violence” 
appears in most definitions of terrorism (see 
Schmid and Jongman 1988, p. 5). As in those 
definitions, the phrase’s key terms are here 
left as primitives; and whether they must be 
defined to realize sufficient empirical applica- 
bility can be determined only by actual 
attempts to apply the definition. 

Despite consensus about violence as a 
necessary feature of terrorism, there is a 
related issue. Writers often suggest that only 
humans can be targets of violence, but many 
journalists, officials, and historians have 
identified instances of destruction or damage 
of nonhuman objects (e.g., buildings, domes- 
ticated animals, crops) as terrorism. More- 
over, terrorists pursue their ultimate goal 
through inculcation of fear and humans do 
fear damage or destruction: of particular 
nonhuman objects. 

The ultimate goal of terrorists. The present 
definition indicates that terrorists necessarily 
have a goal. Even though it is difficult to 
think of a human action that is not goal 
oriented, the consideration is controversial for 
two reasons. One reason is the allegation that 
terrorists are irrational or mentally ill (see, 
e.g., Livingston 1978, pp. 224-39; and 
Livingstone’s commentary, 1982, p. 31 on 
Parry), which raises doubts as to whether 
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terrorists have identifiable goals. The second 
reasons why part 1 of the definition is 
controversial: many sociologists, especially 
Durkheimians, do not emphasize the purpo- 
sive quality of human behavior, perhaps 
because they view the emphasis as reduction- 
ism. In any case, a defensible definition of 
virtually any term in sociology’s vocabulary 
requires recognition of the relevance of 
internal behavior (e.g., perception, beliefs, 
purpose). Thus, without part 1 of the present 
definition, the distinction between terrorism 
and the typical robbery becomes obscure. The 
typical robber does not threaten violence to 
maintain or alter a putative norm; he or she is 
concerned only with behavioral control in a 
particular situation. 

A defensible definition of a norm is not 
presumed (see Gibbs 1981, pp. 9-18, for a 
litany of difficulties). Rather, it is necessary 
only that at least one of the participants (those 
who undertake the violent action or order it) 
view the action as contributing to the 
maintenance or alteration of some law, 
policy, arrangement, practice, institution, or 
shared belief. 

Part 1 of the definition is unconventional 
only in that goals of terrorists are not 
necessarily political. Many definitions create 
the impression that all terrorism is political 
(for a contrary view, see Wilkinson 1986, p. 
51), but the very term “political terrorism” 
suggests at least two types.! The concern of 
social scientists with terrorism typologies is 
premature (see. e.g., the commentary by Oots 
[1986, pp. 11, 30] on Mickolus's notions of 
international, transnational, domestic, and 
interstate terrorism). No terrorism typology 
amounts to a generic definition (see the 
survey in Schmid and Jongman 1988, pp. 


! As pointed out by Laqueur (1987, pp. 19, 
118), much of the terrorism in the American labor 
movement (e.g., the bombing of the Los Angeles 
Times building in 1910) was attacked even by the 
left as "commercial, not idealistic"; and "there 
was no intention of overthrowing the government, 
killing the political leadership: or changing the 
political system." To insist that an effort (violent 
or otherwise) to alter working conditions or wages 
in a particular context (e.g., a publishing corpora- 
tion) is "political" only illustrates indiscriminate 
use of that term. To be sure, it may be that an 
impressive theory about terrorism must be limited 
to "political terrorism," but the necessity for such 
limitation cannot be known a priori. 
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39-59), and without the latter the former is- 


bound to be unsatisfactory. 

Military operations and terrorism. To 
repeat a previous question: How does terror- 
ism necessarily differ, if at all, from 
conventional military operations in a war, 
civil war, or so-called guerrilla warfare? The 
question cannot be answered readily because 
there are no clearly accepted definitions of 
conventional military operation, war, civil 
war, and guerrilla warfare.? “Guerrilla” is 
. especially troublesome because journalists are 
prone to use the word without defining it but 
such as to suggest that it is synonymous with 
terrorism (a usage emphatically rejected by 
Laqueur 1987 and Wilkinson 1986). 

Conventional military operations differ 
from terrorism along the lines indicated by 
parts 2, 3, and 4 of the definition. However, 
the definition does not preclude the possibility 
of a transition from terrorism to civil war. 
One tragic instance was the Easter Rising in 
Ireland (1916), when rather than perpetuate 
the terrorism tradition, a small group of Irish 
seized and attempted a permanent defense of 
government buildings in Dublin, vainly hop- 
ing that the populace would join them in open 
warfare. Today, it is terrorism rather than 
civil war that haunts Northern Ireland, and the 
term “guerrilla warfare” has no descriptive 
utility in that context. 

Terrorism as a special strategy. One 
feature of terrorism makes it a distinctive 
(though not unique) strategy in violence. That 
feature is described in part 5 of the definition. 


? The question is nor how terrorists differ from 
military personnel, insurgents, rebels, revolution- 
aries, or guerrillas. The distinction is irrelevant for 
present purposes because the concern is not with 
defining "a terrorist." The terms "terrorist" and 
“terrorists” are used occasionally in this paper in 
the loose sense of "an individual or individuals 
who have engaged in terrorism," but it is 
recognized that a more elaborate definition is 
needed. 

3 The secretive, furtive, and/or clandestine 
features of terrorism (part 2 of the present 
definition) are not limited to the violent action 
itself. They also pertain to previous and subsequent 
actions (nonviolent), even the lifestyle of the 
participants. Consider Clark's observation (1986, 
p. 300) on members of a terrorist organization 
dedicated to Basque separatism: "The great 
majority of the members of ETA continue to live at 
home, either with their parents or (if they were 
married) with their spouses and children, and to 
work at their regular employment. . . ." 
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Part 5 is controversial primarily because it 
would exclude action such as this threat: 
"Senator, if you vote for that bill, it will be 
your death warrant." Why would such a 
threat not be terrorism? A more theoretically 
significant answer is given subsequently. 
Here it must suffice to point out that scores of 
writers have emphasized "third-party" or 
"general" intimidation as an essential feature 
of terrorism; and journalists, officials, or 
historians only rarely identify “dyadic intimi- 
dation” (X acts violently toward Y but not to 
control Z's behavior) as terrorism. 

"State terrorism" as a special issue. 
Zinam's definition (1978, pp. 244—45) illus- 
trates one of many reasons why definitions of 
terrorism are so disputable: "[Terrorism is] 
the use or threat of violence by individuals or 
organized groups to evoke fear and submis- 
sion to obtain some economic, political, 
sociopsychological, ideological, or other ob- 
jective." Because the definition would extend 
to the imposition of legal punishments by 
government officials to prevent crimes through 
general deterrence, in virtually all jurisdic- 
tions (see Morris 1966, p. 631) some aspects 
of criminal justice would qualify as terrorism; 
and Zinam's definition provides no basis for 
denying that it would be “state terrorism.” 
Even granting that a state agent or employee 
acts for the state only when acting at the 
direction or with the consent of a superordi- 


* For example, Oots (1986, p. 81) makes 
intimidation central in his definition of terrorism, 
but, numerous writers suggest that "seeking 
publicity" is also an essential strategy in terrorism. 
Hence, reference is made in the present definition 
(the last part of it) to “publicizing some cause.” 
Actually, the two strategies—intimidation and 
publicization—are virtually inseparable. 

5 Should it be argued (see, e.g., Wilkinson 
1986, p. 23, and Zinam 1978, p. 241) that 
violence is by definition illegal, what of a killing in 
an Anglo-American case of undisputed justifiable 
homicide? To deny that the killing was violence 
would be arbitrary in the extreme and contrary to 
conventional use of the term “violence.” Indeed, 
why submit to an unconventional usage that makes 
"illegitimate violence" redundant and “legitimate 
violence” contradictory? Perhaps more impor- 
tantly, what term is the appropriate descriptive 
label for undisputed justifiable homicide or the 
legitimate use of force by a police officer? If the 
answer is “coercion,” there is no corresponding 
convention; and when kidnappers bind their 
victims, surely that action is coercion. 
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nate, there is still no ostensible difference 
between the use or threat of violence in law 
enforcement and Zinam’s terrorism. 

Had Zinam defined terrorism as being 
necessarily illegal or criminal, then many 
instances of violence by a state agent or 
employee at the direction or with the consent 
of a superordinate would not be terrorism. 
However, think of the numerous killings in 
Nazi Germany (Ernst Roehm, the Storm 
Troop head being a well-known victim) 
during the Night of the Long Knives (June 
30, 1934). Hitler ordered the slaughter, and at 
the time the killings were illegal in light of 
German statues; but Hitler publicly acknowl- 
edged responsibility, and the only conceal- 
ment was that perceived as necessary to 
surprise the victims.$ Surely there is a 
significant difference between such open, 
blatant use of coercion by a state official 
(dictator or not) and the situation where 
regime opponents are assassinated but offi- 


cials disavow responsibility and the murders. 


are so secretive that official complicity is 
difficult to prove. The "rule of terror" of 
Shaka, the famous Zulu chief, is also 
relevant. Shaka frequently ordered the execu- 
tion of tribal members on a seemingly 
whimsical basis, but the orders were glaringly 
public (see Walter 1969). Shaka's regime 
illustrates another point: in some social units 
there may be no obvious “Jaw” other than the 
will of a despot, in which case there is no 
basis to describe the despot's violence as 
illegal. The general point: because various 
aspects of government may be public vio- 
lence, to label all of those aspects "terrorism" 
is to deny that terrorism has any secretive, 
furtive, or clandestine features. 

Given the conceptual issues and problems 
that haunt the notion of state terrorism, it is 
hardly surprising that some writers attribute 
great significance to the notion, while others 
(e.g., Laqueur 1987, pp. 145-46) seem to 
reject it. The notion is not rejected here, and 
the following definition does not make it an 


$ The suggestion is not that Nazi state terrorism 
ended with the Night of the Long Knives (June 30, 
1934). Note, however, that writers on "Nazi 
terror" (e.g., Noakes 1986) are prone to avoid an 
explicit definition of terrorism. Such phrases as 
“use of terror" (Noakes 1986, p. 67) and “seige of 
terror" (Walter 1969, p. 7) should not be equated 
with "terrorism," and they are conducive to 
misunderstandings. 
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extremely rare phenomenon. State terrorism 
occurs when and only when a government 
official (or agent or employee) engages in 
terrorism, as previously defined, at the 
direction or with the consent of a superordi- 
nate, but one who does not publicly acknowl- 
edge such direction or consent. 

The foregoing notwithstanding, for theoret- 
ical reasons it may prove desirable to limit the 
proposed definition of terrorism (supra) to 
nonstate terrorism and to seek a quite 
different definition of state terrorism. Even 
So, it will not do to presume that all violence 
by state agents is terrorism. The immediate 
reason is that the presumption blurs the 
distinction between terrorism and various 
kinds or aspects of law enforcement. More- 
over, it is grossly unrealistic to assume that 
all instances of genocide or persecution along 
racial, ethnic, religious, or class lines by state 
agents (including the military) are terrorism 
regardless of the means, goals, or circum- 
stances. Nor is it defensible to speak of 
particular regimes (e.g., Stalin's, Hitler's, 
Pol Pot's) as though all of the related violence . 
must have been state terrorism. For that 
matter, granted that the regimes were mon- 
strous bloodbaths, it does not follow that the 
state agents in question made no effort 
whatever to conceal any of their activities 
and/or their identity." Readers who reject the 
argument should confer with American jour- 
nalists who attempted to cover Stalin's Soviet 
Union, Hitler's Germany, or Pol Pot's 
Cambodia. Similarly, it is pointless to deny 
that secretive, clandestine, or furtive actions 
have been characteristic of "death squads" 
(many allegedly “state”) in numerous Latin 
American countries over recent decades. It is 
commonly very difficult to prove that such 
groups murder with the knowledge and/or 
consent of state officials; but the difficulty is 
one justification for identifying the murders 
as terrorism, even though the state-nonstate 
distinction may be debatable in particular 
instances. 


7 When perpetrators of violence attempt to 
conceal their personal identity, the attempt alone is 
indicative of illegality; and the secretive, furtive, 
and/or clandestine features of terrorism (part 2 of 
the definition) are more decisive than conjectures 
about legality. When there are doubts on the part 
of observers as to the legality of some violent act, 
the concealment of the personal identity of the 
actor or actors should be treated as evidence of 


illegality. 
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Difficulties in Empirical Application 


One likely objection to the present definition 
of terrorism is its complexity; but, again, 
demands for simplicity are inconsistent with 
human behavior’s complexity. Nonetheless, 
application of the definition does call for 
kinds of information that may not be readily 
available. Reconsider a previous question: 
Was President Kennedy’s assassination terror- 
ism? The present definition does not permit 
an unequivocal answer, largely because there 
are doubts about the goals of the assassination 
and whether or not it was intimidation. If 
terrorism were defined as simply “the illegal 
use ‘or threat of violence,” an affirmative 
answer to the Kennedy question could be 
given; but the definition would also admit 
(inter alia) all robberies and many child 
abuses. Similarly, the phrase “for political 
purposes” would justify an affirmative an- 
swer to the Kennedy question; but the 
implication would be a tacit denial of 
apolitical terrorism, and divergent interpreta- 
tions of “political” are legion. Finally, 
although a definition that specifically includes 
“murder of a state official” would maximize 
confidence in an affirmative answer to the 
Kennedy question, there must be doubts 
about the feasibility of such an “enumer- 
ative” definition of terrorism. And what 
would one make of the murder of a sheriff by 
his or her spouse? 

The general point is that a simple definition 
of terrorism tends to delimit a class of events 
so broad as to defy valid generalizations about 
it (reconsider mixing presidential assassina- 
tions, robberies, and child abuses) or so 
vague that its empirical applicability is 
negligible. In the latter connection, the 
Kennedy illustration indicates the need to 
grant this methodological principle: the con- 
gruence dimension (but not the feasibility 
dimension) of a definition’s empirical appli- 
cability is enhanced when independent observ- 
ers agree that the definition cannot be applied 
in a particular instance because requisite 
information is not available. If that principle 
is not granted, sociologists will try to make 
do with simple definitions and whatever data 
are readily available. 

Presumptive and possible terrorism. Com- 
parative research on terrorism commonly is 
based on the use of the term “terrorism” by 
journalists or officials. Hence, insofar as the 
use of data on presumptive terrorism can be 
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justified, a definition’s utility is enhanced by 
its correspondence with the use of the term 
“terrorism” by journalists and officials. 
Although only potentially demonstrable, my 
claim is that the present definition corre- 
sponds more with such use of the term than 
does any simpler definition, such as: terror- 
ism is illegal violence. 

Even when terrorism research is based on 
descriptions of violent events, as in newspa- 
per stories, there may be cases that can be 
designated as possible terrorism even though 
the information is not complete; and a 
definition’s empirical applicability can be 
assessed in terms of agreement among 
independent observers in such designations. 
In that connection, the present definition 
points to the kind of information needed for 
truly defensible research on terrorism, which 
is not the case when investigators try to make 
do with a much simpler definition, or no 
definition at all. 


TOWARD A THEORY OF TERRORISM 


The present definition of terrorism does not 
answer any of a multitude of questions, such 
as: Why does the incidence of terrorism vary 
among political units and over time? Al- 
though it is an illusion to suppose that any 
definition answers empirical questions,® a 
definition may be much more conducive than 
are alternatives to thinking about phenomena; 
if so, the definition furthers the pursuit of a 
theory. 


Recognizing Relations 


Unlike an isolated proposition, a theory 
requires preliminary observations and consid- 
erable thought. The observations depend on 
the way the phenomenon in question has been 
conceptualized, anc some conceptualizations 
facilitate recognition of logical connections 
and/or possible empirical associations. 

When a definition comprises several dis- 
tinct parts, it is commonly all the more 
difficult to recognize relations between the 
phenomenon defined and other phenomena. 
The solution is to think about all parts of the 


8 [t is also an illusion to suppose that social ` 
Scientists have anything even approaching an 
adequate theory of terrorism (see commentaries by 
Laqueur 1987, p. 165: Schmid and Jongman 1988, 
p. 61; and Wilkinson 1986, p. 96). 
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definition in terms of a particular notion, one 
that can be used to think also about diverse 
phenomena in the field’s subject matter. 
Explication of the strategy is furthered by this 
diagram: X<—--Y--->Z, where X is the 
phenomenon defined, Z is any other phenom- 
enon in the field’s subject matter, and Y is the 
notion used to think about both X and Z. 
Thinking about X and Z serves no purpose 
unless it suggests a relation between them. If 
the relation is a logical connection, it furthers 
the field’s conceptual unification; but substan- 
tive theory is advanced primarily by recogni- 
tion of a possible empirical association, 
especially one having explanatory implica- 
tions. Whether there are explanatory implica- 
tions depends not just on the way that the two 
phenomena have been defined and on the 
choice of the notion but also on the 
explanatory mechanism. 


Strategic Explanatory Mechanisms 
for Sociology 


In formulating theories sociologists rarely 
identify the type of explanatory mechanism, 
and the relative merits of contenders are 
rarely debated. Unfortunately, space limita- 
tions permit only a few observations on three 
major possibilities. 

Strict causation. Possibly excluding the 
period when functionalism was dominant, 
strict causation has been sociology’s most 
common explanatory mechanism. It is also 
the most difficult to describe, due in part to 
debates (particularly from Hume onward) 
over the nature of causation. So a simple 
residual definition must suffice: strict causa- 
tion is the mechanism if the explanation 
neither makes reference to selective survival 
nor emphasizes the purposive quality of 
human behavior. As such, strict causation 
includes direct, indirect or sequential (i.e., 
intervening variables), multiple, and recipro- 
cal causation. 

Doubts about strict causation as sociology’s 
sole explanatory mechanism grow when one 
contemplates variation in the incidence of 
terrorism. Consider two illustrative asser- 
tions: (1) an increase in urbanization causes 
an increase in terrorism; and (2) an increase in 
stratification causes an increase in terrorism. 
Both assertions tax credulity; and credulity 
would not be furthered by substituting other 
structural variables, nor by invoking multiple, 
sequential, or reciprocal causation rather than 
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direct causation (see surveys by Laqueur 
1987, pp. 72-173; Schmid and Jongman 
1988, pp. 61-135; and Wilkinson 1986, pp. 
93, 102, 197, 213): 

Selective survival. Contemplate Durk- 
heim’s assertion (1949, especially p. 257) 
that an increase in material (population) 
density results in an increase in the division of 
labor. How could the relation be strict 
causation? One answer: it is not strict 
causation; rather, insofar as a positive empir- 
ical association holds between the two 
variables, it is through selective survival. 
Specifically, the probability of a society's 
survival is greater if an increase in material 
density is accompanied or followed by an 
increase in the division of labor, even though 
the association was not anticipated (i.e., it 
was not purposive). 

A "selective survival" explanation can be 
described this way: some patterns or unifor- 
mities exist because exceptions to them tend 
to be eliminated. Although the explanatory 
mechanism requires no reference to the 
internal behavior (perception, intention, etc.) 
of the participants, the term "eliminated" (or 
"survival") is not limited to the purely 
biological sphere. After all, no one is 
confused when it is said that a particular 
marriage did not survive or that various 
l9th-century U.S. occupations have been 
eliminated during this century. 

Functionalist theories in sociology are 
studies in implicit resort to selective survival 
as the explanatory mechanism; but there is no 
mystery as to why the mechanism is com- 
monly left implicit, nor why Davis (1959) 
saw fit to defend functionalism by emphasiz- 
ing "functional analysis" rather than "func- 
tional explanation." The notion of a func- 
tional explanation cannot be clarified and 
made distinctive without invoking selective 
survival; but most functionalist theories ap- 
pear incredible when translated something 
like this: the institution or practice in question 
(i.e., the explicandum) is necessary for the 
survival of the larger system. Credulity would 
be sorely taxed by a functional theory of 
terrorism. Imagine someone even suggesting 
that terrorism is necessary for a country's 
survival. 

The purposive quality of human behavior. 
Any theory that emphasizes the purposive 
quality of human behavior is likely to be 
criticized as being “teleological.” That label 
should be avoided if only because it gives rise 
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to extreme arguments, as when Catton (1966, _ 


pp. 5, 11) dismisses teleological theories or 
explanations on the grounds that they have 
some future state (a goal) causing present 
behavior. However, the term  "purpo- 
siveness” is not used here as a synonym for 
“teleological”; instead, it is used to denote all 
major types of internal behavior, such as 
perception ànd belief, the argument being that 
such behaviors enter into the pursuit of goals. 

Identification of "purposiveness" as an 
explanatory mechanism is consistent with 
symbolic interactionism, one of sociology's 
perennial major perspectives. Yet the subjec- 
tivism of some versions of symbolic interac- 
tionism is so extreme as to suggest that only 
one explanatory mechanism is relevant for 
sociology. To the contrary, the purposiveness 
mechanism can be combined with the other 
two, and defensible sociological theories may 
require such combinations. Consider, for 
example, an explanation of "armed and 
organized groups," which are identified as 
police or military in English-speaking social 
units. The international ubiquity of such 
groups suggests that a country's survival is 
jeopardized by civil war or conquest without 
armed and organized opposition to militant 
secessionists or invaders. Even so, such 
groups are studies in purposiveness. As for 
combining strict causation and purposiveness, 
environmental features make certain human 
practices difficult; but the consequences may 
depend on whether and how the relation is 
perceived. Marvin Harris (1979, p. 105) 
unwittingly supplied an illustration: "Rainfall 
agriculture leads to dispersed, multicentered 
forms of production. Hence it is doubtful that 
any pristine state even developed on a rainfall 
base." Harris could not bring himself to 
recognize that would-be rulers may perceive 
the difficulty of controlling a dispersed 
population. Yet an extreme position need not 
be taken. A  population's actual spatial 
distribution is a causal factor in perception, 
and imperceptive would-be rulers may be 
eliminated. 


Conceptualization of Control 


In light of the foregoing arguments, there is a 
need for a notion that (1) facilitates describing 
and thinking about not only terrorism but also 
any sociological subject and (2) is compatible 
with all three explanatory mechanisms. Con- 
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trol is the most promising candidate, but its 
conceptualization is crucial. 

The immediate issue is the choice of the 
term "control" rather than "social control." 
The latter is only a subclass of the class 
“control over human behavior"; and unless 
control is defined so as to include not only 
that class but also biotic control (e.g., 
domestication of plants) and inanimate con- 
tro] (e.g., making or using tools), it is 
doubtful whether the notion facilitates describ- 
ing and thinking about sociology's subject 
matter. 

The prevailing conception of social control, 
the "counteraction of deviance" conception, 
is conducive to thinking of terrorists as 
objects but not also as agents of social 
control. Moreover, well-known advocates 
(e.g., Parsons 1951, pp. 297, 321) of the 
conception deny the relevance of internal 
behavior. Thus, if the practice of wearing a 
wedding ring is conducive to marital fidelity 
and infidelity is deviant, then the practice is 
social control even if the connection is 
recognized only by a sociologist observer. So 
the counteraction-of-deviance conception of 
social control is alien to terrorism's purposive 
quality and to attempts to suppress terrorism. 

A generic definition of attempted control. If 
only because sociologists should study both 
successes and failures in control, attempted 
control is the key term. That point takes on 
special significance in the case of terrorism. 
Describing and thinking about terrorism 
require recognition more of what terrorists 
attempt to control than what they actually 
control. 

Defined generically, attempted control is 
overt behavior by a human in the belief that 
(1) the behavior increases or decreases the 
probability of some subsequent condition and 
(2) the increase or decrease is desirable. To 
clarify, the commission or omission of an act 
is overt behavior; and "a subsequent condi- 
tion" may be an organism's behavior (ex- 
ternal or internal) or the existence, location, 
composition, color, size, weight, shape, 
odor, temperature, or texture of some object 
or substance, be it animate or inanimate, 
observable or unobservable. 

Durkheim's disciples will be prone to nurse 
this reservation: the definition makes inten- 
tion relevant. The objection ignores the point 
that sociologists use an army of terms that 
imply intention, such as reaching, turning, 
and saluting. For that matter, while some 
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reference to internal behavior is essential to 
maintain the distinction between success and 
failure in control (according to the 
counteraction-of-deviance conception, there 
are no failures in social control), the present 
definition does not limit attempted control to 
intentional behavior in the sense of conscious 
and deliberate. To illustrate, while drivers 
ordinarily are unaware of holding the steering 
wheel, who would deny that they hold in the 
belief that it reduces the probability of an 
undesirable subsequent condition? Recogni- 
tion of an "affective" quality (i.e., desirable 
vs. undesirable) will antagonize both extreme 
behaviorists and Durkheimians, but consider 
the consequences of ignoring it. When 
someone robs a bank, he or she presumably 
acts in the belief that his or her behavior 
increases the probability of being injured, 
which is a cognitive belief. So when a man 
backs out of a bank with gun in hand and is 
shot by a police officer, did the gunman 
"control" the police officer? To answer 
affirmatively is to embrace an absurdity, the 
inevitable outcome of avoiding reference to 
internal behavior (in this case, an affective 
belief) when defining types of behavior. 

Types of human control. For reasons given 
later, social control is a very important type 
of control when describing or thinking about 
terrorism. However, that is the case only if 
the counteraction-of-deviance conception of 
social control is rejected, and it is imperative 
to distinguish social control from other types 
of control over human behavior. 

In attempting self-control and individual 
acts in the belief that the action increases the 
probability that his or her subsequent behav- 
ior will be as desired (e.g., perhaps greater 
diligence at work) or decreases the probability 
of undesirable behavior (e.g., perhaps smok- 
ing). Although that definition is consistent 
with the "challenge" conception of self- 
control (overcoming fears or vices), various 
mundane acts, such as lifting the phone 
receiver before dialing or setting an alarm 
clock, are also attempted self-control. 

Attempted proximate control most com- 
monly takes the form of a command or a 
request, but coercion and certain kinds of 
threats are also proximate control; and they 
are especially relevant in analyzing terrorism. 
However, even though the target of proximate 
control may be an aggregate, as when a 
terrorist leader shouts an order to a bomb 
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squad, there is no third party (i.e., no human 
intermediary, no reference to someone). 

Attempted sequential control is a command 
or request by one human to another in the 
belief that (1) it increases the probability of a 
subsequent command or request by another 
human to still other humans and (2) the 
increase is desirable. A chain of commands is 
the most common form of sequential control, 
and sequential control warrants recognition if 
only because that form is virtually a defining 
characteristic of an organization. For that 
reason alone, sequential control is relevant in 
analyzing terrorist groups and governmental 
agencies that attempt to suppress terrorism. 

Attempted social control is overt behavior 
by a human, the first party, in the belief that 
(1) the behavior increases or decreases the 
probability of a change in the behavior of 
another human or humans, the second party in 
either case; (2) the overt behavior involves a 
third party but not in the way of a sequential 
control; and (3) the increase or decrease is 
desirable. The definition is clarified by 
subsequent observations on terrorism; but 
some clarification can be realized at this point 
by considering one of the five inclusive types 
of social control (Gibbs 1981, pp. 77-109), 
because those types are distinguished primar- 
ily in terms of how a third party is involved. 
In all instances of attempted vicarious social 
control, the first party attempts to punish the 
third party, reward the third party, or 
somehow rectify the third party's behavior, 
always presuming that such action will 
influence the second party's behavior. Vicar- 
ious social control is the basis of general 
deterrence, which enters into criminal justice 
policy in virtually all jurisdictions (see Morris 
1966, p. 631). Less obvious, terrorists also 
often resort to deterrent vicarious social 
control as an integral component of their 
intimidation strategy. 


Some Logical Connections and Possible 
Empirical Associations 


. The initial (unnumbered) part of the definition 


(supra) suggests this question: Why is 


terrorism illegal? Terrorism is a violent act, 
but state officials seek a monopoly on 
violence, especially violence with a negligible 
probability or retaliation (see Weber 1978, p. 
314, though note that Weber ignored the 
probability of retaliation). So the.-question’s 
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dizes the control exercised by superordinate 
state officials (legislators, monarchs, despots, 
or others) or is an attempt by those officials to 
realize a goal that they perceive as realizable 
through legal means, if at all, by incurring the 
loss of something they value more than the 
goal. 

Why do terrorists —state or nonstate— 
resort to secretive, furtive, and/or clandestine 
violence? Because they seek goals that they 
perceive as realizable only through such 
violence or only through legal means that 
entail unacceptable losses. What are such 
goals? Why do terrorists pursue them? Why 
do terrorists perceive secretive, furtive, and/ 
or clandestine violence as an essential means 
of countercontrol? A theory is needed to 
answer ‘such questions; but a theory will not 
be realized unless social scientists take the 
questions seriously, and no theory will be 
defensible if inconsistent with Walter’s state- 
ment (1969, p. 13) about violence: “the 
proximate aim is to instill terror; the ultimate 
end is control.” 

More on goals. In seeking to maintain or 
alter some putative norm, dissidents may 
have so little popular support in the country as 
a whole (e.g., the United Kingdom in the 
case of England, Wales, Scotland, and 
Northern Ireland) and/or such determined 
opposition from state officials that the dissi- 
dents come to perceive violence as the only 
means of realizing that goal. However, the 
amount of popular support and official 
opposition depend not just on the evalutive 
standards of the public and/or officials but on 
the extent to which they view the dissidents’ 
goal as realistic. They will not view the goal 
as realistic if they are baffled by related 
statements. Contemplate the characterization 
of terrorists in West Germany (see Becker 
1988, p. 24) of themselves as fighters for “the 
uprooted masses” of the Third World. Even if 
officials should agree that West Germany is 
responsible for the plight of Third World 
countries, they are unlikely to know what 
would satisfy the terrorists. 

Even if the “acceptability” and “realiza- 
bility” of dissident goals partially determine 
whether the dissidents become terrorists, 
those considerations are not to be judged by 
social scientists. It is entirely a matter of the 
way that the public and/or officials perceive 
those goals. Yet sociologists can further the 
pursuit of a theory about terrorism by 
undertaking research on this question: What 
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conditions promote or impede perceptions by 
the public and officials of the goals of: 
dissidents as acceptable and realizable? 

Another major question about violence. 
Parts 2, 3, and 4 of the definition indicate 
how terrorism differs from lethal conflicts 
commonly identified as wars or civil wars. 
But why is “covertness” so characteristic of 
terrorism? What writers commonly label as 
terrorist groups or organizations (nonstate) 
rarely comprise more than a few hundred 
members, which precludes more than elec- 
toral success. Because nonstate terrorists 
resort to violence, they are certain to be 
targets of attempted control by the police 
and/or the military; and their small number 
alone precludes successful countercontrol 
measures akin to open warfare. Various 
tactics of concealment—all types of counter- 
control—offer the only hope of survival, and 
various features of “international terrorism” 
are manifestations of those tactics. 

Officials engaged in state terrorism are not 
endangered by conventional military or police 
action, but they are concerned with conceal- 
ing their personal identity. Although a theory 
is needed to specify the conditions in which 
state officials resort to terrorism, their reli- 
ance on concealment is not puzzling. Even a 
homicidal despot is likely to recognize that 
the appearance of legitimacy may be essen- 
tial for the regime’s survival. 

Concealment is purposive behavior, and no 
notion rivals control when it comes to forcing 
recognition of purposiveness. Yet the notion 
does not lead to an extreme position as far as 
explanatory mechanisms are concerned. 
Should the concealment tactics of terrorists 
fail or should their bravado lead them to 
extremely reckless behavior, they are virtu- 
ally certain to be killed or incapacitated; but 
terrorists do not just happen to be killed or 
incapacitated, which is to say that the notion 
of control remains relevant. The related 
question for sociology is thus: What condi- 
tions (e.g., degree of urbanization) influence 
the efficacy of the concealment tactics of 
terrorists? 

The strategy of terrorism reconsidered. 
When terrorists inflict injuries or destroy 
property, they aim.to promote fear throughout 
some population (e.g., legislators, factory 
owners), and thereby control that population's 
behavior. To what end? The answer depends. 
on the putative norm that the terrorists are 
attempting to alter or maintain; but whatever 
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it may be, the terrorists employ deterrent 
vicarious social control. 

Why do terrorists engage in that type of 
control? Their small number and vulnerability 
to retaliation make attempts at proximate 
control ineffective; indeed, social control is 
distinctive in that it offers a means for the few 
to control the many. Sequential control is not 
an alternative because the “normative posi- 
tion” of terrorists severely limits the range of 
their authority. Normative considerations are 
also relevant in contemplating this question: 
Why deterrent vicarious social control rather 
than some other subtype or type? Because 
violence (including related coercion, physical 
punishment, etc.) is the principal alternative 
when there is no normative basis (e.g., 
authority, appeals to evalutive standards) for 
control. Should it be objected that assassina- 
tion may be a means to a political goal 
without the element of intimidation, the 
objection ignores a multitude of definitions 
that make intimidation a necessary feature of 
terrorism. Moreover, such a definition need 
not even suggest that terrorism is the only 
means to a political goal, not even the only 
violent means. Finally, by what logic are all 
assassinations instances of terrorism? 

What do terrorists hope to gain through 
deterrent control? One common answer is 
“concessions” (see, e.g., Oots 1986, p. 81), 
but that answer ignores a strategy that several 
writers have attributed to terrorists (see, e.g., 
Laqueur 1987; Wilkinson 1986). Briefly, 
terrorists aim to provoke officials to such 
extreme repressive measures (e.g., censor- 
ship, preventive detention) that the govern- 
ment loses popular support and falls. The 
“provocational” strategy is based on modula- 
tive social control, wherein the first party 
(terrorists in this case) uses the influence of 
the third party (the public at large in this case) 
on the second party (government officials in 
this case). The immediate significance of the 
provocative strategy is the possibility of its 
failure. In employing the strategy, terrorists 
evidently assume that the government will fall 
because it increasingly departs from the rule 
of law, but some powerful faction may 
consider the rule of law secondary to 
suppression of terrorism and stage a coup 
d'etat because of the government's “underre- 
action” to terrorism. Such was the case when 
the military toppled Uruguay’s liberal demo- 
cratic government, crushed the terrorists (the 
Tupamaros), and remained in power. 
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The provocational strategy has implications 
for a theory concerning variation in the 
incidence of terrorism. A failure in the 
strategy may be more important than success. 
Specifically, if the outcome is an authoritar- 
ian regime, the incidence of terrorism may 
decline because repressive measures become 
more effective. That possibility poses a 
sociological question, but some sociologists 
will not be inclined to do research on the 
question because they evidently think of 
officials as thumb-twiddling spectators when 
the status quo is challenged violently (see, 
e.g., Skocpol 1979). To the contrary, in 
numerous countries officials have responded 


effectively to terrorism (see, e.g., Laqueur 


1987). Hence, a theory’s validity is jeopar- 
dized if it does not recognize that variation in 
terrorism may to some extent reflect variation 
in the effectiveness of attempts to control it. 
Indeed, where there is scarcely any rule of 
law, why would terrorists employ the provo- 
cational strategy? That question is relevant in 
contemplating the ostensible rarity of terror- 
ism in Marxist countries (see Laqueur 1987, 
p. 302). 


SUMMARY AND CONCLUSION 


An impressive theory of terrorism requires 
more than a conceptualization that confronts 
issues and problems. A definition of terrorism 
must promise empirical applicability and 
facilitate recognition of logical connections 
and possible empirical associations. Such 
recognition requires a notion that facilitates 
describing and thinking about terrorism; and 
the notion must be compatible with each of 
three possible explanatory mechanisms: strict 
causation, selective survival, and purposive- 
ness. 

The notion of control is the most promising 
candidate. Although that notion has no equal 
when it comes to underscoring human behav- 
ior’s purposive quality, it is not alien to any 
particular explanatory mechanism. All of 
sociology’s subject matter can be described 
and thought of in terms of control (at least as 
it has been conceptualized here), and the 
notion is particularly relevant in the study of 
terrorism. That phenomenon and attempts to 
prevent it are nothing less than one vast 
attempt at control. 


JACK P. GIBBS is Centennial Professor and 
Acting Chair of the Department of Sociology 
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to the sociology of deviance, social control, 
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next book, Control: Sociology’s Central 
Notion, will be published by the University of 
Illinois Press in the Spring of 1989. 
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. LIMITS ON TECHNOCRATIZATION.OF THE LAW: 
THE ELIMINATION OF THE NATIONAL LABOR RELATIONS 
BOARD'S DIVISION OF ECONOMIC RESEARCH* 


ROBIN STRYKER 
University of Iowa 


Because class- and state-centered perspectives converge to predict the increasing 
use of science by capitalist states and because social scientists have increasingly 
pervaded U.S. governmental agencies, the elimination of the Division of Economic 
Research in the National Labor Relations Board in 1935-40 represents a 
theoretical and historical anomaly. This case study of the Division demonstrates 
that a theory of technocratization must account for both the reductions as well as 
the increases in the government’s reliance on science. Explaining variation in state 
agencies’ reliance on social science expertise requires an examination of the 
relationships among agency actors, state actors outside the agency, and class 
actors outside the state. Analysis must consider (1) social scientists’ contribution to 
agency effectiveness; (2) agency impact on the class organization of society along 
with agency deviation from the ideology of the dominant class; (3) the balance of 
power in Congress and the Executive; (4) agency internal politics and its resource 
dependence on the external environment. 


Both class-centered and state-centered theo- 
ries suggest that state reliance on science is 
important and that the advanced capitalist 
State will increasingly incorporate scientific- 
technical expertise.! Congress’ elimination of 


* An earlier version of this paper was presented 
at the 1987 annual meetings of the Law and 
Society Association. I would like to thank the 
NLRB for allowing me access to agency internal 
documents, and I thank Edward Lawler, Barry 
Markovsky, Fred Pampel, Sheldon Stryker, Arthur 
Stinchcombe, Larry Griffin, Kathy Stone, Karen 
Orren, Craig Becker, and the anonymous review- 
ers for their constructive criticism of earlier drafts 
of the paper. In its final stages, the paper benefitted 
from the scrutiny of the Iowa Workshop on Theo- 
retical Analysis, under the direction of Professor 
Guillermina Jasso. I also gratefully acknowledge 
comments on earlier versions of some of this pa- 
per's arguments from Erik Olin Wright, William H. 
Sewell, J. Willard Hurst, and Peter Carstensen. This 
research was in part supported by an Old Gold Iowa 
Summer Fellowship. 

! There is also a literature on socialist states' 
incorporation of scientific-technical expertise (e.g., 
Bailes 1978; Gouldner 1979; Konrad and Szelenyi 
1979). Because differing political economies 
dictate a different content for class conflict, state 
functions, political institutions, and ideologies in 
capitalism and socialism, a theory of technocrati- 
zation applicable to both capitalist and socialist 
states is beyond the scope of this paper. However, 
in showing the import of interplay among class and 
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the National Labor Relations Board's 
(NLRB's) social science division in 1940 is 
an anomaly on two grounds: theoretically 
because it contradicts the predictions of these 
theories, historically because social scientists 
have persisted in other regulatory agencies. 
The NLRB case is strategic because it points 
to limits on the process of incorporating 
scientific expertise by the capitalist state and 
provides an opportunity to search for the 
explanation of such limits. 

This paper uses a deviant case to examine 
technocratization. It combines insights from 
class-analytic, state-centered, and organiza- 
tional approaches to focus on the relationships 
among agency actors, state actors outside the 
agency, and class actors outside the state. The 
paper begins with the concept of technocrati- 
zation, reviews the convergence of past 
theories on increased technocratization, and 
provides historical background for the NLRB 
case. A brief methods discussion prefaces 
case analysis. Theoretical lessons are drawn 
to improve the theory of technocratization and 


organizational conflicts, politics and cultural val- 
ues, Bailes's (1978) study of applied physical 
science and technology in the Soviet Union 
accords with this analysis. In socialism as in 
capitalism neither class theories nor bureaucratiza- 
tion alone can explain variation in state reliance on 
scientific-technical expertise. 
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suggest hypotheses for future historical- 
comparative work. 


THEORETICAL AND | 
HISTORICAL BACKGROUND 


Technocratization 

The technocratization of law is a complex, 
multidimensional process by which organiza- 
tional and reasoning elements characteristic of 
technocratic state intervention become in- 


serted into a legal framework. Offe (1975) 
links modes of capitalist state economic 


intervention with the logic of state policy-. 


making. Bureaucracy is suited to and suffi- 
cient for distributing state resources such as 
money and justice. Technocratic rationality 
becomes more adequate when the state 
responds to disturbances in accumulation by 
producing goods and services. Routinized 
application of inputs like law cannot answer 
such questions as “What should be produced?" 
and “How can it be produced efficiently?" 
Rather, use of scientific knowledge to reach 
flexibly chosen ends is more suited to 
producing output. 

Therborn (1978, pp. 40-54) implies that 
both bureaucracy and technocracy have orga- 
nizational and reasoning dimensions. States 
have "organizational technologies" that ex- 
press their class character. Following Weber, 


legal-rational bureaucracy in liberal capital- , 


ism rests on application of rules, disregarding 
content, persons, and ends served, by trained 
officials in hierarchically ordered positions. 
Whereas bureaucracy suits liberal capital- 
ism's legal regulation of the market, monop- 
oly capitalism's market intervention requires 
managerial technocracy in state executive 
agencies. Technocracy relies on specializa- 
tion, impersonality, and stratified monopoli- 
zation of knowledge, but not on rules and 
fixed hierarchies. Its rationality is substan- 
tive, involving calculation based on the 
content of cause-effect relations. 

Legal and technocratic activity are two 
distinct modes of state intervention. Legal 
reasoning focuses on rule creation, interpreta- 
tion, and application grounded in common or 
statutory law. It relies on manipulating legal 
rules according to procedures such as adjudi- 
cative reasoning based on precedent, the 
technique of distinguishing cases, and statu- 
tory interpretation based on legislative his- 
tory. Legal organization considers training in 
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law vital for occupying authoritative and 
strategic organizational positions. It provides 
hierarchical settings (court or administrative 
agency) and formalized procedures for apply- 
ing legal reasoning, and subjects decisions to 
judicial review. Technocratic reasoning oc- 
curs in relatively informal, nonhierarchical 
organizational settings and involves making 
policy rather than substantive legal rules.? 
Scientific experts given jurisdictional respon- 
sibility for technocratic policy-making pro- 
duce policy on an ad hoc, discretionary basis, 
in response to readings on scientific variables 
and in accord with established cause-effect 
relationships. Substantive provisions of tech- 
nocratic policy are not ordinarily subject to 
judicial review. 

Technocratization is the movement toward 
exclusive use of causal reasoning by scientific- 
technical experts to make and administer state 
policy. Law in any area can provide a 
framework of rules to be interpreted, applied, 
or even created by scientific-technical experts 
establishing and applying general cause-effect 
relations, rather than by lawyers using legal 
reasoning.? Although state action can be 
based on logics other than legal or scientific 
rationality (e.g., force, politics), a specifica- 
tion of all alternatives is not necessary here. 
But it is necessary to appreciate the implica- 
tions of defining technocratization as a 
process in which scientific-technical reason- 
ing and expertise are substituted for alterna- 
tive state logics. On a latent level, opposition 
to technocratization always exists based on, 
or justifiable by, competing logics. Implement- 
ing one or another logic depends on particular 
circumstances. " 

This concept of technocratization, implying 
the implementation of scientific-technical 
logic in the context of conflict, will help 
structure the case analysis and guide theory 
construction. The concept calls for investigat- 
ing the presence or absence of scientific 


? The terms used here to differentiate the 
“socio-logic” of technocratic and legal interven- 
tions are not to be confused with equivalent terms, 
used as legal terms of art in interpreting particular 
statutes such as the Administrative Procedure Act. 
See Stryker (1986) for extensive development of 
both concepts and examples of legal and techno- 
cratic rationality. 

3 See Stryker (1986) for thorough treatment of 
ali reasoning and organizational elements of 
technocratization of law. 
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reasoning in government policy-making and 
administration. For technocratization to oc- 
cur, personnel trained in such reasoning must 
be present. Their presence consequently 
becomes a key indicator of technocratization 
and of technocratization of law. Thus, both 
case analysis and subsequent theoretical 
development focus on the role of scientific 
experts in regulatory agencies. Both are 
sensitive to conflict over state reliance on 
science. 


Past Theory on Increased Technocratization 


Although otherwise diverging, class- and 
state-centered scholars converge in expecting 
the welfare state to rely increasingly on 
scientific-technical expertise. The state- 
centered approach treats government as a 
coercive, administrative and legal organiza- 
tion, analytically independent of its economic 
and social environment, with interests, goals, 
and capacities of its own. Although state- 
centered theory does not formally conceptual- 
ize the distinctive nature of scientific- 
technical as opposed to legal rationality, 
Skocpol (e.g., Skocpol 1985; Skocpol and 
Finegold 1982) explicitly presents scientific 
expertise as a basis for expanded state 
capacity. The state incorporates scientists 
because scientific rationality increases state 
capacity to form and execute policy autono- 
mous from the demands and interests of 
classes and groups outside the state. Scientific- 
technical research is a resource enhancing 
the state’s ability to effectively accomplish its 
goals. Thus, state-centered theory leads us 
to expect government to expand recruitment 
of, and reliance on, scientific-technical exper- 
tise. 

The class-centered approach suggests mech- 
anisms by which liberal and welfare states 
reproduce capitalism. The liberal state does 
so by constituting workers as isolated juridi- 
cal subjects, then organizing them into “the 
public” whose interests the state purports to 
serve (Poulantzas 1975). The liberal state also 
masks capitalism’s class character by juridi- 
cally separating economic and political 
spheres, allowing apparently fair market 
exchanges to become ideologically effective 
(Habermas 1975). 

In the face of an organizing and radicaliz- 
ing working class, state intervention contains 
the market's destructive side effects by 
establishing, for example, a minimum wage 
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and pension programs (Sweezy 1942; Ther- 
born 1978). Capitalism is relegitimated in the 
short run, but at a price. Class-centered 
theory forsees an eventual legitimation crisis 
wrought by state intervention because such 
intervention politicizes economic decisions 
and conflicts by focusing them in and on the 
state (Wright 1978). It also creates new 
administrative structures needing legitimation 
and reduces the efficacy of traditional bour- 
geois ideology claiming a self-regulated 
economy (Habermas 1975). 

'State reliance on scientific expertise re- 
lieves the legitimation crisis. As scientists 
provide knowledge to manipulate the econ- 
omy, their rationality "take[s] on the role of 
an ideology" (Habermas 1970, p. 104). With 
the logic of scientific progress apparently 
determining system development (Habermas 
1970, p. 105), capitalist domination is 
masked. Notions of technical exigency depo- 
liticize, converting questions of policy goals 
to questions of efficacy of means. Citizens 
acquiesce in the conditions of their domina- 
tion, glad to relegate day-to-day administra- 
tion to experts presumed to carry out 
electorally chosen ends. State reliance on 
scientific "laws" to justify its interventions 
enables it to be seen as a neutral representa- 
tion of societal interests discovered and 
served by scientific experts (Sarfatti-Larson 
1977; Gouldner 1979). 

In this argument, underlying conflict be- 
tween capital and subordinate groups, espe- 
cially labor, fuels state need for scientific 
rationality. The legitimation function of 
science does not require that it eliminate 
conflict, but only that it contain or mask the 
true magnitude of class conflict (see Offe 
1985). Despite recognizing class conflict over 
state policy, the argument projects increasing 
state dependence on scientific expertise be- 
cause increasing numbers of activities become 
subject to potentially politicized state interven- 
tion. Class-centered arguments relating scien- 
tific rationality to expanded capacity for 
essential welfare state functions (Therborn 
1978; Off 1975, 1985) also suggest increased 
state reliance on scientific experts.* Elimina- 


* For a more nuanced discussion including 
tensions and inconsistencies in the class-centered 
approach to state reliance on science, see Stryker 
(1987). Since Offe and Therborn treat bureaucracy 
and technocracy as distinct, even competing state 
rationalities, and assume that no state action in 
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tion of the NLRB economic division shows 
that the process of technocratization of law 
has limits, and suggests that neither the class- 
nor state-centered approach alone offers a full 
explanation of the process. 


Historical Background 


The National Labor Relations Act of 1935 
administered by the NLRB was basic to the 
U.S. transition from competitive to monopoly 
capitalism and from liberal to interventionist 
state. Encouraging labor organization and 
collective bargaining, the NLRA removed the 
price of labor from strict market determina- 
tion. The NLRB originally employed eco- 
nomic experts to aid in statutory enforcement. 
Its Division of Economic Research was in the 
thick of regulatory enforcement action during 
the Board’s first five years (Gross 1970). 
Economists’ participation in NLRB adjudica- 
tive rule-making continued a labor law 
tradition, since litigants contesting issues 
arising under the common law governing 
labor relations prior to the NLRA routinely 
used social science evidence in appellate 
court briefs (Ziskind 1939). Because the 
NLRA was based on legislative assumptions 
of cause-effect in social and economic 
processes (Fleming 1957), the NLRB liber- 
ally used social science evidence, for exam- 
ple, statistical analyses of the causes and 
effects of strikes (NLRB Division of Eco- 
nomic Research 1936), in litigating the Act’s 
constitutionality. To this day, occasions for 
use of social science experts, models, and 
research abound in adjudicative interpretation 
of the amended NLRA (see Stryker 1986, 

1987 for examples). 

: Recognizing its relevance, Board critics 
promote agency use of social science research 
(e.g., Gross 1981, pp. 265-67; Getman, 
Goldberg, and Herman 1976; Roomkin and 
Abrams 1977). However, contemporary NLRB 
decisions commonly point to agency experi- 
ence in labor relations as authoritative. Thus, 
the agency relies on "unverified behavioral 
assumptions [and] untested suppositions" 
(Gross 1981, p. 265), rather than on social 
science theory or research which might 
support or deny NLRB assumptions of 


capitalism can operate without contradictions, their 
concepts are building blocks for my concept of 
technocratization. 
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cause-effect in the processes it regulates.^ 
Contemporary critics argue that the NLRB 
should conduct empirical studies to assess the 
impact of its decisions (Gross 1970) and 
should base decisions “on the best available 
data" (Bok 1976, p. xii). Instead, even when 
the Board infrequently attempts to take 
advantage of social science research, the 
agency is poorly equipped to evaluate it (see, 
e.g., details provided by Gross 1970). Such 
shortcomings, day the critics, are largely 
attributable to the unwise dismantling of the 
NLRB's Division of Economic Research 
(e.g., Gross 1970). 

Congress unceremoniously terminated this 
unit on October 9, 1940, in a rider to a 
supplemental appropriations bill (Public Law 
812). The Taft-Hartley amendments to the 
NLRA ratified the 1940 decision without 
debate by inserting into the amended NLRA a 
little-noted, unprecedented provision stating 
that nothing in the act shall be construed to 
authorize the Board to appoint individuals for 
“economic analysis” (61 Stat. 136 [1947]). 
Although Taft-Hartley's language leaves legal 
room for some sort of social science research 
division (Roomkin and Abrams 1977), both 
"economic analysis" and a social science unit 
remain banished from the Board. 

As indicated by the presence of social 
Scientific-technical experts within agency 
decision making, technocratization was ubig- 
uitous by the end of the New Deal. The New 
Deal solidified and routinized American 
Progressivism's pattern of legal intervention 


5 See, for example, Sewell Manufacturing Co., 
138 N.L.R.B. 66 (1962) at 69, 71, which relies, 
without evidence, on an assumption that racial 
messages create a "powerful emotional force" 
impeding “a reasoned, untrammeled choice”; and 
Peerless Plywood Co., 107 N.L.R.B. 427 (1953) 
at 429, which proscribes “captive audience” 
speeches by union or employer within 24 hours of 
an election because of “our considered view, based 
on experience with conducting representation 
elections, that last minute speeches by either 
employers or unions delivered to massed assem- 
blies of employees on company time have an 
unwholesome and unsettling effect and tend to 
interfere with that sober and thoughtful choice 
which a free election is designed to reflect.” 
Dissenting opinions in Peerless and a companion 
case criticize the Board majority for ignoring 
relevant psychological studies of persuasion and 
propaganda, sociological studies of political behav- 
ior, and research in the field of industrial relations. 
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by administrative body. It created or signifi- 
cantly expanded, in addition to the NLRB, for 
example, the Securities and Exchange, Inter- 
state Commerce, Federal Trade, and Federal 
Power Commissions (Sanders 1982). With 
124 new agencies created and a near doubling 
of state civilian employees between 1933 and 
1940, New Deal regulation provided growing 
employment for social scientists and lawyers 
(1982). 

At the time Congress debated removing the 
economic division, the FTC, ICC, and SEC 
all employed economic experts doing work 
similar to that done by NLRB economists, 
including participating in law enforcement as 
expert witnesses (86 Cong. Record Appendix 
2928; FTC Annual Reports 1935—40; Govern- 
ment Manuals 1935-40). Such witnesses are 
persons with scientific or technical training 
and credentials outside of law, called upon by 
counsel and certified by the court as experts 
capable of drawing causal inferences relevant 
to factual issues in a case. Other federal 
agencies and executive departments of the 
1930s continue to employ economic experts. 
Today, the Bureau of Labor Statistics does 
economic and statistical analyses for the 
Department of Labor, the Bureau of Agricul- 
tural Economics (later renamed) does similar 
work for the Department of Agriculture, the 
Federal Trade and International Trade (for- 
merly U.S. Tariff) Commissions maintain 
their economic units, and the Securities and 
Exchange Commission still employs econo- 
mists and accountants. “[T]he NLRB remains 
today the only administrative agency forbid- 
den to seek . . . economic information on its 
own" (Gross 1981, pp. 164—65). 


METHODS 


The case study draws extensively on primary 
material, including NLRB internal docu- 
ments, public documents such as the Congres- 
sional Record, relevant Congressional hear- 
ings and reports, NLRB and court adjudicative 
decisions, NLRB annual reports, research 
bulletins and memoranda of the NLRB's 
economic division, and transcripts of in-depth 
interviews with NLRB personnel. Internal 
documents include letters to and from agency 
personnel, NLRB memoranda, and research 
memoranda, questionnaires, and outlines.$ 


$ Space limitations preclude thorough citation of 
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The study also uses secondary historical and 
legal literature. 
Taft-Hartley’s prohibition on “economic 


_analysis” is treated as an epilogue to analysis 


spanning 1935-40. Congress did not recon- 
sider the economic research issue in 1947. 
Indeed, the 1939-40 Smith Committee inves- 
tigation of the NLRB was largely responsible 
for the entire content of the Taft-Hartley bill 
(Millis and Brown 1950; Gross 1981). 


THE CASE STUDY: EXPLAINING THE 
ANOMALY OF THE NLRB 


Economic Division Role in 
Effective Enforcement 


The NLRB effectively enforced the NLRA, win- 
ning cases under judicial review by appellate 
courts and successfully accomplishing its stat- 
utory mission to promote unionization and col- 
lective bargaining. The agency’s economic di- 
vision facilitated effective enforcement by 
providing social science data and analyses.” The 


primary sources used in the case analysis. Citation 
to relevant NLRB internal memoranda has been 
eliminated, as has almost all citation to relevant 
NLRB and court opinions. Complete documenta- 
tion of secondary and primary material used in the 
case analysis is available in Stryker (1987, 1986) 
as is a complete historiography and detailed 
account of the legal context that helped determine 
research questions for NLRB economists. Oral 
interviews with NLRB personnel used in the 
analysis may be examined in the Labor- 
Management Documentation’ Center, New York 
State School of Industrial Relations, Cornell 
University. 

7 The field of economics in the 1930s was not 
the same as it is today (McNulty 1980; Cooter and 
Rappoport 1984). Economic theory was in disarray 
because of competing visions of its purpose, 
scope, and method. There was a close relationship 
between the importance of the institutionalist 
school in economics as a whole during the 1920s 
and the dominance of institutionalism in the labor 
economics of the interwar period. Institutional 
economists criticized conventional economic the- 
ory for its limited analytical framework based on 
the market. They borrowed heavily from the 
concepts, analytic tools and methods of other 
social sciences, including sociology and political 
science. Much theoretical and empirical work in 
labor economics attempted an integrative, social- 
scientific approach involving markets and other 
institutions. Thus, the terms economic and social- 
scientific are used interchangeably with reference 
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latter provides the initial clue to explain the 
Division’s elimination. 

With the NLRA, the federal government 
promoted workers’ rights to organize and join 
labor organizations, bargain collectively, and 
engage in other collective action including 
strikes. NLRB conducted representation elec- 
tions and vigorous Board prosecution of 
employer unfair labor practices prohibited by 
the Act implemented workers’ rights. Be- 
tween Board formation and June 1940, the 
NLRB held 3,386 elections in which about 84 
percent of eligible workers voted. Unions 
won 94 percent of the vote in 1937, the year 
of their greatest success (NLRB Annual 
Report 1947, pp. 89-90). By summer 1940, 
unions represented about nine million employ- 
ees, roughly twice the number represented in 
1935. Collective bargaining contracts covered 
well over a million workers in such basic 
industries as iron and steel, automobile, and 
electrical manufacturing, where few such 
contracts existed prior to the Act. 

The early NLRB aggressively and success- 
fully prosecuted employer unfair labor prac- 
tices, requiring employers to cease company- 
dominated unions, refusals to bargain in good 
faith, and mass discriminatory discharges for 
union activity (NLRB Annual Report 1947; 
Millis and Brown 1950). Newspaper publicity 
and the NLRB strategy of prosecuting cases 
arising in large companies and in vital 
industries, and cases in which employer 
unfair practices were flagrant and far-reaching 
ensured the high visibility of NLRB work. In 
1938-40 alone, Board action forced employ- 
ers to reinstate about 190,000 workers who 
had been discriminatorily discharged for 
union or strike action, to pay back-pay of 
millions of dollars, to disestablish hundreds 
of company unions, and to begin collective 
bargaining to remedy unfair practices in 
almost 2,000 cases (Annual Reports 1938—40, 
1947). The NLRB often went to court to 
establish the scope of remedial orders and 


to the NLRB. The scientific character of NLRB 
economists’ work rests not on its adoption of a 
particular economic or sociological theory, nor on 
its use of particular research methods, but on using 
various types of data and analytical tools to make 
empirical generalizations and causal inferences. 
Economists in other state agencies of the 1930s did 
similar tasks using similar, that is, mainly 
historical-institutional and statistical, data (see, 
e.g., FTC Annual Reports 1930—40). 
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secure employer compliance. Its excellent 
early court record further evidences effective- 
ness (Millis and Brown 1950; Annual Reports 
1936-40). Administrative law expert Louis 
Jaffe called the first five years of NLRA 
administration “the most high powered and 
effective law enforcement in our history” 
(1953, pp. 79-80). Whereas Griffin, Wal- 
lace, and Rubin (1986) show that the National 
Association of Manufacturers’ antiunioniza- 
tion strategies were effective in the 1902-28 
period, unpublished analyses by Griffin show 
that by 1941 the NLRA had destroyed the 
efficacy of the NAM's antiunion activity. 

From the start, NLRB economists were 
integral to effective Wagner Act enforcement. 
The Board considered plans to organize 
economic research in mid-1935 (Gross 1974). 
By February 1936, David J. Saposs, former 
student of John R. Commons at the Univer- 
sity of Wisconsin, was NLRB Chief Econo- 
mist. In 1936-40, he supervised from 4~16 
economic experts. Past president of the 
American Economic Association and Univer- 
sity of Chicago economics professor Harry 
Millis recommended Saposs to the NLRB 
(Gross 1974). 

Saposs found that the Board’s first priority 
was to get an Act provoking enormous 
industry opposition declared constitutional. 
But Supreme Court precedent weighed heavily 
against finding constitutionality. Despite uni- 
fied capitalist opposition, an extraordinary 
conjuncture of economic depression, the 1934 
strike wave, failure of a National Industrial 
Recovery Act declared unconstitutional by the 
Supreme Court, constituency building among 
labor by liberal congressmen, and democratic 
electoral victories in 1934 made the NLRA 
possible (Gross 1974; Skocpol 1980; Bern- 
stein 1950). Passage of the Act did not 
dampen employers’ resistance. The NAM 
urged noncompliance (Millis and Brown 
1950). The National Lawyers’ Committee of 
the American Liberty League, representing 
major industrial and financial interests oppos- 
ing the New Deal, argued that the NLRA was 
unconstitutional (Gross 1974). Nationwide, 
lawyers used a League brief to file suit in 
federal district court to enjoin Board hearings. 
The NLRB could not function as Congress 
intended while defending against about 100 
geographically scattered injunction suits. Thus, 
the Board’s “overriding objective” became 
establishing constitutionality (1974, p. 3). 

In meeting this objective, NLRB econo- 


LIMITS ON TECHNOCRATIZATION OF THE LAW 


mists’ work occasioned Supreme Court en- 
dorsement of probabilistic causal reasoning’s 
relevance to making adjudicative law (301 
U.S. 1 at 42-43). Economic unit work 
developed causal links among employer 
unfair labor practices, industrial unrest, and 
the obstruction of interstate commerce. It 
showed, for example, that employers' refusal 
to recognize unions was a major cause of 
within-state strikes, which in turn impaired 
the cross-state commercial transactions rele- 
vant to the constitutionality of federal stat- 
utes. Such causal analysis gave the Court a 
credible rationale to essentially overrule its 
precedent narrowly interpreting the constitu- 
tion's Commerce Clause and invalidating 
previous New Deai initiatives (Gross 1974). 

Economic experts presented statistical ma- 
terial and testified in the constitutional test 
cases. They prepared a scientific-technical 
bulletin given the Court during oral argument 
in Jones and Laughlin Steel Corporation, the 
leading test case (NLRB Division of Eco- 
nomic Research 1936, 1938b). Employers 
argued that evidence of general causal 
connections was inappropriate to legal deci- 
sions in any particular case. But the Supreme 
Court acknowledged the relevance and proba- 
tive value of such connections, relying in part 
on the “economic data" to conclude that the 
Wagner Act was constitutional as applied to 
manufacturing industries (Ziskind 1939). Be- 
cause its opinion explicitly affirmed the 
import for law of science's probabilistic 
causal reasoning, the Court implicitly en- 
dorsed using social scientists in regulatory 
enforcement. 

Following the constitutional test cases, 
NLRB economists continued work on jurisdic- 
tional issues and increasingly focused atten- 
tion on issues relevant to establishing em- 
ployer unfair practices. The first contributed 
to expansive government protection for la- 
bor's right to unionize and bargain collec- 
tively. The work on substantive issues of 
labor relations complemented work buttress- 
ing expansive jurisdiction. Exactly as in 
agencies such as the Federal Trade Commis- 
sion (FTC Annual Reports 1915—40), eco- 
nomic experts facilitated prosecutorial and 
adjudicative work in a variety of statutory 
areas, in this case issues arising under the 
unfair labor practice, representation, and 
remedial provisions of the NLRA. NLRB 
economists provided industrial and firm 
organizational data to help attorneys deter- 
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mine if the Board should issue complaints of 
law violations. NLRB economists also testi- 
fied as experts at, and prepared material 
suggestive of appropriate causal inferences 
for, Board hearings, and contributed statisti- 
cal data and causal arguments to appellate 
briefs (e.g., Annual Reports 1937—40). 
Division analyses of employment data for 
discriminatory discharge cases prefigured 
work that the Equal Employment Opportunity 
Commission and other social scientists and 
statisticians perform as expert witnesses in 
cases alleging race or gender discrimination 
in employment (Annual Reports 1937—40; 
Baldus and Cole 1980). Analyses of employ- 


' ment and financial records also provided data 


essential to enforce reinstatement and back- 
pay provisions without which NLRA prohibi- 
tion on discriminating against union support- 
ers would have been meaningless. Economists' 
analytic and statistical expertise was essential 
to success in prosecuting charges of discrimi- 
nation, giving the Division a high profile in 
an area of bitter conflict between unions and 
antiunion employers (NLRB Annual Reports 
1935—40). 

Division work on employer-instigated “in- 
dependent" unions and back-to-work move- 
ments facilitated formation of Board policy 
and NLRB efficacy in defending its policy in 
court. Data from a survey of independent 
unions suggested the accuracy of an empirical 
generalization of continuity between company 
unions outlawed by the NLRA and indepen- 
dents springing up after the Act's passage 
(Division of Economic Research 1938c). This 
provided material for trial attorneys in cases 
charging that independent unions were in fact 
company dominated (Annual Report 1937). 
Division empirical generalizations linking the 
activity of individual employers and their 
peak associations to development of suppos- 
edly independent back-to-work movements 
and vigilantism relied on historical research 
(Division of Economic Research 1938a). A 
bitterly contested textile case indicates that 
NLRB policy in this area converged with 
Saposs's analyses, and that the economic 
unit’s work facilitated Board success in 
protecting findings and remedial orders upon 
judicial review (Stryker 1986). Economic 
work contributing to employee reinstatements 
and disestablishing company unions aided the 
organizational capacities of the bona fide 
labor movement. 

Implementing the requirement that employ- 
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ers bargain in good faith also involved the 
economists. Here, the early NLRB’s most 
important interpretive principle required em- 
ployers to embody agreements reached with 
unions via collective bargaining in written, 
signed contracts. Economists prepared volu- 
minous material promoting this crucial adju- 
dicative rule, including material suggesting a 
strong correlation between employer refusal 
to embody agreements in writing and attempts 
to evade unionization by other means. 
Affirming the rule, the Supreme Court relied 
on a Division bulletin on written agreements 
in collective bargaining (Division of Eco- 
nomic Research 1939) and ratified causal 
analysis and citations supplied to the NLRB 
brief by Board economists (311 U.S. 514 
[1041)). 

Thus, the Division, acting in its scientific 
capacity, albeit within a framework of 
litigation and NLRB advocacy of statutory 
interpretation designed to prosecute and win 
cases, aided the aggressive, effective enforce- 
ment characterizing the Board's first five 
years. 

Participation in this enforcement alone 
would not have brought down the Division. 
This outcome was contingent on effectiveness 
on behalf of labor within a class-organizing 
regulatory framework. This, along with the 
NLRA’s profound departure from traditional 
American business ideology, a changed 
Congressional context, and intra-NLRB con- 
flict between lawyers and economists caused 
elimination of the Division of Economic 
Research. 


The NLRA as Class Organizing: Structural 
Basis for Employer Resistance 


NLRA architects intended to diminish labor 
radicalism, undercut labor militancy, and 
stabilize industry through collective bargain- 
ing (Klare 1978; Bernstein 1950), but such 
procapitalist outcomes were not assured 
during the Act's first five years. Instead, the 
enforcement facilitated by NLRB economists 
gave employers the material and ideological 
bases for their class-organized resistance to 
Board enforcement. 

In 1937—40, the NLRB-aided labor move- 
ment challenged managerial prerogative in 
labor relations (Gross 1981). Enforcement of 
unfair labor practice provisions undercut 
employers' resistance to unionization (see 
above), and the NLRA promoted unionization 
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in part to increase labor's wages (Fleming 
1957). Griffin, Wallace, and Rubin (1986) 
show that in 1902—28, union organization had 
a negative impact on the level and growth of 
industry productivity and a positive impact on 
wage levels and wage growth. They estimate 
that each unit increase in per-unit labor costs 
resulted in about 121 business failures 
annually. A positive relation between wages 
and unionization would likely hold in 
1935-40. In a growing postwar economy, 
monopoly capitalists would find institutional- 
ized collective bargaining in their interest 
(O'Connor 1973). During the Depression, big 
and small business alike could ill afford 
additional labor costs. Material interest thus 
provided foundation for capitalist solidarity 
against NLRB work. 

Ideological and political bases for class- 
organized resistance to NLRB enforcement 
also existed. Correctly, “businessmen per- 
ceived that unions and the government 
threatened their principles as well as their 
profits” (Auerbach 1966, p. 143). Aided by 
the NLRB, expansion of industrial unions 
united broader sectors of labor than ever 
before in U.S. history, increasing labor’s 
organizational capacities and power (Gross 
1981). CIO growth and militancy threatened 
capitalists’ interests. The CIO organized 
industrial unions by preaching class conflict 
against employers’ claims of industrial har- 
mony (Neuman 1982; Auerbach 1966). Many 
CIO organizers were left-wing, even commu- 
nist, and CIO organization drives relied on 
sit-down strikes (Bernstein 1970). Since 
employers thought sit-downs should be ille- 
gal, Division and Board attempts to prevent 
sit-down strikers from being fired, though 
reversed by the Supreme Court (Klare 1978), 
were guaranteed to generate employer anger. 
So, too, were cases in which the Board and its 
economists determined that AFL locals were 
company dominated. When faced with CIO 
organizing drives, employers preferred the 
more conservative and craft-based AFL (Gross 
1981). In the long term, the NLRA decreased 
labor militancy (Tomlins 1985). But in the 
short term, state protection, combined with 
employer intransigence, increased labor left- 
ism (Klare 1978). During the (pre-Taft- 
Hartley) NLRA, though arguably at no other 
time, union organization and labor militancy 
mutually stimulated each other (Rubin, Grif- 
fin and Wallace 1983). Economic unit 
publications and publicized Board enforce- 
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ment revealing class conflict beneath employ- 
ers’ claims of harmony also may have 
increased labor militancy. NLRA impact in 
its early years reflected less its framers’ intent 
to promote labor peace than it did employers’ 
fears that their way of life was unraveling 
(Auerbach 1966; Gross 1981). 

Because early NLRA enforcement orga- 
nized and radicalized labor, it stimulated 
capitalist resistance. The campaigns to defeat 
the NLRA, then to have it declared unconsti- 
tutional, show unified business resistance 
prior to the Supreme Court decision sustain- 
ing constitutionality (Millis and Brown 1950; 
Gross 1974; Bernstein 1950). The Court 
decision forced employers to new strategies 
of resistance, in which the NAM and U.S. 
Chamber of Commerce played primary roles. 

The NAM led employer resistance to 
unionization (Griffin et al. 1986), and oppo- 
sition to the NLRA and NLRB harmonized 
with the NAM's pre-New Deal policy. After 
the NLRA was declared constitutional, the 
NAM publicized strategies to alter traditional 
antiunion devices sufficiently to allow compa- 
nies to evade the Act while protecting 
themselves from labor organization (e.g., 
blueprints for orchestrating back-to-work 
movements and transforming outlawed com- 
pany unions into formally independent unions 
remaining under company control [Millis and 
Brown 1950; NAM Labor Relations Bulletin 
Nos. 20—23]). Such efforts were less success- 
ful than they might have been, in part because 
the economic division exposed and linked 
them to individual employers’ unfair prac- 
tices. NAM Counsel complained publicly 
about Saposs’s expert testimony on NAM 
practices (Senate Committee on Education 
and Labor 1939, 10:1927). 

In 1936-40, both the NAM and U.S. 
Chamber of Commerce instigated Congres- 
sional moves to investigate alleged prolabor 
bias of Board proceedings and personnel and 
to dramatically amend the NLRA, decreasing 
labor’s rights while increasing its duties to 
management (Millis and Brown 1950; Gross 
1981; NAM 1939; Chamber of Commerce 
1939). Though historical sources suggest that 
some employers’ resistance abated following 
the Court decision sustaining constitutionality 
(Millis and Brown 1950), employer opposi- 
tion to the NLRA and NLRB remained vocal 
and widespread. It continued to organize 
employers across segments of capital. Anti- 
NLRB activity of the NAM expressed the 
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sentiments of big business (see Skocpol 
1980). The Chamber more reflected opinions 
of smaller business, since there were more 
small than large businesses in the Chamber's 
voting constituency (see Herring 1929; Childs 
1930) and the Chamber's proposals to amend 
the NLRA required membership endorsement 
(Chamber of Commerce 1939). Anti-NLRB 
complaints of individual employers before 
Congressional committees provide additional 
evidence that employer resistance cut across 
segments of capital (Senate Committee on 
Education and Labor 1939, vols. 8-10). 


Ideology as a Tool for Resisting the NLRB 
and its Economic Division 


NLRA enforcement, facilitated by the eco- 
nomic division, gave employers the advan- 
tage in a propaganda war fought over the 
NLRB. Board supporters tried to frame the 
NLRA’s deviations from “competitive stan- 
dards and the individualistic tradition in terms 
of [the ideology of] social justice, humanitar- 
ianism, the need to combat big business or the 
need for economic balance” (Hawley 1966, 
p. 195). Employers relied on traditional 
business values of property rights, competitive- 
individualism, laissez-faire, and labor- 
management harmony, and were better able 
to mobilize press, public, and Congressional 
support, in large part because such values 
were the traditionally dominant American 
values. Business associations extolled tradi- 
tional virtues of American life, for example, 
“the right of the individual to seek the kind of 
gainful employment he chooses [and to] own 
private property and enjoy its use" (NAM 
1936, pp. 6-7). Employers labeled unioniza- 
tion and collective bargaining “socialistic,” 
charged the Board with pro-CIO bias, linked 
the Board and its staff, including Saposs, to 
the most radical elements of American labor, 
argued that the Board had disrupted rather 
than promoted amicable employer-employee 
relations, argued that Board enforcement 
would lead to public ownership by employ- 
ees, and asserted that the NLRB's theme was 
that employers were the enemy of workers 
(Chamber of Commerce 1939; NAM 1939; 
Senate Committee on Education and Labor 
1939). 

Some members of Congress joined opposi- 
tion to the NLRB. Senator Burke, who 
provided a clearinghouse for employer- 
proposed amendments to the NLRA, criti- 
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cized Board staff and Saposs for persecuting 
business (Gross 1981). In a statement distrib- 
uted with Republic Steel’s help, Senator Nye 
claimed state promotion of unions would 


result “in the condition that exists in Russia: 


today," and argued that Saposs was unsuited 
for his position because he endorsed “Stalin’s 
doctrine of boring from within" (1981, pp. 
49-50; 86 Cong. Record 7739). Employers 
and their Congressional allies had almost 
unanimous press support (Bernstein 1970). 
Opinion surveys showing widespread public 
hostility to the NLRA and NLRB (Gross 
1981, pp. 72-74, 187) suggest the effective- 
ness of the organized opposition. 

By 1939, Republicans had capitalized on 
anti-New Deal sentiment to gain 81 House 
and 8 Senate seats, and a coalition of 
conservative Republicans and southern Dem- 
ocrats replaced the earlier New Deal alliance 
(Neuman 1982). Congress subjected the 
Board to three investigations, including the 
hostile House Special Committee to Investi- 
gate the NLRB. Howard Smith (D.-Va.) a 
virulent opponent of the New Deal, NLRA, 
and CIO, chaired the Committee (Gross 
1981). Smith hired as Chief Counsel Edmund 
Toland, who had represented important NAM 
members and other companies known for 
vicious antiunion tactics (86 Cong. Record 
Appendix 3871—73). Through Toland, em- 
ployer groups had a direct link to the Smith 
Committee investigation, which continued 
industry's attack on the NLRB (Millis and 
Brown 1950). On some issues, industry had 
an ally in conservative AFL leaders who 
charged Board bias toward the .CIO. Despite 
great internal opposition to AFL leadership 
activity (Gross 1981), the antileftist rhetoric 
of AFL leaders fed into the employers' 
campaign (Gross 1981). 


The Smith Committee Attack on the NLRB 
and its Economic Division 


The Smith Committee investigation in 
1939-40 conducted a sustained attack on the 
economic division and Saposs.5 There is no 


3 Gross (1970) argues that the Smith Committee 

' was not concerned with the role of the economic 
division itself, but conducted a personal attack on 
Saposs. Careful study of the Smith Committee 
hearings and reports, as well as the campaign 
against the Division in appropriations and on the 
floor of Congress suggests that from the beginning 
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evidence that employers orchestrated the 
Committee attack on the Division, but it 
occurred in the context of a Committee 
campaign against the Board which reflected 
employer’s charges, garnered employers’ 
support, and served employers’ interests. 

The Committee took NLRB files including 
much economic division material (Stryker 
1986). Echoing employers’ fears and rhetoric, 
Toland and the Committee majority con- 
ducted an unceasing attack on the Board 
(Smith Committee Hearings 1940-41; Inter- 
mediate Report 1940a; Final Report 1941). 
The Committee falsely charged that Board 
Chairman Madden was a subversive, implied 
that the anticommunist Chief Economist was 
a communist, and depicted all left groups as 
antidemocratic, pro-Soviet and anti-American. 
Gross (1981, p. 15) argues that the Commit- 
tee was not really looking for communists, 
but simply for convenient weapons to force 
the Board to back off from vigorous enforce- 
ment of the NLRA. 

The Committee was concerned that the 
Board promoted a view of employer- 
employee relations as class struggle and 
foreign to a “proper” American idea of 
industrial relations. It perceived an intimate 
connection between NLRB policies it detested 
and leftist orientations of NLRB personnel, 
and an equally close relation between radical 
NLRB policies and personnel and “intensified 
class feeling” in labor relations (Final 
Report, p. 5). The Committee majority meant 
to eradicate both antiemployer policies and 
personnel that threatened increased working- 
class power and suggested that the labor- 
management relationship was one of conflict, 
rather than harmony. 

In this context, the Committee’s joint 
attack on Saposs and his division is not 
surprising. The Committee had material 
showing that in enforcement, Saposs vigor- 
ously defended labor's new legal rights. It 
understood the Division’s importance for 
winning Board cases.? 

Released March 29, 1940, the Smith 
Committee Intermediate Report mobilized 
Congressional support for the Smith bill, 
designed like the Burke bill before it to 





the Smith Committee majority and other NLRB 
opponents in Congress were attacking Saposs, his 
staff, and the economic division's work as well. 

? See Stryker (1986, 1987) for the data upon 
which this conclusion is based. 
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advance employers’ interests by amending the 
NLRA and weakening enforcement (H.R. 
8813, 1940). Both the bill and Intermediate 
Report proposed to abolish the economic 
division. The Report argued that the NLRB 
improperly confused prosecuting and adjudi- 
cative functions by basing decisions on 
economic material not introduced into the 
legal record, and that Division activity was 
unwarranted and unnecessary given the exis- 
tence of the Bureau of Labor Statistics 
(Intermediate Report, 1940a).'!° Committee 
arguments that economic expertise was not 
needed or was inconsistent with due process 
of law suggest the utility of a concept of 
technocratization implicating opposition to 
expanding scientific-technical expertise. Here, 
despite invocation of legal rationality as 
justification, the Smith Committee intended 
to rid the Board of both Saposs and his 
division because they helped vigorously 
enforce an Act whose fundamental policies 
were anathema to legislators echoing em- 
ployer resistance. NLRB supporters in Con- 
gress clearly understood this point (e.g., 86 
Cong. Record 3629). Two weeks after the 
Smith bill was introduced, a House Appropri- 
ations subcommittee recommended that the 
economic division's entire operating budget 
be cut from monies allocated to its work, in 
order to eliminate the Division. The recom- 
mendation passed (H.R. 9007 [1940]; Smith 
Committee Hearings, 24:4953—54). 


Internal NLRB Conflict 


In the context of this Congressional on- 
slaught, itself conditioned by the employers' 
campaign against the Board, intense conflict 





10 The NLRA barred the Board from duplicating 
work of the Labor Department. Commissioner 
Lubin of the BLS testified that economic division 
work did not duplicate BLS work and that the 
NLRB needed its economists to analyze relevant 
BLS data and do other scientific-technical work 
(Minority Views 1940b). The BLS could perform 
no research connected with individual Board cases 
(e.g., Letter from Acting Labor Commissioner to 
Madden, 2 August 1940). There were some 
instances of early Board access to off-the-record 
economic material (Minority Views 1940b). But a 
Brookings Institution analysis found that NLRB 
procedures conformed far better to "strict due 
process of law" than did the procedures of any 
other legal-administrative agency of the time (86 
Cong. Record 3842). 
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between NLRB lawyers and economists 
sealed the economic division’s fate. Intra- 
NLRB conflict revolved predominantly around 
the role that Saposs, his staffers, and their 
characteristic expertise would play in NLRB 
enforcement (Gross 1970; Tomlins 1985). 
Along with research interests and methods 
fairly representative of labor economics in the 
early 20th century (Barbash 1967; McNulty 
1980), Saposs took from Commons the 
“Wisconsin school” model of administrative 
agencies. The model made such agencies 
investigative and control bodies primarily 
dependent upon social science research, 
rather than primarily legal bodies with 
prosecuting and adjudicative functions (Mc- 
Nulty 1980; Gross 1970; Saposs interview 
1968). Saposs wanted a role that would 
maximize Division authority and the techno- 
cratic character of Board enforcement— 
compulsory review of drafts of Board deci- 
sions by the economic division, economists’ 
input on all decisions concerning Board 
jurisdiction, and flexible gathering and use of 
scientific-technical data even if this violated 
legal norms demanding differentiated prose- 
cuting and adjudicative functions (e.g., Gross 
1970). 

The Chief Economist’s legal counterpart 
respected economic data and economists’ 
contribution to NLRB enforcement (Fahy 
interview 1968). However, consistent with 
his training and experience in legal doctrine 
and method, General Counsel Charles Fahy 


. Sought to subordinate social science rational- 


ity and experts to the authority of legal 
expertise (Fahy interview 1968). 

Tbe NLRB's chief lawyer and chief 
economist locked horns many times. On the 
whole, the Board supported Fahy and refused 
to expand the economists’ authority or the 
number of cases for which it deemed social 
science research essential (Gross 1970). 
Comparisons of the salaries, numbers and 
lines of communication and authority of the 
economic and legal divisions show that 
despite Saposs’s initiatives, social science 
personnel were subordinated to lawyers 
(Stryker 1986, 1987). While Saposs felt that 
he arid his division’s function were underap- 
preciated, Fahy felt that Saposs and his staff 
exaggerated their division’s importance (Sa- 
poss, Fahy interviews 1968). NLRB attorneys 
esteemed the economists most during the 
constitutional battle. Afterwards, attorneys’ 
support fell (Van Arkel interview 1970). 
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Intra-agency squabbling was visible to 
Congress during its hearings on the NLRB 
(Gross 1970; Tomlins 1985). In a letter to 
Commons, former economics professor and 
Board member William Leiserson argued that 
the NLRB had neither prosecuting nor 
judicial powers, but was a fact-finding 
agency. According to Leiserson, Board attor- 
neys could not understand the notion of truth 
in scientific research and insisted that truth be 
defined by the trial process. Thus, Leiserson 
reported, their views threatened the Wiscon- 
sin model of social science investigation and 
administrative control (Smith Committee Hear- 
ings, 24:2977—78). Fahy told the Committee 
that Leiserson's views were unreasonable and 
unacceptable (25:6747—48). This type of 
exchange probably convinced the Smith 
Committee majority that the economic unit 
made a good political target. 

Political factionalism exacerbated the 
lawyer-economist conflict at the Board. 
Saposs and his staff were socialists and 
diehard anticommunists, some of the lawyers 
were left-wing sympathizers and procommun- 
ist (Gross 1981).!! 

Saposs became convinced that a procom- 
munist group was trying to block his 
division’s work (Saposs interview 1968). 
Leiserson, concerned about NLRB infiltration 
by extreme left-wingers, supported Saposs 
(Kleiler interview 1971). But NLRB Chair- 
man Madden, who at times discounted 
economic work (85 Cong. Record 3616), 
thought that the politics of his staff was a 
private matter (Madden interview 1968). 
Fahy, a political liberal like Madden, tried to 
‘stay aloof from political factionalism (Saposs 
interview 1968). 

Convergent professional and political con- 
flicts at the NLRB led the Smith committee 
minority to tacitly accept the majority’s view 
of Saposs, but the minority tried to salvage 
the economic division (Minority Views 1940b; 
Gross 1981). The strategy backfired because 
it kept Saposs and his division in the glare of 
hostile Congressional scrutiny (e.g., 86 Cong. 
Record 3454-55, 3624). Board critics, sens- 
ing the minority’s willingness to remove 





11 Other lawyers were more politically conserva- 
tive than the economists. Employers' groups and 
some Congressmen suspected all NLRB employees 
of being radical zealots because all were commit- 
ted to enforcing the NLRA (Gross 1981; Millis and 
Brown 1950). 
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Saposs, used this advantage to attack the 
Division (e.g., 86 Cong. Record 3859, 7508, 
12482, Appendix 3549). The visible lawyer- 
economist differences over Board procedures 
likely reinforced NLRB opponents' convic- 
tion that the economic division was the 
Board's weakest link. 

In the end, senior senators who supported 
Roosevelt kept the Smith bill bottled up in the 
Senate Labor Committee (Gross 1981). How- 
ever, despite the Division's unanimous sup- 
port from the AFL, CIO, and major unaffilia- 
ted unions (e.g., Cong. Record 13106-7), 
NLRB opponents eliminated the economic 
division through the appropriations process 
(Gross 1981). Buttressed by the Comptroller 
General's legal opinion, the NLRB had 
maintained the Division despite earlier cuts in 
appropriations aimed at abolishing it.!? When 
an appropriations bill explicitly abolishing the 
Division passed, the Board capitulated and 
terminated the employment of Saposs and 10 
economists (Stryker 1986). In the final drive 
against the Division, the NAM reminded 
Congress that the Board had blatantly defied 
Congress’ expressed intent to abolish the 
Division (Committee on Education and Labor 
1940, pp. 112-16). In 1947, employers 
finally obtained wholesale amendments to the 
NLRA, including ratification of the 1940 
action eliminating the Division. 

When Congress eliminated the Division, em- 
ployers' associations, anti-NLRB employers, 
and their attorneys must have been quietly con- 
fident that the effectiveness of NLRA enforce- 
ment on behalf of workers would be reduced, 
just as Congressional architects of the unit's 
elimination anticipated it would be. In fact, fail- 
ure of the NLRB to resurrect a social science 
unit able to recruit and keep social science per- 
sonnel of the caliber of Saposs and his staff has 
meant a reduction in agency effectiveness (Gross 
1970; Bok 1976). The post-1940 NLRB has 
employed a small number of low-ranking 
technical analysts to prepare caseload statistics 
for annual reports and compute back-pay for 
discriminatorily discharged employees.!? Al- 


12 Letter from Acting Comptroller to Madden, 
21 August 1940. Board old-timers surmise that the 
NLRB would not have tried to evade Congress’ 
intent without express backing from President 
Roosevelt, but no living informant has knowledge 
of such backing. 

13 See NLRB 1936-74, 1948 vol. 2, p. 1577, 86 
Cong. Record 13105-6. 
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though today the NLRB accomplishes certain 
important, routinized statistical tasks, its ability 
to evaluate, or creatively and aggressively mar- 
shall and rely on social and economic science 
for adjudicative rule-making is gone. 


DISCUSSION AND CONCLUSION 


The conjunction of four factors ensured the 
elimination of the Division of Economic 
Research: (1) Agency social siet fur- 
nished data and causal analyses facilitating 
effective regulatory enforcement which was 
prolabor and class-organizing. (2) Regulatory 
enforcement so deviated from traditional 
American values that employers had a potent 
resource to mobilize the press, the public, and 
Congress to support resistance to the NLRB. 
(3) A substantial number of Congressmen 
opposed the NLRB, and key committee 
positions were occupied by opponents. (4) In 
the context of these factors, intense conflict 
between agency attorneys and social scientists 
allowed Congress to eliminate the economic 
division, seriously curtailing Board ability to 
incorporate social scientific-technical reason- 
ing into adjudicative rule-making. 

Case analysis of the NLRB suggests the 
utility of a concept of technocratization of law 
explicitly incorporating conflict over expan- 
sion of scientific-technical expertise based on, 
or justified by, legal rationality. The analysis 
also suggests that state- and class-centered 
approaches are relevant to understanding the 
government-science relationship. But neither 
approach is sufficient to explain technocrati- 
zation or its limits. As state-centered work 
would predict, scientific expertise contributed 
to the NLRB's early enforcement effective- 
ness. However, Board economists' contribu- 
tion to enhanced effectiveness ultimately 
resulted in a much diminished role for 
scientific expertise. This suggests the need to 
£o beyond state-centered focus on the effi- 
cacy of government use of science. We must 
theoretically distinguish various kinds of 
enforcement contexts to gain leverage on 
conditions under which efficacy will result in 
limits on technocratization rather than in 
increased technocratization itself. 

As for class-centered theory, since the 
NLRA represented prolabor rather than pro- 
capital legislation, technocratization of NLRB 
enforcement could not perform the presumed 
legitimation function characteristic of 
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Science.!^ However, NLRB scientific- 
technical work could have supported a 
legitimation of capitalist domination argu- 
ment if agency social scientists had used their 
expertise to diminish, rather than enhance, 
Board rule-making's anticapitalist character. 
But, for example, instead of allaying em- 
ployer fears by arguing that sit-down strikers 
were not protected by the Act and therefore 
could be fired, NLRB economists pushed the 
Act toward its most anticapitalist possibilities. 
They used causal arguments and statistics to 
insist that the Act required reinstating sit- 
down strikers fired by employers. Class- 
centered theory must be modified to account 
for instances where the advanced ‘capitalist 
state uses science to advance, rather than 
subvert, subordinate group interests. 

It is impossible to move directly from one 
case, no matter bow strategic, to a general 
theory. Although elimination of its social 
Science division makes the NLRB historically 
anomalous, this anomaly can also be seen 
representing an extreme along a continuous 
dependent variable. Thus, some agencies 
have experienced an increase in social science 
personne] and functions, for example, eco- 
nomic analysis at the Environmental Protec- 
tion Agency (Melnick 1983). Other agencies 
have witnessed reduction, for example, the 
Bureau of Agricultural Economics in the U.S. 
Department of Agriculture during the 1940s 
(Kirkendall 1966). An adequate theory should 
explain the full range of technocratization of 
the law, for natural as well as social science. 

Reflection on the NLRB case, however, 
does point the way by suggesting hypotheses 
for future historical-comparative work. Inter- 
nal agency politics and state and class 
characteristics of an agency's external envi- 
ronment as they affect agency-environment 
resource relationships must be considered in a 
developed theory. 

First, other things equal, if powerful 
political actors are agency opponents, tech- 
nocratization should be more vulnerable to 
limits. The case study illustrates the indepen- 
dent importance of formal political conflict 
and structure for technocratization, since both 
the makeup and Chairmanship of the anti- 
Roosevelt, anti-NLRB, antieconomic division 


‘4 For an argument that data on government use 
of science can support a class-centered functional 
hypothesis, albeit one different from the legitima- 
tion function formulation, see Stryker (1988). 
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Smith Committee and the pro-Roosevelt, 
pro-NLRB Senate Labor Committee helped 
produce the 1940 outcome. State-centered 
discussion of issues removed from govern- 
ment use of science stresses the political 
context facilitating or diminishing the likeli- 
hood of particular policy outcomes within 
capitalist democracies (Orloff and Skocpol 
1984; Ruggie 1984). Thus, both the NLRB 
case and the state-centered perspective sug- 
gest that future historical-comparative work 
explore presumably variable relations be- 
tween regulatory agencies and that part of 
their environment formally within the state. 
In the United States, this includes the political 
balance in Congress, makeup of Congress' 
oversight committees, and views of agency 
relevant actors in the executive branch, for 
example, the President and, recently, the 
Office of Management and Budget (Szasz 
1986). 

Second, other things equal, technocratiza- 
tion should be most vulnerable when it 
facilitates effective enforcement of statutory 
provisions favoring subordinate actors within 
a class-organizing regulatory context. The 
case study and class theory suggest that the 
class character of regulatory winners and 
losers and class context in which social 
Science and regulatory efficacy occur are 
essential to specifying conditions limiting or 
enhancing technocratization. 

NLRB regulatory enforcement was class 
organizing because it inevitably contained 
overt class content. First, the NLRA explic- 
itly regulated a class or employment relation- 
ship. Second, the Act explicitly intended to 
collectively organize workers on the basis of 
their common class position and to increase 
their power as against that of employers. 
Thus, the regulatory winner of effective 
enforcement was the subordinate class within 
a system of class relations. In response, 
employers launched a campaign of regulatory 
resistance, in which they overcame internal 
divisions to organize as a class as well. 
Class-organized resistance by capital condi- 
tioned Congressional attacks-on NLRB econ- 
omists. 

In stark contrast to the NLRA, Federal 
Trade Commission enforcement of antitrust 
law is class-disorganizing. Antitrust law does 
not regulate a class relationship, but rather 
regulates relationships internal to capital or 
those between producers and consumers. FTC 
enforcement most often pits business against 
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business, industry against industry, or one 
segment of the capitalist class against another 
(Hawley 1966; Sullivan 1977). Normally, 
both ‘regulatory winners and losers are 
dominant class members. Thus, the FTC 
disorganizes classes by fragmenting the dom- 
inant class. The producer-consumer aspect of 
antitrust disorganizes both working and capi- 
talist classes by bringing nonclass relations to 
the fore and promoting concepts like the 
public interest. This was true for both early 
populist antitrust and newer “consumer wel- 
fare” incarnations (McGraw 1984: Sullivan 
1977). 

Between such opposites as the NLRB and 
FTC fall agencies like the Equal Employment 
Opportunity Commission and the Environmen- 
tal Protection Agency. EEOC enforcement of 
prohibitions on employment discrimination 
does regulate a class relationship. However, 
rather than organizing workers on the basis of 
common class position, it tends to fragment 
them on the basis of race and gender. It is 
generally class-disorganizing in promoting 
consciousness of gender- or minority-based 
rather than of class-based interests (Hodson, 
Schervish, and Stryker 1988). Though the 
nominal regulatory winner or loser may be 
employer or employee, the real winners or 
losers are men, women, blacks, or whites. By 
regulating producer-consumer relations, EPA 
enforcement also tends to heighten visibility 
of non-class-based conflict and interests, 
especially since employers and employees in 
ailing, pollution-generating industries share 
economic survival interests. 

If the hypothesis is correct, the vulnerabil- 
ity of technocratization at the EEOC and EPA 
should be less than at the NLRB, but more 
than at the FTC, were all else equal, 
including regulatory effectiveness on behalf 
of subordinate actors. As regulation simulta- 
neously becomes more class-organizing and 
more effective on behalf of labor, the impetus 
for capitalist class resistance increases. If 
social science expertise consistently functions 
to enhance class-organizing regulation’s effec- 
tiveness on behalf of labor, capitalist resis- 
tance will likely include social scientists 
among its targets, increasing the vulnerability 
of technocratization. 

But if class-organizing regulation on behalf 
of labor is ineffectively enforced, solidary 
capitalist resistance may remain unmobilized. 
Or, if social scientists function to lessen 
adverse regulatory impact on capitalists within 
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-a framework of class-organizing regulation on 

behalf of labor, social science experts and 
their function may be exempted from any 
capitalist antiregulatory campaign. Future 
research can examine, for example, reactions 
of capital and labor occasioned by technocra- 
tization of proworker and class-organizing 
occupational health and safety law, where 
economic analysis often mitigates adverse 
impact on employers (e.g., Szasz 1986). 

In short, the class-organizing/disorganizing 
character of regulatory enforcement, in con- 
junction with the enforcement role of social 
scientists, is considered crucial. Of all 
nonstate actors in the regulatory agency’s 
environment, the capitalist class is thought to 
be the single most important in its capacity to 
exert pressure on the agency and other state 
actors. However, under corporate capitalism, 
the American capitalist class has been espe- 
cially strong relative to its labor force and its 
counterparts in many other capitalist democ- 
racies. Thus, the hypothesis in its present 
form may not be applicable outside the U.S. 
context. 

Third, other things equal, technocratiza- 
tion should be less vulnerable when agency 
value resources are high. Case analysis 
suggests attention to value resources, and 
class theory highlights the way such resources 
likely reflect basic features of the capitalist 
system. In this country, an agency will be less 
vulnerable to limits on technocratization the 
more conformable.a statute and its enforce- 
ment are to traditional American values of 
class harmony, competitive-individualism and 
managerial prerogative. The content of tradi- 
tional values varies some depending on 
national context. However, the general hypoth- 
esis should be applicable to all capitalist 
democracies. 

In U.S. state structure, unhappy regulated 
parties seek relief through litigation and by 
attempting the influence enforcement through 
Congress or the executive branch (e.g., Szasz 
1986). Mobilizing support from the press, the 
public, members of Congress, and the 
President takes cultural as well as material 
resources. Thus, by influencing the distribu- 
tion of resources between regulatory agency 
and regulated parties, variable regulatory 
conformity to the traditional values of Amer- 
ican capitalism likely is significant. To the 
extent that regulatory activity deviates from 
traditional values, regulated parties, including 
employers, will have an increasingly effective 
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value tool for self-mobilization and the 
forging of broad antiregulatory coalitions. 
Conversely, the agency and its social scien- 
tists increase their ability to withstand pres- 
sure from regulated parties to the extent that 
regulatory action conforms with traditional 
values. 

Traditional values are differentially avail- 
able to regulatory agency or regulated parties 
depending on the content of regulatory 
statutes and enforcement. For example, in 
contrast to the NLRA, antitrust law elevates 
the principle of competitive markets and 
competitive-individualism to law (Fine 1956; 
Hawley 1966). Antidiscrimination law prior 
to affirmative action enshrined competitive-in- 
dividualism in law, attempting to create a 
labor market allocating jobs by universalistic 
criteria. Even after affirmative action, the 
conflict most implicated by EEO law is not 
class based, but is based on race or gender. 

Fourth, variation in  technocratization 
should be related to variation in internal 
agency conflict. This too should hold in all 
capitalist democracies. Both case analysis and 
organizational theory alert us.to the import of 
internal agency politics. The lawyer- 
economist conflict at the Board reflected 
interest group politics in which divergent 
professional socialization and training created 
intra-agency cleavage. Consistent with a 
concept of technocratization stressing compet- 
ing principles of state action, both lawyers 
and social scientists jockeyed on behalf of 
their characteristic rationality. Though further 
intensification of NLRB professional cleav- 
ages via overlay with differences of political 
ideology may be relatively unique, such 
professional cleavages themselves are com- 
monplace in state agencies (e.g., Melnick 
1983; Katzman 1980). 

From the point of view of the agency trying 
to manage its external environment, organiza- 
tional solidarity is a resource that can be 
mobilized in response to political attack. In 
contrast, manifest conflict between agency 
lawyers and social scientists can increase the 
resources available to agency opponents. As 
in the NLRB case, intense intra-agency 
conflict likely spills over to the wider political 
arena. If conflict is over the enforcement role 
played by social science expertise, agency 
opponents may likely infer that social scien- 
tists constitute a regulatory weak link. 

If potential conflict between agency law- 
yers and social scientists remains latent, this 
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should provide social scientists with a re- 
source when under political attack. Low 
intensity conflict may eradicate that resource 
but is unlikely to elicit signals that encourage 
agency opponents to target social scientists. 
Though “spillover” from intense internal 
conflict of any sort is likely, intense conflict 
over the function of social scientists seems 
most likely to guarantee political attack on an 
entire social science unit. Likewise, once a 
political attack is launched, diminished tech- 
nocratization should be most likely when 
internal agency conflict is intense and in- 
volves professional cleavages over the regula- 
tory role of social science. 

Theoretical specification of the relationship 
between intra-agency conflict and relevant 
agency environmental factors remains to 
future historical-comparative research. More- 
over, because the class content attached to 
hypotheses 2 and 3 may be controversial, it 
also is important that future research gather 


evidence relevant to those hypotheses and to: 


the way in which the class organization/ 
disorganization and value factors may be 
related. 

Finally, the NLRB case may not capture all 
explanatory factors relevant to constructing a 
theory of technocratization and its limits in 
capitalist states. For example, the longer a 
social science unit has been part of an agency, 
the more difficult it may be to dislodge. Or, 
institutional factors involved in the formation 
of new agencies may affect the resources 
available to professional groups under attack. 
The early Federal Trade Commission incorpo- 
rated the Bureau of Corporations and that 
Bureau's general investigative function (FTC 
Annual Report 1916). FTC economists may 
have drawn legitimacy from this. But NLRB 
economists could not draw legitimacy from 
this. But NLRB economists could not draw 
legitimacy from incorporating the Bureau of 
Labor Statistics’ function, because the BLS 
remained separate in structure and function 
from the NLRB. Apart from intra-agency 
conflict, but consistent with organizational 
approaches, future research should gather 
comparative data on agency and agency social 
scientists’ responses to attack. This would 
allow systematic examination of possible 
survival tactics and their outcomes. 

The construction of hypotheses based on 
one case, no matter how strategic, must be 
tentative. The’ next step is to subject the 
suggested hypotheses to additional historical 


AMERICAN SOCIOLOGICAL REVIEW 


cases for further specification. Ideally, future 
comparative work will involve other U.S. 
agencies and agencies in other countries. 


ROBIN STRYKER is Assistant Professor of 
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structing a theory of state reliance on science. 
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GENERATIONS AND COLLECTIVE MEMORIES* 
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A national sample of adult Americans was asked to report “the national or world 
events or changes over the past 50 years” that seemed to them especially 
important, and then to explain the reasons for their choices. The resulting data are 
used both quantitatively and qualitatively to explore hypotheses related to 
generational effects, life course, and collective memory. Broadly speaking, 
different cohorts recall different events or changes, and these memories come 
especially from adolescence and early adulthood. The reasons for mentioning 
various events and changes also differ across cohorts in ways that indicate that 
generational effects are the result of the intersection of personal and national 


history. 


That each generation receives a distinctive 
imprint from the social and political events of 
its youth is an old idea, most often associated 
today with the name of Karl Mannheim. 
Mannheim ([1928] 1952) did not define 
generation with any precision, and in fact he 
emphasized that a generation is a social 
creation rather than a biological necessity. 
Where novel events are rare and change is 
slow, as in traditional peasant societies, 
distinct generations may not appear (1952, p. 
309). Only where events occur in such a 
manner as to demarcate a cohort in terms of 
its “historical-social” consciousness, should 
we speak of a true generation.! 
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variables, with documentation) can be obtained 
from the Inter-University Consortium for Political 
and Social Research, P.O. Box 1248, University 
Of Michigan, Institute for Social Research, Ann 
Arbor, MI 48106. 

1 We will use the term cohort descriptively to 
refer to "the aggregate of individuals (within some 
population definition) who experienced the same 
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Although Mannheim emphasized the so- 
cially constructed nature of a generation, 
implicit in his discussion was also the concept 
of "life course." His essay is not entirely 
clear on the point at which a cohort begins to 
develop a unique generational character, but 
he seems to specify "the age of 17, 
sometimes a little earlier and sometimes a 
little later" (Mannheim 1952, p. 300). He 
further suggests that age 25 may well mark 
the terminal point of major generational 
formation. Thus Mannheim assumed, as have 
almost all later writers, that "late adolescence 
and early adulthood are the formative years 
during which a distinctive personal outlook 
on politics emerges" (Rintala 1968, p. 93). 

Subsequent interest in the generational 
concept has been motivated largely by a belief 
in its potential explanatory power for under- 
standing individual and collective political 
behavior. In simplest terms, the generational 
character created by the events a cohort 
experiences during its youth is assumed to 
exert an important, even decisive, influence 


event within the same time interval" (Ryder 1965, 
p. 845), but treat cohort effect and generation as 
having the additional implication of long-lasting, if 
not permanent, change in the cohort. In Mannhe- 
im's usage, generation has the still further 
implication of ideological distinctiveness, whereas 
a cohort effect can include social or demographic 
changes that may or may not have direct 
ideological effects (e.g., see the illustrations 
offered by Riley, Foner, and Waring 1988, pp. 
256-59). Generation is sometimes: limited to 
lineage relations, but in this paper the wider 
meaning of the term is employed; see Cutler 1977, 
and for a concise discussion of this and other 
relevant distinctions, Knoke 1984. 
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on the later attitudes and actions of its 
members. An analogy to effects from social 
class was developed by Mannheim and is 
often assumed by other writers attracted to the 
generational concept (e.g., Berger 1971). 
Using generation as a variable to predict 
future behavior has met with mixed success in 
systematic, as distinct from anecdotal, re- 
ports. When samples are drawn of very 
specific generational activist groups—similar 
to but more narrowly defined than Mann- 
heim’s “generational units,” which were 
conceived as two or more antagonistic 
political movements—they do seem to carry 
some continuity of thought and action (e.g., 
Jennings 1987; Marwell, Aiken, and Deme- 
rath 1987). However, when cohorts are 
identified across a broader population, there 
is little evidence that past experiences have 
permanent ideological effects on later politi- 
cal attitudes or actions (e.g., Barnes 1972; 
Converse 1987; Holsti and Rosenau 1980; 
Weil 1987). Even when current political 
attitudes or behavior can be traced to a 
particular past period, the connection often 
extends over the entire population, or over a 
subpopulation defined in other than cohort 
terms (e.g., by race or gender or religion). 
However, the attempt to go directly from 
the formal delineation of cohorts in terms of 
age to the prediction of later behavior skips an 


important step: that of identifying what earlier . 


experience is carried forward in memory by a 
particular cohort. The possibility of uncon- 
scious cohort effects must also be admitted, 
and in fact Mannheim's discussion of the 
“natural world" of early childhood would 
allow for subtle cohort effects on taken-for- 
granted elemental social behaviors (e.g., the 
type described by Elias [1939] 1978). But the 
generational hypothesis emphasized by Mann- 
heim and most others, and the one explored 
here, is of experience from adolescence and 
early adulthood that is carried forward with at 
least some self-awareness. Thus the belief 
that the Depression generation will be frugal 
or the Vietnam generation wary of foreign 
military intervention usually assumes memo- 
ries that are at least partly conscious. 

In our research we focused directly on the 
formative events and changes that cohorts are 
assumed to have experienced in the past and 
to carry with them into the present. We ask 
whether there are memories that distinguish 
one cohort from another, and whether cohorts 
defined initially in arbitrary age terms can be 
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redefined generationally by the qualitatively 
distinct events and changes that are currently 
predominant in memory (Rosow 1978). This 
then is a study of the collective political and 
societal memories of a cross-section sample 
of the American population, and our initial 
hypothesis is that these memories will be 
structured along the age dimension in ways 
that point to important cohort effects. We will 
also explore possible interactions between 
generational effects and major stratification 
variables (education, race, and gender), since 
it is possible that effects are limited to or 
magnified in certain social divisions of the 
population (Rosow 1978). 

Operationally, we work backwards and 
first determine in an open-ended way the 
main "national or world events or changes" 
remembered as important from the past 50 
years by the American population. Next we 
consider whether the age patterning of 
particular memories is consistent with the 
hypothesis of generational effects and, if so, 
whether the patterning points to one or more 
age ranges where there is maximum impact of 
events. Finally, as explained further below, 
we attempt to go beyond the nominal events 
and changes that are remembered to their 
meanings for the individuals holding them, in 
order to determine whether these meanings 
are also connected to cohort. In all of these 
inquires, we deliberately focus on the half 
century preceding our date of inquiry (1985), 
so that we have a period that coincides with 
the lives of a large part of the population, but 
also includes years that preceded the lives of a 
significant part of the sample as well (i.e., 
those younger than 50 will not have been 
alive during part of the period). 

A more specific hypothesis tested is that 
the events and changes that have maximum 
impact in terms of memorableness occur 
during a cohort's adolescence and young 
adulthood, often referred to as "youth." 
There are three types of consideration that 
point to this period as the critical one for 
generational effects. First and most obvi- 
ously, we assume that people will tend not to 
recall as important those events and changes 
that preceded their own lifetime. As Mann- 
heim ([1928] 1952, p. 296) stated it, only 
knowledge "personally gained in real situa- 
tions . . Sticks," and thus even very 
important political events and changes that 
preceded one's life should not have registered 
very clearly. Halbwachs ([1950] 1980) makes 
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a similar point in distinguishing between 
autobiographical and historical memory, the 
former richer and more personally meaningful 
than the latter, and recent evidence on the 
attitude-behavior connection provides further 
indirect support for the importance of direct 
experience (Fazio 1985). 

Second, although’ by elementary school 
age, children are fairly well developed in 
terms of language and other abilities, there is 
considerable evidence that they have only a 
very superficial grasp of the political world 
beyond their family and other personal 
relationships (Gallatin 1980). Apparently an 
awareness of larger political events and 
changes does not appear until early adoles- 
cence (Sigel and Hoskin 1977). The extent to 
which this is largely a result of intellectual 
maturation (e.g., Inhelder and Piaget 1958), 
of psychosocial development of personal 
identities (Erikson 1968), or of changes in 
social expectations and opportunities (White 
and Siegel 1984), is a matter of dispute; but 
consistent with Mannheim’s assumption, de- 
velopmental psychologists view youth as a 
kind of “critical period” for learning about 
the larger society, almost in the same sense 
that earlier years are critical for other 
developmental tasks, for example, the acqui- 
sition of language (Braungart 1984). 

Third, we expect that most people will tend 
not to recall as important those events and 
changes that occur after their early adulthood. 
The events that register most strongly during 
adolescence and early adulthood have the 
great advantage of primacy, and research in 
experimental settings suggests that primacy 
effects are especially strong on impression 
formation (Markus and Zajonc 1985), More- 
over, primacy should be much more impor- 
tant in the life course than in standard 
laboratory experiments, since striking politi- 
cal events that occur during adolescence or 
early adulthood are primary in a more 
fundamental sense, both disrupting the taken- 
for-granted natural world of childhood and 
providing an induction into the larger political 
and social world. In Mannheim’s view there 
is a “fresh” encounter with the political world 
at that point that can seldom be duplicated in 
later life. More dramatically, Davis (1979, p. 
102) speaks of noteworthy historic events 
during adolescence that rip “the larger 
existential fabric of our being-in-the-world,” 
and thus leave an indelible impression in 
memory. (Recency effects also tend to occur 
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with memory, and we attempt to assess this 
possibility as well in our research design—see 
below, note 2.) 

The hypotheses stated thus far focus on the 
importance of a critical: period in the life 
course for creating the possibility of genera- 
tional effects. However, both the “objective” 
importance of events and the particular 
interests of individuals are likely to modify 
life course factors and lead to some events 
being seen as important by persons who were 
not adolescents or:young adults at the time. 
For example, the magnitude of World War II, 
even though learned secondhand from books 
or parents, may compete strongly with 
youthful experience during the Vietnam War 
in determining memorableness. However, 
even in cases where the surface memory of an 
event does not vary by age, we hypothesize 
that the meaning of the event will neverthe- 
less be different for different cohorts. As 
Mannheim suggested, those from an older 
generation are likely to interpret the event in 
terms of their previously well developed view 
of the world. Likewise, those too young to 
have experienced the event firsthand should 
interpret it in terms of the world they 
themselves experienced during their own 
adolescence. Thus, we are interested not only 
in collective memories of events and changes, 
but also in the interpretive content of those 
memories as a further possible factor in 
generational differences. 

In summary, our paper reports investiga- 
tions of three closely connected hypotbeses. 
First, after identifying in an open-ended 
manner major collective memories about the 
past half century, we test whether these 
memories are structured by age in a way that 
points clearly to generational differences. 
Second, if such generational effects occur, we 
ask whether they fit the model of adolescence 
and early adulthood as the primary source of 
political and social memories. Third, we 
compare the meaning of such youthful 
memories with those that occur from other 
periods of the life course, with the expecta- 
tion that even the latter memories can best be 
understood by taking into account the adoles- 
cent and early -adult experiences of the 
rememberers. 

In describing our investigation as one 
dealing with "collective memories," we make 
use of a term advanced by Halbwachs ([1950] 
1980) to describe memories of a shared past 
that are retained by members of a group, large 
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or small, that experienced it. The concept is 
both suggestive and difficult to specify 
clearly. Initially, we use it to refer to shared 
memories of societal-level events, but in our 
concluding section we draw on our results to 
distinguish among several possible meanings 
of “collective memories.” We will also at 
that point discuss what our findings about 
memories of the past suggest about the value 
of distinguishing "generations" in order to 
predict current and future social actions. 


METHOD AND DATA 


We pursued these ideas in 1985 ‘by asking a 
probability sample of 1,410 Americans, 18 
years and older, to think of “national or world 
events or changes” that have occurred over 
the past 50 years and to name "one or two 
. . . that seem to you to have been especially 
important." Both "events" and "changes" 
were always mentioned together, since we 
wished to include both time-bound occur- 
rences and more general social movements or 
changes, and each respondent was encour- 
aged to give two such events or changes. (The 
full question is presented in the footnote to 
Table 1.) We then asked each respondent why 
he or she chose each event or change, as 
discussed at a later period.? 

In evaluating our hypotheses we draw on 
graphic presentations of memories by age 
(and implicitly by cohort), on logistic analy- 
ses using age and three stratification variables 
(education, race, and gender) for purposes of 


? Our questions were asked as part of a 
random-digit-dial telephone survey during four 
months (April, May, August, September) of 1985. 
The break between May and August was deliber- 
ate, in order to determine whether either recent 
events or commemorations of earlier events 
affected our findings about the past; small but 
statistically significant increases occur for two 
categories (civil rights and nuclear weapons), but 
in these and all other cases the results for relations 
to age are similar in the two seasons. (The increase 
for nuclear weapons was probably due to media 
commemorations of the atomic bombing of 
Hiroshima 40 years earlier on August 6; no similar 
explanation is apparent for the rise in civil rights 
mentions.) The main response rate for the four 
telephone surveys was 71 percent and another 4 
percent broke off the interview before reaching our 
20-minute set of questions, which occurred in the 
middle of a longer questionnaire dealing with 
consumer attitudes unrelated to our section. 
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control, comparison, and tests of interactions, 
and on content analysis of explanations of 
memories, which can help us see more clearly 
the different perspectives on the past that 
people have. 


MAJOR EVENTS AND CHANGES 
MENTIONED AS IMPORTANT 


The half century marked off by the period of 
approximately 1930 to 1985 was a momen- 
tous one for the United States. It included 
three major wars, important movements to 
effect changes in race and gender relations, 
the development of nuclear weapons, re- 
peated acts of political terrorism abroad and 
assassination at home, and . . . . One could go 
on and on. Certainly it was a half century full 
of both specific events and broader changes 
that might be remembered by Americans. Our 
initial interest is in which ones are remem- 
bered as important. (Although we will use 
words like "recall" and "remember," some 
respondents who mention an early event or 
change will be too young to have experienced 
it personally; what is presumably remembered 
is having heard or read about it.) 

World War II and Vietnam are clearly the 
most frequently recalled events or changes 
from the past five decades, as shown in Table 
1, with 29 percent and 22 percent, respec- 
tively, mentioning one or the other of these 
two wars.? Along with the two wars, we will 
include in our primary analysis the next listed 
10 events and changes as well. Six of these 
were mentioned by more than 5 percent of the 
sample: space exploration, the assassination 
of John F. Kennedy, civil rights, the nuclear 
threat, advances in communication and trans- 


? The base for percentages in the first column in 
Table 1 is the 1,253 respondents who mentioned at 
least one event or change in answer to our 
question. (We deal below with the 157 respondents 
included in our sample who were unable to 
mention any event or change but who answered 
other questions that were part of our study.) In the 
last column the same base of 1,253 is used, but the 
percentage in each row is for those who mentioned 
each event, whether as a first mention or a second 
mention, as against all those who did not mention 
the event. For example, 29.3 percent of the 1.253 
respondents mentioned World War II and 70.7 
percent did not mention it. The percentages in this 
last column are not mutually exclusive, since those 
who gave two responses appear in two categories. 
Thus the column does not add to 100 percent. 
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Table 1. Highly Mentioned Events and Changes 




















Number Number % of 
% Ist Ist 2nd Combined Respondents 
Event Mention Mention Mention Number Mentioning* 
* World War H 21.3% 267 100 367 29.3 
* Vietnam War 11.6 145 131 276 22.0 
* Space exploration 7.4 93 66 159 12.7 
* Kennedy assassination 5.0 63 48 111 8.8 
* Civil rights 6.1 TI 30 107 8.5 
* Nuclear war, threat of 4.5 56 ' 42 98 7.8 
* Communication/transportation 3.7 46 31 TI 6.1 
* Depression 4.6 58 12 70 5.6 
* Computers 1.8 23 26 49 3.9 
* Terrorism 1.5 19 24 43 3.4 
* Moral decline 2.9 28 13 41 3.3 
* Women's rights 1.6 20 17 37 3.0 
Reagan's presidency 1.6 20 15 35 2.8 
Nixon (Watergate) 0.7 9 26 35 2.8 
Inflation 1.6 20 14 34 2.7 
Medical advances 1.1 14 19 33 ^ 2.6 
Social security 1.8 23 6 29 2.3 
Nuclear power 1.2 15 14 29 2.3 
Unemployment problems 1.2 15 11 26 2.1 
Korean War 1.0 12 14 26 2.1 
War in general 1.0 13 11 24 1.9 
Soviet Union, Cold War 1.0 12 9 21 1.7 
World hunger 1.1 14 6 20 1.6 
Education, better/worse 1.0 12 4 16 1.3 
F.D. Roosevelt 0.9 11 5 16 1.3 
Israel, creation of 0.7 9 4 13 1.0 
Peace movement 0.6 8 5 13 1.0 
Farm problems 0.6 7 5 12 1.0 
Middle East 0.4 5 7 12 1.0 
Central America 0.6 7. 4 1l 9 
Economic improvement 0.6 7 3 10 8 
Immigration 0.5 6 4 10 8 
Miscellaneous 9.5 119 96 201 16.0 
Total 100.0% 1253 840 — — 
Base N (1253) (1253) 


* Major events and changes. 


* Each row represents a dichotomy of those mentioning the event at all divided by the total (N — 1253) mentioning 


any event. 
Event/Change Question 


The next questions concern how people think about the past. There have been a lot of national and world events and 
changes over the past 50 years —say, from about 1930 right up until today. Would you mention one or two such events 
or changes that seem to you to have been especially important. There aren't any right or wrong answers to the 
question —just whatever national or world events or changes over the past 50 years that come to mind as important to 
you. 

(IF ONLY ONE MENTION, ASK: Is there any other national or world event or change over the past 50 years that you 


feel was especially important?) 


portation, and the 1930s Depression. In 
addition, we include the next four most 
frequent mentions; although not quite making 
the 5 percent threshold, they are close to it 
and all are of considerable intrinsic interest— 
the development of the computer, terrorism 
(mainly the 1979 hostage-taking in Iran), 
moral decline, and women's rights. 
Altogether, 82 percent of the 1253 respon- 
dents who were able to mention any event or 


change mentioned at least one that fell into 
these 12 categories. We shall therefore 
concentrate our analysis on these “major” 
categories, which range from time-bound 
events such as World War Il to broader 
changes like civil rights that are difficult to 
describe in terms of precise dates. Since there 
is a high correlation between an event or 
change being mentioned at all and its being 
mentioned first, our analysis will treat each 
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major category dichotomously: mentioned at 


all or not mentioned. Other analysis using. 


only first mentions produces results for age 
generally quite consistent with what will be 
presented here (see Appendix A for first 
mentions by age). 

The 12 major events and changes are listed 
in Table 1 in a straightforward way, but 
judgment is necessarily involved in creating 
and using such categories. Even a well- 
demarcated "event" such as World War II 
consisted of a complex series of more specific 
events (the attack on Pearl Harbor, the 
invasion of North Africa, the surrender of 
Germany, etc.), and the placement of these 
under the label “World War II" is an act of 
conceptualization, since historical reality is an 
undivided stream. In the case of World War 
Ii, the conceptualization is provided by the 
larger culture, and a reexamination of the 367 
respondents coded into that category shows 
that 310 answered by using the exact words 
“World War IL." However, we have also 
included within the category respondents 
whose answers referred to “Pearl Harbor," to 
“the end of World War II," or in a few cases 
to some other event that seemed to fit best 
there. Thus, even for the simplest event 
categories, some judgments were needed both 
in creating the code and in actual coding. 

Other categories in Table 1 are less tightly 
constrained by specific dates and cultural 
definitions; for example, Women's Rights 
includes positive responses that refer to 
greater employment opportunities for women, 
positive references to the Equal Rights 
Amendment (ERA), mentions of the Wo- 
men's Movement, and similar answers. In 
this case, more conceptualization was re- 
quired on our part than was true for World 
War II. In our subsequent analysis we have 
tried to remain sensitive to variation within, 
as well as between, categories, and at points 
we will note tests carried out to make certain 
that the labels in Table 1 do not become so 
reified as to prevent discovery of important 
relationships at other levels of conceptu- 
alization.* 


* Coding was done by professional coders 
unaware of the specific hypotheses of the study. 
For the events and changes question, agreement 
between original and check-coders averaged 95 
percent for this question. For the more complex 
coding of reasons why the events or changes were 
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Other events and changes. Before focusing 
on the 12 major categories, we briefly note 
three types of response that are not included 
among them. First, in Table 1 we show a 
number of specific categories that fall below 
the threshold for a major event or change yet 
are mentioned by at least 10 persons—the 
figure we used to admit a specific category 
into Table 1. None of these other categories 
seems frivolous or irrelevant to the question, 
and we omit them from our analysis because 
they are small in frequency and the task of 
dealing separately with each would become 
overwhelming. 

Gathered together under “Miscellaneous” 
at the bottom of Table 1 are all other 
substantive responses given by those who 
answered the event/change question. These 
responses range from some not very different 
from the categories already included in the 
table (e.g., the Oil Crisis with nine responses 
just misses the threshold for separate listing) 
to a variety of more specialized or exotic 
answers (e.g., “Alaska became a state,” 
“Cincinnati Reds won the World Series,” 
“improvements in water and sewage,” “Rock 
’n Roll"). Such responses serve as a reminder 
that a random sample of Americans yields a 
vast variety of memories and concerns, and 
the last 50 years is viewed by members of the 
population in many diverse ways. 

A third grouping that requires note is not 
represented at all in Table 1: the 157 
individuals out of our total sample of 1,410 
who answered other questions in our study 
but were unable (or conceivably unwilling) to 
mention even one event or change over the 
past 50 years that seemed important to them. 
By far the most powerful background factor 
that accounts for the lack of historical 
memory is education. Most of the people with 
no apparent memory of events or changes are 
located among those without college educa- 
tion: 17 percent of those with no college 
experience, as against 4 percent of those with 
any college experience at all.5 


chosen, discussed at a later point, coder/check- 
coder agreement averaged 74 percent. 

5 When education, age, race, and sex are 
included in a single logit analysis of failure to 
mention any event or change, education remains 
the clearest explanatory factor (18.11, p<.001), 
age shows no relation, and race (blacks) and sex 
(women) are both significantly related (p<.05) to 
not mentioning any event or change. 
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GENERATIONAL EFFECTS 


Our first interest is in determining whether 
generation, operationalized in the form of age 
categories, helps explain mentions of events 
and changes as important in-response to our 
initial question. The simplest form of the 
generational hypothesis—that people of all 
ages will tend to report events and changes 
from their youth—is supported remarkably 
well for the majority of the 12 major events 
and changes, as will be seen in Figures 1 to 5, 
to be discussed in detail. (The exact percent- 
ages that are graphed in Figures 1 to 5 are 
available on request.) The figures present 
bivariate relations between age and each 
major category, but the relations have also 
been tested using logistic regressions that 
included education, gender, and race; the 
results from such tests are reported in Table 
2, both for control purposes and in order to 
provide comparisons of the sizes of the age 
effects with those attributable to the other 
three variables. In addition, since for events 
that occurred toward the middle of the 
50-year period, the generational hypothesis 
about youth predicts curvilinearity, we regu- 
larly included in the logistic analysis a 
quadratic term for age. In no case do the 
controls for education, gender, and race alter 
substantially the main effect of age, and for 
most events on which age has a significant 
effect at all, it is the strongest of the 
predictors, often with both linear and curvilin- 
ear trends registering as significant. We also 
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explored interactions between age and educa- 


tion, gender, and race; the four such interac- 


tions discovered will be noted at later points. 

Wars. We begin with two major wars in 
which the United States has been engaged 
over the past 50 years. Figure 1 shows that 
nominations of World War II as especially 
important are relatively high and sharply 
demarcated among those in their 50s and 60s 
in 1985. Nominations of the Vietnam War are 
high among those 18 to about 44, and 
especially among those in their 30s and early 
40s, but decline rapidly at later ages. 

If we transpose the present peak ages to the 
ages of the respondents at the beginning and 
end of each war, we find that the highest 
proportions of mentions occur as follows: 


Beginning End 
World War 11 (1941-1945): 16-20 20-24 
Vietnam War (1965-1973): 15-19 23-27. 


There is the clear beginning date of 1941 for 
World War II, and we use 1965 as the year in 
which the Vietnam War “could be considered 
as having started as far as the American 
public was concerned” (Mueller 1973, p. 37). 
The transposed peaks of 16 to 24 for World 
War If and of 15 to 27 for Vietnam are 
remarkably close to the 17 to 25 age range 
identified by Mannheim as critical for gener- 
ational formation. 

Whatever credence we give to these exact 
estimates, it is apparent from Figure 1 that 
memories are strongest for those in their 


Table 2. Relations of Major Event/Change Categories and Age, Education, Gender, and Race: Significant t-Ratios* 


Comm. 

World Viet- Civil Ken-  Nu- &  Depres- Compu- Moral Terror- Women’s 

Wari nam Space Rights nedy clear Transp. sion ters Decline ism Rights 
Education 5.63 (1.89) — 2.35 — — — 2.00 274 — — — 
Gender —3.12 — — — 2.27 -2.30 — — — — — 3.66 
Race —3.66 —3.70 (—1.90) 10.99 — — — — — — — — 
Age 
(linear) 6.83 —8.26 — —> -3.65 —2.93 3.52 4.63 — —  —408 -241r 
Age 
(squared) (—1.68) —3.82 — -4$ —452 — — 2.22 — = (1.75) — 


* Based on logistic analysis of each major event or change using four predictors: age (6 categories), education (6 
categories), gender (1 Men, 2= Women), race (1 = White, 2= Black). The cell figures are statistically significant 
(p.05) r-ratios (coefficient/standard error), with those in parentheses of borderline significance (.10>p>.05). Each 
analysis was done with and without an additional term for age squared to test for curvilinearity; if the age-squared term 
was not significant, results are shown only for the model omitting it. The sample size for these analyses was 1165, a 


number smaller than that for Table 1 because only whites 
two-tailed statistical significance levels for this table are: f= 
p<.001. 


and blacks are included for the race variable. Nominal 
1.64, p<.10; t= 1.96, p<.05; r= 2.58, p<.01; t=3.29, 


> The ratios for age for blacks and whites separately on Civil Rights are 2.42 and .27, respectively. 
* The i-ratios for men and women separately on Women's Rights are —.48 and — 2.06, respectively. 
* The t-ratio for age squared for blacks on Civil Rights is — 1.89. 
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(407) (182) (143) (174) (009) (90 (90) (77) (91) (76) (62) (49) 
Age 

===- Vietnam —— World War Il 


Figures in parentheses are base N's. 


Fig. 1. World War H and Vietnam Mentions by Age 


youth at the time of the event. This finding 
arises, however, from two different sources. 
On the one hand, those now too young to 
have directly experienced a war during their 
adolescence (below age 50 in the case of 
World War II) are less likely to mention the 
event, presumably because it was simply not 
part of the world they knew personally. On 
the other hand, those who were beyond their 
youth at the time of an event —beyond present 
age 45 with reference to the Vietnam War— 
are also less likely to mention it, probably 
because it is overshadowed by earlier events 
that dominate their memories. We should 
emphasize that since respondents were encour- 
aged to name two events or changes, the fact 
that a person mentioned World War II in no 
way precluded their mentioning Vietnam 
also. Yet of those 50 to 74 years in age who 
gave World War II and who gave one other 
event/change, only 11 percent mentioned 
Vietnam, a percentage essentially the same as 
the figure (12 percent) for those of the same 
age range who did not give World War II as 
one of their two mentions. Thus, it is not only 
that older people tend to remember World 
War II, but also that such people tend nor to 
mention a war that occurred after their youth. 

Other clearly datable events. Three other 
clearly datable categories are the Depression, 


the assassination of John F. Kennedy in 1963, 
and terrorism (consisting almost entirely of 
responses about the 1979 Iran hostage-taking 
and subsequent terrorist incidents). All three 
of these events show clear relations to age, as 
indicated in Figure 2.5 

Not many respondents mention the Depres- 
sion, but the modal age of those who do is in: 
the 70s and over category, so that these 
people are even older than those mentioning 
World War II, as they should be according to 
the generational hypothesis. (However, be- 
cause of small samples at the oldest ages, it is 
not practical to identify a highest age range 
for mentions in this case.) 

John Kennedy's assassination is given 
especially by people now in their late 30s and 
their 40s, which means teens to early and 
mid-20s in 1963 when the assassination 
occurred. One might also expect still younger 
persons to mention this particular event 
because recognition and idealization of the 
president appears earlier than other political 
awareness (Greenstein 1965), and in addition 


Note that the scale used for the ordinate in 
Figure 2 differs from that in Figure 1. We have set 
the scale to a 20 percent maximum for most of the 
figures, but Figures 1 and 4 require higher 
maximums. 
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Fig. 2. Other Datable Events by Age: Depression, JFK's Death, and Terrorism 


television brought the dramatic impact of 
Kennedy's assassination directly into most 
American homes. Although the peak age of 
mentions is by individuals who were 15 in 
1963, persons then 8 to 12 years old are also 
relatively high in naming the assassination as 
important. 

Finally, terrorist incidents, which captured 
public .attention in late 1979 with the 
hostage-taking in Iran and continued into the 
1980s, are mentioned most often by the 
youngest members of our sample—that is, by 
those who had not experienced most of the 
other major events and changes but who were 
at least entering their teens when the Iran 
hostage incident began. In sum, these three 
events, like the two major wars, are recalled 
most readily by those in a narrow age band of 
teens to middle 20s when each occurred. 

Broader changes. Two changes less easy to 
connect with specific dates but nevertheless 
amenable to age-related interpretation are 
shown in Figure 3. First, advances in 
communication and transportation include 
mentions such as the development of radio 
and television and of the jet airplane. Not 
surprisingly, these are reported as important 
changes disproportionately by older Ameri- 
cans who witnessed such extraordinary ad- 
vances in their own lifetimes. For younger 
Americans, television and jet planes are part 
of their natural world and not something to be 
remarked on. 


One might have expected a somewhat 
similar relation of age to the category 
“nuclear war,” since the initial impact of the 
atomic and hydrogen bomb developments go 
back to 1945 and 1950, respectively, and the 
1950 date was also caught up in growing 
U.S./Soviet hostilities. However, Figure 3 
shows almost the opposite: responses about 
nuclear war, which include both nuclear 
weapon developments and nuclear disarma- 
ment talks (but not nuclear power), are 
associated with younger ages. Our interpreta- 
tion here cannot be as straightforward in 
terms of simple age-related experience as for 
previous categories, but it is likely that a 
general increase in concern over nuclear 
destruction over the recent past is responsible 
in part for this age effect. Various antinuclear 
movements gained considerable ground in the 
years just preceding 1985, and these both 
result from and have further raised national 
consciousness on the nuclear issue. Thus, to 
the extent that the issue has been reborn as a 
new one, like all new issues it has impressed 
the young most of all, if only because it does 
not compete in their minds with “old” events 
and changes. From this perspective, mentions 
of nuclear weapons are similar to mentions of 
terrorism. In addition, however, it seems 
possible that there is a life cycle effect also, in 
that it is the young who will be particularly 
anxious about growing up in a world, and 
bringing children into a world, that may be 
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destroyed by nuclear war. Older people are 
probably less concerned about such apocalyp- 
tic visions because their worries center on 
aging and illness, the loss of loved ones, and 
other more personal harbingers of mortality 
(Bengston and Black 1973).7 

Civil rights and women's rights. For these 
two social changes, it is only the group most 
directly affected — blacks and women, respec- 
tively — who show a clear relation to age (see 
Figures 4 and 5, which use only six age 
breaks because of the smaller Ns when racial 
and gender subsamples are graphed). On civil 
rights, it is older blacks who are most likely 
to offer civil rights as important (although not 
the very oldest if the small case base of blacks 
70 and over can be trusted). The drop among 
younger blacks can readily be attributed to the 
lack of recent civil rights activities, since they 


7 A Gallup Poll (1982, p. 168) found a large age 
difference in 1981 in the percentage of Americans 
favoring the proposal that "every new house in the 
United States be required to have a bomb shelter, 
with the federal government paying most of the 
costs": 47 percent of those 18-29, 37 percent of 
those 30-49, and 27 percent of those 50 and over. 
Similar age trends occur in Gallup’s “most 
important problem" question for the code category 
"fear of war/international tensions" (The Gallup 
Report, 1987, 260, pp. 6-7). 


were too young to experience the height of 
such activity in the 1954-1965 period. (The 
generally high levels of mentions among 
blacks at all ages no doubt reflect the breadth 
of impact across the black population of the 
civil rights movement.) It is difficult to 
explain the null age relationship for whites, 
especially since the proportion of whites 
mentioning civil rights (5.2 percent) would 
place it fairly high in Table 1 even with black 
respondents omitted. We have no satisfactory 
interpretation of the puzzling absence of an 
age trend in this case. 

In the case of women's rights, the number of 
mentions by men is so small (n= 5) that the 
lack of an age relation does not merit specula- 
tion. The number of mentions by women is also 
not large (n — 32), but there is a marginally sig- 
nificant relation to age (p<.05), with younger 
women more likely to mention the women’s 
movement or associated responses, readily un- 
derstandable in terms of its more recent origin 
as compared with the civil rights movement. 
The unusual nonlinearity beyond age 49 must be 
interpreted with caution (neither a quadratic nor 
a cubic term is significant in the logistic regres- 
sion); however, it is worth noting that women 
now in their 60s were in their 20s during Worid 
War II, a point at which women were briefly 
encouraged to move into male occupations, 
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Space exploration and the development of 
computers, Next we turn to two types of 
scientific change that might have been 
expected to be related to age—especially for 
those who associate technological innovation 
with youth (e.g., Ryder 1965)—but are not. 
(The absence of relations to age holds not 
only for the sample as a whole, but for such 
major sample components as high- and 
low-educated respondents.) We thought that 
in the case of space exploration this might be 
due to the heterogeneous content of the 
category, and therefore carried out a separate 
analysis that focused on only those respon- 
dents (49 out of 159) who specifically 
mentioned man’s first landing on the moon in 
1969. Despite our successful isolation of this 
specific event, as starkly dramatic on televi- 
sion as was the assassination of President 
Kennedy six years earlier, there is not a hint 
of any relation to age. 

Mentions of the computer also show no 
sign of a relation to age. In this case there is 
no single dramatic point in time, but we are 
able to separate the sample into those who, to 
a later question, report having used a 
computer in some way (41 percent of our total 
sample) and those reporting no use (59 
percent): neither set of respondents shows any 
relation of mentions of computers to age, and 
indeed there is only an insignificant trend for 
those who have used computers to mention 
them as important more frequently than do 
non-computer users. (However, there is other 
evidence that in the case of computers the 
open question may not have adequately 
encouraged spontaneous mentions; see Schu- 
man and Scott 1987.) 

The fact that so many events and changes 
are related clearly and meaningfully to age 
suggests that we ought to take seriously the 
absence of such a relationship for space 
exploration and for computers. Apparently 
people at all ages attend more or less equally 
to space exploits—the third most highly 
mentioned category in Table 1—and all ages 
must also have been influenced at least 
vaguely by the emergence on the scene of the 
computer. Perhaps for the old, these nonpoli- 
tical categories do not face interference from 
earlier events such as World War If, while at 
the same time the continuing exploits in each 
area prevent the young from taking either for 
granted. However, when we come to the 
reasons that people give for mentioning space 
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exploration, we will see that cohorts do differ 
in their perception of the same event. 

Moral decline. 'The category Moral Decline 
is different in character from all other political 
and technological changes discussed thus far. 
It covers a variety of responses referring to 
crime, abortion, drugs, sexual promiscuity, 
alcohol, loss of religion, or other similar 
problems. Since in many instances respon- 
dents mentioned several together, or them- 
selves used a general term like “moral 
decline," we created a single category with 
this label. It seemed likely that such concerns 
would be voiced more often by older or at 
least middle-aged Americans, but surpris- 
ingly, there is no sign of such a relation 
overall. A plausible interpretation is that a 
rising concern with such issues as abortion 
and drugs has affected the young in a way that 
balances the presumed tendency for older 
people to be troubled over a more general 
long-term change in values from the era in 
which they grew up. If purely present 
concerns (e.g., “too much use of drugs and 
sex”) are separated out from those that make 
an explicit contrast with the past (e.g., "the 
growing lack of religious respect among the 
younger people"), the trend is for the former 
to be given by younger respondents and the 
latter by older respondents (gammas of — .20 
and +.25, respectively, in the associations 
with age). The number of cases, however, is 
too small for this finding to be more than 
suggestive and the interaction of age by 
mention (past versus present concern) is not 


significant (p.10). 


Other Background Factors: Education, 
Gender, Race 


Our presentation has focused on the bivariate 
relations of age to memories of events and 
changes. Not only are these relations main- 
tained when education, gender, and race are 
included in the logistic regressions reported in 
Table 2, but age is the most frequent and for 
most memories the strongest predictor in the 
table. Some of the other predictors may be of 
independent interest (e.g., World War II is 
mentioned more often by highly educated 
than by less educated respondents), but 
extended comment on these taken alone is not 
relevant to our present concern with genera- 
tional effects. Furthermore, in none of the 12 
cases does education interact significantly 
with age in yielding memories of events, and 
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in only two instances each do race and gender 
produce such interactions. Only blacks show 
an age relation to mentions of civil rights, as 
noted in connection with Figure 4, and in 
addition only whites show the relation for 
mentions of Vietnam presented in Figure 1, 
perhaps because of black preoccupation with 
civil rights during much the same years. Only 
women show the age relation to women’s 
rights, as noted in connection with Figure 5; 
and moral decline is mentioned more by older 
women (gamma=.23) and by younger men 
(—.34), but this difference is too close to 
borderline in significance to justify specula- 
tive interpretation here. In sum, age is clearly 
the most general predictor of memories for 
events and changes over the past 50 years, 
and the graphing of the age relations provides 
strong evidence that in all or almost all such 
cases, age represents cohort effects, which in 
turn have their origins in adolescence and 
early adulthood. 


THE REASONS FOR MENTIONING 
EVENTS AND CHANGES 


Although our search for connections between 


cohorts and remembered events has been 
generally successful, even where such differ- 
ences are sharpest there are always a fair 
number of people outside the modal cohorts 
who nevertheless mention an event or change 
as especially important to them. For example, 
although more than 40 percent of those now 
in their 50s and 60s (and then in their teens 
and 20s) cite World War II as especially 
important, the same war is also mentioned by 
approximately 20 percent of those who were 
not even born when the war ended in 1945. 

Or is it the same war that they mention? 
The fact that these two different age groups 
cite the same event, even use the same words, 
does not mean that they think about World 
War II in the same way, or indeed that they 
“perceive” it in the same way. It is this issue 
of possible generational differences in concep- 
tion and perception that we now turn to. The 
same issue presents itself even more force- 
fully for events such as space exploration that 
do not show a generational effect in number 
of mentions. 

Immediately after our sample of Americans 
named one or preferably two events or 
changes as important, we asked them to 
explain the reasons for each of their choices: 
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What was it about that makes it seem 


especially important to you? 





We expected to find generational differences 
in the reasons given, and from these to infer 
differences in the events themselves as 
subjectively known. For earlier events, older 
people can base their choice on direct 
experience, not necessarily of some core 
event or change itself—even soldiers were not 
necessarily in the thick of battle in World War 
II—but of living through the “real time” of 
its happening. They can have what are 
correctly termed autobiographical memories 
(Halbwachs [1950] 1980; Rubin 1986). 
Younger people, on the other hand, must base 
their knowledge of earlier events on what they 
have heard or read, which may have the 
advantage of greater perspective but is less 
likely to be personal or concrete. When we 
turn to recent events, however, these should 
be seen by younger people directly and with a 
fresh eye, whereas older Americans will bring 
to the same events the world of their youth, 
with a tendency either to assimilate or to 
contrast the recent events with personal 
experience from their earlier years. This was 
the guiding hypothesis that we explored, 
though as it turns out the evidence leads 
toward more varied formulations. 

Problems in reasons analysis. The analysis 
of reasons is more difficult than the analysis 
of events themselves because we start from 
the number of people who gave each major 
event, not the number of people who gave 
any event. Even with a frequently chosen 
event like World War II, the base is only 364 
cases, and with most other events it is much 
smaller. Our discussion of reasons will 
therefore concentrate on the four events with 
at least 100 mentions — World War H, Viet- 
nam, Space Exploration, and the Kennedy 
Assassination.? 


* Although slightly more than 100 respondents 
are coded as mentioning civil rights, the need to 
analyze blacks and whites separately reduces the 
effective numbers well below 100. It is also 
necessary to limit consideration to more frequently 
mentioned "reason" code categories, and those 
categories with less than 20 responses are omitted 
from Table 3. The categories listed in Table 3 do 
capture at least one of the reasons mentioned by 
the great majority of those who chose each event 
(82 percent for WW II, 89 percent for Vietnam, 
and 88 percent for Space Exploration) The 
proportion is smaller for Kennedy's assassination 
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Table 3. Reasons Given for Event Choices by Age, Education, and Gender 


Age Categories Logistic f-ratios 
18-29 30-39 40-49 50-59 60-69 70+ Age Educ Gender 

World War IT N: (55) (70) (58 (69) (73) (37) 
% War Experience (99) 4 9 14 45 44 54 6.63*** — 1.69" = 
% Wartime Shortages (20) 2 0 12 9 4 8 133 = 2.48** 
% Lives Lost (28) 6 7 5 6 1n 5 044 —2.01** — 
96 Economic Prosperity (66) 18 26 23 20 6 14 -2ll** — = 
% Patriotism (27) 11 10 7 4 6 3  —1.74* — — 
% Winning the Good War (62) 13 30 21 12 14 8 -236** ——2418* -22]** 
% Created World 

Structure (54) 20 29 14 10 10 3 -—2.88***  404*** —3,11*** 
% Large Impact (vague) (52) 18 23 21 10 7 0 -323* — = 
JFK Assassination N: (19 (45) (30) (8) (89 (0) 
% Flashbulb and Other 

Memories (28) 26 36 20 12 0 0 -1.94* — = 
% If He Had Lived . . . (23) 26 9 2 25 50 0 177 m m: 
% Personal Grief (33) 26 31 23 50 43 0 C96 — 2.08** 
Vietnam N: (8) (109) (50 (16) (13) (5) 
% Veterans Ill-Treated (43) 2 17 8 6 8 0 -2.58*** — 2.32** 
% Know Others in War (62) 12 24 32 19 38 40 Á210* -—230*  317*** 
% War Experience (33) 7 17 16 0 0 0 -0.66 = —3.16*** 
% Division and Distrust (78) 20 34 .34 038 15 0 036 2.16**  -2.73** 
% Lives Lost (69) 28 19 26 4 15 6 062 —1.99**  2]4** 
% No Meaning (69) 28 19 25 31 38 20 035 — a 
% Didn't Win (21) 1 11 10 0 8 2 139 — m 
% Didn't Try to Win (23) 7 10 10 0 8 0 -0.26 — —2,28** 
Space Exploration N: (36) (38) (28) (25) (17) (14) 
% Emotional Awe (49) ` 14 26 29 48 53 36  290** — 2.5]** 
% Intellectual 

Excitement (61) 56 34 39 28 35 29 -230* — 2.13** 
% Nationalism (31) 2 19 2) 20 12 7 -183* zs = 
% Practical Spin-offs (58) 33 21 601 44 35 29  Á208*  3.55*** —3.34* 

*** p.01. **p«.05. *p«.10. 


Results of Reasons Analysis 


The percentage giving each reason category 
for each event is presented in Table 3 for six 
age-groups. For example, of the 55 respon- 
dents 18 to 29 years old who mentioned 
World War II as important, 4 percent gave 
*war-time experience" as the reason for their 
mention and 96 percent did not. (The total 
number of people giving the reason across all 
ages is also shown in parentheses next to each 
reason, e.g., 99 respondents for wartime 
experience.) On the right side of the table, 
t-ratios are provided from logistic regressions 
in which each reason is treated as a 
dichotomous dependent variable (given or not 
given), and age, education, and gender are 
treated as predictors.? All the f-ratios are 


(62 percent) because of the omission of a large 
category of vague miscellaneous responses. 

? Education in this analysis also has six 
categories: 0-8, 9-11, 12, 13-15, 16, 17+ years 


provided for age, since it is our main focus, 
but only those that are statistically significant 
(at p<.10) are noted for education and gender 
in order to decrease the density of the table. 
These statistics are shown for initial guidance 
only, since in some cases nonlinearities will 
be our focus. It should also be noted that each 
response could be coded for up to three 
reasons, and thus the categories shown for an 
event in Table 3 are not mutually exclusive. 
(Analysis of first coded reasons only, in order 
to create mutual exclusivity, yields results 
similar to those in Table 3.) 

World War II. Important age differences 
occur for the most frequently given World 
War II reason categories, as shown in Table 


of schooling. We do not include the background 
variable of race here, since this would further 
reduce sample efficiency because of the need to 
exclude several small classifications other than 
white and black. Racial differences, however, are 
already partly captured by education. 


_ GENERATIONS AND COLLECTIVE MEMORIES 


3. The most clear-cut of these relationships is 
straightforward in interpretation: older people 
tend to explain their choice of the war as 
important in terms of personal War Experi- 
ences (row 1), with a sharp discontinuity in 
such reasons between those above and below 
the present age of 50 (essentially between 
those alive and those not yet born before the 
beginning of the war). Examples of these 
responses are: 


J had to go to North Africa. I don’t like North 
Africa. I have to wear a hearing aid now because 
of it. Lost part of my hearing there. (man, age 
70) : 


Because my husband was away from me for 
three and a half years and it changed my life a 
lot. I had a child when he was gone, and I had to 
go through that alone. (woman, age 72) 


The war is remembered as important by these 
people because they served in the armed forces, 
or their close relatives were away in the armed 
forces. Not all such experiential memories were 
negative —"I was drafted . . . trained in the 
navy on small boats; the experience was good" 
(man, age 63)—but almost all are obviously 
concrete and autobiographic, sometimes viv- 
idly so. These percentages of 45 percent, 44 
percent, and 54 percent (for those ages 50-59, 
60-69, and 70+, respectively) are the highest 
in the World War II set of reasons, indicating 
that the dominant reason that older people offer 
for mentioning World War II as important is 
personal experience. 

Wartime Shortages and Lives Lost are both 
small categories of response that show some 
effects of age—more clearly when the results 
are examined in detail. In the case of Wartime 
Shortages, the age effect is sharpest when the 
sample is divided by gender, prompted by the 
significant gender difference indicated in 
Table 3. As shown in Table 4, memories of 
shortages are more common among women, 
and especially among those women who were 
quite young near the end of World War II (see 
also Elder, Gimbel, and Sweat 1988). For 


Table 4. Wartime Shortages by Age and Gender* 
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example: “I just remember the lines you had 
to go through to get your sugar and coffee and 
gas stamps" (woman, age 50 now, age 10 in 
1945). The age effect is less obvious for men, 
but there are still clear references to early 
childhood experience: “I remember I couldn't 
get candy bars," said a middle-age man who 
was just three years old in 1945. 

Mentions of Lives Lost during the war 
extend over the entire age range, but the 
largest percentage giving this reason is for 
those 60 to 69, that is, for people who were 
teenagers or young adults during the World 
War II years. If the 60—69 cohort is contrasted 
with all other ages combined, the difference 
reaches conventional significance: L?= 4.48, 
df- 1, p=.03. Moreover, for those in the 60 
to 69 cohort, these responses often suggest 
direct emotional meaning: 


A lot of lives were lost. I came home but a lot 
didn't. (man, age 63) 


For younger people, the same type of coded 
response is more distant and removed from 
the scene: : 


It was a good thing to have it end and people 
weren't being killed any more. (man, age 23) 
Thus the Lives Lost reason probably contains 
age differences in personal meaning that are 
not fully captured by statistical testing of the 

gross code category itself. 

What do younger people, who were not 
even born in 1945, "remember" about World 
War II? The two most reliable relations in the 
direction of youth are the categories labeled 
Created World Structure and Large Impact. 
Both types of response refer to the importance 
of World War II from a broader perspective, 
and they differ from each other mainly in how 
elaborated the response is, as suggested by 
the fact that Created World Structure has a 
positive association with education in Table 
3, whereas Large Impact does not. For 
example, the two responses below were 
coded, respectively, as Created World Struc- 
ture and Large Impact: 


Mentioning 
Shortages 18-29 30-39 40-49 50-59 60-69 70+ 
Men 3% 0% 1% 3% 0% 5% 
i (31) (42) (30) (34) (40) (19) 
Women 0% 18% 14% % 
(23) (28) (28) (35) (33) (18) 


* Base Ns are in parentheses. 
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The way it changed world relations. Created 
alignments of countries. The cold war was 
caused by World War II, precipitated by it. 
(man, age 36, college graduate) 

Affected more people than any other war. (man, 
age 36, high school graduate) 


Both responses are of a kind that might be 
learned in courses on American history, or 
from television, or in the case of the second 
response might almost be inferred from the 
word “world” in “World War.” 

Much more interesting theoretically is the 
relation to younger ages of responses catego- 
rized as Winning the Good War. The 
responses refer to the fact that the United 
States triumphed in World War H and that it 
was a good war because it involved a struggle 
against evil: 


If we didn’t win the war, this would be a 
different kind of world—not as much freedom. 
(woman, age 31) 


That was a victorious war so it was exhilarating 
to bond together in the country. (woman, age 
36) 


These responses are not only related linearly 
to younger ages, but are given especially by 
the main Vietnam generation age-group 
(those who were in their teens and 20s at the 
height of the Vietnam War), not by those who 
either are younger now or were alive during 
World War II. When the Vietnam age 
category (30—39) is compared with all other 
age categories combined, controlling for 
education and gender, the difference is highly 
significant (t — 3.31). (We note also that such 
responses are given more often by males and 
by the least educated, but these other 
background factors do not interact with 
age.19) Thus, it is primarily the Vietnam 
generation that looks back on World War II as 





10 Winning the Good War includes some 
responses that emphasize only the "winning" 
aspect and some that emphasize mainly the good 
versus evil aspect. Although a single combined 
category is analyzed here, analysis of codes for the 
separate aspects yields the same conclusions for 
relations to age. However, mentions only of 
` “winning” are given disproportionately by males 
and by the less educated, whereas mentions of 
good versus evil are unrelated to sex and 
education. It is the former, therefore, that produces 
the sex and education relations in Table 3 for 
Winning the Good War. 
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the "good war" that we fought and won— not 
those who lived during the war itself! 

This finding calls for an important reversal 
to the more usual proposition that older 
people are the ones most likely to see recent 
events in terms of a world that is now gone. 
In the case of World War II, it is a younger 
generation who see an earlier event, World 
War II, in terms of their later experience with 
Vietnam. These Vietnam generation represen- 
tatives are nostalgic for a world they have 
never known directly, in contrast to the world 
of their own youth during the divisive late 
1960s and early 1970s. We might term this 
“vicarious nostalgia," a socially created 
version of Edwin Arlington Robinson's ““Min- 
iver Cheever.” 1! 

Younger people also tend to think of other 
positive features or consequences of the war. 
They look back on it as a time of Patriotism 
(row 5) and also a time of Economic 
Prosperity (row 4). The first of these, 
Patriotism (e.g., “the country came together 
as a whole during that time; everybody 
worked as a unit” [man, age 36]), is readily 
understood as being much the same as 
Winning the Good War, though it appears 
equally concentrated among the youngest 
Vietnam and post-Vietnam respondents. But 
the second, Economic Prosperity, is more 
surprising and is contrary to our initial 
expectation. It had seemed likely that the 
wartime and immediate postwar prosperity 
would be salient mainly to those who had 
experienced it, and that younger people 
would not associate “war” with economic 
gain. Yet the logistic result is in the opposite 
direction, and Economic Prosperity can be 
seen to be mentioned at a fairly uniform level 
from 18 to 59, then to drop sharply among the 
main World War II generation. However, 
many of these responses turn out on closer 
examination to be not about World War II 
itself, but rather about the postwar prosperity: 


!! [n agreement with Davis (1979, p. 8), true 
nostalgia refers to “a personally experienced past, 
rather than one drawn solely from chronicles, 
almanacs, history books.” The latter is nicely 
represented in Robinson’s poem: 


Miniver loved the days of old 
When swords were bright and steeds were 
prancing; 
The vision of a warrior bold 
Would set him dancing. 
Miniver sighed for what was not... . 
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Well, I was a war baby. I benefited from it like 
the others did. . . . It all came easy. Education, 
jobs, wealth. (man, age 42) 


Our financial outlook changed for the good. My 
dad came home from the war; he bought a 
house— which we had never had before— a car, 
and other things. Things just started looking up. 
(woman, age 51) 


Thus, some of the responses have fairly clear 
experiential content, though others are by 
people too young to have been part of the 
postwar prosperity period and refer vaguely to 
improvements in the economy. !? 

The Kennedy assdssination. Nominations 
of the 1963 assassination of John Kennedy as 
one of the important events of the past 50 
years are already clustered within a fairly 
narrow age range, but the reasons given for 
these mentions show some additional rela- 
tions to age. One set of “reasons” is simply a 
clear memory of the assassination, in terms of 
either a specific “flashbulb” image of hearing 
of the event itself (Brown and Kulik 1977; 
Neisser 1982), or a more general report of its 
being memorable. These slightly different 
references to memory are illustrated by the 
following two quotations, the first of the 
flashbulb variety and the second more general 
in nature: 

I remember it vividly. I was in my sixth grade 

class when the principal came in to announce it. 

(woman, age 33) ` 


I was very young and it made a big impression 
on me. (woman, age 30) 


The two types of remembering have been 


12 Reasons for mentioning the Depression paral- 
lel reasons for mentioning World War II in that 
older people explain their choice of the Depression 
in terms of remembered hardships, whereas young 
people's explanations tend to be in terms of New 
Deal legislation and other social outcomes. It may 
be that the positive side of a past event tends to be 
stressed more by later generations that did not 
experience it. We had also expected to find among 
the oldest Depression cohort reasons that referred 
to valuing frugality, but such responses occur 
mainly among those 50 years old now, which 
means they were born in 1935 and were the 
children of the Depression cohort, rather than from 
the Depression cohort itself. On reflection, it is 
plausible that frugality as a personal habit became 
part of the "natural" world of young children 
through socialization by the Depression cohort of 
parents, and thus differs from reasons that are 
connected with experience during adolescence. 
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combined in Table 3 because the number of 
cases in each is small (13 cases classified as 
flashbulb and 15 as more general memory) 
and because when viewed separately they 
show identical trends. The linear relation of 
this combined Memory category is for 
younger people to mention clear memories of 
the assassination more often than do older 
people. However, when the bivariate associa- 
tion is examined more closely, it is the second 
youngest cohort— those 30 to 39, who were 8 
to 17 in 1963—that speaks especially of 
memory. (If the 30 to 39 age cohort is 
contrasted with the other age categories 
combined, L?=4.04, df=1, p=.04.) The 
absence of specific memories by those very 
young in 1985 is hardly surprising—our 
18-year-olds were not born until 1967—but 
the fact that memories are not often men- 
tioned by those over 30 is a more meaningful 
finding. It suggests that even those older 
respondents who named the assassination as 
especially important were less likely to recall 
it in personal terms such as where they were 
and what they were doing at the time of the 
event. Flashbulb memories, whether com- 
pletely accurate or not (Neisser 1982), point 
directly to the way in which a traumatic event 
leaves an imprint by disrupting the natural 
world of childhood. 

Older people, on the other hand, tend to 
give as a reason for mentioning the assassina- 
tion a type of response that we have called “If 
he had lived. . . .” These are statements about 
how Kennedy's assassination led to negative 
changes in the United States, though the 
changes are seldom specified very clearly. 
For example: 


Things would be different today if he lived but I 
don't know how. He was a great man. (man, age 
64) 


Thus people who were middle-aged or older 
when the assassination occurred were not as 
dramatically affected by the stark report of the 
shooting as by a more general sense of 
political loss that it signified. 

The one other clear category that we were 
able to code involved strong expressions of 
grief and shock over the assassination: 


I was at school and we were all very devastated. 
One girl even started crying. (woman, age 28) 


Although some of these responses are part of 
a flashbulb memory, this is not always true 
and the correlates of the two categories are 
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not the same. Here the main correlate is 
gender: 26 of the 33 respondents coded as 
reporting personal grief are women. 

Vietnam. Mentions of Vietnam as impor- 
tant, like mentions of Kennedy’s assassina- 
tion, have a narrow age range, with nearly 70 
percent of such answers coming from respon- 
dents under 40 years of age. (Even where 
percentages look large among older respon- 
dents, the base N's are quite small in the case 
of Vietnam, e.g., only five cases among those 
70 and over, and hence the percentages in 
Table 3 are misleading unless this is taken 
into account.) Because of this age constric- 
tion, explanations for such mentions are not 
likely to show much association with age. 

Of the eight types of reasons in Table 3 for 
mentioning Vietnam, two show significant 
linear relations to age, two show significant 
curvilinear relations, and four show only 
trends or point to no relation. The clearest 
linear association is negative: young people 
speak in terms of veterans being ill-treated, 
which quite likely results from the fact that 
Vietnam veterans were ignored during and 
immediately after the war and their problems 
only recently became a salient issue: 


A lot of people went over there and fought and 
died and when they came back no one was 
happy with them even though the government 
sent them over there. (woman, age 20) 


Thus Vietnam Veterans is a young person's 
reason because it really refers to a more 
recent event than the Vietnam war itself. 

Know Others in War also shows a 
significant linear relation to age, but in this 
case positive in direction and produced 
mainly because of the disproportionately 
small number of post-Vietnam generation 
persons (ages 18—29) who give such a reason. 
Those offering the reason are usually the 
relative and friends of soldiers: 


I was involved in it. My husband and my brother 
served over there. (woman, age 35) 


The two significant curvilinear relations are 
for Personal War Experience (quadratic term: 
= —2.35) and Division and Distrust (qua- 
dratic term: t= —2.35). The first type of 
response is most frequent among those in 
their 30s and 40s and often involved direct 
military experience in Vietnam: 


Į was in the service at that time and I lost friends 
in the war. (man, age 40) 
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The second type of response also occurs 
especially among the 30 to 49 age cohorts, 
and it includes two slightly different types of 
response, here combined under the rubric 
Division and Distrust: 


It separated the country. The people who were 
antiwar and the people who were pro. It put the 
country in a turmoil. (man, age 31) 


I'm very cautious now. I don't trust people or 
governments. . . . I’m just wary about the 
government. (man, age 36) 


This is the largest of the Vietnam war 
categories, and it directly reflects the disillu- 
sionment felt by the Vietnam generation but 
extends as well to a still older cohort (50 to 
59) that was in its 30s when the war began. 

There are also nonsignificant curvilinear 
trends for the categories Didn't Win and 
Didn't Try to win. The former is generally 
offered as a matter of fact conclusion about 
the disappointing outcome of the war, while 
the latter is a much more ideological 
condemnation of the government for not 
pursuing victory with greater vigor. For both 
categories, it is the Vietnam generation and 
just beyond (ages 30—49) that produce the 
bulk of the responses. Finally, there are two 
categories that show no sign of a relation to 
age: mention of Lives Lost, which we note 
also failed to be related to age when offered 
as a World War Il reason, and vague 
statements to the effect that the Vietnam War 
had no meaning. These relatively frequent 
reasons for mentioning Vietnam as important 
tend to be given at all age levels. 

Space. Unlike the three events dealt with 
thus far, the sheer number of mentions of 
space exploration as an important event or 
change did not show any sign of an 
association with age. Even if after-the-fact 
speculations about the uniqueness of scientific 
events can account for the absence of an age 
relation, we might still expect the reasons 
given for such mentions to differ by age. The 
reactions of a 10-year-old and of a 50-year- 
old were unlikely to be quite the same as they 
watched Neil Armstrong step from Apollo 11 
onto the moon, and this should show up in 
1985 when each recalls the event some 16 
years later. The results in Table 3 bear out 
this expectation, for all four main reasons 
Biven for mentioning space exploration are 
significantly associated with age—two in 
each direction. 

Emotional Awe and wonder over space 
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exploits are expressed disproportionately by 
those approaching age 50 and beyond. A 
respondent of 50 today was 34 at the time of 
the moon landing, and it is these people who 
are most apt to use words like “fantastic” or 
“amazing” in explaining why space explora- 
tion was the most important event of the past 
50 years: 


I saw accomplishments I never thought I'd see 
and they happened so quickly. I'm just fasci- 
nated by it. (man, age 52) 

Sheer minds that are back of all that. I’m 
amazed at that much intelligence and courage. 
Sheer magnitude of it all. (woman, age 69) 


On the other hand, younger respondents, 
perhaps because they have lived with space 
exploration almost from the beginning and are 
future-oriented, are especially likely to spec- 
ulate about the Intellectual Excitement of new 
developments still to some: 


Our world will change in the next 50 years 
because of what's going on the space industry. 
We may make moves to live elsewhere. 
(woman, age 24) 


Well, we might even have space stations and so 
if we destroy our world, we will have a place to 
go. (woman, age 27) 


There is a futuristic science-fiction quality to 
the excitement of the young, as distinct from 
the simpler awe of older respondents at a 
world so different from their youth. The 
contrast provides a good example of how the 
two ends of the age spectrum recall—and 
presumably experience—the same objective 
event in different ways. Moreover, the results 
for Emotional Awe and Intellectual Excite- 
ment hold just as strongly if only reasons for 
mentioning the 1969 Moon Landing are 
tested, rather than for all mentions of space 
exploration. 

Younger and.middle-age respondents also 
show a tendency more than the oldest 
respondents to stress national goals and 
national pride (Nationalism) when talking 
about space exploration: 


Because we know what Russia has up there and 
we can keep pace with them on their level. 
(woman, age 24) 


Gives you a sense of pride of being American. 
(man, age 47) 
Older respondents, on the other hand, are 
likely to cite practical spin-offs from space 
exploration. Since more highly educated and 
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male respondents similarly see practical value 
to space explorations, it is the older, male, 
highly educated respondents who most often 
refer to such practical payoffs, though in 
analysis not shown here this combination of 
characteristics appears to be an additive set, 
not an interactive one. Examples of such 
responses are: 

They've come a long way in predicting the 

weather and that sort of thing. (man, age 53, 

high school graduate) 


In sum, older respondents see space 
exploits as utterly unexpected events and 
express surprise and pleasure about them. 
Older respondents are also likely to note the 
technological gains in other areas of life that 
have been stimulated by space developments. 
Younger respondents tend to take past space 
accomplishments as given, and to speculate 
more on future possibilities such as space 
colonies or travel to other parts of the 
universe. The young are also more apt to 
view space exploration as a race with the 
Soviet Union and to show pride in what they 
see as uniquely American achievements. 


CONCLUSIONS 


For the majority of 12 major national or world 
events and changes from the past half century 
that Americans recall as especially important, 
the memories refer back disproportionately to 
a time when the respondents were in their 
teens or early 20s. Thus the data fit well both 
the general hypothesis that memories of 
important political events and social changes 
are structured by age, and the more specific 
hypothesis that adolescence and early adult- 
hood is the primary period for generational 
imprinting in the sense of political memories.!? 
Furthermore, for memories of two other types 
of change, the group most directly affected— 
blacks for civil rights and women for 
women’s rights—show similar age structur- 
ing, and in a third case—answers referring to 
moral decline—there is evidence that a larger 
sample of responses might allow finer coding 
which would yield the hypothesized age 
relation. In the end, there are only two clear 
exceptions to the hypothesis about age 
structuring of memories, and both of these 


13 Henceforth we use either “event” or “change” 
to refer to both particular events and more general 
changes. i 
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involve scientific developments: the explora- 
tion of space and the invention of the 
computer. Quite possibly it is the nonpolitical 
nature of these two events that accounts for 
their lack of association with age, perhaps 
because there is less interference from earlier 
events than is the case for political issues. 
(See Brown, Shevell, and Rips 1986, for 
other evidence that political and nonpolitical 
public events are remembered differently.) 
Of course, not everyone names an event 
from his or her youth when asked to recall 
important events or changes over the past half 
century. However, we expected the meaning 
respondents give to an event to be heavily 
influenced by their own experience during 
youth, whether or not the event itself came 
from that period of their life. The evidence 
for this hypothesis cannot be evaluated in as 
precise a way as can the simple recall of 
events, but several findings are supportive. 
Those who chose an event that happened 
during their own adolescenee or early adult- 
hood show a strong tendency to explain their 
choice in terms of straightforward personal 
experience at that time, for example, service 
in the army during World War II, or a 
presumed flashbulb memory of the Kennedy 
assassination. However, those whose own 
youth occurred in a period different from that 
of the event they mention show a tendency to 
contrast the event implicitly with events from 
their own adolescence and young adulthood. 
The most striking example of this is the 
finding that characterizations of World War II 
as a “good war” and a “victorious war” come 
less from the World War II generation itself 
than from the later Vietnam generation now in 
its 30s and early 40s. Thus youthful experi- 
ence of an actual event or change often 
focuses memories on the direct personal 
meaning of the experience, whereas the 
attribution of some larger political meaning to 
the event is more likely to be made by those 
who did not experience it at all, or at least did 
not experience it during their adolescence or 
early adulthood. Not every explanation of 
remembered events fits this conclusion neatly: 
the Vietnam war itself provides a partial 
exception, since memories of that period's 
divisiveness and distrust, which are at once 
personal and more generalized, do come 
disproportionately from the Vietnam genera- 
tion; but each of the four events producing 
sufficient explanations for detailed examina- 
tion— World War II, Vietnam, space explora- 
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tion, and the Kennedy assassination — shows 
cohort differences in perception that seem to 
reflect differences in generational vantage 
points. 

Our earlier use of the term "generational 
imprinting" may suggest an almost machine- 
like stamping in process, as Rubin, Wetzler, 
and Nebes (1986) imply in their discussion of 
reminiscence, but this is not our interpreta- 
tion. The importance of adolescence and early 
adulthood can be seen instead to emerge out 
of the conjunction of several life course 
factors: first, the low salience for most people 
of events that occurred prior to their own 
lifetime, or even prior to their near- 
intellectual maturity in adolescence; second, 
the openness of adolescents and young adults 
to events and influences from outside the 
home and neighborhood; and third, the 
importance of the first political and social 
events that people encounter for shaping their 
later views of the political and social world, 
so that subsequent events seldom seem as 
significant as those encountered earlier. 
Generational imprinting can thus be regarded 
as a consequence of normal individual 
development, just as differences in genera- 
tional perspectives on the "same" event can 
be seen to be a consequence of varying 
locations in historical time. 

Our results also suggest a need to distin- 
guish between at least two meanings of the 
term "collective memory." On the one hand, 
when large parts of the population appear to 
remember a common object, this can be 
thought of as a form of collective memory. 
However, it may be a rather superficial form, 
especially when on closer examination the 
memories turn out to be quite personal and 
particular—less about “World War Il" as a 
collectively conceptualized event than about 
one's personal loss of hearing while on 
military assignment in North Africa, or the 
shortage of candy bars on the home front. On 
the other hand, when a large part of the 
Vietnam generation remembers the Vietnam 
period as one of distrust and division, this is a 
collective memory in the more general sense 
of being collectively created and collectively 
held, and it probably has more general import 
for future actions by members of that 
generation. However, using this second 
meaning, if members of the Vietnam genera- 
tion also "remember" World War II as a 
triumph of good over evil, even though they 
were not alive at the time, this is a kind of 
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collective memory too. Collective memories 
in this second sense of widely shared images 
of a past event need not be personally 
experienced and thus begin to be difficult to 
distinguish from Durkheim’s ([1901] 1938) 
conception of traditional beliefs as a form of 
"collective representation" (see also Bloch 
1925). The difference may lie less in the 
content of the memory than in the degree of 
personal feeling that is apt to accompany 
events lived through, as against events 
learned about second-hand. 

Whether the generational memories docu- 
mented in this paper influence future behavior 
is an important issue, but not a simple one. 
For one thing, some of the most deeply felt 
memories reported by the public have little 
clear implication for future behavior. The 
assassination of John F. Kennedy was the 
fourth most frequently remembered event in 
our study, yet the meaning that people took 
from the assassination seems to have been 
more personal and philosophic than political: 


Reality hit me. It made me see death and blood 
and after that.there were no surprises for me. 
(man, age 31 now, age 9 when Kennedy was 
shot) 

We all stopped being so innocent. We thought 
everything in the U.S. was good and pure, and 
we found out it wasn't so. (woman, age 51 now, 
29 when Kennedy was shot) 


Furthermore, even for events such as World 
War II, which might at first seem full of 
political meaning, most memories were about 
personal experiences that had no obvious 
implication for future political behavior. Even 
though we did not probe specifically for 
political interpretations, their extreme rarity 
of occurrence in spontaneous answers sug- 
gests that wider political generalization was 
not an important result of experience con- 
nected with that war, though of course it may 
have played a more significant role in shaping 
the views of future political leaders (Converse 
1964, 1987). 

Even for memories that have more apparent 
relevance to the future, for example, memo- 
ries of the divisiveness of the Vietnam era, 
there may be quite contradictory lessons 
drawn, some people focusing blame on the 
government or the military, some on the press 
or on liberal critics of the war. These two 
different lines of political interpretation corre- 
spond roughly to Mannheim’s (1952) distinc- 
tion between: the perspectives of different 
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“generational units,” and they indicate again 
the danger of moving too quickly from one’s 
own views of the lessons of the war to 
predictions of the views, let alone future 
behavior, of others. In addition, the general 
public was less concerned about the larger 
moral issues of the war than about the 
likelihood of winning or losing it (Schuman 
1972), so the most obvious implications for 
decisions about future American intervention 
are contingent on that aspect of a new 
situation, with enthusiastic backing of what 
seems like a painless intervention (e.g., 
Grenada), but reluctance to support what is 
thought to be riskier involvement (e.g., 
Central America). 

None of these considerations means that 
generational memories of the past are unre- 
lated to future actions, but rather that as 
Weber ([1956] 1968) insisted, it is important 
to understand what events mean to individuals 
and social groups, since subjective meaning is 
a crucial element in the translation of 
experience into future action. The findings in 
this paper help move us in that direction. 

There is one other important empirical 
problem that needs to be addressed in 
considering our results. A study like this at a 
single point in time inevitably confounds 
cohort effects with the “objective” impor- 
tance of events. One might argue that older 
people choose World War II as especially 
important because it was in a real sense the 
most important single event of the past half 
century. According to this argument, if our 
study had been carried out in 1950, we might 
have discovered no age effect at all, since 
virtually everyone would have selected World 
War II as most important—a type of “period 
effect." From this standpoint, only one 
additional factor needs to be added to explain 
the choices of events in 1985: the ignorance 
of today's youth about the truly important 
events of the past 50 years that occurred 
before their adolescence. However, while 
historical importance is no doubt a major 
factor in influencing events, the diverse series 
of age-related choices reported in this paper 
make such a purely historical argument 
untenable (see Schwartz 1982 for a balanced 
treatment of this issue). In addition, the quite 
personal reasons that people often give for 
their nominations of events show clearly that 
generational effects on memory are much 
more than simply a mental recording of 
external pointers to historical importance. 
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Some people certainly make judgments that 
reflect primarily the perspective of historians 
toward the past, but for most of us it is the 
intersection of personal and national history 
that provides the most vital and remembered 
connection to the times we have lived 
through. 
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LIFE STRESS AND HEALTH: 
STRESSORS AND RESOURCES* 


NAN LIN WALTER M. ENSEL 
State University of New York at Albany 


In the life stress process social, psychological, and physiological environments 
simultaneously impinge on well-being. In this paper we focus on physical health as 
it is affected by two environmental elements: stress(or)es and resources. Data from 
a three-wave panel survey of a representative upstate New York community are 
examined by lagged causal relationships among these variables. The results show 
that (1) prior physiological conditions contribute directly to current physical 
symptoms, (2) stress and resource components of the psychological environment 
directly affect current physical symptoms, (3) social resources buffer both social 
stressors and psychological stress, whereas psychological resources buffer only 
psychological stress, and (4) psychological stress mediates the effects of prior 
Social stressors and psychological resources. Clearly, stress is a complex process 
in which the three interacting environments affect well-being. Importantly, social 
resources play a buffering role relative to social and psychological stresses for 


physical well-being. 


THE THEORETICAL ISSUES 
AND FORMULATIONS 


The theoretical interest in social epidemiol- 
ogy, the study of effects of social conditions 
on the diffusion of distress and disease in the 
population, can be traced to Durkheim's 
study of suicide in 1897 (1951). Since then, 
theory and research have elaborated the 
associations between the various forms of 
Social integration and psychiatric disorder. 
Among the classic works are Faris and 
Dunham's study of the ecology of mental 
disorders in urban areas (1939), Hollingshead 
and Redlich's research on social class and 
mental illness in New Haven (1958), the 
Midtown Manhattan studies (Srole, Langner, 
and Michael 1962; Langner and Michael 
1963; Srole 1975), and the Sterling County 
studies by the Leightons and their colleagues 
(A. H. Leighton 1959; Hughes and Tremblay 
1960; D. Leighton, Harding, Macklin, Mac- 
Millan, and A. H. Leighton 1963). Each 
study illuminates the linkage between social 
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conditions and distress and advances theories, 
hypotheses, and empirical evidence in the 
specification of the relationships. 

The conceptual development of the life 
Stress process can be described in terms of 
three environments (social-psychological- 
physiological environments) and two types of 
forces in each environment (stress and 
resources). After we briefly discuss these 
concepts, we introduce a paradigm that 
incorporates all the forces in the three 
environments and subjects the elements of the 
paradigm to a systematic empirical examina- 
tion. 


Life Events—the Social Stressors 


Probably the most dominant theoretical formu- 
lation in the social etiology of distress has 
been the stress-distress model. Physiologist 
Hans Selye (1956) proposed the theory that 
diverse external stimuli might generate undif- 
ferentiated physiological and mental re- 
sponses. This stress-distress model repre- 
sented a direct convergence between the 
earlier sociological concerns with conse- 
quences of social integration and the physio- 
logical modeling of internal responses to the 
external environment. This theoretical formu- 
lation spurred research on personal life 
changes as a set of such social stressors. The 
hypothesis linking stressful life events and 
onset of distress and disorder had become the 


1989, Vol. 54 (June:382-399) 


LIFE STRESS AND HEALTH 


leading thrust in theoretical development and 
research programs in social epidemiology 


since the mid-1960s (e.g., Holmes and Rahe 


1967; Dohrenwend and Dohrenwend 1981; 
Myers, Lindenthal, Pepper, and Ostrander 
1972). 

In general, the research shows that life 
stress exerts a significant but moderate 
influence on mental and physical well-being. 
In a simple zero-order correlation, the rela- 
tionship between life stress and well-being 
(e.g., depressive symptoms) ranges between 
.25 and .40 (Rabkin and Struening 1976). 
This figure is somewhat less for physical 
health (House 1981; Wallston, Alagna, De- 
Vellis, and DeVellis 1987; Ensel 1986a). The 
magnitude of this relationship seems to hold 
up when other factors are taken into account 
(general socioeconomic status measures, age, 
psychological resources such as self-esteem, 
personal competence, and locus of control, 
physical health, and prior mental state; for an 
example of differential impact of undesirable 
life events for males and females, see Kessler 
and McLeod 1984) and improves marginally 
when undesirable life events are examined. It 
has also been shown that when items 
pertaining to psychological states (sleeping 
and eating problems) or illnesses are deleted, 
the magnitude of its effect is only marginally 
reduced (Ensel and Tausig 1982; Tausig 
1982, 1986). 


Social Support—the Social Resources 


A major refinement of the basic research 
paradigm has been the exploration of addi- 
tional social factors that may exert positive 
effects on well-being and, especially, those 
that may ameliorate the negative effect of life 
stress on well-being. If we consider life stress 
as the aspect of the social environment having 
a detrimental effect on well-being, then it is 
reasonable to expect that there are aspects in 
the social environment that enhance one’s 
ability either to improve well-being or to 
counter potential adverse effects of life stress. 
Social support, defined as the process (e.g., 
perception or reception) by which resources 
in the social structure are brought to bear to 
meet the functional needs (e.g., instrumental 
and expressive) in routine and crisis situa- 
tions, has gained recognition as one such 
potential factor. It has been argued that 
intimate relations with others whom one 
might confide in and receive feedback from 
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may significantly affect one’s mental and 
physical health. Some have argued that such 
an effect is direct and independent of life 
events, while others have suggested that 
social support mediates or buffers the effect 
of life events on illness. The mediating effect 
(sometimes referred to as the intervening 
effect) is said to occur when the incorporation 
of the social support factor reduces the direct 
effect of life events on physical health or 
mental health. The buffering (interacting) 
effect is said to occur only if the simultaneous 
presence of life events and absence of social 
support exert a detrimental effect on physical 
health or mental health. 

Since the early seventies, numerous studies 
have examined the effects, both independent 
and mediating/buffering, of social support on 
mental health. In general, the independent 
and mediating effects of social support have 
received strong confirmation (see reviews by 
Barrera and Ainlay 1983; Leavy 1983; 
Wallston and Wallston 1984; Berkman 1984, 
1986; House and Kahn 1985; Wethington and 
Kessler 1986; Cohen and Wills 1985; Lin 
1986b). Likewise, a number of studies have 
provided evidence for buffering effects (see 
reviews by Kessler and McLeod 1985; Cohen 
and Wills 1985). 

The underlying social basis of the theory 
not only specifies the significance of social 
experiences (life events and social support) in 
the production, exacerbation, and reduction 
of psychophysiological problems, but also 
paves the way for further identification of 
other factors in the stress-distress process. 
The two dominating contending factors are 
psychological resources and psychological 
stress/strain. 


Psychological Resources and Psychological 
Stress/Strain 


Coping resources have been used to explain 
differential vulnerability of individuals to 
illness, controlling for the level of stressors 
they experience. While social support can be 
characterized as an external form of coping 
resources, internal or psychological coping 
resources have come to play a major role in 
the etiology of both mental and physical 
health (Folkman and Lazarus 1980; Kobasa, 
Maddi, and Coddington 1981; Pearlin, Lieber- 
man, Menaghan, and Mullan 1981; Wheaton . 
1983; Gore 1985). Source 
hypothesized to affect the 
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individuals to recognize the stressful social 
stimuli (i.e., life event) and trigger a response 
reaction to the stimuli in an attempt to prevent 
or eliminate potential distress. Concepts such 
as sense of mastery, feelings of personal 
competence, self-esteem, and locus of control 
have come to be viewed as personality factors 
(resources) that influence an individual's 
reaction to life events or stressors and may 
buffer or reduce the impact of a stressor on 
subsequent illness (Gore 1985; Wheaton 
1983). 

Psychological stress/strain, in contrast to 
psychological resources, increases or exacer- 
bates health problems. It has commonly been 
referred to as psychological vulnerability 
which may lead to the onset of an actual 
physical illness. (Mellinger, Balter, and Man- 
heimer 1978). Various indicators tapping 
psychological stress (i.e., depression, anxi- 

. ety, decreased energy and interest, sleep and 
appetite disturbances, somatization in the 
form of symptom checklists), as well as 
strains (Pearlin et al., 1981) have been 
employed. Findings generally point to a 
relationship between social stressors (life 
events) and psychological stress/strain and 
between psychological stress/strain and subse- 
quent illness onset. For consistency, we will 
use the term psychological stress to represent 
the various components of psychological 
stress and strain. 

It should be noted that psychological stress 
has been used both as independent (causal) 
and dependent (effectual) variables in the 
stress literature. In fact, a substantial body of 
the literature focuses on psychological dis- 
tress as the ultimate endogenous (dependent) 
variable. Here, we emphasize psychological 
stress 'as potential independent variables 
directly affecting health outcomes. Therefore, 
it may constitute a stressing condition in a 
buffering model. Social stressors (i.e., life 
events) and psychological stress point to the 
parallel stressing conditions.in both the social 
and psychological environments. This parallel 
will become clearer when we later introduce 
the conceptual paradigm in Figure 2. 


Physiological Resources and Stress 


Further, such models also need to take into 
account the physiological environment. In the 
physiological environment, both elements of 
stress (i.e., physical symptoms, clinically 
diagnosed illnesses) and resources (i.e., 
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dieting, exercise, and health practices such as 
refraining from use of alcohol, drugs, and 
tobacco) are expected to exert effects on 
well-being. Thus, it is important that theoret- 
ical models be developed not only to examine 
the roles played by factors representing these 
social-psychological-physiological perspec- 
tives in the explanation of well-being, but 
also to provide the basis for empirical 
assessment of their relative merits. 


LIFE STRESS PARADIGMS 


While conceptual analysis and research atten- 
tion have been given to life stress, social 


_ support, and psychological stress for their 


potential effects on physical health and 
mental health, few attempts have been made 
to integrate these elements into a coherent 
theoretical framework. In an attempt to fill 
this void, Dohrenwend and Dohrenwend 
(1981) summarized various formulations of 
life stress processes and the psychological and 
social contexts in which they occurred. Each 
formulation (hypothesis) was shown to pro- 
vide viable conceptual linkages between life 
events and health outcomes and to have 
received some empirical support (see Figure 
1). 


The Dohrenwends’ Models: Physical Health 
as the Outcome Variable 


The hypotheses in these models share two 
common features: (1) the ultimate dependent 
variable is adverse health or adverse health 
change, rather than mental health problems or 
disorders, and (2) each hypothesis delineates 
and explains the possible empirical associa- 
tion between life events and health. Some of 
the hypotheses affirm the primary role of life 
events as causing health problems, while 
other incorporate mediating factors to explain 
health problems. 

The first and most basic model, the 
victimization hypothesis (Model A), postu- 
lates the direct effect of extreme social 
situations (natural disasters, prison camps, 
etc.) as well as a variety of everyday 
happenings. It posits, for example, that 
stressful life events directly affect adverse 
health changes. The hypothesis formed the 
basis for the development of the Social 
Readjustment Rating Scale (SRRS) (Holmes 
and Masuda 1974) and was examined in 
numerous studies (e.g., Zubin and Spring 
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Fig. 1. Six Models of the Life Stress Process as Proposed by Dohrenwend and Dohrenwend (1981; p. 20) 


1977; Brown and Harris 1978; Hamburg and 
Adams 1967; Gore 1978). 

An intervening variable, psychological 
strain, proposed by Langner and Michael 
(1963), is introduced in the stress-strain 
hypothesis (Model B). This model, which 
suggests that individual variations in psycho- 
logical strain or other nonpathological re- 
sponse mediate the impact of stressful life 
events, is exemplified in the work of Garrity, 
Marx, and Somes (1977). 

In a further attempt to explain individual 
differences in responses to stress, the vulner- 
ability hypothesis (Model C) proposes that 
preexisting personal dispositions and social 
conditions interact with life events to produce 
adverse health consequences. This model is 
generally known as the interaction of buffer- 
ing model. That is, in the absence of personal 
and/or social resources, experiencing life 
events increases the likelihood of health 
problems. Alternatively stated, the hypothesis 
postulates that when one encounters life 
events, the absence of personal and/or social 
resource will increase the likelihood of health 
problems. 

The additive burden hypothesis (Model D) 
also considers situation and personal charac- 
teristics but sees these as factors in addition to 
stressful life events in affecting health status. 
Andrews and his co-workers, for example, 


found that stressful life events, poor coping, 
and poor social support explained one-third of 
the variance in psychological distress in 
patients whose life events stress alone could 
account for only one-quarter of the variance 
(Andrews, Tennant, Hewson, and Vaillant 
1978). 

The next hypothesis, the chronic burden 
hypothesis (Model E), posits that it is stable 
personal dispositions and social conditions 
rather than transient stressful life events 
which bring about adverse health changes 
(Dohrenwend and Dohrenwend 1981, p. 22). 
Gersten, Langner, Eisenberg, and Simcha- 
Fagan (1977), Link (1978), and Pearlin et al. 
(1981), for example, have used this model in 
their research. 

The event proneness hypothesis (Model F) 
infers a reversal of the causal relationship 
between life events and health problems in 
that adverse health change is considered as 
the primary cause of subsequent experiences 
of stressful life events which, in turn, 
exacerbate such health changes. Fontana, 
Marcus, Noel, and Rakusin (1972) found 
support for this hypothesis in his study with 
hospitalized psychiatric patients and a control 
group of nonpatients. 

The Dohrenwends proposed that these 
hypotheses should be examined together for 
their relative merits. Later, Golden and 
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Dohrenwend (1981) outlined the analytic 
requirements for testing these causal hypoth- 
eses. 

To date, no study has simultaneously tested 
all six hypotheses. While the results of 
separate tests have generally been consistent, 
they cannot be used to refute alternative and 
sometimes competing hypotheses. In the case 
of social factors, for example, much contro- 
versy and debate have been generated from 
piecemeal evidence supporting either direct 
effects of life events and social support, or the 
mediating effect of social support, or the 
interaction between life events and social 
support. The tendency has been to conceive 
of them as distinct hypotheses. The summa- 
rizing formulations proposed by the Dohren- 
wends, while not exhaustive, laid the concep- 
tual frameworks for systematic examination 
of the effects of social, psychological, and 
physiological factors in the life stress process. 
Some of the hypotheses are indeed competi- 
tive. For example, the presence of the 
victimization hypothesis necessarily. negates 
the presence of the chronic burden hypothe- 
sis. Other hypotheses depict processes that 
may be distinctive but not conflicting with 
one another. For example, the presence of the 
victimization hypothesis does not necessarily 
negate the possible presence of the vulnerabil- 
ity hypothesis and the additive burden hypoth- 
esis. 

We have recently presented results pertain- 
ing to many of these variables and hypothe- 
ses, in a study on social support, life events, 
and depressive symptoms (Lin, Dean, and 
Ensel 1986, especially chapter 10). However, 
not all variables were present, and physical 
well-being measures were not used as the 
dependent variables. 

The lack of research in this area is not 
` accidental. The design requirements for such 
a systematic examination are stringent and 
difficult to meet. The present paper represents 
a preliminary effort at providing such an 
examination. We begin with a paradigm of 
the life stress process, expanding upon the 
Dohrenwends' work. 


The Proposed Paradigm 


Several improvements relative to the Dohren- 
wends' models are needed to fully realize this 
conceptual integration. In Dohrenwends' for- 
mulations each hypothesis was formulated to 
assess the relative contribution and manner of 
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contribution of life events in certain social 
and psychological contexts. A more compre- 
hensive model should give equal conceptual 
attention to every major element in the life 
stress process. For example, not considered in 
their formulations is the hypothesis that life 
events may serve as a mediating or buffering 
factor between social and psychological 
factors and well-being. Another hypothesis 
may postulate that social and/or psychological 
resources may mediate or buffer the impact of 
social or psychological stress on well-being. 

In giving equal attention to all elements in 
the model, we find it necessary to begin with 
a revised paradigm at the conceptual level. As 
discussed earlier, we have identified the 
social, psychological, and physiological envi- 
ronments as major contributing factors in the 
life stress process. We have further identified 
the enhancing (rescurces) and detrimental - 
(stressing) forces in each environment. A 
comprehensive paradigm of the life stress 
process, therefore, should incorporate these 
concepts. 

A paradigm specifying the variations in the 
relationships among the social, psychologi- 
cal, and physiological concepts is presented 
in Figure 2. 

In this paradigm, the three environments 
and their respective factors are identified as 
the exogenous concepts impinging on well- 
being outcomes. Their effects on the out- 
comes can be specified as (1) direct effects, 
(2) mediating effects, and (3) interacting or 
buffering effects. Direct effects are self- 
explanatory. For example, social resources 
will have a direct impact on well-being, even 
when other factors are taken into account. 

A mediating effect of a factor on well- 
being is said to have taken place when its 
presence reduces the direct impact of another 
exogenous factor on the outcome concept. A 
typical example of the mediating effect would 
be social resources' ability to reduce the 
direct effect social stressors exert on the 
outcome. 

' A buffering or interacting effect is said to 
have taken place when the joint presence of 
two exogenous factors impacts the well- 
being. A typical empirical model of this 
nature would be that, only when one 
encounters social stressors (i.e., life events), 
the absence of social resources (i.e., the lack 
of social support) does have a detrimental 
effect on the outcome concept (i.e., physical 
symptoms). 


LIFE STRESS AND HEALTH 
ENVIRONMENT 









PSYCHOLOGICAL 











PHYSIOLOGICAL 





Fig. 2. The Life Stress Paradigm 


The proposed paradigm identifies the major 
components in the life stress process. How- 
ever, in order to specify the causal relations 
among the major components, as usually 
stated in mediating and buffering hypotheses, 
it is necessary to further delineate the model 
in a temporal-causal perspective. One such 
conceptual model is presented in Figure 3, 
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where the model is specified for a three-wave 
time frame. 
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The conceptual model will now allow us to 
identify specific hypotheses proposed for 
testing. To illustrate this use, we will now 
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return to the original hypotheses in the 
Dohrenwends’ formulations and identify them 
in the conceptual model. 

The victimization hypothesis (A in Figure 
1) can be seen as positing a direct relationship 
from social stressors to the health outcome. 
This hypothesized relationship is represented 
by either the path P, or the path P» in Figure 
3. The stress-strain hypothesis (B in Figure 1) 
can be expressed as the mediating effect of 
psychological stress on the direct relationship 
' between social stressors and the health 
outcome, as represented by paths P4 and P4. 

The vulnerability hypothesis (C in Figure 
1) can be seen as the effects of both social and 
psychological resources interacting with so- 
cial stressors, as expressed by paths Ps and Pg 
respectively. The additive burden hypothesis 
(D in Figure 1) can be expressed as the 
independent direct effects of social stressors 
and social resources, along with psychologi- 
cal resources, as represented by the paths P 
(or Pa), P; (or Po) and Pg (or Pio). 

The chronic burden hypothesis (E in Figure 
1) argues for the direct effects of social and 
psychological resources on the health out- 
come. These are represented by the paths P7 
and Pg, or the paths Po and Pio. Finally, the 
event proneness hypothesis (E in Figure 1) 
posits the mediating effect of social stressors 
on the direct relationship between the physi- 
cal environment and health outcome, repre- 
sented by paths Pı; and P2. 

Thus, by specifying causal paths among the 
concepts in the proposed paradigm, we have 
demonstrated that it is possible to examine 
simultaneously all the hypotheses stated in the 
Dohrenwends' models. Further, the concep- 
tual model reveals other causal relationships 
not specified in the Dohrenwends' models but 
discussed in the literature. 

For example, the mediating effect of social 
resources (social support) on the relationship 
between social stressors (life events) and 
health outcome can be represented by the 
paths P,» and P;. The coping hypothesis, that 
psychological resources (coping) may medi- 
ate the direct effect of social stressors (life 
, events) on health outcome, can be represented 
by Pi, and Pg. The buffering hypothesis 
concerning social resources (support) and 
social stressors is reflected in the path Ps. 

There are many other potential hypotheses 
that can be revealed and formulated in view 
of the conceptual model and its operationali- 
zations. For example, the model allows a 
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hypothesis concerning the mediating and/or 
buffering roles of social/psychological re- 
sources relative to psychological stress. While 
this hypothesis has not surfaced in the 
literature, it is theoretically plausible. Spe- 
cific hypotheses, of course, must be guided 
by theoretical considerations. Nonetheless, it 
is clear the conceptual model as proposed 
here allows precise specification of hypothe- 
ses, causal sequencing of concepts, and 
simultaneous empirical examination among 
competing and complementary hypotheses. 

In this paradigm, therefore, the conven- 
tional stress model (life events-social support 
or coping-psychological distress) becomes 
one in a family of models. The conventional 
Stress model represents one particular model 
that emphasizes the social stressor condition 
and the mediation or buffering of social or 
psychological factors for a psychological 
dimension of well-being. In the general 
models, the initial stressing condition can 
come from the social, psychological, or 
physiological environment. The mediating or 
buffering forces can be plausibly explored in 
these environments, and well-being calls 
attention to the multidimensional nature of the 
ultimate endogenous variables to be ex- 
plained. 


Design Requirements 


To examine such a model, the research design 
must have the following characteristics: 

1. A panel design. Causal hypotheses 
require specification of temporal as well as 
conceptual sequence among the variables. For 
example, three waves of data are required to 
examine the causal chains posited in the 
mediating and buffering hypotheses concern- 
ing social resources and psychological re- 
sources in the model (Figure 3). 

2. Measures of all key variables. Since the 
hypotheses involve forces in the social, 
psychological, and physiological environ- 
ments, measures of these concepts must be 
simultaneously present to empirically exam- 
ine their relationships. 

3. A representative community sample. 
Most studies have used specialized popula- 
tions or patient groups, leaving the generali- 
zability of the findings in great doubt. To 
understand the life stress process, proper 
sampling of community populations is vital. 

4. Simultaneous testing. In the past, the 
hypotheses have been advanced either in 
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theoretical discussions or in empirical studies 
examining these hypotheses individually. 
There is insufficient evidence to demonstrate 
the relative merits among the hypotheses. The 
efficacy in confirming a hypothesis may be 
substantially improved when it is examined 
along with other hypotheses. 

The remainder of the paper provides an 
empirical examination of the proposed model, 
with a specific set of empirical measures. 
Each exogenous factor is operationalized with 
one variable. For social resources, we use 
social support as the variable; for social 
stressors, undesirable life events; for psycho- 
logical resources, self-esteem; and for psycho- 
logical stress, a depressive symptom scale. 
The physiological stress(or)s are lifetime 


diagnosed illnesses. However, no adequate . 


measures of physiological resources (i.e., 
dieting, exercise, and health practices) are 
available in the study. These exogenous 
variables will then be assessed, simulta- 
neous]y and temporally, for their additive and 
multiplicative effects on health outcomes. 

As far as we know, there has not been a 
study that fulfills all the requirements of such 
a design. The present study is an attempt to 
examine these hypotheses in design condi- 
tions that meet the requirements specified 
above. 


METHODS 
Nature of the Data 


The data in the current paper come from a 
three-wave health study conducted from 1979 
to 1982 in a SMSA area in upstate New York. 
In 1979, a multistage probability sample of 
1,091 individuals was drawn from census 
housing unit data. A comparison of the 
characteristics of the sample with those of the 
entire tricounty SMSA area indicated that the 
sample was representative of the total popula- 
tion of the area. The second wave was 
conducted one year later in 1980, with a 
response rate of 80 percent. Of those 
respondents not reinterviewed (V=220), 10 
percent were not locatable, 9 percent refused 
to be reinterviewed, and .5 percent died. The 
third wave of data was collected in the 
summer of 1982 with a response rate of 62 
percent (N — 677). More than 95 percent of 
those interviewed in T3 were also interviewed 
in T1 and T2, giving us three waves of data 
on 639 respondents. The data in this paper 
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comes from the three-wave data set.! The 
selection of variables from various waves was 
guided by the temporal sequence of the 
variables specified in the Conceptual Model 
(Figure 3). 


Measurement Model 


The measurement model derives from the 
conceptual model (Figure 3) and incorporates 
empirical measures of the various operational 
definitions of the concepts. The measures in 
this paper are grouped into five types, 
reflecting the following concepts and their 
interactions: (1) the health outcome, (2) the 
resources and stressors in the social en- 
vironment, (3) the resources and stress in 
the psychological environment, (4) the 
physiological (stress) environment, and (5) 
the interactions of the various forces in the 
social, psychological, and physiological 
environments.? The actual measures exam- 
ined in the study include: the independent 
variables (undesirable life events, social 
support, self-esteem, depressive symptoms, 
and prior state of physical health) as measured 
at Tl and T2; and the dependent variable, 
physical symptoms, measured at T3. Also to 
be measured are the interactions among the 
independent variables at T1 and T2. 
Physical Symptoms (PS3). There are two 
ways to measure physical health or illness: (1) 
number of diagnosed physical illnesses that 
have occurred over a period of time; (2) 
assessment of magnitude of physical symp- 
toms exhibited at a particular time. The 
former presents an interpretational problem, 
because visits to physicians may also reflect 
certain demographic and social/psychological 
influences other than actual physical condi- 
tions (Cohen 1980). Therefore, we decided to 
use the physical symptoms to measure 


l An attrition analysis shows that there was a 
greater tendency to lose young and unemployed 
respondents in the second and third waves. This is 
consistent with the trend in most community panel 
studies (e.g., the New Haven panel study, Meyers 
et al, 1984; the Los Angeles panel study, 
Aneshensel and Frerichs 1982). However, a 
comparison between the missing observations and 
panel observations shows no significant differences 
in the key study variables measured at T1. 

? [n the present study, we do not have measures 
of physical resources, such as diets, exercise, and 
health practices. 


390 


physical effects. While they may not directly 
correspond to physical diagnoses, they do 
reflect actual physical functioning of the 
individuals. 

This measure consists of the 40-item 
Cornell Medical Index of physical symptoms 
(Brodman, Erdmann, Lorge, and Wolfe 
1953). Respondents were asked to report the 
frequency of each symptom during the last 
month prior to the T3 interview. Each 
symptom was scored on a four-point scale: 
often, sometimes, hardly ever, and never. 
Therefore, the higher the total score, the 
lower the level of physical symptoms. 

Undesirable Life Events (ULE) is based on 
the schedule of Recent Events (SRE) devel- 
oped by Holmes and Rahe (1967) and 
includes additional and modified items from 
Myers, Lindenthal, and Pepper (1975), Doh- 
: renwend and Dohrenwend (1978), and Rahe 
(1975). The entire scale consists of 118 
events and taps the six-month period prior to 
the T1 and T2 interviews. We examined three 
ways to measure life events: (1) weighted 
versus simple summation of occurrences, (2) 
with or without health-related items, and (3) 
all items versus undesirable items only. 

The outcome of this analysis is that the 
total simple sum of undesirable -events could 
be used as the measure. Neither the weighting 
Scheme nor the exclusion of health-related 
items significantly affected tbe association 
between the life event measure and adverse 
health change. The summing of undesirable 
life events only, on the other hand, signifi- 
cantly increased the association between the 
life event measure and adverse health change. 
However, we finally decided to include only 
items that were not health-related (these 
included serious physical illness, serious 
injury or accident, pregnancy, menopause, 
miscarriage, frequent minor illness, major 
dental work, and mental illness), because 
there has been considerable concern about the 
conceptual overlap between these items and 
measures of health conditions. 

Social Support (SS). Dohrenwend and 
Dohrenwend used the term social situations 
to describe social support provided by family 
and a network of neighbors and acquaintan- 
ces. In our conceptualization (see Lin 19862), 
it is argued that social support be conceived 
as part of social resources which individuals 
access and use for either expressive or 
instrumental purposes. The measure used here 
is a two-item strong-tie support scale. The 


AMERICAN SOCIOLOGICAL REVIEW 


two-item scale is a result of a factor analysis 
of a 26-item instrumental-expressive support 
scale developed by Lin, Dean, and Ensel 
(1979). A complete discussion of the psycho- 
metric properties of the scale can be found in 
Ensel and Woelfel (1986). The scale consists 
of the following items: not having close 
companions and not having enough close 
friends. The measure taps the six-month time 
period prior to the T1 and T2 interviews. 
Responses were made on a four-point scale: 
most or all of the time, occasionally or a 
moderate amount of time, some or a little of 
the time, rarely or none of the time. It is used 
here as a summated scale. The higher the 
score, the higher the level of social support. 
Alternative measures of social support were 
also explored: (1) community-neighborhood 
support (four items), (2) family support 
(Medalie and Goldbourt 1986) scale (four 
items), (3) confidant support (Kaplan 1975) 
(seven items), (4) interaction with the most 
important confidant (one item), and (5) role 
relationship with the most important confidant 
(one item). These scales and items had been 


found to be significantly associated with 


mental health measures, such as the depres- 
sive symptoms (the CES-D) (see Lin et al. 
1986). 

These measures were found to have either 
lower or similar levels of association with the 
physical health measures, as compared to the 
strong-tie support measure. Therefore, it was 
decided that the strong-tie support be used as 
the indicator of social resources. 

Other indicators of social resources were 
also explored: sex, age, marital status, 
presence of children, and employment status. 
With the exception of age, none of the 
variables showed any significant association 
with physical health measures. Age had a 
zero-order correlation of .13 with both new 
occurrences of illnesses and total diagnosed 
illnesses. Since age is commonly used as an 
indicator of life course, the decision was to 
retain strong-tie support as the indicator. 

Self-Esteem (SE). The measure of psycho- 
logical resources used in this study is a 
10-item self-esteem scale developed by Ro- 
senberg (1965). It is consistent with the 
Dohrenwends' conceptualization of personal 
dispositions. This scale has been widely used 
in many current studies (Rosenberg and 
McCullough 1981; Pearlin et al. 1981). 
Responses to the 10 items are on a four-point 
scale: strongly agree, agree, disagree, or 
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strongly disagree. The scale is treated as a 
summated measure. The higher the total 
score, the higher the self-esteem. 

Depressive Symptoms (CESD). Psycholog- 
ical stress is measured here with the 20-item 
Center for Epidemiologic Studies Depression 
(CES-D) scale (Radloff 1975, 1977). The 
scale consists of 20 items and has four 
response categories for each question: most or 
all of the time (five to seven days a week), 
occasionally or a moderate amount of time 


(three to four days a week), some or a little of . 


the time (one to two days a week), and rarely 
or none of the time (less than once a week). 
The scale measures depressive symptoms in 
the week prior to the T2 interview. The 
measure is treated here as a simple summated 
score of the 20 items making up the scale, 
The higher the total score, the greater the 
depressed mood. The CES-D scale has been 
validated and its reliability replicated in our 
own work (Ensel 1986b). 

Prior State of Physical Health (PH). Prior 
health was measured by the total number of 
diagnosed illnesses occurring to a respondent 


in a specified time period prior to the 


interview at Times 1 and 2. A modified 
Cornell Medical Index offered a list of 
illnesses (Abramson 1966). Many of these 
illnesses have implications for functional 
impairment (e.g., asthma, coronary heart 
disease, cancer, vision loss, hearing loss). 
The Time 1 measure specified all the 
diagnosed illnesses a respondent had experi- 
enced in his or her lifetime. The Time 2 
measure specified the diagnosed illnesses a 
respondent had experienced during the six 
months prior to the interview. 

Interactions. Finally, interaction terms were 
constructed among the independent variables. 
These interaction terms consist of all the 
pairings among the independent variables: (1) 
undesirable life events, (2) social support, (3) 
self-esteem, (4) depressive symptoms, and (5) 
physical health. In order to minimize the 
multicolinearity these interactions might cause 
when they are used alongside their constituent 
variables as independent variables in the same 
equations, deviations from constants were 
‘used in constructing the interaction terms. 
The constants were calculated according to 
Smith and Sasaki (1979). Thus, a typical 
form of the interaction term would be 
(variable 1 - constant a)*(variable 2 - constant 
b). 

Interactions were computed between all 


391 


independent variables at Time 2 and between 
each independent variable at Time 1 with all 
other independent variables at Time 2. 

The means, standard deviations, and zero- 
order correlátions for all measures can be 
found in Appendix 1. i 


ANALYSIS AND RESULTS 


The examination of the model and related 
hypotheses will proceed in three phases. In 
the first phase, we examine the direct effects 
of the key variables. In the second phase, we 
add the interaction terms to evaluate potential 
buffering effects. Finally, we estimate the 
mediating effects. In this section, the analytic 
procedures and empirical findings are dis- 
cussed. 


Direct Effects 


The analytic strategy employed to examine 
the measurement model was the stepwise 
hierarchical multiple regression technique. 
We began by constructing estimation equa- 
tions for the dependent variable, physical 
symptoms (PS3), by (1) entering the indepen- 
dent variables at T2, (2) adding the indepen- 
dent variables at T1, and (3) adding the 
interaction terms. 

The first equation regressed PS4 on the 
independent variables at T2 (see Table 1). All 
first independent variables made statistically 
significant contributions, at the .001 level, in 
the explanation of PS3. The R? was .16. 

The second equation regressed PS3 on the 
independent variables at both T2 and T1. We 
first entered all the variables at T2 and then 
added the variables from T1 into the equation 
(see column 2 of Table 1). The coefficients 
from undesirable life effects (ULE,), self- 
esteem (SE,), and depressive symptoms 
(CESD;) remained significant at the .01 level, 
but the coefficients from SS; and PH; did not. 
Among the T1 variables, only PH, was 
significant at the ,01 level, but the R? 
increased to .25. 

Thus, in this initial analysis, when only the 
additive effects of the independent variables 
from two previous time periods (T2 and T1) 
were considered, the magnitude of physical 
symptoms (PS3) was significantly influenced 
by the prior state of physical health along 
with psychological stress (CESD2) and re- 
sources (SE2). Social stressors, ULE2, were 
also significant. However, the enhancing 
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Table 1. Regression Equations and Metric Coefficients 
for Illness Symptoms at T3 
(Standardized coefficients in parentheses) 





"redd ed Equations? 
Variables 1 2 
Time 2 Variables 

CESD2 — .26** —.21** 
(—.13) (-.12) 
PH2* —2.49** —.65 
(-.19) (-.05) 
SS2 1.78"* .87 
(.12) (.06) 
ULE2 —1.10"* —.71** 
(-.12) (—.08) 
SE2 .32** .30** 
(.08) (.08) 
Time 1 Variables 
PHI —2.50** 
(—.32) 
SS1 71 
(.05) 
SEI 17 
(.04) 
ULEI —.18 
—.02 
CESD1 (-.01) 
Intercept 124.30 123.81 
R .16 25 


* See Appendix 1 for variable names. 

^ Hierarchical and stepwise procedures used. 

* PH2: diagnosed illnesses for last six months, 
measured at T2. 

4 PHI: diagnosed illnesses ever, measured at T1. 

** p < 0l. 


force in the social environment, SS2, ceased 
to make a significant direct contribution, 
when other variables had been taken into 
account. 

The finding that social support does not 
exert a direct effect on physical symptomatol- 
ogy is not consistent with the general finding 
regarding its impact on mental health. In any 
event, we proceeded to incorporate the 
interaction terms into the equations. 


Interaction Effects 


There are two sets of interaction terms to be 
considered. One set deals with the interac- 
tions among the variables at T2; and the other 
set represents interactions between T1 and T2 
variables. Conceptually, the T2*T2 interac- 
tions represent contemporaneous interactions, 
the simultaneous presence of two conditions. 
This is difficult to interpret, since it is 
causally difficult to argue which of the two 
conditions is the buffering force. The T1*T2 
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interactions are much more consistent with 
the buffering formulations, since the T2 
variables occurred subsequent to the T1 
variables and are assumed to be the buffering 
forces. For analysis, we entered each set of 
the interactions into the equation separately. 
The procedures and results of these analy- 
ses are presented in Table 2. Equations 1 and 
2 in Table 2 convey the resulting R?'s and R? 
changes from the additive models presented 
in Table 1. As can be seen, the T2 variables 
explained .16 of the variance of current 
physical symptoms at T3 (PS3). The addition 
of T1 variables increased the explained 
variance from .16 to .25, a change of .09. 
In Equations 3a and 3b we show the 
consequences of adding separately each of the 
two sets of the interaction terms. We show 
that the T1*T2 interactions increased the 
explained variance to .35, an increment of .10 
over the R? contributed from the additive 
effects of the T1 and T2 variables (Equation 
2). On the other hand, the T2*T2 interactions 
increased the explained variance to .27, an 
increment of only .02 over the additive 
effects. Thus, these results support the 
conceptual preference for the lagged (T1*T2) 


Table 2. Increments in Variance Explained of Illness 
Symptoms at Time 3 


R? Change 

Variables in from Previous 
Equation* Equation 
1. T2 Variables 

(CESD2, PH2, SS2, ULE2, 

SE2) 16 
2. T2 and T1 Variables 

(CESD2, PH2, SS2, ULE2, 

SE2, CESDI, PHI, SSI, 

ULEI, SEI) .25 .09 
3a. T2, T1 Variables and 

TI*T2 Interactions? ! .35 .10 
3b. T2, T2 Variables and 

T2*T2 Interactions? 27 .02 


4. Reduced T2 and T1 Variables* 
(CESD2, PH2, ULE2, SE2, 
PH1) .24 
5. Reduced T2, T1 Variables 
and T1*T2 Interactions* 
(CESD2, PH2, SE2, PH1, 
CESD1*PH2, CESDI*SS2, 
ULEI*SS2, CESDI*SE2) 31 .07 


* Sec Appendix 1 for variable names. 

> Interactions between all T1 variables paired with all 
T2 variables, measured in deviations. 

© Interactions among all T2 variables, measured in 
deviations. i 

* All variables in equation are statistically significant 
at the .05 level. 
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interactions over the contemporaneous (T2*T2) 
interactions. From this point, we will focus 
only on the contributions of the lagged 
interactions. 

Since not all independent variables made 
significant contributions, we constructed an 
abbreviated equation only for the significant 
variables: Equation 4 which incorporates for 
T1 and T2, CESD2, PH2, ULE,, SE», and 
PH, (see Table 2, column 2).3 The R? is 
almost identical to that of the full equation 
(Equation 2) where all variables were entered, 
suggesting that this abbreviated equation 
recovers almost all the explanatory power of 
the T1 and T2 variables. Equation 5, then, 
adds the significant T1*T2 interactions from 
Equation 3a (CESD,*PH2, CESD,*SS2, 
ULE;*SS,, and CESD,*SE,). The resulting 
R?, .31, represents an increment of .07 over 
the additive equation (Equation 4). It also 
recovers most of the contributions from all the 
T1*T2 interactions in Equation 3a (30 terms 
in all). 

The coefficients, along with the F values 
for the final equation (Equation 5), appear in 
Table 3. 


Mediating Effects 


Finally, we examine potential mediating 
effects. A mediating effect is said to exist if a 
variable at T1 has a direct effect on a T2 
variable which, in turn, has a direct effect on 
the T3 variable, PS3. Recall, from Table 1 
(column 2), that only three T2 variables 
exerted direct effects on PS4: ULE?, SE; and 
CESD,. Thus, an equation was constructed 
for each of these variables and all TI 
variables were entered as. independent vari- 
ables. All autoregression (from a T1 variable 
to its corresponding variable at T2) coeffi- 
cients were significant, as expected. None of 
the other coefficients for the SE; equation 
was significant. Two variables affected ULE, 
and two affected CESD;. For ULE), there 
were significant direct effects from PH, and 


? Usually, the constitutive terms along with the 
interaction terms should be incorporated in an 
equation in order to estimate the main and 
interactive effects. Since the interaction terms here 
have been constructed as products of deviation 
(from the mean) terms, they are independent of the 
constitutive terms. Thus, when the constitutive 
terms are insignificant, they can be eliminated 
from the equation. 
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Iliness Symptoms at Time 3 (SYM3) 








Metric 

Predetermined Coefficient Standardized 

Variables (Standard Error) Coefficient F 

CESD2 —.18 —.10 6.33 
(.07) 

PH2 —]1.12 —.08 4.56 
(.53) 

SE2 .68 17 18.43 
(.16) 

PHI —2.45 —.31 65.16 
(.30) 

CESDI*PH2 .13 .09 6.27 
(.04) 

CESD1*SS2 —.24 — 22 33.35 
(.04) 

ULEI*SS2 1.42 20 32.25 
(25) 

CESDI*SE2 .06 21 26.15 
(.01) 

Intercept 126.76 

R? .31 





CESD,. For CESD;, significant direct effects 
from ULE, and SE, were observed. (See 
Appendix 2 for the regression equations 
involving Time 2 variables on Time 1 
variables, from which these mediating effects 
were identified.) 


DISCUSSION 


The significance of these findings can now be 
recast in the issues raised in the proposed 
conceptual model. The autoregressive effects 
of physical health conditions, as indicated by 
the prior state of health measures, reflect the 
extent to which current physical symptoms 
are affected by prior physical conditions. 
These effects do not enter into the causal life 
stress process per se, even though their 
presence needs to be accounted for when the 
causal effects can be meaningfully discussed. 

In Table 3, the strongest contributing 
effects are the buffering (interaction) rather 
than direct effects. Four interaction terms 
(CESD,*ILL,, CESD,*SS2, ULE,*SS2, and 
CESD,*SE,) all show stronger effects than 
the direct psychological effects, (i.e., SE, and 
CESD)). 

Further, the two strongest buffering terms 
(SS, and CESD,, SS; and ULE)) reflect the 
buffering role of social resources relative to 
both prior social and psychological stresses. 
That is, in the presence of social stressors or 
psychological stress, the lack of social 
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resources exerts a strong impact on health 
outcomes. In contrast,. psychological re- 
sources (SE?) only buffer psychological stress 
(CESD)). 

Thus, the life stress process leading to 
physical symptoms is more significantly 
conditioned by the general buffering capabil- 
ity of social resources relative to stresses that 
may be present in both the social and the 
psychological environments. Psychological 
resources, on the other hand, play a more 
restricted buffering role, only for stress in the 
psychological environment. 

The significance of the stress-buffering role 
played by social resources does not negate the 
importance of psychological factors in the life 
stress process. In addition to the psychologi- 
cal resources' ability to buffer psychological 
stress, both psychological stress and resources 
also show direct effects on physical symp- 
toms. In fact, we found that psychological 
stress (CESD;) mediates the prior states of 
social stressors (ULE;) as well as psycholog- 
ical resources (SE,). (The standard partial 
regression coefficients of CESD; on ULE, 
and SE; are .13 and —.12 respectively.) 

Nonetheless, it is quite clear that social 
resources play a dominating role in buffering 
social-psychological stress in the life stress 
process when physical health is the outcome 
variable of concern. 

Placing the present work in the larger 
context of research of the life stress process, 
we note that much of prior research has 
focused on life stress as it relates to mental or 
psychological well-being and the direct versus 
buffering role of social and psychological 
resources relative to life events stressors. The 
present finding that social resources serve as a 
buffering force for physical health fills an 
obvious gap in the literature, as previous 
research on social resources has focused 
primarily on mental health as the outcome 
measure. 

Further, we have found that social and 
psychological resources buffer both social and 
psychological stress(or)es. In previous concep- 
tual work, including our own, the attention 
has focused only on the buffering role of such 
resources on social stressors. Very little 
theoretical development or empirical evidence 
has been provided regarding the potential 
buffering role of resources relative to psycho- 
logical stress. This may have been an 
unintended outcome of the initial concern 
with social stressors (particularly life events) 
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and the further explication of the basic 
stress-distress model. For the last two dec- 
ades, the stress-distress model has almost 
exclusively been conceptualized as a process 
initiated with social stressors measured by life 
events and life experiences. Consequently, 
we have ignored a stress-distress model where 
other types of stress (i.e., psychological, 
physiological) are considered. By casting the 
life stress process in a general conceptual 
framework (e.g., Figure 2) where both 
stresses and resources in the social and 
psychological environment are taken into 
account, we have provided a perspective that 
allows a systematic theoretical development 
of the direct, indirect, and interactive roles 
played by a variety of factors important in the 
social, psychological, and physiological envi- 
ronment as they impinge on physical and 
mental well-being.^4 

Of special interest is the fact that social 
resources do not in and of themselves exert 
significant effects on physical symptoms. 
However, under the condition where either 
social or psychological stress is present, the 
absence of social resources will exacerbate its 
negative impact on physical symptoms. 

The present study suggests that the life 
stress process is a complex one where social, 
psychological, and physiological environ- 
ments interact with one another in affecting 


* One possible validity issue may be raised 
concerning the findings— namely, whether the 
findings about the buffering patterns are uniquely ' 
associated with the particular data set used. To 
examine this possibility, we conducted analyses 
parallel to those presented in Table 1—3, using 
CESD; as the dependent variable. If the findings 
were idiosyncratic because of the data, we would 
expect to find interaction effects for the mental 
health dependent variable as well. Instead, we 
found that social support exerted direct as well as 
mediating effects (relative to life events). There 
was no buffering effect (SS; * ULE;). Thus, these 
results are consistent with past literature findings 
regarding the mediating role of social support for 
mental health dependent variables. They also show 
that the data set provides consistent differential 
results. However, we noted that for mental health, 
social support buffers the effects of psychological 
stress (SS; * CESD)), psychological resources 
(SS; * SE), and physical conditions ((SS2 & ILL). 
These effects have not been conceptualized or 
systematically examined. Future research can now 
conceptually address these various patterns un- 
veiled in the proposed paradigm (Figure 1). ` 
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well-being. The specific patterns of relation- 
ships among the forces (i.e., resources and 
stressors) are contingent upon the specific 
contexts of well-being (psychological or 
physical) being examined. Most noteworthy 
are the differential roles played by social as 
well as psychological resources in various 
contexts. To understand how individuals 
socially, psychologically, and physiologically 
experience their lives and cope with such 
experiences requires an understanding and 
appreciation of how social and psychological 
resources are brought to bear in the specific 
context in which individuals find themselves. 
In the case of physical health, the presence of 
social and psychological stresses will have an 
inflated effect unless social resources are 
present to buffer the detrimental effect. 

These conclusions are contingent on the 
validity and reliability of the design and 
measurement. The measures used in this 
study are limited to the data available. Some 
measures may not be the ideal indicators of 
the variables called for in the hypotheses. In 
addition, we did not incorporate measures of 
physiological resources (i.e., dieting, exer- 
cise, and health practices). There is increas- 
ing interest regarding the potential intervening 
effect of physiological resources, such as 
smoking cessation and weight control, in the 
life stress process (Gottlieb 1988). We hope 
that other studies using varied and additional 
Measures can be examined for further verifi- 
cation of the hypotheses addressed here. 

In spite of these constraints which, in most 
likelihood, should have weakened the magni- 
tude of the estimated effects, we are encour- 
aged by the viability and meaningfulness of 
the results. They demonstrate the intimate 
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additive and joint roles of social and psycho- 
logical stresses, social and psychological 
resources, and physiological stress in the life 
stress process. Future improvements concern- 
ing the issues raised above should further 
clarify the intricate roles played by social 
factors in the life stress process. 

The logical next step in research would be 
an empirical investigation of the conceptual 
model in the context of both physical and 
mental well-being with the incorporation of 
more comprehensive measures for the various 
forces in the environments. We are currently 
engaged in such a program of investigation. 
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APPENDIX 1 ; 
Means, Standard Deviation, and Zero-Order Correlation Among Major Model Variables‘ 
Variables 
SS] ULEI SE! CESDI PHI  SS2 ULE2 SE2 CESD2 PH2 PS3 Mean Std Dev 
SS1 1.00 —.28** .,13** —.45** —.16** .42** —.12* .]lI* —.20** —.12*  .19** 7.53 1.07 
ULEI  —.28** 1.00 —.08 .34**  .20** — 2]**  40** —.09 .27**  .16** —.19** 1.06 1.71 
SEI .13** — 08 1.00 —.26** —.07 .233  —.09 47** — .22** —.03 .16** 32.19 4.04 
CESD1 —.45** .34** — .26** 1.00 .23** —,.39**  22** —.]9**  .44**  .]5*9 —.24** 811 9.20 
PHI —.16** 20** —.07 .23** 1.00 -—.16** .21** —.04 .17**  43** —.40** 1.98 2.11 
SS2 .42** —.21**  .23** —,39** — 16** 1.00  —.24**  29** —.45** —.06 .24** 7.45 1.14 
ULE2  —.12*  .40** —.09 .22** .21** —24** 1.00 —.17** .39** .17** —24** 1.08 1.77 
SE2 .11* —.09 .47** —.]9** — 04 .29** —.17** 1.00 —.37** —.06 .20** 32.47 44 
CESD2 —.20** .27** —.22**  44**  .]7** — A5**  .39** —,37** 1.00 .18** —.30** 8.38 9.32 
PH2 —.12*  .16** —.03 .15**  .43** —.06 .17** — .06 .18** 1.00 -—.24**  .67 124 
PS3 .19** —.]9**  .]6** — 24** —.40**  24** —.24**  20** —.30** —.24** 1.00 143.08 16.61 
!1SS1  -Social Support at TI ULE? --Undesirable Life Events at T2 
ULE] -Undesirable Life Events at T1 SEQ  =Self Esteem at T2 
SEl =Self Esteem at T1 CESD2 = Depressive Symptoms at T2 
CESD1 = Depressive Symptoms at T1 PH2 =Prior Physical Health at T2 
PH1  -Prior Physical Health at T1 PS3 =Current Physical Symptoms at T3 
SS2 = Social Support at T2 
* = significant 001. — ** =significant .01. 
APPENDIX 2 
Regression of Additive Lagged Models 
Pred ined Equation for? 
Variables" ULE2 SS2 SE2 CESD2 PH2 
ULEI .36** —.04 —.03 13** .06* 
(Life Events, T1) (.37) (-.03) (-.08) (.72) (.05) 
SS1 04 .30** .01 .03 —.03 
(Social Support, T1) (.06) (.32) (.05) (.24) (— .03) 
SEI —.03 l4" .45** — .d2** .01 
(Self-Esteem, T1) (—.01) (.04) (.48) (— .28) (.00) 
CESD1 .08** - .20** — .06* 068" .02 
(Depressed Mood, T1) (.01) (— .02) (—.03) (.37) (.00) 
PHI .12** — 05 .01 .06* Alt 
(Illnesses Ever, T1) (.10) (-.02) (.03) (.26) (.24) 
R? 18 25 23 23 20 
Intercept (.39) (4.09) (16.95) (11.29) (.24) 
* Variables measured at T2. 


> Entries are standardized partial regression coeficients; 


unstandardized coefficients are in parentheses. Probability 


of a null relationship, for approximate sample size of 630 and two tails, at or less than .05 level, is designated with an 


* and at or less than .O1 is designated with **. 
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Critics of labeling theory vigorously dispute Scheff's (1966) provocative 
etiological hypothesis and downplay the importance of factors such as stigma and 
stereotyping. We propose a modified labeling perspective which claims that even if 
labeling does not directly produce mental disorder, it can lead to negative 
outcomes. Our approach asserts that socialization leads individuals to develop a 
set of beliefs about how most people treat mental patients. When individuals enter 
treatment, these beliefs take on new meaning. The more patients believe that they 
will be devalued and discriminated against, the more they feel threatened by 
interacting with others. They may keep their treatment a secret, try to educate 
others about their situation, or withdraw from social contacts that they perceive as 
potentially rejecting. Such strategies can lead to negative consequences for social 
support networks, jobs, and self-esteem. We test this modified labeling perspective 
using samples of patients and untreated community residents, and find that both 
believe that "most people" will reject mental patients. Additionally, patients 
endorse strategies of secrecy, withdrawal, and education to cope with the threat 
they perceive. Finally, patients' social support networks are affected by the extent 
to which they fear rejection and by the coping responses they adopt to deal with 


their stigmatized status. 


In recent years, labeling theory propositions 
that directly link the emergence of mental 
illness to societal reaction (Scheff 1966) have 
received sustained and severe criticism (e.g., 
Gove 1970, 1980, 1982; Lehman, Joy, 
Kreisman, and Simmens 1976; Weinstein 
1983). More than simply refuting the extreme 
claim that "labeling causes most career 
deviance," critics also downplay the salience 
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Horwitz, Janice Husted, Mary Clare Lennon, 
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reviewers for comments. This research was 
supported by NIMH Grants MH13043, MH38773, 
and MH36208 and by the Rehabilitation Research 
and Training Center for Psychiatrically Disabled 
Individuals Grant Number G0085 C3504. 
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of social factors such as stigma and stereotyp- 
ing and assert that for the “vast majority of 
mental patients stigma appears to be transi- 
tory and does not appear to pose a severe 
problem" (Gove 1982, p. 280). We argue that 
such an assessment is overpessimistic and 
misguided in suggesting that stigma is 
unimportant. Our argument draws on Scheff’s 
(1966) labeling theory but qualifies and 
extends it to arrive at a “modified labeling 
approach.” In this paper we derive predic- 
tions from this approach and test them 
empirically. Specifically, we examine whether 
Stigmatization affects the social support net- 
works of patients officially labeled by contact 
with a mental health clinic or hospital. 

To begin, we note that critics rely on 
several types of data to dismiss the stigma of 
mental disorder as unimportant. Often they 
point to findings which seem to show that the 
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public does not harbor negative feelings 
toward the mentally ill. For example, Cro- 
cetti, Spiro, and Siassi (1971) found that a 
sample of automobile workers typically ex- 
pressed a willingness to work on the same job 
with, rent a room to, or even to fall in love 
with a former mental patient. Other critics 
argue that even if negative attitudes toward 
patients exist, they do not result in rejecting 
behavior. As noted by Huffine and Clausen 
(1979, p. 1057) and by Gove and Fain (1973, 
pp. 496-97), present and former patients 
rarely are able to report concrete instances of 
rejection. Similarly, Weinstein’s (1979, 1983) 
literature reviews prompted him to conclude 
that many patients do not feel stigmatized and 
view favorably their treatment by mental 
health professionals. Further, a growing body 
of research concludes that any rejection 
patients experience is far more likely to occur 
because of their deviant behavior than be- 
cause of the label “mental patient.” A 
seemingly convincing approach along these 
lines involves experimental manipulation of 
“labeling” and “behavior,” and shows that 
behavior determines peoples’ reactions more 
strongly than does labeling (for a review see 
Link, Cullen, Frank, and Wozniak 1987). 

Yet recent studies challenge these conclu- 
sions. First, with respect to the idea that public 
attitudes are benign, Link and Cullen (1983) 
used a vignette experiment to show that al- 
though respondents know that the “ideal” re- 
sponse to the mentally ill is one of acceptance, 
their perception of how “most people" re- 
spond is far less positive. Thus it may be that 
studies based on accepting responses to straight- 
forward social distance items, like Crocetti et 
al.'s (1971), measure socially desirable atti- 
tudes —the ideal response —but overlook more 
latent and unfavorable views. 

Second, studies that compare a wide 
variety of stigmatized conditions provide little 
evidence of benign attitudes toward the 
mentally ill. Albrecht, Walker, and Levy 
(1982; see also Tringo 1970) show that 
"mental illness" is one of the most highly 
rejected status conditions, clustering with 
drug addiction, prostitution, ex-convict sta- 
tus, and alcoholism rather than with cancer, 
diabetes, and heart disease.! 





! Gove (1980, p. 57) reports a study by Sim- 
monds (1969, p. 53) that yields different results. In 
this study ex-mental patients were rejected less than 
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Third, the notion that labeled persons are 
rejected only insofar as they behave inappro- 
priately has been rebutted both in experimen- 
tal work (Link et al. 1987) and in nonexperi- 
mental work (Link 1982, 1987; Palamara, 
Cullen, and Gersten 1986). Link et al.'s 
(1987) study is particularly relevant because it 
shows how tests of labeling need to be 
conducted in the context of a more fully 
elaborated model of labeling processes. They 
presented data illustrating that previous exper- 
imental studies may have reached erroneous 
conclusions on the salience of labeling and 
may have overestimated the importance of 
behavior, because the studies' designs failed 
to explore the meaning of a “mental patient" 
label for respondents. Thus Link et al. found 
that among community respondents who 
perceived the mentally ill as dangerous, the 
label of "former mental patient" elicited 
strong expressions of social distance. 

To summarize, we believe that it is 
premature to dismiss labeling and stigma as 
unimportant in the lives of mental patients. 
On close examination we find less than 
convincing much of the evidence that the 
critics of labeling theory offer to support their 
position. Though such studies rightly question 
some of the strongest claims of labeling 
theory, typically they do not address the more 
complex ways in which labeling and stigma 
may be important in the lives of psychiatric 
patients. Building on this observation, we 
offer a model for understanding how the 
stigmatization of the mental patient status can 
have harmful consequences. Since this frame- 
work extends Scheff's (1966) path-breaking 
theorizing, we refer to our model as a 
“modified labeling approach.” 


A MODIFIED LABELING APPROACH 
Statement of the Approach 


Scheff’s model. As a prelude to our approach, 
we specify central elements of Scheff’s 
labeling model (1966, 1984). Because we 


persons displaying 11 types of deviant behavior. 


- Yet by comparing ex-patients with people whose 


deviant behavior persists— atheists, gamblers, beat- 
niks, alcoholics, and adulterers—the study con- 
founds two factors: The deviant statuses and current 
incumbency in those statuses. Would ex-mental pa- 
tients by rejected less than former atheists, gam- 
blers, beatniks, or alcoholics? 
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focus on the consequences of labeling rather 
than on factors that lead to it (see Cullen and 
Cullen 1978; Thoits 1985), we do not 
consider Scheff’s entire approach but summa- 
rize his main points on the consequences of 
labeling (see Figure 1A). Once labeled, an 
individual is subjected to uniform responses 
from others. Behavior crystallizes in confor- 
mity to these expectations and is stabilized by 
a system of rewards and punishments that 
constrain the labeled individual to the role of 
a “mentally ill person.” When the individual 
internalizes this role, incorporating it as a 
central identity, the process is complete and 
chronic mental illness is the consequence 
(Scheff 1966, especially p. 82). 

Step 1: Beliefs about devaluation and 
discrimination. A modified labeling perspec- 
tive (Figure 1b), like Scheff’s (1966), relies 
heavily on the idea that individuals internalize 
societal conceptions of what it means to be 
labeled mentally ill. As Mead (1934) ob- 
served, during socialization individuals learn 
the attitude of the community toward many 
behaviors, objects, and attributes, and inter- 
nalize these in the form of what he called the 










Societal conceptions 
of the Mentally Ill 
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“generalized other.” Following Mead we 
note that the attitude of the community toward 
the mentally ill can be formed by a variety of 
mechanisms and thereby can function as part 
of the generalized other. Scheff (1966), for 
example, emphasizes how jokes, cartoons, 
and the media’s reporting of mental patient 
status can influence views of what it means to 
be mentally ill, Drawing on sources like 
these, all members of society—those who 
will become psychiatric patients as well as 
those who will not—form conceptions of 
what it means to acquire that status. 

These conceptions include two important 
components: the extent to which people 
believe that mental patients will be devalued 
and the extent to which they believe that 
patients will be discriminated against. “Deval- 
uation” comes from Cumming and Cum- 
ming’s (1965) notion of stigma as “loss of 
status” and from Goffman’s (1963) ideas 
about the “discrediting” nature of stigmatized 
statuses. “Discrimination” is suggested by 
the extensive “social distance” tradition as it 
applies to mental patients (Link et al. 1987). 
Our interest in devaluation-discrimination is 





Fig. 1. Diagramatic Representation of Scheff’s Labeling Model and the Modified Labeling Approach 
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in the extent to which individuals believe that 
“most people” (the community at large) will 
devalue and discriminate against a mental 
patient. Following Scheff (1966), we expect 
that community residents believe that most 


people view the status of mental patient: 


negatively. However, since numerous factors 
are likely to produce such beliefs, people will 
almost certainly vary in this regard. 

_ An important implication of this reasoning is 
that patients’ expectations of rejection are an 
outcome of socialization and the cultural con- 
text rather than a pathological state associated 
with their psychiatric condition (see Crocetti et 
al. 1974, p. 130, for an example of the latter 
view). Thus, patients and nonpatients should 
share the belief that most people devalue and 
discriminate against mental patients. 

Step 2: Official labeling through treatment 
contact. An official label is important because 
it brings personal relevance to a labeled 
person’s views about the attitude of the 
community toward mental patients (beliefs 
about devaluation-discrimination). A seem- 
ingly innocuous array of beliefs becomes 
applicable to oneself; it now matters whether 
one believes that people will devalue and 
discriminate against a person who is in 
treatment for mental illness. 

Step 3: Patients’ responses to their stigma- 
tizing status. We consider three possible re- 
sponses to labeling and argue that a tendency to 
endorse them indicates that patients see stigma- 
tization by others as a threat. In the first, se- 
crecy, patients may choose to conceal their treat- 
ment history from employers, relatives, or 
potential lovers to avoid rejection (Goffman, 
1963). Second is withdrawal, or limiting social 
interaction to those who know about and tend 
to accept one’s stigmatized condition. Goffman 
(1963) notes that this group tends to consist of 
persons he calls the “own” (those similarly stig- 
matized) and the “wise” (those who know about 
and accept the stigma). When patients adopt 
this response, they are protected from the re- 
jection that might ensue if they ventured out to 
seek friends, jobs, and the like in the wider 
social environment. Third is the attempt at ed- 
ucating others (“preventive telling”) in hopes 
of enlightening them so as to ward off negative 
attitudes (Schneider and Conrad 1980).? Edu- 





? The nature of the relationships between these 
adaptive strategies and devaluation-discrimination 
(Step 1) may be complex because patients can 
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cation, with its emphasis on changing others’ 
views, does not connote passive acceptance of 
others’ attitudes toward a label as secrecy and 
withdrawal might. Still, their need to educate 
suggests that patients consider stigmatization 
likely, a possibility that critics of labeling the- 
ory deny. Moreover, as Schneider and Conrad 
(1980) observe, educating implies disclosure and 
this risks direct discrimination. 

Step 4: Consequences of the stigma process 
on patients’ lives. Negative outcomes may 
arise directly from one’s beliefs about com- 
munity attitudes toward the status of mental 
patient (see Step 1), or they may follow from 
attempts to protect oneself by withdrawing 
(see Step 3). If people believe that others will 
discriminate against them or devalue them 
because of a status they possess, powerful and 
unfortunate consequences can ensue. They 
may feel shame (Scheff 1984) or believe that 
they are set off from others and thus are very 
different. In addition, beliefs about others’ 
views of a stigmatized status have been 
shown to negatively affect social interaction 
(Farina, Gliha, Boudreau, Allen, and Sher- 
man 1971) and self-esteem (Link 1987). 

The responses of secrecy, withdrawal, and 
education may also produce negative conse- 
quences. While adoption of these responses 
may protect patients from some negative 
aspects of labeling, they also may limit their 
life chances. For example, withdrawal may 
lead to more constricted social networks and 
fewer attempts at seeking more satisfying, 
higher-paying jobs. If present, this effect is 
consistent with the classical labeling theory 
idea of secondary deviance with its emphasis , 
on “defense, attack, or adaptation” to factors 
brought on by labeling (Lemert 1967, p. 17). 

Step 5: Vulnerability to future disorder. If 
the processes outlined in Steps 1 through 4 
operate, many patients will lack self-esteem, 
social network ties, and employment as a 
consequence of their own and others’ reac- 


adopt a mixture of responses. A patient who 
chooses withdrawal or secrecy, for example, may 
cope simultaneously by denying that mental 
patients face discrimination. Similarly, a patient 
may endorse the strategy of education even though 
he or she feels that most people are accepting of 
patients and that education is necessary for the 
minority who are likely to be rejecting. Therefore, 
we do not predict that beliefs about devaluation- 
discrimination will be related to these coping 
strategies in one-to-one, linear fashion. 
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tions to labeling. These deficits are regarded 
as major social and psychological risk factors 
for the development of psychopathology 
(Dohrenwend and Dohrenwend 1981; Turner 
1981). Thus, for some patients, labeling and 
stigma may induce a state of vulnerability that 
increases their likelihood of experiencing 
repeated episodes of disorder. 

Comparisons to Scheff’s model. The differ- 
ences between our approach and Scheff’s are 
a matter of emphasis. First, although we 
agree with Scheff that people will perceive 
community attitudes toward mental illness as 
strongly negative (Step 1), we emphasize 
more than Scheff the importance of variability 
in these beliefs. Some people believe that 
mental patients are vilified while others 
believe that society’s conception is far more 
temperate. Second, although Scheff observes 
that labeled persons have internalized the 
same cultural views as the public, he tends to 
emphasize the responses of others. In con- 
trast, we highlight the labeled individual’s 
response on the basis of his or her beliefs 
about how others will react.unless he or she 
does something to avoid their reactions. 
Finally, our approach does not assign to 
labeling the power to create mental illness 
directly. Instead, we view labeling and stigma 
as possible causes of negative outcomes that 
may place mental patients at risk for the 
recurrence or prolongation of disorders that 
resulted from other causes. 


Assessing the Modified Labeling Approach 


One previous study provided a partial assess- 
ment of the modified labeling approach. In 
keeping with Step 1, Link’s study (1987) 
showed that current patients, former patients, 
and community residents tended to share the 
belief that mental patients will be devalued 
and discriminated against by most people. 
Moreover, in keeping with Steps 2 and 4, 
official labeling made these beliefs personally 
relevant. Specifically, Link found that even 
when diagnosis and relevant demographic 
variables were constant, the more strongly the 
labeled cases feared rejection the more likely 
they were to (1) feel demoralized, (2) earn 
less income, and (3) be unemployed. This 
suggested one reason why untreated cases of 
disorder tended to function better than 
patients; they were unaffected by labeling- 
activated expectations of rejection. The present 
study extends Link’s work in two major 
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ways. First, it adds an analysis about the 
coping orientations outlines in Step 3 and 
examines their impact. Second, it extends the 
test of modified labeling to another set of 
outcome variables by studying. the social 
support networks of labeled and unlabeled 
respondents. i 

Unfortunately our cross-sectional study 
cannot test the fifth and final step of the 
approach concerning stabilization in a “career” 
of mental disorder, and therefore cannot 
provide definitive evidence on this provoca- 
tive labeling hypothesis. Even so, the find- 
ings relevant to the first four steps speak to 
the plausibility of the full five-step model. 
Inconsistent evidence would lead us to reject 
the modified labeling model and would 
indicate that a test of Step 5 would be 
unnecessary, but consistent evidence would 
heighten our interest in such a test. Moreover, 
the supporters and the detractors of labeling 
theory disagree sufficiently about the facts at 
issue in Steps 1 through 4 to make empirical 
tests of them informative. 

We test our hypotheses by comparing data 
on five groups: (1) psychiatric patients 
experiencing their first treatment contact, (2) 
current psychiatric patients with repeated 
treatment contacts, (3) community residents 
who report having been in treatment but who 
are not currently in treatment, (4) community 
residents who are classified as untreated cases 
on the basis of systematic evaluations of their 
symptomatology, and (5) “well” community 
residents who show no evidence of severe 
pathology and have no history of treatment. 

As Table 1 shows, the five groups vary in 
three major ways. First, they vary in levels of 
psychopathology. Untreated community cases, 
first-treatment contact patients, and repeat- 
treatment contact patients all experience high 
levels of psychopathology. Community resi- 
dents, who were once in treatment, constitute 
a mixed group: some are well, whereas others 
still qualify as cases. The remaining commu- 
nity residents show no evidence of psycho- 
pathology and have never been in treatment. 
Second, untreated cases and “well” respon- 
dents have not been labeled, whereas first- 
treatment contact patients, repeat-treatment 
contact patients, and community residents 
who were in treatment at one time have been 
labeled. Third, each treated group has had a 
different experience with labeling. First- 
treatment contact patients have been exposed 
only recently. As a result, their pretreatment 
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Table 1. Summary of Modified Labeling Theory Predictions in Five Groups Which Differ in Their Experience with 


Labeling and in Levels of Psychopathology 








First- Repeat- Formerly Community 
Treatment Treatment ` Treated Community -Respondents 
Contact Contact Community Untreated without 
Patients Patients Respondents Cases Pathology 

Distingiushing Features 

Cuttently in treatment? yes yes no no no 

Previous treatment? no yes yes no no 

Level of psychopathology? high high mixed high low 
Modified Labeling 
Theory Predictions 

Step 1 

Level of belief that mental High* high* high* high* high* 

patients will be devalued 

and discriminated against 

Step 3 

Patients’ level of endorsement high* high* no data not applicable not applicable 

of protective strategies 

Steps 2 and 4 

Effect of devaluation- no effect the more concern the more concern no effect no effect 

discrimination and coping about stigma, about stigma, 

orientations on social the more within- the more within- 

relationships within the household support household support ` 

household 

Effect of devaluation- 

discrimination and coping no effect the more concern the more concern no effect no effect 

orientations on social about stigma, about stigma, 

relationships with relatives the less outside- the less outside- 

and nonrelatives outside the-household the-household 

the household support support 


* By "high" we mean that a majority tend to agree that stigma is a problem and that item and scale means are higher 


than their midpoints. 


support networks cannot have been affected 
by labeling. Thus, we predict no association 
between stigma and network support for this 
group. Repeat-contact patients differ because 
they have had a history of being labeled, so 
their network ties may have been affected by 
stigma. In addition, because repeat-contact 
patients are currently in treatment, they differ 
from community respondents who were in 
treatment at one time but are not in treatment 
now. 

Table 1 summarizes the labeling predic- 
tions that we will test. Most of these are 
derived from Figure 1, but additional theoret- 
ical specification is required to test Step 4 and 
our predictions about social support networks. 
Specifically we asked our respondents whether 
each individual named in the network was a 
relative or a household member. With this 
information, we constructed network support 
measures of three kinds: (1) relatives outside 
the household, (2) nonrelatives outside the 
household, and (3) household members, 
including both relatives and nonrelatives. We 


make different predictions about the effects of 
stigma processes on these three types of 
supporters. 

Underlying our predictions is a theoretical 
perspective on stigmatizing conditions devel- 
oped by Jones, Farina, Hastorf, Markus, 
Miller, and Scott (1984). In their conception, 
the starting point for understanding interac- 
tions between a “marked” person (their term 
for a potentially stigmatizing social designa- 
tion) and an “unmarked” person is each 
individual’s expectations. Initial expectancies 
are influenced strongly by cultural concep- 
tions as conveyed through the press, televi- 
sion, jokes, and the attitudes of important 
others. These expectations are brought to 
initial interactions and then influence them. 
Expectancies then are revised; new expectan- 
cies are brought to the next interaction, 
revised again, and so on. | 

We are interested in the influence of 
patients’ initial, culturally derived expectan- 
cies of others’ responses to them on their 
social support networks. We predict that 
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patients’ fear of devaluation and discrimina- 
tion and their adoption of protective responses 
such as secrecy, withdrawal, and education 
are less relevant to interactions with house- 
hold members than to interactions with 
nonhousehold members. Since patients are in 
direct, frequent contact with fellow household 
members, they are likely to have experienced 
repeatedly the process described by Jones et 
al. Their initial fears of devaluation and 
discrimination by “most people” become less 
relevant in this context. So do two approaches 
for coping with stigma: secrecy with house- 
hold members is not possible and withdrawal 
leaves no place to go. This is not the case, 
however, for potential supporters outside the 
household. The Jones et al. process may not 
have occurred at all and is certainly less likely 
to have been repeated as frequently as for 
household members. Seeking supportive con- 
tact with such persons may mean engaging 
people who harbor the negative attitudes that 
patients fear. Furthermore, if patients do fear 
contact with people outside the household, 
they are more likely to rely on household 
members when supportive exchanges are 
required. With this reasoning, we predict that 
the expectation of devaluation-discrimination 
and reliance on the stigma coping approaches 
will increase the use of household supporters 
and decrease the use of nonhousehold support- 
ers. 
One further consideration is required with 
respect to the prediction concerning nonhouse- 
hold support. Nonrelative supporters can be 
chosen from a large pool of potential 
supporters, whereas relatives cannot. Patients 
can reconstitute their nonrelative support 
networks with people they have "tested" and 
found to be accepting or who are unlikely to 
reject them because they are current or former 
psychiatric patients themselves. Thus, it is 
possible for highly fearful patients to have a 
relatively robust network of nonrelatives. If 
patients withdraw, however, one would 
expect nonrelative support to be sparse 
because such people cope by withdrawing 
from interaction. Support networks consisting 
of relatives cannot be reconstituted in the 
same way as for nonrelatives because their 
numbers are predetermined and finite. Thus 
the greater the fear of rejection, the less likely 
that patients will have support from nonhouse- 
hold relatives. 
Critics of labeling theory make different 
predictions from those based on our perspec- 
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tive. because they claim that labeling and 
stigma are not a "severe problem" (Gove 
1982) and that former patients "enjoy nearly 
total acceptance in all but the most intimate 
relationships" (Crocetti et al. 1974), critics 
would expect beliefs that mental patients are 
rejected to be low to moderate (Step 1). In 
addition, patients should not feel the need to 
endorse secrecy, education, or withdrawal in 
the nearly stigma-free world that these critics 
envision (Step 3). Finally, there would be no 
reason to believe that official labeling should 
make beliefs about others’ responses to 
mental patients personally relevant (Step 2) in 
such a way as to shape the nature of patients' 
social support networks (Step 4). This study 
was designed to ground the key elements of 
our theory in empirical facts. Since our 
predictions about these facts differ from those 
of critics, our study allows us to evaluate 
these two very different views about the 
importance of labeling and stigma. 


METHOD 


Sample 


Samples of community residents (V = 429) 
and psychiatric patients (V = 164) from the 
Washington Heights section of new York City 
were administered two face-to-face interviews 
between 1980 and 1983 (Dohrenwend, Shrout, 
Link, and Skodol 1985). We recruited the 
community respondents initially to participate 
in a methodological study of symptom scales 
and interviewed them approximately six 
months later for this research. In the original 
community sample, we enumerated house- 
holds and contacted them to learn whether an 
eligible respondent between 19 and 59 years 
of age lived there. We also obtained informa- 
tion about ethnic background to permit us to 
sample roughly equal proportions of blacks, 
Hispanics, and non-Hispanic whites from this 
urban neighborhood in which the majority of 
residents is Hispanic. In 93 percent of the 
households, screening information was pro- 
vided; and 68 percent contained one or more 
potential respondents. Of the 943 eligible 
individuals, 57 percent (541) were inter- 
viewed successfully. 

The original methodological study involved 
random assignment of respondents to one- 
month versus one-year recall of symptoms 
from the Psychiatric Epidemiology Research 
Interview (PERD). Given the methodological 
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focus, we made no intensive efforts to pin 
down hard-to-schedule respondents and to 
convert refusals. When we conceived and 
implemented the substantive study near the 
end of the methodological study and when 
questions of generalizability became more 
compelling, we made special efforts to 
interview a subsample of hard-to-schedule 
subjects. This group compfised 48 of the 541 
subjects in our total sample. 

In addition to generating a sample of 
hard-to-schedule respondents, we attempted 
to insure an adequate reinterview rate in the 
second wave of data collection. As a result, 
79 percent of the initial sample (429) were 
located and reinterviewed.? When compared 
to census data for the same area, this group 
showed no difference in gender and age 
(within the 19-59 range). Differences emerge 
on educational level and ethnicity, but these 
can be attributed largely to our efforts to 
stratify. We purposely undersampled the 
Hispanic majority. Because Hispanics have 
less education than either blacks or non- 
Hispanic whites, our sampling decision con- 
tributed to the underrepresentation of lower- 
educated respondents. Furthermore, it was 
more difficult to-recruit lower-educated respon- 
dents; probably they are underrepresented in 
the community sample for this reason. 
Because of the stratification, we include 
controls on ethnicity in all analysis to follow. 

To further assess the representativeness of 
the community sample, we compared the 


? We examined whether the characteristics of 
those we were unable to reinterview (N — 112) 
differed from those we recontacted successfully 
(N — 429). Our reinterview rate for those with less 
than a high school education was 71 percent, 
compared to 81 percent for those who had 
completed high school and 85 percent for college 
graduates. Our reinterview rate among Hispanics 
was 70 percent, compared to 84 percent among 
whites and 83 percent among blacks. There were 
no significant differences in age, sex, marital 
status, or fathers’ occupational prestige, or on five 
of the six screening scales we employed to 
characterize our respondents' levels of psychopa- 
thology. Respondents whom we reinterviewed 
were slightly more symptomatic on confused 
thinking (point biserial r — .11). Thus, although 
significant differences exist on some variables, the 
magnitude of these differences is either very small, 
as with confused thinking, or else the lowest level 
of response is not devastatingly small, as with less 
educated persons and Hispanics. 
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hard-to-schedule respondents with the rest of 
the sample. We reasoned that those respon- 
dents would be more similar to the larger pool 
of nonrespondents than to the more easily 
obtained individuals, for without our exten- 
sive efforts they would have been nonrespon- 
dents. We found no difference between. them 
and other respondents on age, sex, education, 
or ethnicity. Moreover, in light of our 
emphasis on stigma, we examined whether 
the hard-to-schedule respondents differed in 
perceived devaluation-discrimination (de- ` 
scribed below) and found that they did not. 

The patient sample was selected from out- 
patient clinics and inpatient facilities in the 
same area of New York City. Since our goal 
was to select patients in two diagnostic cate- 
gories—major depression and schizo- 
phrenia/schizophrenia-like psychotic disor- 
ders—all patients thought to belong to one of 
these diagnostic groups were referred to our 
project.^ In addition, we made a considerable 
effort to locate cases in their first episode of 
each of these types of disorder. While it was 
extremely difficult to locate first episode cases, 
we interviewed 164 patients in four groups: 50 
first-episode major depression, 21 first- 
episode psychotic, 48 repeat major depres- 
sion, and 44 repeat psychotic.5 We cannot give 
a completion rate for the patient cases because 
the diagnosis was made only after the patient 
was recruited; thus we have no denominator 
within each diagnostic type. Very few patients 
refused outright to participate. 


The Study Groups 


To form the five groups, we required 
distinctions on psychiatric status and labeling 
exposure. We constructed the group of 


* Slightly less than half of the schizophrenialike 
group had diagnoses that fell within the “schizo- 
phrenia spectrum" (paranoid, schizoaffective, and 
Schizophreniform disorders and atypical psycho- 
Sis). The rest met the full criteria for DSM-III-R 
schizophrenia. 

5 One patient did not fit into any of these four 
groups. In addition, first-episode and first-treat- 
ment contact are different; a person experiencing a 
first episode of schizophrenia or schizophrenialike 
disorder may have been treated earlier for another 
problem. Similarly, a first-treatment contact patient 
may have had several untreated episodes and there- 
fore may be classified as a repeat-episode patient. 
Because this paper focuses on labeling, treatment 
contact is the major variable employed. 
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untreated community respondents with no 
evidence of psychopathology (N = 171) by 
subjecting all respondents to extensive screen- 
ing. First, their scores on six symptom scales 
taken from the PERI interview needed to be 
low enough that they could be classified 
correctly as community respondents in a 
multiple-discriminant analysis (Shrout, Doh- 
renwend, and Levav 1986). In addition, only 
individuals who had never undergone mental 
health treatment were eligible for this group. 
Finally, in the second interview subjects who 
scored high on a scale measuring psychologi- 
cal distress or demoralization were evaluated 
by a modified form of the Diagnostic 
Interview Schedule (DIS) (Robins, Helzer, 
Crougham, and Ratcliff 1981), a standardized 
interview that yields psychiatric diagnoses 
according to DSM-III criteria. No individual 
who was found in this interview to have a 
diagnosable psychiatric disorder is included in 
the community respondents without psycho- 
pathology. 

The untreated community case group can 
be formed in two different ways. One method 
uses multiple-discriminant analysis: A com- 
munity case group is formed by community 
respondents who are “misclassified” in the 
analysis because they appear “just like” 
psychiatric patients in regard to the symptoms 
they express. The second way incorporates 
diagnostic procedures and uses the results 
from the modified DIS to generate DSM-III 
diagnoses. Further, because we wanted to 
identify untreated cases, we used information 
on treatment history to remove community 
cases who had had contact with a mental- 
health professional (psychiatrist, psycholo- 
Bist, or social worker) or who had been 
hospitalized in a mental hospital. We in- 
cluded in the untreated group individuals 
(N = 142) who were identified via the PERI 
discriminant function or via the modified 
DIS. Yet because this group can be defined in 
a variety of ways (1.e., discriminate function, 
DIS, both, or either) we check our results for 
each of these ways of defining this group.$ 


6 For two reasons we chose to highlight results 
for the 142 cases defined by either the PERI 
discriminant function or the DIS. First, this 
approach allowed us to place each respondent into 
one of the five mutually exclusive groups, thereby 
avoiding the complexity that would have resulted if 
we formed new groups such as positive on PERI, 
negative on DIS. In addition, the DIS was 
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The group of recently labeled first- 
treatment contact patients was generated from 
the community (N — 11) and from the patient 
sample (N — 56). Community respondents 
were defined as a first-contact patient if they 
reported a first-treatment contact with a 
psychiatrist, a psychologist, or a social 
worker during the year before the interview. 
To check if patient sample cases were 
experiencing their first-treatment contact, we 
examined both their self-reports and their 
clinic and hospital records. 

The group of repeat-treatment contact 
patients (N = 117) also was generated from 
both the community (N — 9) and the patient 
(N = 108) samples. The small number of 
repeat-contact patients from the community 
consisted of individuals who reported their 
first contact to be more than a year ago and 
also reported current treatment. The group 
from the patient sample consisted of individ- 
uals who we determined had had a previous 
treatment contact. 

All the past-treatment contact patients (N 
— 96) were drawn from the community 
sample. They reported past contact with a 
mental-health professional, but were not 
currently in treatment. Because this group of 
"former patients" was derived from the 
community sample, some were less like our 
patient samples and more like the "friends 
and supporters of psychotherapy" described 
by Kadushin (1969). Such persons seek 
treatment for personal growth or training 
within a context that values treatment seeking 
as a positive step in one's development. Thus 
such treatment does not constitute a potent 
labeling experience and could dilute stigma 
effects in this group. 


Stigma Measures 


Devaluation-discrimination. This measure con- 
sists of 12 six-point "strongly agree — 1" to 
"strongly disagree — 6" Likert items de- 


administered to individuals who scored high on 
demoralization, the only PERI scale we gave 
during our second wave of interviewing. Thus, to 
require that untreated cases be positive on both the 
PERI and the DIS, or only on the DIS, would 
restrict this group to untreated cases who are high 
on demoralization. Because many (if not most) 
clinically significant cases are not demoralized 
(Link and Dohrenwend 1980), this requirement 
would lead to an overrestriction. 
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signed to assess the extent to which respon- 
dents believe that most people will devalue or 
discriminate against a person with a history of 
psychiatric treatment (see Table 3 below for 
wording of questions). All items (half are in 
the reverse direction) are scored so that a 
high score indicates a belief that mental 
patients will be devalued and discriminated 
against. The overall measure of devaluation- 
discrimination then is scored by adding the 
individual items and dividing by 12. This 
step allows us to locate the mean of the 
summative scale on the six-point agree- 
disagree scale. Because the midpoint of the 
scale is 3.5 and is located directly between 
the strongly agree—strongly disagree poles, a 
mean above 3.5 indicates that the average 
person tends to endorse an item or a pool of 
items (as with the summative scale). The 
devaluation-discrimination measure shows ad- 
equate internal consistency overall (alpha — 

.76) and among each of the five study groups 
(range .69 untreated cases to .82 former 
patients). 

. Secrecy, withdrawal, and education. 'These 
three multiple-item measures were written to 
tap coping orientations that mental patients 
might use to deal with stigmatization (see 
Table 4 for item content). Thus they are 
appropriate only for individuals who have 
been officially labeled by treatment contact. 
Ideally we would have asked these questions 
of all officially labeled persons, whether we 
obtained them from treatment settings or from 
the community. We decided, however, not to 
ask them of community respondents. We did 
not approach them in their role as patients and 
thought it would be awkward to focus on 
material that asked them to place themselves 
in this role. As a result our analyses of these 
measures are based only on the patient 
sample. 

The items in these scales were answered 
with the same six-point Likert format used for 
the devaluation-discrimination measure. Items 
were recoded so that a high score indicated 
endorsement of a coping orientation (i.e., 
secrecy, withdrawal, or education). Summa- 
tive scales were formed by adding the scores 
on individual items and dividing by the 
number of items in the scale so that each scale 
could vary from 1 to 6 with a midpoint of 3.5. 
The internal consistency reliability (Cron- 
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bach's alpha) of these measures are .71 for 
secrecy, .67 for withdrawal, and .71 for 
education.” 


Measures of Social Network 


We elicited measures of social network with 
Fisher’s (1982) questions. We asked respon- 
dents to name individuals with whom they 
had or could have had supportive exchanges 
during the past year in nine areas of activity 
such as care of children, watching the house, 
discussing personal problems,’ borrowing 
money, and social-recreation activities (see 
Appendix). Note that for first-treatment 
contact patients this period comes before 
official labeling occurred, which is why we 
predict no labeling effects for these measures 
for this group. We focus on two types of 
measures generated from these data: extensive- 
ness and availability of instrumental support 
(Dohrenwend, Shrout, Link, Martin, and 
Skodol 1986). To construct the measures, our 
theory required we use information obtained 
from probes that asked whether each named 
person was a household member or a relative. 
This allowed us to develop network measures 
for household members, nonhousehold nonre- 
latives, and nonhousehold relatives. 
Extensiveness. This variable represents the 
extent to which a network contains individu- 
als who fulfill the supportive "tasks" identi- 
fied in the nine questions. The simplest 
measure determines how many of the nine 
tasks are "covered" by at least one network 
supporter. The measure we employ deviates 
from this for two reasons. First, all respon- 
dents do not "need" all nine forms of 
supportive contact. Questions 2 and 5, which 
ask about work and child care, are not 
applicable to those who are not in the paid 
labor force or who are not parents. Second, 
the presence or absence of a supporter in each 


7 In a confirmatory factor analysis based on the 
full patient sample, a three-factor model with items 
from each domain (secrecy, withdrawal, and 
education) loading on a factor for that domain fit 
the data quite well. The ratio of chi-square to 
degrees of freedom was 1:56. One item (Number 7 
in the withdrawal scale in Table 4) loaded on both 
secrecy and withdrawal when we freed it to do so. 
We included it in the withdrawal scale because it 
loaded higher on this factor and because the 
internal consistency of withdrawal needed strength- 
ening. : 
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area does not take into account the added 
benefit of having a second person, a “backup 
supporter,” who might provide support in 
each area. . j 

Therefore, we constructed the measure of 
extensiveness in the following way: the 
number of areas applicable to the respondent 
for which there was no coverage was 
subtracted from nine, the total. To that 
difference we added the weighted value (.5) 
of the number of areas for which backup 
support was available; these areas contained 
two or more supporters: 


Extensiveness = (9 — number of applicable 
areas not covered) + (.5 X number of areas with 
two or more individuals named— backup sup- 
port). 


We subtract the number of noncovered areas 
from nine on the basis of our reasoning about 
“need,” noting that if someone does not need 
support in an area, he or she cannot miss it. 
This approach is preferable to one based on 
the proportion of applicable areas covered, 
because each uncovered area would represent 
more of a “penalty” for a respondent with 
fewer applicable areas than for a respondent 
with the full nine. The weighting of backup 
support by .5 ensures that less importance is 
assigned to it than to the presence or absence 
of at least one network member for each task. 
Thus, the major emphasis on task coverage is 
maintained without ignoring the fact that a 
second person in a task area contributes as 
well to an “extensive” network. In order to 
develop the measures for household mem- 
bers, nonhousehold relatives, and nonhouse- 
hold nonrelatives, we repeated these proce- 
dures but restricted attention to the relevant 
category of supporter. 

Instrumental support. We focused on 
instances in which one person performed a 
task for another person (see Questions 1, 3, 5, 
Appendix) or could be called on to lend 
money (Question 9). The support is concrete 
and direct; it can be helpful without influenc- 
ing the subject's state of mind, as required in 
Questions 2 and 8 about help in decision 
making. Unlike the extensiveness measure, 
instrumental support is not “need-based.” We 
are not subtracting from the number of areas 
in which instrumental support might be 
needed but not provided. Instead this measure 
is a simple count of the supporters mentioned 
in response to the four questions. Thus, we 
can handle the problem of having one 
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question that is not applicable to all subjects 
(help with child care) by simply controlling 
for the presence or absence of young children 
(under age 14). The household member, 
nonhousehold relative, and nonhousehold 
nonrelative measures are generated simply by 
adding the number of supporters specific to 
each of these categories.? 


Control Variables and Variables That 
Describe the Groups Used in Our Analysis 


In Table 2 we show significant differences 
among the groups on a number of important 
sociodemographic measures. Such differences 
suggest the importance of controlling these 
variables for hypotheses about differences 
between these groups. Recall, however, that 
our key hypotheses involve the effect of 
stigma variables on types of network support 
within these groups. It is important to keep in 
mind the large between-group differences, but 
the adequacy of their control is not as central 
to testing these within-group hypotheses. We 
use many of the variables in Table 2 as 
control variables because often they are 
significantly associated with network vari- 
ables. Therefore, they are potential confound- 
ers of the association between stigma and 
social support networks in our within-group 
analyses. 


RESULTS 


With one exception, we organize results by 
presenting evidence concerning each step of 
our modified labeling theory in turn. Evi- 
dence for Step 2 is presented after Steps 1, 3, 
and 4 because Step 2 is the critical juncture at 
which the fate of labeled and unlabeled 
persons diverges (see Figure 1). Thus, we 
examine Step 1 first because it involves 
labeled and unlabeled respondents. Then we 
follow labeled persons through Steps 3 and 4 
and finally return to test Step 2 by examining 


8 We could have defined the network variables 
slightly differently (e.g., including Questions 2 or 
8 in instrumental support). However, these deci- 
sions were made earlier, in the context of an 
entirely different theoretical question (see Dohren- 
wend et al. 1986) and are, therefore, agnostic with 
respect to the labeling hypotheses that we address. 
Also, we could have used more network measures 
or even different ones altogether, a matter we 
return to in a subsequent footnote. 
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Table 2. Distribution of Control Variables in Groups Which Differ in Labeling Status, Psychiatric Condition, and 
Experience with Labeling (Maximum N= 593)* (standard deviations in parentheses) 


First- Repeat- 
Treatment Treatment 
Contact - Contact 
Patients Patients 
Age 32.71 36.59 
(in years) (13.31) (11.56) 
Education 12.46 12.38 
(in years) (3.14) (3.67) 
Married 
(percent married) 25% 19% 
Hispanic 27% 20% 
Black 36% 39% 
White and other 37% 4196 
Sex (percent female) 58% 719 
Children under age 14 
(percent with children) 16% 17% 
Work status 
(percent employed) 48% 39% 
Demoralization 2.87 3.14 
(27-item PERI scale) (.73) (.83) 


Untreated Significance of 
Community F or Chi-Square 
Former Untreated Well Test of Group 
Patients Cases Respondents Differences 
37.68 35.77 40.29 Fg 5g7 = 5.53*** 
(11.84) (12.26) (12.47) . 
13.46 11.78 13.28 F4,587 = 5.25"** 
(3.59) (3.52) (3.13) 
49% 44% 56% X5, s88 = 48.69%"* 
1996 4096 24% 
28% 28% 30% X24 s84 25.52"* 
53% 32% 46% 
59% 64% 50% X^, 589 = 14.11** 
26% 37% 31% X24 53872 17.65** 
7995 63% 80% X2, 59g 7167 65*** 
2.48 2.55 1.84 F4,555581.97*9* 
(.69) (.62) (.34) 


* Different variables have different numbers of missing values. The degrees of freedom associated with the statistical 


tests indicate how many are missing for each variable. 
* p«.05. 
** p.01. 
*** p«. 001. 


whether the fate of unlabeled persons is 
different. 


Evidence Concerning Step 1: Beliefs about 
Devaluation and Discrimination in the 
Community and Patient Samples 


Beliefs of unlabeled community respondents. 
Our labeling approach posits widespread 
endorsement of the belief that mental patients 
are devalued and discriminated against. Con- 
sistent with this, for 11 of the 12 items in 
Table 3 the mean is higher than the 3.5 
midpoint of the six-point Likert scale both for 
untreated cases and for untreated well respon- 
dents. This trend across the 12 items is 
reflected in the summative scale of perceived 
devaluation-discrimination: the scale mean 
for both untreated cases and community well 
respondents is significantly higher than the 
3.5 midpoint (p«.001 in both instances). 
Clearly, any assessment of how negative an 
attitude is cannot be resolved by quantitative 
assessments of how much higher a mean is 
than a midpoint. The position of the mean in 
relation to the midpoint can be manipulated 
by item wording. Thus, it is important to 
examine the content of the items. If the 
questions focused on inconsequential forms of 
rejection or on mild slights, a mean higher 


than the midpoint would not be substantively 
significant, but we show in Table 3 that 
respondents tend to believe that a former 
patient will be excluded from close friend- 
ships, jobs, and intimate relationships. 

To highlight this point, we translated the 
community sample's responses into percent- 
ages by grouping together all responses that 
fell into one of the three “agree” categories 
versus all responses falling into one of the 
“disagree” categories. Notably, the results 
show that 75 percent of the community 
sample agreed that employers will discrimi- 
nate against former mental patients; 80 
percent and 66 percent subscribe to similar 
expectations with regard to dating relation- 
ships and close friendships respectively; 71 
percent agreed that former patients will be 
seen as less trustworthy, 62 percent that they 
will be seen as less intelligent, and 70 percent 
that their opinions will be taken less seri- 
ously. Finally, results for the full 12-item 
scale show that only 19 percent of the 
untreated community respondents score be- 
low the 3.5 midpoint and a mere 2 percent 
below 2.5, a score which would indicate a 
strong refutation of the notion that mental 
patients face devaluation-discrimination. In 
contrast, 28 percent of the respondents score 
above 4.5, indicating a strong belief that 
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Table 3. Means (Standard Deviations) and One-Way Analysis of Variance Showing the Extent to Which Community 
Respondents, Former Patients, and Current Patients Believe That Mental Patients Will Be Devalued and 
Discriminated Against (six-point strongly agree = 1 to strongly disagree= 6; “R” indicates reverse scoring.) 











Community 
First- Repeat. Formerly Respondents P-value 
Treatment Treatment Treated Community without One-Way 
Contact Contact Community Untreated Psycho- Analysis 
Patients Patients Respondents Cases pathology of 
(N=61)* (N—-115* (N=95) (N-—142*  (N=169)" Variance 





1, Most people would willingly accept 
a former mental patient as a close 4.11* 3.94* 4.43* 4.23* 4.31* 0.8. 


friend. (1.57) (1.64) (1.44) (1.57) (1.56) 
2. Most people believe that a person 
who has been in a mental hospital 


is just as intelligent as the 3.93 3.66 4.03* 4.05* 3.95* nS. 
average person. (1.82) (1.62) (1.55) (1.51) (1.61) 

3. Most people believe that a former 
mental patient is just as trust- 3.97* 4.09* 4.27* 4.46* 4.19* n.8. 
worthy as the average citizen. (1.56) (1.44) (1.58) (1.42) (1.67) 


4. Most people would accept a fully 
recovered former mental patient 


as a teacher of young children 4.31* 4.40* 4.91* 4.76* 4.76* p «.05 
in a public school. (1.39) (1.38) (1.63) (1.26) (1.50) 

5. Most people feel that entering a 
mental hospital is a sign of 3.62 4.14* 3.83 3.51 3.41 p «.01 
personal failure (R). (1.65) (1.45) (1.80) (1.68) (1.72) 


6. Most people would not hire a former 
mental patient to take care of 


their children, even if he or she 4.38* 4.47* 4.61* 4.44* 4.45* n.8. 
had been well for some time (R). (1.65) (1.52) (1.55) (1.57) (1.69) 

7. Most people think less of a 
person who has.been in a mental 4.13* 4.48* 4.20* 4.07* 4.02* n.s. 
hospital (R). (1.57) (1.32) (1.51) (1.42) (1.50) 

8. Most employers will hire a former 
mental patient if he or she is 3.23 3.57 3.99* 3.32 3.53 p «.05 
qualified for the job. (1.52 (1.61) (1.48) (1.47) (1.50) 


9. Most employers will pass over the 
application of a former mental 


patient in favor or another 4.43* 4.71* 4.65* 4.34* 4.38* 0.8. 
applicant (R). (1.37) (1.30) (1.50) (1.49) (1.69) 

10. Most people in my community would 
treat a former mental patient just 3.53 3.65 3.86* 3.74 3.57 n.5, 
as they would treat anyone. (1.64) (1.68) (1.71) (1.55) (1.65) 


11. Most young women would be reluctant 
to date a man who has been 
hospitalized for a serious mental 4.64* 4.41* 4.44* 4.48* 4.51* n.$. 
disorder (R). (1.50) (1.55) (1.55) (1.40) (1.58) 
12. Once they know a person was in a 
mental hospital, most people will 


take his or her opinions less 4.07* 4.23* 4.17* 4.11* 4.15* n.8. 
seriously (R). (1.54) (1.45) (1.42) (1.37) (1.53) 

13. Devaluation-discrimination (sum 
of items 1 through 12 4.03* 4.14* 4.33* 4.13* 4.08* n.8. 
divided by 12). (.86) (.89) (.92) (.73) (.80) 





* The Ne given are for the number of valid cases for the summed devaluation-discrimination measure. Eleven 
respondents with fewer than eight valid reponses were assigned missing values. For respondents with one to four 
missing values, the valid responses were prorated. Reports for individual items include all valid responses (maximum 
N=584, minimum N= 577). 

* The mean is significantly higher than the midpoint at p<.05. 
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patients can expect devaluation-discrimina- 
tion. Therefore, it is clear that most people 
expect patients to be devalued and discrimi- 
nated against at least in some ways, and that 
more than one-fourth of the respondents 
(those scoring above 4.5) believe that this 
reaction occurs consistently across many 
important aspects of a patient’s life. 

Beliefs of labeled respondents: first- 
contact, repeat-contact, and former patients. 
Our theory specified that patients and former 
patients will share the nonpatient public’s 
views. We show in Table 3 that with only one 
exception, all means for these patient groups 
exceed the 3.5 midpoint, indicating that the 
average patient believes that mental patients 
may face severe devaluation and discrimina- 
tion. The mean of the summative scale is 
significantly higher than the midpoint for all 
three patient groups (p<.001). 

We also show in Table 3 that differences in 
perceptions among the five groups are 
minimal and inconsistent. One-way analysis 
of variance shows that the group means are 
significantly different for only three items and 
that the amounts of variance explained are 
small (a maximum of 3 percent for Item 5). 
Moreover, with the exception of a slight trend 
indicating that former patients expect more 
rejection, there is little to suggest a consistent 
ordering of the groups. 

The nonsignificant findings across the 12 
items are reflected in the nonsignificant result 
for the summative scale. Only 1 percent of the 
variance in the summative scale is explained 
by the groups in the one-way analysis of vari- 
ance. Nor does a significant difference emerge 
when we control for the variables in Table 2 
that were distributed so differently across the 
groups. Furthermore, the absence of a signif- 
icant group effect probably is not due to a lack 
of power. The power to detect a medium ef- 
fect size (6 percent of the variance) at alpha = 
.05 is better than .99 in a sample of this size 
(Cohen 1977). Contrary to the claims of some 
labeling critics, we find that current patients, 
former patients, and nonpatients agree that men- 
tal patients will be rejected by most people. 


Evidence Concerning Step 3: 
Responses to Labeling in First- and 
Repeat-Contact Patients 


According to our approach, one determinant 
of patients’ outcomes is the protective maneu- 
vering they use in response to labeling. 
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Therefore, it is important to determine the 
extent to which coping orientations such as 
secrecy, withdrawal, and education are deemed 
advisable by patients. Moreover, their endorse- 
ment of such approaches is further evidence 
that they feel stigmatized by their patient 
status. Otherwise, why would they report that 
these approaches are necessary? 

Recall that we could ask questions about 
stigma coping orientations only of our patient 
sample (N = 164). Therefore, the results in 
Table 4 are restricted to these respondents. 
Most items measuring secrecy, withdrawal, 
and education are endorsed strongly by 
patients; mean scores generally are above the 
3.5 midpoint of the items. Only three 
exceptions occur. Two of the secrecy items 
(items 2 and 5) and one of the withdrawal 
items (Item 3) have means below the 
midpoint, but even with these questions, over 
40 percent reported that they employed or 
would employ the approach. Moreover, the 
means of summative scales measuring with- 
drawal and education are significantly higher 
than the midpoint (p<.001), whereas the 
means of secrecy is above the midpoint but 
not significantly so. Finally, first- and 
repeat-contact patients do not differ signifi- 
cantly in their endorsement of the coping 
orientations (not shown); this finding suggests 
a widespread belief that such protective 
responses are required.? 

It is also important to note the wording of 
some of the items because these orientations 
to coping may show how negative labeling 
effects emerge. Items 2 and 5 of the 
withdrawal scale, for example, indicate that a 
patient would avoid a person or would decline 
to apply for a job if he or she believed that 
rejection might be the result. Because many 
patients believe that “employers” will discrim- 
inate against them and that "most people" 
will value them less (see Items 7 and 9 of 
Table 3), the strong endorsement of these 
“withdrawal” items suggests the plausibility 
of Step 4 of our model, which specifies that 
efforts designed to cope with stigma may 
have negative consequences for a patient's 
life chances. 


? Nonsignificant trends show that first-treatment 
contact patients are less likely to withdraw or to 
keep their history of treatment a secret and are 
more likely to educate. In short, they are slightly 
more optimistic or more naive. 


414 AMERICAN SOCIOLOGICAL REVIEW 


Table 4. Means and Standard Deviations of Items Measuring Secrecy, Education, and Withdrawal in the Patient 
Sample (Maximum N= 158)* (six-point strongly agree=1 to strongly disagree —6; “R” indicates reverse 





scoring.) 
Percent 
Mean Endorsing 

Item (standard deviation) Strategy 
Secrecy 
1. In order to get a job, a former mental patient will have 

to hide his or her history of hospitalization (R). 4.26* = (1.65) 70.5 
2. There is no reason for a person to hide the fact that he or she 

was mental patient at one time. 3.25 (1.84) 40.1 
3. If you have been treated for a serious mental illness, 

the best thing to do is to keep it a secret (R). 3.85* (1.79) 61.1 
4. If I had a close relative who had been treated for a serious mental 

illness, I would advise him or her not to tell anyone about it (R). 3.70 0.70 58.6 
5. I rarely feel the need to hide the fact that I have been 

in psychiatric treatment. 3.25 (1.68) 41.8 
Education 
1. I’ve found that it’s best to help the people close to me understand 

what psychiatric treatment is like (R). 5.07* (1.28) 91.1 
2. If I thought a friend was uncomfortable with me because I had 

been in psychiatric treatment, I would take it upon myself 

to educate him or her about my treatment (R). 4.55* (1.50) 78.8 
3. If I thought an employer felt uneasy hiring a person who had been 

in psychiatric treatment, I would try to make him or her 
. understand that most ex-patients are good workers (R). 4.35* (1.65) 78.2 
4. After I entered psychiatric treatment, I often found myself educating 

others about what it means to be a psychiatric patient (R). 3.80* (1.77) 62.0 
5. I would participate in an organized effort to teach the public more 

: about psychiatric treatment and the problems of people who 

seek the help of psychiatrists (R). 4.87* (1.47) 85.4 
Withdrawal 
l. It is easier for me to be friendly with people who have 

been psychiatric patients (R). 3.80* (1.65) 61.3 
2. If I thought that someone I knew held negative opinions about 

psychiatric patients, I would try to avoid him or her (R). 3.62 (1.69) 51.6 
3. After being in psychiatric treatment, it's a good idea to keep 

what you are thinking to yourself (R). 3.14 (1.78) 42.3 
4. If I was looking for a job and received an application which asked 

about a history of psychiatric treatment, I wouldn't fill it out (R). 3.87* (1.84) 58.0 
5. If I thought an employer was reluctant to hire a person with a history 

of psychiatric treatment, I wouldn't apply for the job (R). 3.85* (1.84) 57.3 
6. If I believed that a person I knew thought less of me because I had 

been in psychiatric treatment, I would try to avoid him or her (R). 4.15* (1.69) 67.7 
7. When I meet people for the first time, I make a special effort to keep 

the fact that I have been in psychiatric treatment to myself (R). 4.33* (1.76) 71.3 





* Questions in this table were asked only of respondents from tbe patient sample. Thus 19 individuals from the 
community sample who were classified originally as first- or repeat-contact patients are excluded. The minimum 
number of cases on any item was 156. 

* 'The mean is significantly higher than the midpoint at p«.05. 


Evidence Concerning Step 4: 
Repeat-Treatment Contact Patients 


We now test the effects of stigma among 
repeat-treatment contact patients, using two 
measures of network-based social support for 
three types of supporters: relatives outside the 
household, nonrelatives outside the house- 
hold, and household members. Before we 
report on these six measures, we note that 
overall (irrespective of the type of supporter), 
repeat-treatment contact patients have fewer 


instrumental supporters and less extensive 
task coverage, with sociodemographics from 
Table 2 controlled, than do either well 
community respondents (F226 = 9.36; 
p<.001) or untreated cases (F2,234 = 8.45; 
p<.001). The comparison of the repeat- 
treatment contact patients to the untreated 
case group supports the labeling perspective 
because the label seems to be the most salient 
factor distinguishing the two groups. Critics 
of labeling theory, however, are correct in 
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countering that untreated cases are less 
severely impaired, on average, and that 
attempts to control such differences statisti- 
cally are subject to regression artifacts that 
bias toward finding support for labeling (see 
Link 1982). 

- Our analysis goes beyond such group 
comparisons by asking whether stigma vari- 
ables affect specific types of social network 
ties within groups of patients who are 
relatively homogeneous in the nature and 
severity of their disorder. Most of the 
difficulty in interpreting labeling effects arises 
when one attempts to equate labeled and 
unlabeled groups statistically on psychiatric 
condition (Link 1982). 

Multivariate hypothesis tests. The most 
global test of our hypotheses asks whether the 
four stigma measures (devaluation-dis- 
crimination, secrecy, withdrawal, and educa- 
tion), when considered together, have an 
effect on the six social support network 
variables. To test this hypothesis we used 
multivariate multiple regression analysis, a 
procedure that tests the significance of the 
relationships between the -two sets of vari- 
ables using a multivariate F-test. We found a 
highly significant association between the 
four stigma measures and the six support 
network variables (F434 = 2.47, p<.001; 
this and all multivariate tests use Pillai’s 
criterion).!° To determine whether this asso- 
ciation held when potential confounding 
variables were added, we controlled for 
education, age, marital status (married versus 
others), employment status (working versus 
nonworking), and the presence or absence of 
children under 14 years of age in the 
household because these variables showed 
significant associations with the six support 
network measures. We did not control for 
sex, religion, Type A behavior, internal 
versus external locus of control, or social 
desirability because these variables were not 
significantly related to our social network 
variables and as such cannot be potent 
confounders (see Dohrenwend et al. 1986 for 





10 From this point on, all analyses of repeat- 
treatment contact patients involve 103 cases. Nine 
cases drawn from the community are excluded 
because they were not asked all questions about 
secrecy, withdrawal, and education. Five addi- 
tional patients are excluded because they had 
missing values on one or more of the stigma 
variables. 
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the measurement of these variables). We 
enter ethnicity because we stratified on this 
variable and diagnosis because of its impor- 
tance as a control for psychiatric condition. 
When the stigma variables are entered after 
controls, the multivariate hypothesis test 
remains significant (F24 352 = 1.82, p=.011). 
In addition, we tested for possible interactions 
between diagnosis and the four stigma 
variables, but found none. This suggests 
relatively constant effects of stigma across 
major depression and schizophrenia, two very 
different types of mental disorder. We 
conclude that there are relationships between 
the stigma dimensions and the social network 
variables that cannot be explained by a range 
of possible confounding factors. We do not 
yet know, however, which of the six outcome 
measures are affected by the stigma variables 
or whether the effects are in the direction 
predicted in Table 1 (i.e., more stigma 
concern, more household support— more 
stigma concern, less nonhousebold support). 
We address these more refined questions 
by examining the six univariate (single 
dependent variable) multiple-regression equa- 
tions that formed the basis of the omnibus 
multivariate test reported above. In presenting 
these equations, we faced an issue raised by 
the degree of overlap between the stigma 
variables. Although the four stigma variables 
predicted substantial proportions of variance 
as a group, they often failed to show 
significant unique effects when considered 
singly, with the other three held constant. To 
conclude from this that no effects exist 
constitutes what Gordon (1968) called the 
“partialling fallacy.” When significant empir- 
ical overlap exists among variables, much of 
the predictive power can reside in the shared 
variance that is partialled from consideration. 
In our study, for example, secrecy was 
correlated with both devaluation-discrim- 
ination (r=.575) and withdrawal (r= .405), 
such that a full 43 percent of the variance in 
secrecy —more than half its reliable variance 
(alpha — .71)— was accounted for by these 
two variables. To avoid the partialling fallacy 
we chose to retain devaluation-discrimination 
and withdrawal and to drop the other two 
measures. Aside from its centrality in our 
theory, devaluation-discrimination is the one 
measure that was asked of all groups, and 
withdrawal has clear theoretical significance 
for the study of social connectedness. More- 
over, even with the other stigma variables 
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Table 5. Multiple Regression Analyses Showing the Effect of Stigma Dimensions on the Number of Instrumental 
Supporters and the Extensivenesss of Task Coverage among Household Members (repeat-treatment contact 











patients N= 103) 
Number of Extensiveness of 
Instrumental Supporters Task Coverage 
Regression Standardized Regression Standardized 
coefficient coefficient coefficient coefficient 

Control Variables 
Children under 14 at home 

(1=yes, 0— no) .226 .056 — .732 ~ 1 
Currently employed (1 yes, 0 — no) — .095 — .030 ~ 1.510** ~ 278 
Diagnosis (1 = depression, 0 = schizophrenia) 408 134 492 .093 
Marital status (1 = married, 0 = other) —.172 — 043 2.502** .355 
Age (years) —.041** — .334 — .070*** ~~ 321 
Education (years) —.041 — .093 .076 .100 
Black — .678* — .220 —.834 —.155 
Hispanic — .068 —.017 .003 .001 
Stigma Variables 
Perceived devaluation-discrimination .236 .143 732** .255 
Withdrawal .013 .008 —.223 — .080 
Increment to variance explained 

due to stigma variables 1.995 5.8% 
Variance explained by entire equation 18.2% 29.3% 

* p<.05. 
** p«.01. 
*** p« 001. 


partialled, these two variables showed unique 
effects for at least one of the six outcome 
variables, whereas secrecy and education 
never did. 

Effects of stigma measures on household- 
based support networks. According to our 
theory, we should not expect that the stigma 
dimensions will be associated with a reduc- 
tion in the availability of household members 
for socially supportive contacts. In fact, we 
expect a greater reliance on these individuals 
in the presence of a strong fear that most 
people will be rejecting. As shown in Table 
5, insofar as there are any significant 
associations between household support and 
stigma, more stigma concern results in greater 
reliance on household support. Specifically, if 
patients believe that they will be devalued and 
discriminated against, they are more likely to 
have important tasks covered by household 
members. 

Effects of stigma measures on nonhouse- 
hold network members. In Table 6 we show 
significant relationships between social net- 
work ties outside the household and both 
perceived devaluation-discrimination and with- 
drawal when we control for relevant sociode- 
mographic characteristics. The more one fears 
devaluation and discrimination, the less likely 
one is to have instrumental supporters and 


extensive task coverage among nonhousehold 
relatives. Withdrawal is related inversely to 
support from nonrelatives outside the house- 
hold are available to provide instrumental 
support.!! Thus, as predicted, nonhousehold 
network ties are related negatively to stigma 
dimensions.!? 

Yet can these negative effects be explained 
by the severity of the patient's psychopathol- 
ogy? We used six reliable symptom scales 
taken from the Psychiatric Epidemiology 
Research Interview (PERI): confused thinking 
(5 items), false beliefs and perceptions (13 
items), demoralization (27 items), insomnia 
(3 items), suicide ideation (1 item), and 
somatic problems (9 items) (Dohrenwend, 
Shrout, Egri, and Mendelsohn 1980). Of the 


!! The effects of withdrawal and devaluation- 
discrimination shown in Tables 5 and 6 hold each 
other constant, but the pattern of results holds if 
each is entered separately. 

12 Recall the issue of our choice of network 
measures. At the suggestion of an anonymous 
reviewer, we ran the preceding analyses using a 
variable we called confident support (items 2, 7, 8 
of Appendix). Overall the stigma variables were 
significantly associated with this measure in a 
manner generally consistent with our predictions 
about inside and outside the household support. 
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Table 6. Multiple Regression Analyses Showing the Effect of Stigma Dimensions on the Number of Instrumental 
Social Supporters and the Extensiveness of Task Coverage of Relatives and Nonrelatives Living outside the 











Household (repeat-treatment contact patients N= 103) 
Number of Instrumental Supporters Extensiveness of Task Coverage 
Nonhousehold Nonhousehold Nonhousehold Nonhousehold 
relatives nonrelatives relatives nonrelatives 
Regres- Regres- Regres- 
sion Standard- sion Standard- sion Standard- sion Standard- 
coeffi- ized coeffi- ized coeffi- ized coeffi- ized 
cient coefficient cient coefficient cient coefficient cient coefficient 
Control Variables 
Children under 14 

at home 

(1=yes, 0=n0) —.311 — .078 .579 .120 —1.774**  —.286 -.091 —.0l4 
Currently employed 

(1=yes, 0=n0) .192 .062 497 134 — 473 — .099 .416 .081 
Diagnosis 

(1 — depression, 

O-— schizophrenia) .232 .077 .467 .129 .809 .174 .517 .104 
Marital status 

(12 married, 

0 other) 1.023* .256 —.941* | —.195 2.585*** .417 —1.775*  — 264 
Age (years) .001 .001  —.003 — .020 .038* 191  —-.027  -—.134 
Education (years) .018 .042 .114* 222 .003 .005 .106 .149 
Black 114 .037  —.652 —.178 .405 086 —.351 — .069 

i ic 157 041 —.324 — .070 — 139 —.023 -1.064  —.168 
Stigma Variables 
Perceived 

devaluation- 

discrimination —,549**  —,336 .134 .068 — .621* — .246 .062 .023 
Withdrawal — .030 —.019 -—.481**  —.253 .121 .044 —.295 4 —.112 
Increment to variance 

explained due to 

stigma variables 10.7% 5.5% 5.3% 1.1% 

Variance explained 
by entire equation 17.1% 32.4% 26.5% 23.0% 
"* p<.05, 
** p<.0l. 
*** p« 001. 


36 possible associations (six symptom scales 
by six network support measures), in only one 
case (suicide ideation and extensiveness of 
nonhousehold relatives) does a symptom scale 
add significantly to the explained variance of 
a network measure above and beyond con- 
trols. Controlling for this one significant 
effect has no influence on our conclusions 
about the relationship between stigma and 
social network support. 

For the measures that involve relatives, the 
availability of network supporters could 
depend on the number of such persons who 
are living. Therefore, we controlled for the 
number of living relatives and separately for 
the number of living parents, siblings, and 
children. In each case the pattern and 
significance of the relationships between the 
stigma variables and the network measures 
remained unchanged. 


We indicated above that patients can 
reconstitute their networks of nonrelatives 
outside the household by replacing individu- 
als who may reject them with other patients or 
with people they screen carefully for accept- 
ing attitudes. In this way, a patient who 
expects,to be devalued and discriminated 
against can have a social network that is as 
extensive and has as many instrumental 
supporters as that of a patient who fears 
rejection less. This reasoning is consistent 
with the near-zero association between per- 
ceived devaluation-discrimination and the 
network variables for nonrelatives outside the 
household (see Table 6). Social networks of 
nonrelatives outside the household are dimin- 
ished only when a patient endorses strongly 
the coping orientation of withdrawal. 

With respect to networks of nonhousehold 
relatives, patients cannot replace such persons 
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with new kin whom they may perceive as less 
rejecting. As a result, a strong expectation of 
devaluation-discrimination could lead to an 
erosion of supportive kin ties if patients avoid 
nonhousehold relatives who they fear may 
reject them. The significant negative coeffi- 
cients for perceived devaluation-discrim- 
ination in predicting extensiveness of task 
coverage and number of instrumental ties are 
consistent with this possibility. Withdrawal, 
in contrast, is not associated independently 
with network variables involving nonhouse- 
hold relatives. The major reason, we believe, 
is that withdrawal is likely to have its most 
potent effect in the formation of new ties, in 
which the patient may have to be assertive in 
initiating contact. Because contacts with 
relatives involve relationships which typically 
have a considerable history, there is less 
reason to expect that our measure of with- 
drawal should have an effect in this group. 


Evidence Concerning Step 4: Former 
Patients from the Community Sample 


Data on ex-patients in the community provide 
further assessment of Step 4 in our approach. 
Contrary to our expectations, former patients 
are similar to the group with little psychopath- 
ology in number of instrumental supporters 
and in the extensiveness of their task coverage 
(F2,260 = .09; p = .912). However, the 
former patient group may include people 
whom Kadushin (1969) calls "the friends and 
supporters of psychotherapy” and who conse- 
quently have not been exposed to a potent 
labeling experience. When we examined 
those who reported having been hospitalized 
(N = 19)—a substantial labeling experi- 
ence—we found that they had significantly 


13 Evidence from other studies suggests that the 
proportion of “friends and supporters” in our 
sample may be sizable. The recent large-scale 
Epidemiological Catchment Area (ECA) Study, for 
example, found that approximately one-third of 
those currently in mental health treatment had no 
diagnosable disorder (Shapiro et al. 1984). The 
comparable figure in the Midtown Manhattan 
Study was 47.5 percent (Srole et al. 1978, pp. 
208-9), Also, seeking treatment for personal 
growth is more likely among the more highly 
educated; our former patients (X = 13.56 years) 
are more highly educated than either current 
patients (X = 12.41) or untreated community 
respondents (X = 12.68). 
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fewer instrumental supporters and less exten- 
sive task coverage than other former patients 
(F292 = 5.33; p = .007). In fact, their 
support networks closely resemble those of 
repeat-treatment contact patients (F5;;9 = 
.43; p = .60) and differ significantly from 
the group with little evidence of psychopath- 
ology (F2,134 = 3.32; p = .044). 

Our test of the effects of stigma within the 
former patient group is restricted to the 
devaluation-discrimination measure because 
this was the only such measure applied to 
members of the community sample from 
which the former patient group is constructed. 
In this group we find no significant multivar- 
iate effect of devaluation-discrimination on 
the six network measures above and beyond 
the controls. Yet when we enter a cross- 
product term to reflect interaction between 
hospitalization and’ the devaluation-discrimi- 
nation measure, the effect is marginally 
significant (F&7g = 2.23; p = .049). The 
regression coefficients show that the effects 
for hospitalized cases are in the same 
direction and of roughly the same magnitude 
as the significant effects for devaluation- 
discrimination among the repeat-treatment 
contact patients. 

Therefore, the evidence for former hospi- 
talized patients is consistent with the results 
for repeat-treatment contact patients and 
supportive of the modified labeling approach. 
This is not the case, however, for former 
patients who have not undergone the substan- 
tially stigma-inducing experience of hospital- 
ization. There are two possible explanations. 
First, perhaps "friends and supporters of 
psychotherapy" who have not encountered a 
harsh labeling experience dilute the effect of 
stigma in this group. Alternatively, the 
"psychiatric patient" label may become less 
personally relevant for this group over time. 


Evidence Concerning Step 2: The 
Importance of Official Labeling 


According to Step 2, official labeling makes 
patients' beliefs about devaluation-discrimina- 
tion personally relevant and activates coping 
strategies such as secrecy, withdrawal, and 
education. Below we present two tests of Step 
2: one involves first-treatment contact pa- 
tients; the other, nonpatient community respon- 
dents. 

First-treatment contact patients. Recall 
that there was no reason to expect that the 
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networks of first-treatment contact patients 
would be affected by the stigma measures we 
developed. Their exposure to labeling is new 
and thus cannot have affected the network 
questions that were asked about the period 
before they entered treatment. Because we 
found effects among repeat-contact patients, 
we wanted to determine whether there were, 
in fact, no significant effects among first 
treatment contact patients. Therefore, we 
performed the same multivariate tests of 
stigma factors on social network measures 
among the 48 first-treatment contact patients 
from the patient sample.!^ Applying a model 
both with and without the sociodemographic 
controls that we employed in our analysis of 
repeat-treatment contact patients, we found 
no significant multivariate association be- 
tween stigma and network support. More- 
over, when we examined the individual 
regression coefficients of the stigma vari- 
ables, we found no evidence to suggest 
effects similar to those for repeat-contact 
patients.!5 

Nonpatient community respondents. A sec- 
ond test of Step 2, conceptually similar to the 
test with first-treatment contact patients, can 
be achieved by focusing on our measure of 
perceived devaluation-discrimination and by 
directing attention to the association between 
this variable and social network ties among 
two groups: untreated community cases and 
community well respondents. Recall that the 
perceived devaluation-discrimination measure 
can be answered by patients and nonpatients 
alike. For labeled persons these beliefs about 
others' opinions are potentially applicable to 
themselves, whereas for unlabeled persons 
they are an innocuous array of beliefs about 
what others think of mental patients. 


^ Because the full battery of stigma questions 
was asked only of patient sample cases, 11 
first-treatment contact patients from the commu- 
nity sample are excluded from this analysis. Eight 
other patients had missing data on one or more of 
the stigma variables; 48 cases were left. 

15 Analyses for first-treatment patients that are 
identical to those conducted in Tables 5 and 6 are 
available by writing to the first author. These 
analyses show that in 10 out of the 12 instances (4 
out of 4 when the effects were significant in Tables 
5 and 6), the regression coefficients for the 
repeat-contact patients are larger than the coeffi- 
cients for first-treatment contact patients in the 
direction of negative outside household effects an 
positive inside household effects. $ 
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Controlling for sociodemographics, we 
assessed the effect of perceived devaluation- 
discrimination on the six network variables. 
We found no significant overall multivariate 
association in untreated cases, community 
well respondents, or the two groups com- 
bined. Nor did we find a significant effect 
when we used any of the other ways of 
defining untreated cases: positive on PERI 
symptom scales alone, positive on DIS alone, 
or positive on both. This finding of no 
association is particularly noteworthy because 
the power to detect a significant effect is 
substantial: .90 to detect a 5 percent incre- 
ment to explained variance. !6 


. DISCUSSION 


Our results are consistent with the following 
explanation of the effects of labels, based on 
modified labeling theory. In the course of 
being socialized, individuals develop negative 
conceptions of what it means to be a mental 
patient and thus form beliefs about bow others 
will view and then treat someone in that 
status. Typically this array of beliefs is fully 
in place before an individual enters treatment. 
As a result, when patients enter treatment for 
the first time, they are likely to confront the 
effects of stigma immediately because often 
they have internalized a generally negative 
view about what it means to be a mental 
patient. Moreover, they tend to endorse 
coping orientations such as secrecy, with- 
drawal, and education. With time, their 
beliefs about the implications of the label they 
carry and their way of dealing with it shape 
the nature of their social connectedness. 
Those patients who are most concerned with 
stigma are likely to have insular support 
networks consisting of safe and trusted 
persons on whom they rely extensively. At 
the same time, such patients have consider- 
ably less support available from individuals 
outside their immediate household. Our 
results are generally inconsistent with posi- 
tions taken by critics of labeling theory and 
are not easily “explained away” by those 
investigators’ major alternative explanations. 
This is true both for our attitudinal data (Steps 


16 Supplementary tables showing analyses for 
community well respondents and untreated case are 
available by writing to the first author. 
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1 and 3) and our social support network data 
(Step 4). 

With respect to attitudinal data, some 
researchers argue that the attitudes of the 
public (Crocetti et al. 1974) and of patients 
(Weinstein 1983) are too positive to make any 
form of labeling theory believable. These 
investigators studied many, but certainly not 
all, potentially relevant attitudes. Because 
their findings usually were not intended as 
tests of labeling theory, the attitudes mea- 
sured often were not the key theoretical 
concepts required to form an adequate test. In 
contrast, we identified a set of theoretically 
relevant attitudes—beliefs about how most 
people will treat mental patients—that turned 
out to be consistently but not uniformly 
negative. Thus, although our results do not 
directly contradict the attitudinal findings 
marshaled by critics of labeling theory, they 
do lead us to question strenuously the 
conclusion that labeling processes are unim- 
portant. 

In general, the findings concerning social 
networks—behavioral outcomes—support a 
modified labeling approach over an antilabel- 
ing position. The only result that might 
support the critics’ view is the finding that 
stigma variables have no detectable effect on 
the support networks of former patients. This 
finding could indicate that the effect of 
labeling dissipates with time, as Gove’s 
(1982) ideas about the “transitory” nature of 
stigma would suggest. Still, even if this is 
true, this and other research (Link 1987) 
suggests that the short-term consequences can 
be powerful and unfortunate—a possibility 
most critics deny. Moreover, this finding 
might indicate that some of these former 
patients were “friends and supporters of 
psychotherapy” who never experienced a 
potent label to begin with. Consistent with 
this latter view is the finding that former 
hospital patients, who clearly did encounter a 
potent labeling experience, showed evidence 
of enduring stigma effects. The other predic- 
tions of the modified labeling approach were 
supported: significant effects of stigma when 
labeling preceded the outcome variables 
(repeat contact patients), and no effects either 
when labeling followed the period covered by 
the outcome variables (first-treatment contact 
patients) or when no labeling occurred 
(community untreated cases, community well 
respondents). 

Moreover, these results cannot be ex- 
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plained away by invoking the leading alterna- 
tive explanation offered by critics of labeling. 
According to critics, any untoward effects 
experienced by mental patients are more 
likely to be caused by psychopathology than 
by labeling. This plausible alternative, how- 
ever, cannot explain our findings concerning 
social support networks. Our modified label- 
ing perspective specifies variables that should 
have effects within groups of patients with 
severe disorders (major depression and schizo- 
phrenia). Behavior is held relatively constant 
in this way, and yet the variables specified as 
important by a modified labeling perspective 
remain important. Moreover, we used. six 
PERI symptom scales to control for variabil- 
ity on severity within these patient groups, 
and still the stigma variables remained 
significant predictors of social support net- 
works. Finally, we showed that the stigma 
variables had consistent effects across the two 
diagnostic groups (schizophrenia and major 
depression). This result suggests that even 
when psychopathology varies, the stigma 
variables have relatively constant effects. 
These findings make it difficult to explain the 
effects of the stigma variables by asserting 
that they are caused by psychopathology. 

It is also important to consider the 
robustness of our findings in relation to 
alternative explanations that do not emerge 
from positions developed by critics of label- 
ing theory. In particular it is possible to 
explain our network findings by positing 
reverse causation. People fear rejection and 
endorse withdrawal because they have net- 
works with few nonhousehold supporters. Yet 
our findings concerning first-treatment con- 
tact patients lead us to question this possibil- 
ity. The questions about support networks 
concerned a period before these patients were 
labeled officially by treatment contact; thus 
we predicted no association between stigma 
and network support in this group. If network 
support affects stigma (reverse causation), 
however, we should find an effect among 
these patients. The fact that we do not is 
consistent with the modified labeling perspec- 
tive and inconsistent with reverse causation. 
Still, this test is based on a relatively small 
sample of first-treatment contact patients (N 
= 48); a stronger test of reverse causation 
could be achieved if longitudinal data on 
first-treatment contact patients were avail- 
able. 

Our study strongly challenges the notion 
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that labeling and stigma are inconsequential 
in the lives of psychiatric patients. Even so, 
many facets of the labeling process remain 
unexplored and untested. Most important, we 
know little about whether variations in the 
circumstances under which official labels are 
applied make a difference. Does it matter how 
labelers present the meaning of the new 
status? Does the patient’s response to the 
labeling experience vary according to the 
location and the social atmosphere of the 
treatment setting? Are patients’ reactions to 
the labeling experience influenced by the 
application of a specific diagnosis and by 
their own beliefs about what different diag- 
noses connote? These more refined questions 
and many others are left unexplored by our. 
inquiry and need to be addressed in future 
work. Note, however, that in the severe 
antilabeling atmosphere that prevailed in the 
1970s and early 1980s, it would have made 
little sense to pursue questions such as these. 
Why study such elaborations of labeling 
theory when critics could assert forcefully 
that according to the available evidence, 
labels hardly mattered? By confronting this 
evaluation of the labeling approach and by 
showing that labeling and stigma are associ- 
ated with important outcome variables, our 
study can contribute to a revitalization of 
efforts to address more refined questions 
about labeling. 

Perhaps the most important work that can 
be motivated by this and related studies is a 
test of the still-unexplored fifth step of the 
modified labeling perspective. If labeling and 
stigma are connected to outcome variables, 
such as self-esteem, employment status (Link 
1987), and support networks (this paper), it 
will become more and more plausible to 
conceive of these factors as playing a role in 
producing a chronic course for some people; 
environmentally oriented investigators point 
to these variables as major risk factors for the 
onset of episodes of mental disorder. There- 
fore, it is possible that labeling and stigma 
can leave patients and former patients vulner- 
able to the likelihood of experiencing another 
episode of disorder. A modified labeling 
position offers the possibility of deepening 
our understanding of chronic mental disorder 
as a process within an influential social 
context. 
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APPENDIX 
Questions Used to Derive Social Network Measures 


1. Who has or would have taken care of your 
apartment (house) if you were out of town in 
the last year? Anyone else? 

2. Who did you talk to about decisions at work in 
the last year? Anyone else? 

3. In the past year, who has helped you with 
household tasks? Anyone else? 

4. In the past year, who has spent time with you in 
social activities such as having dinner together, 
going to the movies, hanging out, or talking on 
the telephone? Anyone else? 

5. Who have you asked to help you look after your 
children if you could not in the past year? 
Anyone else? 

6. In the past year, have you had a fiancee, a 
mate, or a steady boyfriend/girlfriend? Anyone 
else?* 

7. In the past year, with whom have you discussed 
personal worries? For example, worries about 
family, job or yourself? Anyone else? 

8. In the past year, whose advice have you 
considered in making important decisions? 
Anyone else? 

9. In the past year, from whom could you have 
borrowed a large sum of money? Anyone else? 
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Studies of group discussions reveal that 


INTERRUPTIONS IN GROUP DISCUSSIONS: 
THE EFFECTS OF GENDER AND GROUP COMPOSITION* 


LYNN SmiTH-LOVIN 
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CHARLES BRODY 
Tulane University 


Conversations both reflect and maintain social inequalities. They import 
hierarchical structures from larger society and help perpetuate them by creating 
inequalities in the ability to accomplish interactional goals. In this study of speaker 
transitions in six-person, task-oriented experimental groups, we explore the 
well-known finding that men interrupt women more frequently than women 
interrupt men. We ask three questions about the structure of interruptions. Who 
attempts to interrupt whom and under what conditions? How does the affective 
character of interruptions vary across speakers and groups? What determines 
whether an interruption succeeds? We find that gender inequality in these 
task-oriented discussions is created by a mixture of attempts to use power and of 
differential success. In their interruptions, men discriminate by sex in attempts and 
in yielding to interruptions by others. Women interrupt and yield the floor to males 
and females equally. The sex composition of the group affects interruption patterns 
in complex ways. Men interrupt men with supportive comments in all-male groups, 
but these supportive interruptions drop as the number of women in the group 
increases. Supportive interruptions also succeed in gaining the floor more often in 
single-sex groups. Taken together, the results suggest a mixture of status and 
conflict models and reaffirm the importance of group composition in interaction. 


High-status actors talk more, 


are more 


status! has many effects on participation. 
High-status people are asked their opinions 
more often, talk more, receive more positive 
comments, are chosen as leader more fre- 
quently, are more likely to influence the 
groups’ decisions, and in general dominate 
the conversation (see review in Ridgeway 
1983, pp. 160—204). Status differentiation not 
only creates conversational dominance, but 
often legitimates it (Ridgeway and Berger 
1986). Studies of conversations in dyads or 
families also frequently find status effects. 


* 'The authors thank Paul Benson, Ron Breiger, 
Fred Koenig, Bruce Mayhew, James McRae, 
Miller McPherson, Victor Nee, Patrick Nolan, 
Mark Plunkett, Cecilia Ridgeway, Joseph Sheley, 
John Skvoretz, David Weakliem, and two anony- 
mous reviewers for helpful comments on the 
paper. They also thank Charlotte G. Hudson for 
allowing them to use the data, which were 
collected for another purpose in her master's thesis 
research. Brody's work was supported partially by 
NIMH grant ROI-MH40666. The order of the 
authors' names was determined by a coin toss. 

! Following Ridgeway (1983, p. 160), we 
define a member's status as "the degree of 
deference, esteem and power to influence others" 
he or she acquires. 
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successful at introducing topics, interrupt 
more, and receive more positive feedback 
from their listeners. 

Conversation is an important domain for 
studying these status effects. Language use 
can tell us a great deal about the nature and 
extent of social inequality. It reflects the 
hierarchical social structure outside the group, 
while simultaneously providing the means 
through which that inequality is maintained. 
Differences by status in participation, conver- 
sational dominance, and adherence to turn- 
taking norms create differences in our ability 
to get our ideas across to others. In a family 
this may mean control over decision making; 
in a work setting it may determine perfor- 
mance and promotion. 

In this study, we are interested in the status 
differences between males and females. There 
is extensive evidence that sex operates as a 
status characteristic in small groups research 
(e.g., Strodbeck, James, and Hawkins 1957), 
in the expectations states paradigm (e.g., 
Wagner, Ford, and Ford 1986; Lockheed 
1985), and in studies of conversation (e.g., 
Kollock, Blumstein, and Schwartz 1985; 
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West and Zimmerman 1977).2 In group 
discussions, men talk more, more often 
assume a leadership position, receive more 
positive statements and fewer negative state- 
ments, are more likely to show nonverbal task 
and dominance cues, and so on. 

The aspect of discussion in which we are 
interested is interruption.? An interruption 
occurs when one speaker disrupts the turn of 
another with a new utterance. Normally, 
conversation is organized so that one speaker 
talks at a time; speakers alternate in turns to 
prevent the conservation from becoming a 
monologue (Sacks, Schegloff, and Jefferson 
1974). Speakers indicate when their turn is 
coming to an end with a variety of nonverbal 
and tonal cues. Typically, turn taking in 
conversations is achieved without substantial 
gaps or overlaps in speech (Dabbs, Ruback, 
and Evans 1985). As Kollock et al. (1985) 
note, a speaker's turn is his or her opportunity 
to accomplish interactional goals or to block 
others from accomplishing theirs. To interrupt 
a speaker —that is, to begin talking before the 
speaker's turn is finished and perhaps prevent 
him or her from completing the thought—is to 
prevent, at least temporarily, that speaker 
from accomplishing these interactional goals. 

Since interruptions represent a clear viola- 
tion of turn-taking norms that give one 
conversant greater access to others' attention, 
we are not surprised that their occurrence is 
linked to dominance, power, and status (see, 
e.g., Drass 1986; Eakins and Eakins 1978; 
Zimmerman and West 1975; West 1984). 
Earlier studies have found that men interrupt 
women, adults interrupt children, doctors 
interrupt patients (except when the doctor is a 
“lady”), the more powerful spouse interrupts 
the less powerful one, and those with 
masculine identities interrupt those with more 
feminine self-images. These violations of 
turn-taking norms clearly allow the powerful, 


? Some studies raise questions about the univer- 
sality of gender as a status characteristic in our 
culture. However, earlier analyses (Smith-Lovin et 
al. 1986) of University of South Carolina student 
data indicated that sex is a status characteristic in 
this population. 

3 We focus on only half of the conversational 


process, ignoring the important “listener” role and ` 


the interactional work done by the nonspeaker. In 


our six-person discussion groups, speaking acts 
could be coded much more reliably than the subtler 
listener responses. 
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high-status speakers more access to important 
interpersonal resources. (the “floor”) at the 
expense of their lower-status partners. They 
may also serve to disorganize the speech and 
ideas of the interrupted (West 1979). As 
Kollock et al. (1985, p. 40) argued, attempt- 
ing to interrupt is an excellent indicator of 
attempted conversational control; successful 
interruptions are a sensitive measure of actual 
dominance. Power is a human accomplish- 
ment, situated in everyday interaction; inter- 
ruptions are one of the mechanisms that 
accomplish power in discussions. 

Unfortunately, there are numerous prob- 
lems in studying interruptions. First, they are 
relatively rare events. Most conversation is 
very well coordinated. In their classic study 
of naturally occurring conversations, Zimmer- 
man and West (1975) found only 7 interrup- 
tions in 20 same-sex two-party conversations 
and 48 interruptions in 11 cross-sex conversa- 
tions. Drass (1986) found 61 interruptions in 
28 single-sex conversations— only slightly 
more than 2 interruptions per conversation. In 
a more recent study of couples, Kollock et al. 
(1985) found an average of 18.6 attempted 
interruptions per quarter hour of conversation, 
less than half of which were successful. Since 
interruptions are infrequent, we need long 
specimens of conversation or large samples of 
interactions to get stable estimates. The 
skewed distribution of transitions into normal 
versus interrupted categories also causes 
problems for statistical analyses. 

A second problem arises from the fact that 
one must speak before one can be interrupted. 
Even in a two-person conversation, the two 
interactants are not equally at risk of being 
interrupted. The person who talks more is at 
risk more of the time. When more than two 
people participate in a discussion, the prob- 
lem will be exaggerated since participation is 
known to be strongly skewed (Horvath 1965). 
The risk factor is especially important since 
we know that participation is status and power 
dependent (Strodbeck et al. 1957; Berger, 
Rosenholtz, and. Zelditch 1980). An addi- 
tional risk factor involves the success of 
interruption attempts. An interruption that 
succeeds in breaking off a speaker's turn may 
be more disruptive than one that the speaker 
manages to fend off. But an interruption must 
be attempted before it can succeed. There- 
fore, the issue of interruption attempts is 
somewhat different from that of interruption 
success. 
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A third problem is that interruptions may 
differ in their impact depending on their 
content. While an interruption that comments 
positively on an idea, repeats a phrase, or 
finishes a thought for a speaker is clearly a 
violation of turn-taking norms, it is support- 
ive of the speaker and may be the basis for 
highly collaborative talk (Tannen 1987). 
Interruptions that interject negative comments 
or put-downs or completely ignore the 
speaker by introducing an unrelated topic are 
more intrusive and disruptive; they are much 


more explicitly an attempt at power. More 


neutral interruptions may be related to issues 
of power or may result from lack of a shared 
speech style. Lumping types of interruptions 
into a single analysis may mask important 
differences in conversational dynamics. 

A final concern is the importance of social 
context in determining conversational interac- 
tion. As Thorne, Kramarae, and Henley 
(1983, p. 13) noted, the study of isolated 
conversational variables almost universally 
leads to further questions about the effects of 
setting, topic, roles, and other social factors 
that may interact with gender. Three factors 
that might affect interruptions are considered 
here: the intimacy of the conversants, the size 
of the group, and the sex composition of the 


group. 


THE CONTEXT OF CONVERSATION 


The research on interruptions has concen- 
trated heavily on conversations in intimate 
dyads. While these intimate settings are 
important domains for the exercise of interper- 
sonal status and power, these studies may not 
generalize to a wide variety of more imper- 
sonal, task-oriented situations. Several re- 
searchers have found that intimacy influences 
conversational dynamics. Intimates have less 
sex-differentiated conversations than noninti- 
mates (Heiss 1962). Among friends, overlaps 
in speech are frequently instances of highly 
collaborative talk: cooperative sentence build- 
ing, requesting and giving verification through 
backchannels, and “savoring repetitions” 
(Tannen 1983). But friends also interrupt less 
and have more silences (Shaw and Sadler 
1965). Among stable couples, power-and 
dominance are likely to be more important 
than sex in determining interruptions, ques- 
tion asking, and topic control (Kollock et al. 
1985; Vuchinich 1984, p. 231). Therefore, 
the typical study of -interruptions among 
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intimate dyads is likely to underestimate 
interruption rates and their sex differentiation. 

In studies of larger, less intimate groups, 
group composition variables become impor- 
tant. Both researchers and popular writers 
have noted that groups of different sex 
composition have different patterns of talk. 
All male groups seem to be more competitive, 
to form dominance hierarchies more quickly, 
and to maintain stable hierarchies during 
continued interaction (Aries 1976). Some 
evidence suggests that both men and women 
express more negative and less positive 
socioemotional behavior in mixed-sex than in 
same-sex groups (Anderson and Blanchard 
1982). 

The best developed theoretical work on the 
group composition question is Kanter's (1977) 
theory of proportional representation. Kanter 
suggests that variations in sex composition 
within the mixed-sex category are important. 
In particular, minority group members are 
under special interaction pressures in groups 
with a skewed distribution. Such token 
members may adopt conservative behavior 
patterns, avoid conflict, and become a 
relatively passive audience for the boundary- 
heightening behavior of the majority. Empir- 
ical studies generally have supported Kanter's 
ideas, showing that group dynamics and 
minority achievement differ in skewed as 
opposed to more representative groups (e.g., 
Spangler, Gordon, and Pipkin 1978; Alex- 
ander and Thoits 1985). South, Markham, 
Bonjean, and Corder (1987) and Wharton and 
Baron (1987) suggest that social support and 
interaction quality differ in work groups with 
varying sex composition. 

We have two goals for the current analysis 
regarding the context effects on interactional 
process. First, we widen the context of the 
earlier research on interruptions by studying 
interactions among nonintimates in a task- 
oriented setting. Second, by using larger 
groups and systematically varying sex compo- 
sition, we Study the effects of this complex 
context variable on the interaction. 


INTERRUPTIONS IN GROUP 
DISCUSSIONS 


In this study, the task-oriented nature of the 
groups serves two purposes. Ít creates an 
instrumental framework similar to that of 
many work groups, allowing us to study a 
domain in which the ability to promote ideas 
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and gain recognition is likely to be particu- 
larly important. It also produces conditions 
under which status effects are most likely to 
be found (Berger et al. 1980). 

We use six-person groups both to create the 
differentiation that is typical of larger groups 
and to have groups in which Kanter’s 
definition of a skewed group could be met 
(with a 1:5 ratio). The sex composition of the 
groups systematically varies, while variation 
in other status characteristics is minimized.‘ 
Group members’ prior knowledge of each 
other was minimal‘; our assumption was that 
this lack of previous intimacy will heighten 
the effects of the observable status character- 
istic (gender) relative to other status or power 
differences that might develop as the result of 
long-term interaction. 

We address three questions about interrup- 
tions and conversational dynamics. First, who 
attempts to interrupt whom? Attempts to 
interrupt represent disregard for the speaker 
and an attempt to block his or her interac- 
tional goals. We analyze who makes these 
attempts at conversational dominance, who 
are the recipients of these power plays, and 
how the group context of the discussion 
affects power attempts. 

While any interruption attempt violates 
turn-taking norms, some interruptions may be 
more intrusive than others. We ask how the 
character of interruptions varies across speak- 
ers and groups. For example, are men or 
women more likely to make supportive 
interruptions? Do certain group compositions 
create more negative interruptions? 

Finally, what determines whether an inter- 
ruption succeeds? Are some interrupters more 
likely to wrest the floor from another? Does 
this success rate vary depending on the 
affective character of the interruption? Do 


* Students who wanted to participate filled out 
an index card with their name, social security 
number, age, and major area of study. The sex and 
race of students were noted surreptitiously as the 
cards were collected. The cards were used to 
manipulate sex composition, to hold constant other 
status characteristics (age, major, race), and to 
assign students randomly to groups within the 
limits imposed by sex composition and class times. 

5 Students from eight different classes (most 
with more than 50 students enrolled) constituted 
the subject pool. Postexperimental questionnaires 
confirmed that the randomized selection of stu- 
dents from large classes effectively eliminated 
prior personal knowledge in most cases. 
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different types of interruptions work for men 
and women (or against men and women)? 
Such questions are important, for these 
successful interruptions are the most serious 
intrusions into speakers’ talk. They represent 
conversational dominance—the successful use 
of power in these group discussions. 

To control for the frequency with which 
males and females are at risk of being 
interrupted and to allow more detailed 
Statements about the pattern of interruption 
attempts and their success, we analyze the log 
odds that an interruption .was attempted 
(relative to normal, uninterrupted transitions), 
as well as the log odds of an attempted 
interruption succeeding. Such an analysis 
controls for risk factors; if males participated 
more frequently than females (and were at 
greater risk of being interrupted), there would 
simply be more conversational transitions 
involving males. The relative size of this 
marginal would not affect the estimates of the 
degree to which sex and group composition 
influence interruptions. This form of analysis 
also compensates for the fact that different 
group compositions offer differing opportu- 
nity structures for male/female interaction 
(e.g., in a group with two females and four 
males, the females would be more likely to 
interrupt a male than a female simply by 
change—a given female would have only one 
other female to interrupt as opposed to four 
males). 


© A logit model was used in preference to other 
statistical techniques because interrupted transi- 
tions are rare relative to normal transitions 
(including overlaps, simultaneous speech, unsuc- 
cessful interruptions). Since the transitions are the 
unit of analysis here, the statistical model requires 
the unlikely assumption that each transition is an ' 
independent event. Kollock et al. (1985, p. 39) 
note that the behavior of an individual within a 
couple is not independent of his or her partner; the 
same is undoubtedly true for group members' 
behavior within a group—it will be influenced by 
other members and by previous features of the 
conversations. Note, however, that Vuchinich 
(1984) finds in family discussions that sequencing 
in oppositional moves can be described as a 
first-order Markov process: one move affects the 
next, but not a long chain of consequences. Since 
the independence assumptions of the statistical 
model are not perfectly met, we regard all 
significance tests to be suggestive only. In effect, 
we used such tests to help locate patterns in the 
data rather than in a hypothesis-testing framework. 
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To address the important questions about 
the affective character of interruptions, we 
analyse the odds that an attempted interrup- 
tion is supportive or negative (as contrasted 
with neutral comments and elaborations). At 
a later stage of the analysis, we also look at 
the odds of success separately for supportive, 
negative, and neutral interruptions. 


DATA COLLECTION? 


The subjects were undergraduates enrolled in 
introductory Sociology. The experiment used 
only white students between the ages of 17 
and 25 to minimize variation in important 
statuses other than gender. We systematically 
varied the sex composition of the six-person 
groups, with approximately five groups repre- 
senting each of seven experimental conditions 
(see Table 1).® 

A sex-neutral collectively oriented task (a 
variation of the task developed by Fisek 1974) 
stimulated an active group discussion. The 
task is described more completely in Smith- 
Lovin, Skvoretz, and Hudson (1986). To 
motivate the groups, we awarded a $30.00 
prize ($5.00 per group member) at the end of 
the semester to the group who arrived at the 
best solution. To minimize confounding 
effects of seating, we placed the six group 
members around a circular table with ran- 
domly assigned seating. 

Two video cameras recorded interaction 
through one-way mirrors. We later tran- 
scribed all group discussions, using the best 
videotape as a primary source, while relying 
on the second to resolve problematic ex- 
changes. We coded an interruption attempt 
when one speaker's utterance was broken by 
an intrusion into the internal structure of the 
` speech act, not corresponding to a possible 
transition place (following the definition by 
West and Zimmerman 1977). We coded an 
interruption attempt as successful when one 
speaker's utterance prevented another speaker 
from completing a speech turn. If the speaker 
completed the utterance to a normal transition 


7 A more complete description of data collection 
procedures is available in Smith-Lovin et al. 1986. 

8 Data on 31 groups in all were collected. Five 
groups were studied in all seven conditions except 
all female (four groups), five females (three 
groups), and four females (four groups). Schedul- 
ing problems prevented these conditions from 
having five groups. 
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place despite the intrusion, we considered the 
interruption attempt to be unsuccessful. For 
purposes of the analysis here, a noninter- 
rupted transition constituted all other occa- 
sions when one speaker followed another 
(including minor overlaps of speech and 
normal transitions). Simultaneous starts were 
rare; although the continuation of speech after 
a simultaneous start may be an indicator of 
dominance, it is not a violation of turn-taking 
norms. Therefore, we did not code such 
simultaneous starts as an interrupted transition 
for this analysis. Similarly, we did not code 
backchannel utterances that produced simulta- 
neous speech as attempted interruptions; 
Tannen (1983) argues convincingly that such 
overlaps are not norm violations and that they 
support rather than disrupt the speaker's 
utterance in most cases. 

After six months, we recoded one group's 
videotapes to check coder reliability. Of 186 
transitions, the coder credited 8 different 
speakers in the two codings, resulting in an 
accuracy of 96 percent. In none of these cases 
was an interaction originally credited to one 
sex shifted to another; nor was the interrupted 
versus noninterrupted status of a transition 
changed. 

We coded the affective character of inter- 
ruptions into three categories using typed 
transcripts of the conversations. We consid- 
ered an interruption supportive if it expressed 
agreement with the speaker, if it made an 
affectively positive request for elaboration 
("that's a great idea; how do you think we 
should do it"), or if it completed the 
speaker's thought (often repeating several of 
the speaker's words, then continuing to a 
normal transition place). We coded an 
interruption as negative if it expressed 
disagreement with the speaker, raised an 
objection to the speaker's idea, or introduced 
a complete change of topic (disregarding the 
speaker's utterance entirely). We coded all 
other interruptions as neutral. This category 
included interruption attempts that were so 
short that their content could not be deter- 
mined, interruptions that elaborated on the 
topic of the interrupted speaker without 
evaluative content, and requests for clarifica- 
tion where it was unclear from context 
whether the request was a challenge or a 
request for more talk. We do not imply by our 
labels that supportive interruptions are not 
disruptive to a speaker’s talk and his or her 
ability to accomplish interactional goals. In 
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fact, one reader has suggested that in 
established power structures such supportive 
interruptions can be used to signal power; the 
high-status actors may use positive comments 
to reward task-related behaviors from subor- 
dinates or to take over the discussion of 
promising ideas suggested by others. How- 
ever, since such supportive disruptions may 
spring from different conditions and lead to 
differing consequences than the more conflict- 
ual negative interruptions, we distinguish 
them for further analysis. 


ANALYSIS AND RESULTS 


We analyze speech transitions in the form of 
an incomplete 7 X 2 X 2 X 7 contingency 
table (see Table 1). This table includes eight 
rows with structural zeroes (indicated by *) 
which represent combinations of speakers and 
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followers that are logically impossible. Thus, 
six rows are completely blanked since, for 
example, it is impossible for a male to follow 
a female speaker in an all-male or all-female 
group. Rows 8 and 21 are only partially 
blanked. Male-male transitions are possible in 
groups with only one male member, as are 
female-female transitions in groups with only 
one female member. Such transitions were 
coded whenever a pause of greater than 1.5 
seconds occurred between two utterances 
made by the token member of the group 
(following Fisek's [1974] definition of an 
action opportunity). Such transitions, how- 
ever, cannot be interruptions. 

Several observations about group interac- 
tion are immediately apparent. First, interrup- 
tions are very rare: 540 interruptions com- 
pared to 4,576 noninterrupted transitions in 
the 31 group discussions. Group discussions 
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Table 1. Outcome of Speech Act by Sex of Speaker, Sex of Follower, and Number of Females in Six-Person Groups 
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* Denotes a structural zero (a combination that is logically impossible). 

Note that groups with only one female can have a female speech act followed by another female speech act and 
groups with only one male can fave a male-male transition. Such occurrences are the result of a pause greater than 1.5 
seconds between two utterances made by the "token" member of the group. 
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typically lasted 15 minutes, with approxi- 
mately 17.4 interruption attempts per quarter 
hour. This rate is similar to the 18.6 attempts 
per quarter hour observed by Kollock et al. 
(1985). Explicitly negative interruptions are 
especially rare in our data, marring only 1.79 
percent of the observed speaker turns. 

We employed a variant of the continuation- 
odds strategy suggested by Fienberg (1980) to 
analyze Table 1. We first considered the odds 
that an interruption attempt is made. We also 
analyzed the relative odds of supportive, 
negative, and neutral attempts. In a later stage 
of the analysis, we focused only on the 
attempted interruptions, analyzing the odds of 
success. Thus, the analyses of Table 1 
proceeded by focusing on several partitions of 
the seven columns, which correspond closely 
to the questions raised above. 

At each stage of the analysis, we estimated 
a set of logit models. Since the odds that we 
described above are asymptotically indepen- 
dent (Feinberg 1977, p. 87), we estimated the 
models and assessed their fit separately at 
each stage of the analysis. The individual 
chi-squares were then added to get an overall 
goodness-of-fit measure for the set of models 
(see Table 2). ; 

The total baseline chi-square represents the 
model of independence of the row and 
column classifications of Table 1. In Table 2, 
this model yields L?=219.31, df=114. The 
overall fit for our set of preferred? models is 
given by L?=126.78, df= 104. Since all our 
models are logit-specification models which 
include the highest-order interaction among 
the independent variables, the eight rows of 
the table including structural zeroes are 
effectively eliminated from the computations. 
Each partition of Table 1 differs only in the 
numbers of columns (i.e., the aspect of the 
interruption variable that is analyzed). Hence, 
degrees of freedom for each baseline model 
can be calculated as (20 — 1)(#columns 
— 1). We began the analysis of each partition 
by estimating the standard logit models for a 
four-way table, testing for various combina- 
tions of main and interactive effects of sex of 
speaker, sex of follower, and number of 
females in the group on the interruption 


? We use the term "preferred model" below to 
indicate a model that cannot be simplified without 
a significant increase in the likelihood-ration 
chi-square and cannot be significantly improved on 
by including additional effects. 
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variable. We also considered similar series of 
models in which the effect of number of 
females in the group was constrained to 
linearity. 


Interruption Attempts 


To address the first question of who attempts 
to interrupt whom and under what conditions, 
we combine the first six columns of Table 1, 
contrasting all six types of attempted interrup- 
tions with uninterrupted transitions. Our 
preferred model includes an interactive effect 
of the sex of speaker and the sex of follower. 
The odds of a male attempting to interrupt 
another male (.078) are less than one-half the 
odds of a male attempting to interrupt a 
female (.163). Females attempt to interrupt 
male and female speakers at essentially 
identical levels (odds of .146 and .141 
respectively). In other words, men discrimi- 
nate in their interruption attempts, interrupt- 
ing women much more often than men. 
Women, on the other hand, do not discrimi- 
nate; they attempt to interrupt men and 
women equally. Group composition does not 
appear to influence interruption attempts. Net 
of the sex of speaker/sex of follower 
interaction, neither the linear nor uncon- 
strained effects of number of females are 
significant. In particular, we do not see any 
impact of token status on interruption at- 
tempts in the (1:5) and (5:1) groups. Group 
composition does not interact with other 
independent variables in affecting attempted 
interruption. 


The Affective Character of 
Interruption Attempts 


How does the affective character of interrup- 
tion attempts vary across speakers, interrupt- 
ers, and groups? To address this question, we 
combine successful and unsuccessful interrup- 
tion attempts, retaining the distinction be- 
tween positive, negative, and neutral charac- 
ter. Thus, the dependent variable has three 
categories: supportive, negative, and neutral 


, interruption attempts. The results of this 


analysis indicate a significant four-way inter- 
action among sex of speaker, sex of follower, 
number of females, and the character of the 
interruption. Inspection of the observed odds 
suggests a concise representation for this 


ot 
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Table 2. Fit of Bascline and Preferred Model for Each Partition of Table 1 





Partition 
A. Interruption attempts: (attempted 
interruption vs. normal transition) 
B. Content of interruption attempts: 
(supportive vs. negative vs. neutral) 
C. Success of interruptions: 
i. positive 
ii. negative 
iii. neutral 
TOTAL 


interaction. This preferred model!? includes a 
linear effect of group sex composition (nuniber 


of females) only for male attempts to interrupt: 


other males with supportive (as opposed to 
neutral or negative) comments. This effect is 
negative, indicating that males direct more 
supportive interruption attempts toward other 
males in all-male groups (a fitted odds of 


. 1.82). The odds of a male-male interruption 


being supportive decline steeply as the 
proportion of women in the group increases, 
reaching a low of .26 in groups with just two 
males. For all other sex combinations (male- 
female, female-male, and female-female), the 
odds that an interruption attempt is positive 
are equal (.47) and do not vary with group 
composition. Under this model, the odds that 
an interruption. attempt is negative (.30) also 
are not affected by group composition or 
gender. 

Our analyses revealed no token effect in 


support of Kanter's ideas.!! The (1:5) and 


(5:1) groups did not deviate significantly from 


10 Examination of residuals indicates that the 
relative lack of fit for this portion of our model is 
attributable to two cells in the cross-classification— 
those pertaining to same-sex (male-male and 
female-female) supportive interruption attempts in 
groups containing three males and three females. 
Our preferred model underestimates the frequen- 
cies in these two cells; reestimating the model 
while “blanking” these cells results in Z2 —32.61, 
df — 34. However, since the combined frequency in 
these two cells is just 14 and since we have no 
compelling theoretical reason to expect such a 
pattern, we prefer the model described here. 

! While our ideas have been directed by 
Kanter's discussion, this analysis cannot be 
considered a test of her theory of proportional 
representation. The link between the outcomes that 
she discusses and the conversational processes of 
transition and interruption is too loose to be more 
than exploratory. 


Baseline Model Preferred Model 
df B df Dp 

19 60.42 16 15.97 
38 74.42 36 51.87 

19 26.29 18 19.91 

19 30.08 18 21.24 

19 28.10 16 17.79 
114 219.31 104 126.78 


the linear relationship described above. Simi- 
larly, the groups containing tokens did not 
influence the odds of an interruption being 
supportive or neutral (as opposed to negative) 
in affective character. 


Success of Interruptions 


What factors determine whether an interrup- 
tion succeeds? To address this question, we 
originally analyzed the odds of an interruption 
attempt succeeding by affective character of 
interruption, by sex of speaker, by sex of 
follower, and by group composition. That 
analysis showed that the answer to the 
question of who is more likely to succeed 
varies by the affective character of the 
attempt. Thus, in this section we model the 
odds of success separately for positive, 
negative, and neutral interruption attempts. 

For positive interruption attempts, we find 
a significant nonlinear effect of group compo- 
sition on the odds of success. In mixed sex 
groups, positive interruption attempts have 
about a 50-50 chance of success (an odds of 
.94). Positive attempts are more than twice as 
likely to succeed in all-male or all-female 
groups (odds of 2.42). Men and women who 
are faced with a supportive intrusion in such 
groups yield the floor much more frequently 
than their counterparts with an opposite-sex 
audience. Groups with a token composition 
(1:5 and 5:1) did not differ significantly from 
other. mixed-sex groups (with 2:4, 3:3, and 
4:2 distributions of males and females). 

For the relatively rare negative interruption 
attempts, the predicted odds of success 
depend only on the sex of the person being 
interrupted. The odds of a man yielding the 
floor to a negative interruption are .59; for 
women, these odds are over three times as 
great (2.24). There is no significant effect of 
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the sex of the interrupter—both men and 


women are more likely to succeed in 
interrupting the speech of women with 
negative comments. 

Finally, for neutral interruptions (the most 
frequent category) we find significant main 
effects of both sex of speaker and sex of 
follower and a marginally significant (p — .075) 
interactive effect of these two variables on the 
odds of success. Under the simpler model, 
women succeed less frequently than men 
when they attempt to interrupt with a neutral 
comment, while they are more likely to lose 
the floor when someone attempts to interrupt 
them. The interactive effect clarifies this 
result. When men are speaking, women are 
less than half as likely as men to successfully 
interrupt them with a neutral, elaborating 
comment (.45 compared with 1.17). When 
women are speaking, the odds of success are 
roughly the same for women and men (1.19 
and 1.10). Therefore, the exceptional ability 
of men to hold.the floor against a female's 
neutral interruption is the major feature 
creating the two main effects observed in the 
simpler model. 

The success of negative and neutral inter- 
ruption attempts does not depend on group 
composition. Most importantly, we find no 
evidence that token members of a group are 
more or less successful in their interruption 
‘attempts; nor are they more or less likely to 
yield the floor. 


DISCUSSION 


This analysis of interruptions in large, 
task-oriented discussion groups supports pre- 
vious research and adds new information 
about the form of the relationships. We find 
support for the typical finding that there is 
considerable gender inequality in interrup- 
tions. But we see that this inequality is not the 
result of a sex difference in frequency of 
interruption, or even a simple dominance 
effect. Instead, it is produced because men 
discriminate in their interruption attempts, 
disrupting the speech of women far more 
frequently than that of men, while women do 
not discriminate, interrupting women and 
men equally often. This pattern carries over 
into the pattern of successful interruptions, 
since interruptions are more likely to succeed 
against women than against men (especially 
when interruptions are disruptive and negative 
in character). Men, on the other hand, are 
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more able to fend off potential disruptions, 
especially when directed at them by women 
(this pattern is especially clear with neutral 
interruptions). ; 

In effect, men are acting as if sex is a status 
characteristic (within the expectation states 
framework) (Berger et al. 1980) but women 
are not. Men interrupt women more than 
other men, a pattern we would expect if they 
had higher task expectations for males than 
females. 

By attempting to interrupt men at the same 
rate as other women, and by yielding to 
intrusions at the same rate whether they come 
from men or women, women are behaving as 
though sex were not a status characteristic for 
them. While it is possible that the statuses of 
male and female are linked to performance 
expectations for males and not for females, it 
seems unlikely that the two sexes could 
maintain such divergent performance expecta- 
tions during long-term interactions. Perhaps 
this is why Kollock et al. (1985) found that ` 
power in the relationship is more important 
than gender among established couples. 

Still, it is difficult to believe that men and 
women are bringing radically different ideas 
about male and female competence to our 
small-groups setting. Many expectation states 
experiments have shown that sex is a status 
characteristic for both men and women and in 
our society (see review in Wagner et al. 
1986). Furthermore, in other analyses of 
these data we observe the participation 
inequalities by sex that typically indicate 
status ordering (Smith-Lovin et al. 1986). Tt is 
unlikely that men &nd women have substan- 
tially different views of the societal status 
structure. l 

Another view that could explain the pattern 
would be a mixture of status effects and of 
conflict between males and females within the 
groups. In this combination, both status 
differences and adversarial relations between 
males and females would lead males to 
interrupt women more than other men. But 
for females, the two effects would produce 
opposite predictions: status differences would 
suggest deferring to men while interrupting 
other women, while adversarial conflict 
would suggest interrupting men while respect- 
ing or supporting the speech of other women. 
If the two effects were roughly equal in 
strength, they could cancel one another, 
producing no sex difference in interruptions 
for females. 


INTERRUPTIONS IN GROUP DISCUSSIONS 


Several recent studies yield some indirect 
support for conflict dynamics in mixed-sex 
groups. Ridgeway's (1981, 1982) experi- 
ments show that females in mixed-sex groups 
lose favor quickly if they do not express a 
group orientation (through group-oriented 
cues and conformity). Taps and Martin 
(1988) find that females who are tokens in 
all-male groups need external accounts for 
their ideas to gain influence; they also find 
that women are less liked in sex-balanced 
groups than in single-sex or token groups. 
South et al. (1987) show that both men and 
women get less social support for achieve- 
ment from their opposite-sex coworkers when 
the work group becomes evenly mixed. 
Similarly, Wharton and Baron (1987) find 
that men have lower job satisfaction, lower 
self-esteem, and higher job-related depression 
when they are in evenly sex-mixed work 
settings (as opposed to either male-dominated 


or female-dominated settings). They interpret . 


these findings to indicate a decline in the 
quality and quantity of intergroup relations in 
such numerically balanced settings. 

While all of the above studies could be 
interpreted in different ways, they are all 
consistent with cross-sex conflict in groups. If 
we add this conflict imagery to the status 
differentiation between males and females 
that is so widely demonstrated in the early 
small groups literature and in the expectations 
states tradition, we may explain the sex 
differentiation in males' interruption patterns 
and the apparent lack of discrimination in 
females' interruption attempting and yielding 
patterns (as the status and conflict effects 
cancel one another out). 

The complex effects of group composition 
and the affective character of interruptions on 
conversational dynamics in our task-oriented 
groups reinforce the picture of simultaneous 
status differentiation and conflict. We find 
that supportive interruptions show different 
patterns from the negative and neutral inter- 
ruptions where conflict dynamics are most 
likely to be played out. These supportive 
interruptions are the most influenced by group 
sex composition. In all-male groups we find 
that male-male interruptions are most likely to 
be supportive. Males interrupting other males 
in such groups often clarify and even 
encourage the ideas suggested by the inter- 
rupted speaker. Although the speech is 
disrupted, the group discussion of the topic is 
not. Such supportive, topic-continuing inter- 
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ruptions are also more likely to succeed in a 
single-sex context. 

These indications that single-sex groups 
have a more supportive, lively, non-status- 
dominated flow of ideas is supported by some 
work in the Bales tradition (Anderson and 
Blanchard 1982). This research indicates both 
men and women express somewhat less 
positive socioemotional behavior in mixed- 


„sex than in single-sex groups. Kanter (1977) 


also points to some maladaptive consequences 
of interaction in mixed groups, especially in 
minority-majority interactions within skewed 
groups. (However, in our data, skewed 
groups do not differ from more numerically 
balanced compositions.) 

Perhaps positive, spontaneous, and pleas- 
antly disorganized speech occurs most fre- 
quently between status equals. When: status 
differentiation and/or opposing group member- 
ships are activated in more heterogeneous 
groups, much of the supportive interruption 
drops out (or succeeds less frequently). 
Perhaps the presence of a common speech 
style in a homogeneous group leads to more 
relaxed, less conflictual group interaction. 
Tannen (1980) suggested that speakers may 
be characterized by stylistic patterns (high- 
involvement versus high-considerateness) that 
include features such as frequency of overlap, 
length of pause between utterances, prefer- 
ence for shifts in amplitude and pitch, and 
tendency to talk through overlaps. When 
these speech styles are shared, they provide 
the basis for well-coordinated talk. However, 
when opposing styles are mixed in conversa- 
tional groups, negative interpretations of 
others’ actions are frequent. (High-involve- 
ment speakers think slower-paced talk indi- 
cates lack of rapport and interest; high- 
considerateness speakers are more likely to 
call the overlap style competitive and domi- 
nant, although stimulating.) If such speech 
styles are correlated with sex (and participa- 
tion inequalities suggest that they might be), 
members of mixed-sex groups may find 
themselves misinterpreting and reacting against 
the speech of others. 

In summary, we find considerable gender 
inequality in the pattern of interruptions, but 
it is not easily described by a straightforward 
status/dominance process as implied by the 
expectations states tradition and by most 
conversational analysts. Men’s actions could 
be explained by such theoretical traditions, 
but we find women’s interruption attempts 
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and patterns of yielding the floor do not 
support an image of females simply submit- 
ting to the higher-status (in expectation states’ 
terms, higher task expectation) males. Women 
direct and accept interruptions in a way that 
does not differentiate systematically between 
males and females. In addition, we find 
interesting effects of interruptive content and 
group sex composition on conversational 


process. These factors suggest that issues, 


involving conversational style, coordination 
of speech, and salience of gender identity 
within groups of varying composition are 
fruitful areas for further research. 
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RACE, EMPLOYMENT HARDSHIP, AND INEQUALITY IN THE 


AMERICAN NONMETROPOLITAN SOUTH* 


DANIEL T. LICHTER 
The Pennsylvania State University 


The present study (1) examines changing levels of employment-related hardship 
among southern nonmetropolitan blacks for 1970-85, (2) compares the 
employment circumstances of nonmetropolitan black men and women with those of 
whites and blacks in the metropolitan South and in the North, and (3) assesses the 
contribution of black deficits in human capital and job characteristics to persistent 
racial inequality in rural employment adequacy. Data from the March annual 
demographic files of the Current Population Survey indicate that black employment 
hardship continues to be felt in nonmetropolitan areas, where two out of every five 
blacks are without jobs, cannot find a full-time job, or cannot earn enough to raise 
themselves significantly above poverty thresholds. Employment hardship among 
nonmetropolitan blacks continues to exceed that of blacks elsewhere, despite the 
comparatively rapid deterioration in employment conditions for northern and 
southern metropolitan blacks in the past decade. Finally, racial inequality in 
nonmetropolitan employment adequacy remains substantial in the 1980s and 
cannot be explained away by black deficits in human capital or by the 
disproportionate concentration of blacks in declining industrial sectors or in 


blue-collar occupations. 


INTRODUCTION 


Recent research in racial stratification has 
become increasingly sensitive to the substan- 
tial—and apparently growing —economic het- 
erogeneity within the black population. No 
longer is the black population treated as a 
monolithic whole or in stereotypical terms. 
Studies of class polarization within the black 
community (Farley 1984), of the growing 
black middle class (Landry 1987), and of the 
chronic problems of the black “underclass” 
(Wilson 1980, 1987) give ample testimony to 
this fact. Nevertheless, spatial heterogeneity 
in the black population continues to be largely 
ignored, despite apparent evidence of rather 
striking differences in black socioeconomic 
status from city to city (Farley 1988). 
Current debates have focused largely on the 
changing status of urban blacks, while 


* An earlier version of this paper was presented 
at the 1988 annual meetings of the Population 
Association of America. This research was sup- 
ported in part by a grant from the Rural Economic 
Development Program, sponsored jointly by the 
Aspen Institute and the Ford Foundation. The 
author benefited from the helpful comments and 
assistance of Lionel Beaulieu, Clifford Clogg, 
Scott Eliason, Gilbert Ko, Wade Smith, and an 
anonymous reviewer. 
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excluding nonmetropolitan blacks (see Wil- 
son 1987; Farley 1988). Unfortunately, rural 
blacks today are more spatially dispersed than 
urban blacks, less visible, and apparently 
easier to ignore. The main objective of this 
study is to examine patterns of black 
employment-related hardship and racial in- 
equality in the nonmetropolitan South. 


BACKGROUND AND 
SPECIFIC OBJECTIVES 


Few areas provide a better context than tbe 
nonmetropolitan South for reevaluating recent 
economic and employment gains of American 
blacks. For much of its history, the black 
population was decidedly rural, located over- 
whelmingly in the southern United States (Sly 
1981), and postbellum southern rural blacks 
have long represented one of the most 
economically disadvantaged segments in Amer- 
ican society (Durant and Knowlton 1978). 
Moreover, post-1970 social and economic 
changes are cause for both optimism and 
continuing discouragement about the integra- 
tion of nonmetropolitan blacks into the 
economic mainstream of American society. 
For example, substantial change in the 
racial climate of the South was set in motion 
during the 1950s and 1960s by the Civil 
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Rights Movement. The abolishment of Jim 
Crow laws, the impact of voting rights and 
fair housing legislation, rulings regarding 
school desegregation, and, later, affirmative 
action and other ameliorative equal opportu- 
nity policies, have greatly affected race 
relations in the South. Indeed, Firebaugh and 
Davis (1988) report that antiblack prejudice 
declined faster in the South than in the 
non-South over the past decade. This pro- 
vides a significant departure from the virulent 
racial prejudice that has historically supported 
black employment discrimination in the 
South. s 

The past decade or so has also ushered in a 
*New South," with substantial employment 
growth in the nonagricultural sector (Bloom- 
quist 1987). With increasing urbanization and 
industrialization, hiring decisions based on 
particularistic criteria (e.g., race) cannot be 
easily sustained, given the demands of a 
modern economy for highly skilled labor 
(Hogan and Featherman 1977). However, 
while large metropolitan areas have experi- 
enced an unprecedented demographic and 
economic revival, rural areas have lagged 
behind (Beaulieu 1988; Falk and Lyson 
1988). Southern blacks, particularly those in 
rural areas, continue to face high rates of 
labor force nonparticipation and unemploy- 
ment, both absolutely and relative to whites 
(Marshall 1978; Rungeling, Smith, Briggs, 
and Adams 1977). 

Some optimism also rests with the return of 
northern blacks to the South, beginning in the 
1970s (Johnson and Campbell 1981). Not 
surprisingly, much of the in-migration oc- 
curred in metropolitan and urban areas. Less 
well appreciated is that the interregional 
exchange of black population also favored the 
nonmetropolitan South at the expense of the 
non-South (Dahmann 1986). If we accept the 
traditional economic assumption that the flow 
of labor responds largely to spatial differences 
in economic opportunity, then recent migra- 
tion patterns of blacks suggest relative 
improvements in employment opportunities in 
the South, including its nonmetropolitan 
parts. On the other hand, metropolitan areas 
may no longer provide much economic 
incentive for black in-migration. Indeed, over 
50 percent of young central city black men in 
the 1980s were jobless, working at part-time 
jobs involuntarily, or earning poverty-level 
wages (Lichter 1988). The southern resur- 
gence may thus have less to do with the pull 
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of economic opportunities than with the push 
caused by deteriorating employment condi- 
tions for blacks in large northern cities. 

As described in the next section, the Labor 
Utilization Framework (LUF) of Clogg and 
Sullivan (Clogg 1979; Clogg and Sullivan 
1983; Sullivan 1978) provides a useful 
analytical device for addressing several objec- 
tives on changing black employment-related 
hardship. This paper (1) documents changes 
in various forms of employment hardship 
during the 1970-85 period in the nonmetro- 
politan South, (2) compares levels of employ- 
ment hardship among southern nonmetropoli- 
tan blacks with those of blacks living 
elsewhere, such as in the North and the 
metropolitan South, and (3) identifies sources 
of racial variation in employment hardship 
among males and females in the southern 
nonmetropolitan labor force. 


DATA AND MEASURES 


The data for this analysis are from the 1970, 
1975, 1980, and 1985 annual demographic 
files of the March Current Population Survey 
(U.S. Bureau of the Census 1970, 1975, 
1980, 1985). Each yearly sample contains 
approximately 60,000 households and repre- 
sents the noninstitutionalized population of 
the United States. The analysis is restricted to 
workers in the adult labor force, aged 18 to 
64. 

Employment-related hardship includes job- 
lessness, as well as marginal employment, 
such as involuntarily part-time employment 
and poverty-level earnings. My focus on 
racial differences in employment hardship is 
appropriate for at least two reasons. First, the 
unemployment rate has been criticized as a 
measure of labor force performance, particu- 
larly for the nonmetropolitan South (Lichter 
and Costanzo 1987; Moland 1981). For 
instance, in a study of four southern rural 
counties, Rungeling et al. (1977) found that 
unemployment rates were consistently very 
low (about 3 percent), while “subem- 
ployment" affected over 40 percent of the 
work force.! For the same period, Levitan and 





! The subemployment index is one of the first 
widely-recognized measures of employment hard- 
ship or underemployment. This measure (SJ) is 
calculated as 


SI = (U + DW + PT + PE + U + DW) 
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Taggart (1973) reported national levels of 
subemployment of about 11.5 percent. Clearly, 
the unemployment rate minimizes both the 
absolute and relative extent of all employment- 
related hardship in the nonmetropolitan South. 

Second, this study avoids the potentially 
serious biases of “sample selection” (Clogg, 
Eliason, and Wahl 1987). Past studies of 
racial differences in earning or occupational 
attainment typically restrict their analyses to 
the currently employed or to full-time 
full-year workers or their equivalents. Such 
sample restrictions ignore long-standing racial 
differences in employment status (Kasarda 
1985). To be sure, racial convergence in 
earnings may occur at the same time that 
unemployment or other forms of employment 
hardship are diverging. Workers with lower 
earnings also face greater prospects of 
unemployment or part-time employment, 
which means that they are often excluded 
from analyses of racial differences in earn- 
ings. The potential bias seems clear: Marginal 
workers are disproportionately “selected out” 
of the sample, leaving behind a relatively 
more advantaged black subpopulation to 
compare with whites. 

These problems are avoided here by 
including blacks and whites across a variety 
of labor force categories, including the 
jobless.? The LUF is a comprehensive 
measurement scheme that includes several 
forms of employment hardship. First, the 
sub-unemployed or “discouraged worker” 
category includes those no longer looking for 
work because they believe none is available. 
Second, the unemployed are identified follow- 
ing traditional practices of the Bureau of 
Labor Statistics. It includes those currently 
out of work but who have actively looked for 
work during the last four weeks. The third 
hardship category is low hours, or the 
involuntary part-time employed. This in- 
cludes individuals who are working part-time, 
as defined by the Bureau of Labor Statistics 
(i.e., less than 35 hours a week), but who 
desire a full-time job. Fourth, the “working 


where, U is the unemployed, DW is “discouraged 
workers,” PT is the involuntary part-time em- 
ployed, Y is the working poor, and E is total 
employed family heads. 

Here I refer to nonblacks as whites. Not only 
does this aid the discussion, but the vast majority 
of nonblacks in the nonmetropolitan South are 
white. 
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poor” or low income category includes those 
whose labor market earnings in the past year, 
adjusted for weeks worked, are less than 1.25 
times the individual poverty threshold, as 
defined by the Social Security Administration 
(see Clogg and Sullivan 1983, p. 144 for 
details). This measure of earnings inadequacy 
identifies full-time workers whose jobs fail to- 
provide earnings significantly above poverty 
levels. Other sources of income (e.g., public 
assistance) are not included in this definition. 

The sum of these categories provides a 
useful composite measure of employment- 
related hardship (Clogg 1979).5 This additive 
feature is possible because the LUF is a 
hierarchical measurement scheme, with hard- 
ship categories ordered from most (i.e., 
sub-unemployment) to least (ie., low in- 
come) severe. Thus, workers either are 
adequately employed or are included in a 
LUF category that defines the most severe 
employment hardship they experience. For 
more detailed discussion of measurement 
procedures, see Clogg (1979) or Clogg and 
Sullivan (1983). 


FINDINGS 


Nonmetropolitan Black Employment 
Hardship, 1970-85 


Perhaps the most striking result is that 
southern nonmetropolitan blacks had remark- 
ably high levels of employment hardship from 
1970 to 1985.4 Regardless of year, about 


3 Clogg (1979) and others (Lichter 1988; Lichter 
and Costanzo 1987) have used this composite as a 
summary measure of “economic underemploy- 
ment.”. 

* A common problem with analyzing CPS data 
over several years is the inability to adjust for 
changing designations of metropolitan and nonmet- 
ropolitan over time. Any evidence here of spatial 
convergence in employment hardship, however, is 
not likely to be an artifact of changing definitions. 
Instead, changes from nonmetropolitan to metro- 
politan are likely to have the effect of exacerbating 
metro-nonmetro differences in employment hard- 
ship, if it is assumed that areas “growing up” to 
metropolitan status are likely to be more economi- 
cally vital than those left behind. Changing 
definitions are likely to accentuate rather than 
reduce metro-nonmetro differences. As I show 
later, the employment circumstances of blacks in 
southern nonmetropolitan areas appear to have 
improved relative to recent changes in metropolitan 


\ 
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Table 1. Black Employment Hardship in the Nonmetro- 
politan South, 1970—1985 
Categories . 

Yea — SU UN LH LI Total 

1970 2.99% 6.81% 11.37% 23.19% 44.3596 








1975 3.64. 16.36 12.18 16.18 48.36 
1980 3.42 12.51 7.79 16.40 40.12 
1985 4.52 16.32 10.48 15.19 46.52 


Note: SU=Sub-unemployed; UN= Unemployed; 
LH=low hours; LI=low income; Total=sum of SU, 
UN, LH, and LI. 


two-fifths to nearly one-half of nonmetropoli- 
tan blacks experienced some form of employ- 
ment hardship (see Total column, Table 1). 
These data also reveal that unemployment 
represents only a minority share of overall 
hardship. For instance, black unemployment 
in 1985 was very high—over 16 percent—yet 
it represented only about one-third (i.e., 
16.32/46.52) of all black employment-related 
hardship. Other forms of “hidden unemploy- 
ment,” such as worker discouragement and 
part-time employment, contributed over one- 
half of the total labor market hardship 
experienced by nonmetropolitan blacks. 

Although black hardship was substantial 
during 1970-85, there is little evidence that 
levels changed systematically over this pe- 
riod. Rather, black hardship fluctuated largely 
in response to national or cyclical shifts in the 
economy. Unemployment especially was great- 
est in the 1975 recession and in 1985, when 
the farm crisis and declines in resource-based 
industries were most felt in nonmetropolitan 
areas. 


Spatial Convergence in Black 
Employment Hardship 


To what extent have the employment circum- 
stances of southern nonmetropolitan blacks 
converged with those of blacks living in the 
metropolitan South and in the North? As 
shown in Table 2, southern metropolitan 
blacks generally experienced lower levels of 
employment hardship than their nonmetropo- 
litan counterparts. In 1985, for example, total 
black hardship in the nonmetropolitan South 
exceeded that in the metropolitan South by 


areas. Despite this reassuring finding, some 
caution nevertheless should be exercised in inter- 
preting changes in employment hardship since 
1970. 


about 13 percentage points (column 5, panel 
2), and the nonmetro-metro ratio of southern 
black hardship was 1.39 (column 5, panel 3). 
Moreover, in nearly every comparison across 
years and employment hardship categories, 
nonmetropolitan percentages were. substan- 
tially higher than in southern metropolitan 
areas. . ; E. 

The data in Table 2 also provide some 
evidence of metro-nonmetro convergence in 
southern black employment-related hardship. 
The ratios of southern nonmetropolitan to 
southern metropolitan black hardship declined 
modestly during this period. Whereas south- 
ern nonmetropolitan black hardship was about 
1.7 times greater than that of southern 
metropolitan blacks in 1970, it was only 1.4 
times greater in 1985. Much of this conver- 
gence was due to declining spatial differences 
in the low income category. In 1970, 
nonmetropolitan black employment hardship 
exceeded metropolitan black hardship by 
about 9 percentage points, declining to under 
4 percent by 1985. The nonmetro-metro 
ratio declined from 1.66 to 1.34. 

Evidence of spatial convergence in black 
employment hardship is further reinforced in 
the analysis of northern blacks. In Table 3, 
the absolute difference between nonmetropo- 
litan black and northern black employment 


Table 2. Black Employment Hardship in the Metro 
South, 1970-1985 


Categories 
SU UN LH LI Total 
Percentage: i 
1970 1.88 6.23 452 14.01 26.64 
1975 2.59 10.93 6.62 10.60 30.74 
1980 2.84 9.76 5.67 10.66 28.93 
1985 2.79 12.12 7.33 11.31 33.55 
Absolute 
Difference:* 
1970 1.11 58 6.85 9.18 17.71 
1975 1.05 5.43 5,56 5.58 17.62 
1980 58 2.75 2.12 5.74 11.19 
1985 1.73 4.0 3.15 3.88 12.97 
Ratio» 
1970 1.59 1.0 2.52 1.66 1.66 
1975 1.41 1.50 1.84 1.53 1.57 
1980 1.20 1.28 1.37 1.54 1.39 
1985 1.62 1.85 1.43 1.34 1.39 





Note: SU-Sub-unemployed; UN = Unemployed; 
LH-]ow hours; Li-1ow income; Total=sum of SU, 
UN, LH, and LI. 

* Absolute difference in black employment hardship 
between the nonmetro and metro Soutn. 

* Ratio of black employment hardship in the nonmetro 
South to black hardship in the metro South. 
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Table 3. Black Employment Hardship in the North, 








1970-1985 
Categories 
SU UN LH ul Total 
Percentage: 
1970 2.17 7.48 3.21 7.28 20.14 
1975 4.00 16.67 5.11 5.71 31.68 
1980 2.92 13.03 4.36 5.94 26.25 
1985 5.07 16.40 7.23 10.83 39.53 
Absolute 
Difference 
1970 82 —.67 8.16 15.91 24.21 
1975 —.56 -—.3 7.07 10.47 16.68 
1980 50 -—.52 3.43 10.46 13.87 
1985 —.55  —.08 3.25 4.36 6.99 
Ratio? 
1970 1.38 .91 3.54 3.19 2.20 
1975 .87 .98 2,38 2.83 1.53 
1980 1.17 96 1.79 2.76 1.53 
1985 .89 1.00 1.45 1.40 1.18 


Note: SU=Sub-unemployed; UN -Unemployed; 
LH=low hours; LI—low income; Total- sum of SU, 
UN, LH, and LI. 

* Absolute difference in black employment hardship 
between the nonmetro South and the North. 

> Ratio of black employment hardship in the nonmetro 
South to black hardship in the North. 


hardship declined from over 24 percentage 
points in 1970 to a comparatively modest 7 in 
1985 (column 5, Table 3). Likewise, the ratio 
of southern nonmetropolitan black to northern 
black hardship was nearly halved over this 
period. By 1985, nonmetropolitan blacks in 
the South experienced a rate of employment- 
related hardship that was about 18 percent 
higher than that for blacks in the North. In 
1970, nonmetropolitan black hardship was 
120 percent higher! Thus, the period since 
1970 was marked by considerable conver- 
gence between. southern nonmetropolitan 
blacks and blacks living elsewhere, especially 
in the North. 

This pattern of nonmetropolitan conver- 
gence with northern blacks was not uniform 
over all hardship categories. In fact, nonme- 
tropolitan black joblessness (i.e., the SU and 
UN components) over the 1970-85 period 
was often lower than that experienced by 
blacks in the North. Most of the overall 
convergence can be attributed to convergence 
in the low hours and low income categories 
(see panels 2 and 3, Table 2). The absolute 
difference in low income declined from about 
16 to 4 percent during 1970—85, and the ratio 
declined from 3.2 to 1.4. For low hours, the 
absolute difference declined from about 8 to 3 
percent, and the ratio from 3.5 to 1.4. 
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Declines of such magnitude represent substan- 
tial convergence between employed nonmetro- 
politan blacks in the south and northern 
blacks. 

The spatial convergence in black hardship 
levels is due to the fact that the employment 
circumstances of blacks in metropolitan areas 
deteriorated over the 15-year period. This was 
especially so in the North, where black 
hardship rates nearly doubled between 1970 
and 1985. Indeed, employment-related hard- 
ship among northern blacks in 1985 was 
greater than that of southern metropolitan 
blacks, a reversal of the pattern observed in 
1970 when northern blacks were the least 
disadvantaged of the spatial groups consid- 
ered here. Clearly, the quality of black 
employment in southern nonmetropolitan ar- 
eas has not been elevated to the historically 
more favorable circumstances found else- 
where. Rather, black employment elsewhere 
has become increasingly like the depressed 
conditions that have long characterized the 
rural South (see Lichter 1988). 


Racial Inequality in the Nonmetropolitan 
South, 1970-85 


Data in Table 4 provide striking evidence of 
continuing racial differences in nonmetropoli- 


Table 4. White Employment Hardship in the Nonmetro 
South, 1970—1985 














Categories 
SU UN LH LI Total 
Percentage: 
1970 109 3.65 3.45 10.19 18.38 
1975 1.52 8.11 5.47 8.85 23.95 
1980 78 5.14 3.44 9.45 18.82 
1985 . 1.04 6.73 5.25 10.85 23.87 
Absolute 
Difference? 
1970 1.90 3.16 7.92 13.00 25.97 
1975 2.12 825 6.71 7.33 24.41 
1980 2.64 737 435 6.95 21.30 
1985 3.48 9.59 5.23 4.34 22.65 
Ratio” 
1970 2.74 1.87 3.30 2.28 2.41 
1975 2.39 2.02 2.23 1.83 2.02 
1980 4.38 2.43 265 1.74 2.13 
1985 4.35 2.42 2.00 1.40 1.95 
Note: SU=Sub-unemployed; UN=Unemployed; 


LH=low hours; LI—low income; Total=sum of SU, 
UN, LH, and LI. 

* Absolute difference in employment hardship between 
blacks and whites in the nonmetro South. 

* Black/white hardship ratio in the nonmetropolitan 
South. 
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Table 5. Logit Models of Employment Hardship in the Nonmetro South, 1985 








1 2 3 4 
B SE B SE B SE B SE 

Constant —.649 .028 —.640 .028 —.761 .032 —1.105 .042 
Race (Black — 1) .S11 .028 .500 .028 .421 .030 .353 .030 
Sex (Female — 1) — — .301 .023 .353 .024 444 .026 
Education: 

LT 12 — — — — .712 .036 .487 .039 

12 — — — — —.038 .033 —.083 .034 
Age: 

18-29 — — — — .465 .034 .41 .035 

30—49 — — — — —.227 .033 —.202 .034 
Industry." 

Periphery — — — — — — .228 .036 

Trades — — — — — — .131 .035 
Occupation: 

Managerial and professional — — — — — — —.638 .064 

Tech., sales, admin. support — — — — — — —.034 .048 
Likelihood ratio Chi-squared 1629.1 1457.3 851.5 508.7 
df 268 267 263 259 





* See Hodson (1983) for detailed description of occupational categories. 


tan hardship throughout the 1970-85 period. 
Regardless of period, overall employment- 
related hardship among nonmetropolitan blacks 
was substantially higher (i.e., roughly 20—25 
percentage points) than among their white 
counterparts (column 5, panel 2). Moreover, 
each form of hardship was higher among 
nonmetropolitan blacks than whites during 
this 15-year period. Black-white ratios, for 
example, ranged between 1.4 (ie., LI in 
1985) and 4.4 (i.e., SU in 1985). Compared 
with nonmetropolitan whites, southern nonmet- 
ropolitan blacks today continue to face the 
prospect of substantial employment-related 
hardship. 

Some evidence points to racial convergence 
in the nonmetropolitan South. The total 
employment hardship ratio declined from 2.4 
in 1970 to 2.0 in 1985.5 Convergence in the 
low hours and low income components was 


5 In previous work on central city men aged 
18—29 (Lichter 1988), black-white ratios increased 
during the 1970-82 period. Racial divergence for 
this age-group is less apparent in the nonmetropo- 
litan South. Black-white employment hardship 
ratios were 2.22, 1.89, 1.82, and 1.98 for 1970, 
1975, 1980, and 1985, respectively. Absolute 
black-white differences, however, were greatest in 
1985. Levels of hardship were substantially higher 
among black young adults than among their white 
counterparts (59.94 percent vs. 30.26 percent in 
1985). And the high labor market distress of black 
young adults was significantly greater than that of 
the total southern nonmetropolitan black popula- 
tion (i.e., over 46 percent in Table 1). 


most responsible for this pattern. The black- 
white low income ratio declined over this 
period from 2.3 to 1.4. On the other hand, 
racial differences in joblessness (i.e., SU and 
UN) were greater in the 1980s than in the 
1970s. As for the total U.S. black population 
(Farley 1984), such results imply a growing 
economic polarization among nonmet- 
ropolitan blacks. That is, employed nonmet- 
ropolitan blacks have become more similar to 
whites in the likelihood of holding an 
"adequate" job (i.e., jobs not marked by low 
hours or low pay), while at the same time 
racial differences in joblessness have gener- 
ally increased. 

Sources of nonmetropolitan racial inequal- 
ity. Black-white inequality may be attributed 
in part to racial differences in human capital or 
other job circumstances, such as employment 
inlow-wage, low-skill industrial sectors or oc- 
cupational categories (e.g., see Falk and Ly- 
son 1988; Parcel 1979). Table 5 provides sev- 
eral logit models that evaluate in a multivariate 
framework the effects of race on employment- 
related hardship in 1985. The dependent vari- 
able in this analysis is dichotomous; it distin- 
guishes whether or not workers experience any 
of the hardship forms in the LUF in 1985. 
Independent variables aré the following: race 
(black = 1) sex (female = 1); age (18-29; 


$ Effect coding is used throughout this analysis. 
This coding scheme is the default option in SPSS* 
and it means that the logit coefficients for 


442 


30—49; and 50-64 as the reference category); 
education in completed years (less than 12 
years; 12 years; and 13 or more years as the 
reference category); industry (periphery; trades; 
and other industries as the reference category); 
and occupation (managerial and professional 
specialities; technical, sales, and administra- 
tive support; and other occupations represent- 
ing the reference category).® Results from this 
analysis provide a basis for drawing inferences 
about the effects of race vis-à-vis human cap- 
ital and job characteristics on employment- 
related hardship. 

In model 1 of Table 5, the logit coefficient 
for race suggests that the odds of employment- 
related hardship among nonmetropolitan blacks 
are about 1.67 (i.e. , e?!) times those of rural 
whites.? The estimated probability of black 
employment hardship (ie., Pj) can be 
calculated as (Hanushek and Jackson 1977): 


P, = exp(Z)/(1 + exp(Z,)) 
= exp(—.138)/(1 + 

exp( — .138)) 

.466 


where Z; is the predicted value of 


categories (including the reference category) of 
each independent variable sum to zero. 

7 The industrial categories refer to industry of 
current or longest employment (if unemployed). The 
industrial categories identified here are taken from 
Hodson's (1983) six-category industrial sector clas- 
sification scheme. Periphery industries represents 
one of these sectors and includes those cJassified as 
periphery, small shop, and real estate from his 16- 
category scheme. Trades, as defined by Hodson, 
includes local monopoly, education and nonprofit, 
and agriculture industries. As shown by Hodson 
(1983), workers in periphery and trades industry 
sectors have much lower mean earnings than those 
in other categories. For specific detailed industry 
codes describing these categories, see Hodson (1983, 
Table 4.4, pp. 73-77). 

8 These categories are based on the Standard 
Occupation Classification (SOC) codes provided 
by the U.S. Bureau of the Census. Managerial and 
Professional Specialty occupations comprise SOC 
codes 3 through 199. Technical, Sales, and 
Administrative Support occupations are defined by 
SOC codes 203 through 389. 

? To aid interpretation, multiplicative effects on 
the odds of employment hardship can be calculated 
by taking the natural antilogarithms of the logit 
coefficients (i.e., exponentiating the logit coeffi- 
cients). Values exceeding 1.0 indicate an increased 
odds of employment-related hardship, while 
values Jess than 1.0 indicate a decreased odds. 
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In|PJ(1—Pj)] ie. —.649 + 511 = 
—.138 from model 1. Not surprisingly, the 
estimated probability of .466 is very similar 
to the hardship percentage reported earlier in 
Table 1. 

The race effect from model '1 could be due 
in part to racial differences in.the sex 
composition of the labor force. Nonmetropol- 
itan females typically experience more em- 
ployment-related hardship than do males, and 
the female percentage is greater in the black 
than in the white labor force (Brown and 
O’Leary 1979). Yet, when sex is included in 
model 2, the race effect is reduced only 
slightly to .500 and its magnitude exceeds 
that of the sex effect. Net of racial differences 
in sex composition, the odds of employment 
hardship among southern nonmetropolitan 
blacks remains considerably higher than those 
among whites. 

In model 3 (Table 5), education and age 
have substantial effects on the probability of 
employment-related hardship in nonmetropol- 
itan labor markets of the South. Being poorly 
educated increases the odds by a factor of 
2.04 (e7!2), and young adults experience 
substantial levels of employment hardship 
(e*6> = 1.59). More significantly, model 3 
indicates that the race effect is not substan- 
tially reduced when the human capital vari- 
ables of education and age (as a proxy for 
experience) are included. The odds of black 
employment-related bardship are still 1.52 
(e^?) times those of whites, net of racial 
differences in age and education. 

Finally, the inclusion of occupation and 
industry in model 4 has only a modest effect 
on black-white differences in overall hard- 
ship, with the race coefficient reduced from 
.421 (in model 3) to .353. Workers in the 
periphery sector, however, are significantly 
more likely to face employment hardship than 
are those in other sectors. Not surprisingly, 
those in managerial and professional occupa- 
tions are least likely to experience any of the 
forms of hardship in the LUF, while the odds 
of employment hardship in largely blue-collar 
positions (i.e., the reference category) are the 
greatest. 

Such results do little to assuage claims of 
continuing black employment discrimination 
in the nonmetropolitan South. Hardship rates 
in the 1970s and 1980s were about twice as 
large for nonmetropolitan blacks as for 
whites—differences that cannot be explained 
away by black deficits in human capital or by 
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the disproportionate concentration of blacks 
in declining industrial sectors or in blue-collar 
occupations. There is little indication here of 
a declining significance of race in the 
nonmetropolitan South. 

Race-sex differences in nonmetropolitan 
employment hardship. To further illuminate 
the sizeable magnitude of race effects in the 
nonmetropolitan South, Table 6 (line 1) 
provides the employment hardship probabili- 
ties for each race-sex group. These probabili- 
ties are calculated from results of model 2 in 
Table 5 and range from .191 for white males 
to .540 for black females. Moreover, the 
differences in employment hardship reported 
here are not substantially changed when 
race-sex differences in education, age, occu- 
pation, and industry are purged (line 2). 
Clearly, problems of employment marginality 
are especially acute among black females, a 
contrast to Farley and Allen’s (1987) observa- 
tion of minimal earnings differences between 
U.S. black and white women in the 1980s. 

Table 6 (line 3) also provides conditional 
probabilities for workers located in education, 
age, industry, and occupation categories with 
the least hardship (based on results of model 
4, Table 5).!° These employment hardship 
probabilities are very small for each race-sex 
group, especially when compared to the 
overall probabilities reported on line 1. Not 
surprisingly, employment-related hardship was 
highly improbable (P = .029) for white 
males. 

The conditional probabilities for the high 
employment hardship profile tell a decidedly 
different story. The race-sex differentials 
obviously remain, but the levels of employ- 
ment hardship are extremely high for all 
race-sex groups. Probabilities ranged from 
.474 for white males to a disconcertingly high 


10 The logit coefficients also can be used to 
estimate the probability of employment hardship 
for any given labor force profile. Consider the 
following logit equation: Zn[P/(1— P,)] = LB X, 
= Z. Any probability, P;, can be computed as 
P, = exp(Z)/(1+exp(Z)). As an illustration, the 
probability of hardship among nonmetropolitan 
black females with a “high employment hardship 
profile" is computed as exp(—1.105 + .353 
+ 444 + 487 + 411 + .228 — (—.638 
—.034)/(1 + exp(—1.105 + .353 + .444 
+ 487 + 411 + .228 — (—.638 —.034), 
based on the results of model 4, Table 5. As 
reported in Table 6 (line 4), this conditional 
probability is .816. 
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Table 6. Probabilities of Employment Hardship Among 
Race-Sex Groups in the Nonmetro South, 1985 


Black White Black White 
Male Male Female Female 


1. Probability* 392 .191 .540 .302 

2. Probability: 

Purged of effects of 

independent variables" 

3. Conditional probability: 
Low employment 
hardship profile? 

4. Conditional probility: 
High employment 
hardship profiled“ .646 .474 .816 
* Calculated from results of model 2, Table 5. 
> Calculated from results' of model 4, Table 5. The 

effects of age, education, industry, and occupation are 

held constant by assigning zero effects to each of these 
variables. 

* Calculated from results of model 4, Table 5. The 
effects of age, education, industry, and occupation are 
held constant by assigning the effects of labor force 
categories with the lowest hardship. 

4 Calculated from results of model 4, Table 5. The 
effects of age, education, industry, and occupation are 
held constant by assigning the effects of labor force 
categories with the highest hardship. 


.2332 .131 .424 266 


.057 .029 .128  .068 


.687 


value of .816 for black females. Simply, 
these results mean that poorly educated young 
adults, working as nonprofessionals in the 
periphery industrial sector, face an extraordi- 
narily high likelihood of labor force margin- 
ality in the nonmetropolitan South. 

The above results highlight race-sex differ- 
ences in employment-related hardship. Yet 
previous studies of southern rural labor 
markets have also shown that the effects of 
human capital and job characteristics (e.g., 
industry of employment) on labor market 
outcomes may vary considerably across 
race-sex groups (Horan and Tolbert 1984). 
Table 7 provides separate logit models for 
each race-sex group in the nonmetropolitan 
South. 

First, it is clear that education is least 
strongly related to employment-related hard- 
ship among black females and most important 
among white males. Thus, among the most 
highly educated (i.e., completed schooling 
exceeds 12 years), the odds of hardship 
among black females are reduced by a factor 
of .74 (i.e., e (?18 + 989 or 714 is multiplied 
by the average odds of hardship). Among 
white males, the odds are reduced by a factor 
of .66. Thus, “returns” to education in the 
form of reduced employment hardship (or 
adequate employment) are apparently less in 
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Table 7. Logit Models of Black and White Employment Hardship in the Nonmetro South, 1985 








Black Male White Male Black Female White Female 
B ‘SE B SE B SE B "SE 
Constant —1.256 .204 —1.729 .061 —.499 ..136 —].053 .053 
Education: 
LT 12 .581 .120 .606 .059 .218  .135 .367  .067 
12 —.291 .117 —.190 .055 086  .110 .018 ..053 
Age: 
18-29 .681 .114 .402 .056 .936 — .125 .237  .056 
30-49 —.152 .112 —.224 .054 -—-.4:6 .108 —.121 .053 
Industry* 
Periphery «172 115 .457 .055 086  .116 .108 .061 
Trades —.019 .112 .022 .057 257  .127 .235 054 
Occupation: 
Managerial and professional —.558 .328 —.532 .103 —.665  .233 ~.744 .093 
Tech., sales, admin. support —.166 .314 .133 .090 —.072 .170 -—.107 .063 
Likelihood ratio Chi-squared 62.6 84.1 66.7 171.0 
df^ 46 71 47 70 





* See Hodson (1983) for detailed description of occupational categories. 
> Calculated as number of nonzero fitted cells minus number of parameters estimated. See Clogg and Eliason 


(1987). 


evidence for rural blacks than they are for 
whites—especially men. 

The results of Table 7 also suggest that 
being young imposes a special hardship on 
today's nonmetropolitan black men and 
women. Indeed, the odds of employment 
hardship are increased by a factor of 1.98 for 
black men and 2.55 for black females in the 
young adult age categories. These effects are 
only 1.49 and 1.27 for white men and white 
women, respectively. Such racial differences 
portend a potentially ominous future for 
blacks in the nonmetropolitan South, particu- 
larly if the high rate of employment-related 
hardship is a cohort phenomenon that persists 
as young blacks make the transition to middle 
age. 

Finally, periphery industry and occupation 
effects appear much weaker for black men 
than for white men. For black men, experi- 
encing employment problems apparently has 
less to do with their location across occupa- 
tional and industrial sectors than with their 
human capital. Among black women, on the 
other hand, industry and occupation of 
employment matter greatly. The odds of 
employment hardship among black women 
are increased by a factor of 1.29 if employed 
in the trades sector, and by a factor of 2.15 if 
employed in occupations other than manage- 
rial/professional and technical, sales, and 
administrative support. 

These results provide little optimism regard- 


ing the effectiveness of black educational 


upgrading in reducing racial inequality in the 


nonmetropolitan South (Durant and Knowlton 
1979). Racial differentials in employment 
hardship would persist even if blacks acquired 
greater skills and education or found “better” 
jobs (as measured by occupation and indus- 


try). 


CONCLUSION 


The legacy of black employment hardship 
continues to be felt in nonmetropolitan areas 
of the South, where two out of every five 
blacks are without jobs, cannot find a 
full-time job, or cannot earn enough to raise 
themselves much above poverty thresholds. 
Despite the recent achievements of the Civil 
Rights Movement, changing racial attitudes, 
continuing urbanization, and the resurgence 
of black population growth in the South, 
overall employment conditions of nonmetro- 
politan blacks have not changed appreciably 
since 1970. 

Clearly, the growing concern with the 
black urban “underclass” (e.g., Wilson 1987) 
should not deflect attention from nonmetropo- 
litan blacks. Rural blacks, especially black 
females, remain amcng the most economi- 
cally disadvantaged groups in the United 
States. Farley and Allen (1987) have sug- 
gested that the urban black underclass today 
resembles the rural sauthern peasantry of the 
late 19th and early 20th century. The results 
of my study also show that the labor market 
experiences of metropolitan blacks increas- 
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ingly resemble those of nonmetropolitan 
blacks of the late 20th century. 

This apparent convergence in the employ- 
ment experiences of nonmetropolitan blacks 
with those of blacks living elsewhere is cause 
for national concern. The employment circum- 
stances of metropolitan blacks, especially 
those in the North, have deteriorated rapidly 
over the past decade or so (see Lichter 1988). 
For nonmetropolitan blacks, migration to 
urban centers has historically provided an 
escape from the restricted employment oppor- 
tunities in rural areas. But this adaptive 
mechanism may not apply today. The narrow- 
ing of employment conditions between the 
nonmetropolitan South and large cities in the 
North and elsewhere may anchor disadvan- 
taged blacks to particular locales. 

Finally, nonmetropolitan blacks have not 
shared equitably in the substantial employ- 
ment and population growth of the South 
during the 1970s and 1980s. The emergence 
of the so-called “New South” has apparently 
bypassed many nonmetropolitan and rural 
blacks, who, like generations before them, 
continue to face high unemployment or fill 
mostly menial and low-paying jobs. Indeed, 
concerns about the economically disadvan- 
taged rural black population, which some 
regard as a continuing legacy of the southern 
slave and plantation economy (Durant and 
Knowlton 1978), remain as salient today as 
ever. 


DANIEL T. LICHTER is Associate Professor 
of Sociology at The Pennsylvania State 
University. The present report on the chang- 
ing employment circumstances of rural blacks 
is a companion of “Racial Differences in 
Underemployment in American Cities” (AJS, 
1988). His recent research focuses on labor 
force transitions between “good” jobs and 
“bad” jobs, and on the link between local 
marriage market conditions and black-white 
differences in marriage patterns. 
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THE REAL AND UNREALIZED CONTRIBUTIONS OF- 
QUANTITATIVE SOCIOLOGY 


HUBERT M. BLALOCK, JR. 
University of Washington 


There are real trends in the quality of data analysis, owing to the computer 
revolution, interdisciplinary borrowing, and the availability of large data sets. 
Less obvious are improvements in the quality of data collection, conceptualization 
and measurement, or the ignoring of data gaps. Nor has the quality of our training 
or our students improved. The result is an increasing communication gap and 
failure of practices to keep pace with the technical literature. A case for 
quantification is made. Ideally, hidden assumptions are made explicit, common 
sense can be clarified and refined, systematic séarch procedures developed, . 
intractable problems located, and new theoretical insights obtained. But there are’ 
disciplinary obstacles to improving training, increasing the quality of graduate 
students, improving editorial policies, and promoting on the basis of quality rather 


than quantity of publications. We need to face this quality problem head on. 


Any attempt to evaluate the contributions of 
quantitative approaches to the development of 
our discipline must necessarily be interpretive 
and somewhat speculative in many respects. 
Certain selected trends and their causes are 
obvious, others may be more apparent than 
real, and still others may be nonexistent. 
Assessing causal impacts is a hazardous 
enterprise at best. Yet I believe that it is 
essential to make such appraisals even where 
they may be subject to dispute. I have been 
inspired in this respect by the recent critique 
of macrosociological theory by Gerhard 
Lenski (1988) and believe that my own 
remarks will be complementary to his. Lenski 
argued that macrosociologists have failed to 
specify their theories in testable form and 
have paid inadequate attention to conceptual- 
ization, the consistent use of precise terminol- 
ogy, and the explicit statement of rejectable 
propositions. I shall argue that most quantita- 
tive empirical analyses have also failed to 
grapple with required assumptions, conceptu- 


alization and measurement, implications of. 
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substantial data gaps, 
generalizability. 

Certain trends in quantitative sociology are 
obvious and rather easily explained. The 
computer revolution has made it possible to 
carry out many kinds of tedious calculations 
that were simply out of the question even 
several decades ago. Whereas most quantita- 
tive analyses that appeared in the sociology 
journals in the 1940s and 1950s involved 
rather straightforward tabular analyses of 
frequency distributions, the kinds of complex 
multivariate analyses that now regularly 
appear in our journal literature were ex- 
tremely rare. Without the aid of standard 
computer packages it seems doubtful that 
empirically inclined sociologists would ever 
have attempted to employ the modes of 
analysis that are now recommended in our 
methodological literature. ` 

There has also been a substantial increase 
in our ability to borrow quantitative ap- 
proaches developed in other disciplines. The 
cohorts of methodologists that preceded my 
own were simply too small to train graduate 
students, advise their colleagues, do their 
own research, and simultaneously keep abreast 
of developments in statistics, econometrics, 
biostatistics, psychometrics, and other ap- 
plied fields. As the number of sociologists 
expanded during the 1950s and 1960s, so did 
the absolute numbers of methodological 
specialists, who were then in a position to 
specialize and call our attention to important 
developments in these other disciplines. 
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and problems of 
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Indeed, it is difficult to overemphasize the 
critical importance of borrowing in account- 
ing for the very rapid diffusion of methodo- 
logical sophistication that has occurred in our 
discipline over the past three decades. 

Large data sets in selected fields of 
sociology are now much more available than 
was the case even 20 years ago. Most of these 
data sets have involved major surveys, but 
increasingly valuable historical and demo- 
graphic data from around the world have also 
come into common use. Much of the credit 
for this development should be given to the 
efficient archiving work of the Inter- 
University Consortium for Political and So- 
cial Research (ICPSR). Credit should also be 
given to some of the major funding agencies, 
such as the NSF, for encouraging and even 
requiring principal investigators to make their 
data available to the general social science 
user. 

Some other trends are not quite as obvious. 
Certainly, there has been a rather steady 
increase in both the percentage and sophisti- 
cation of quantitative journal articles in our 
most important general journals: the Ameri- 
can Sociological Review, American Journal 
of Sociology, and Social Forces. The picture 
with respect to books is far less obvious, and 
I am unaware of anyone so foolish as to 
attempt a frequency count of that huge and 
still growing type of publication outlet. It is 
my impression, however, that any trends that 
might exist are rather minor except, possibly, 
for selected monograph series. Quantitative 
data analyses, being compact, have a compet- 
itive advantage over qualitative research when 
it comes to the journals. The very opposite 
may actually be the case with respect to the 
book literature, which is controlled by 
publishers’ judgments as to the interests and 
training of a book’s potential audience, much 
of which is assumed to consist of laypersons. 

We must also keep in mind that there has 
been a proliferation of specialized journals in 
virtually all of our substantive fields. In some 
of these areas, such as population-ecology 
and the sociology of education, the trend has 
undoubtedly been in the quantitative direc- 
tion. There are other journals, however, that 
have actually been established to appeal to a 
nonquantitative audience, so that it becomes 
exceedingly difficult to assess the overall 
trends involved. l 

It is possible that there has been a slight 
trend favoring closer ties between quantitative 
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research, on the one hand, and sociological 
theory, on the other. For many years, in 
sociology, we seem to have had the notion 
that “theory” involves the study of what dead 
sociologists (primarily European ones at that) 
have had to say about major historical 
processes, rather than, say, what a theoretical 
physicist such as an Einstein or a Hawking 
actually does, namely thinking in terms of 
mathematical equations. In this respect we are 
much closer to historians than to economists. 
I believe there is nowadays a recognition that 
mathematical modeling is a form of theoriz- 
ing, but the dent that such modeling has made 
on the overall theoretical enterprise in sociol- 
ogy is indeed small, if not minuscule. 

Another critical kind of nontrend is in the 
amount of quantitative training that we 
regularly provide our graduate students. A 
handful of our most mathematically inclined 
students do get better training than members 
of my own cohort, and there are a somewhat 
larger number of them owing to the growth of 
our discipline, if nothing else. At the 
University of Washington, our more or less 
typical M.A. or doctoral student receives 
much the same training as did students at 
Michigan, when I began teaching in 1954. If 
anything, our incoming graduate students 
come to us with poorer math backgrounds 
than those I encountered in the 1950s and 
early 1960s, and the required methodology 
courses they must take—as well as the 
numbers they actually take—are fewer than 
my North Carolina students took during the 
heyday of the protesting 1960s! 

I am deeply concerned about an increasing 
communication gap as a result, with today’s 
graduate students being far less close to the 
cutting edges of methodological develop- 
ments than was the case several decades ago. 
There is simply too much technical material 
to be absorbed during the very limited time 
that our average graduate student devotes to 
statistics and methods courses. And virtually 
none of them take mathematical modeling 
courses that presuppose strong backgrounds 
in probability theory, differential equations, 
or linear or matrix algebra, Both of these 
facts, I believe, contribute to a tendency to 
substitute training in techniques and the 
uncritical use of computer programs rather 
than more in-depth knowledge that places a 
higher premium on the understanding of 
underlying assumptions and their implications 
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for theoretical interpretations of research 
findings. 

Although there have been substantial in- 
creases in the number and quality of sophisti- 
cated data analysis, trends in the quality of 
data collection are much less evident. Perhaps 
data gaps are not as evident as they used to 
be, but they are still very substantial. As 
noted, much large-scale quantitative research 
is heavily dependent on costly surveys or the 
happenstance of data collection policies that 
are beyond our control. In the field of race 
relations, for example, we have decent trend 
data that enable us to document changes in 
various kinds of racial inequalities, or our 
dependent variables, but there is a correspond- 
ing lack of trend data on independent 
variables, a fact which badly handicapped the 
work of the NAS Committee on the Status of 
Black Americans, of which I was a member. 

We still must rely very heavily on 
self-reports to get at actual behaviors, with 
the result that behaviors that occurred even in 
the not-too-distant past are seldom recorded 
or considered as based on reliable data 
sources. Event-history analysis is heavily 
“loaded” in favor of easily recalled, dramatic 
events such as job changes, births and deaths 
of children or other relatives, major illnesses, 
voting behaviors, and the like. The more or 
less mundane, regularly occurring behaviors 
and experiences that occurred, say, on the job 
or in husband-wife interactions do not lend 
themselves readily to such analyses and thus, 
in effect, enter into the error terms of our 
equations. So do memories of past events, 
value modifications, and a host of “lesser” 
interaction patterns that may have important 
cumulative effects. 

The general point is, of course, that data 
gaps are patterned and vary substantially by 
field and subfield. High quality quantitative 
data are far scarcer in the fields of compara- 
tive race relations, the sociology of religion, 
and family, for example, than they are in 
demography, the sociology of education, and 
stratification. Our ignoring or failing to 
recognize such data gaps may produce 
important biases in our interpretive accounts. 
What is of crucial importance is our ability 
and motivation to correct for such data gaps 
and to find ways of hitching together 
data-collecting techniques that complement 
one another in terms of the information each 
is capable of providing. Too often, we tend to 
ignore such gaps and to presume that 
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variables we have been unable to measure on 
grounds of expediency can safely be ne- 
glected for theoretical reasons as well. 
Undoubtedly, the lack of explicitly formu- 
lated theories that contain omitted as well as 
measured variables also reinforces such a 
tendency. It is but a short step from omitting 
variables in: particular studies to neglecting 
them altogether in the larger scientific enter- 
prise. This is one path that leads to 
disciplinary blind spots, and I am not all that 
certain that we have made genuine progress in 
this respect over, say, the past 30 years or so. 

The existence of readily available, though 
highly selective, secondary data, when com- 
bined with pressures that encourage sociolo- 
gists to “produce” large numbers of publica- 
tions rather than a smaller number of 
high-quality ones, has undoubtedly resulted in 
a tendency to push theoretical concerns into 
the background and to accept uncritically 
whatever measures are readily available. 
Where surveys are designed to provide trend 
data, there are also pressures to continue to 
employ inadequate measures used in prior 
studies so as to preserve superficial compara- 
bility; without paying sufficient attention to 
issues of dimensionality and validity. 

Related to this last observation is my 
concern that there has been little or no 
favorable change in our level of interest in 
problems of conceptualization and measure- 
ment. Certainly, there is presently far less 
interest in scaling procedures and the careful 
assessment of dimensionality than there was 
in the days of Paul Lazarsfeld, Louis Gutman, 
and Clyde Coombs. The tendency to rely 
heavily on standardized rather than unstan- 
dardized measures may, indeed, be a reflec- 
tion of this inattention to the difficult problem 
of arriving at a much higher degree of 
consensus on both theoretical and operational 
definitions of our basic concepts. 

Path analysis and causal or structural- 
equation modeling received rapid acceptance 
in our discipline during the 1960s because 
they fulfilled an important need and did not 
involve new or especially difficult statistical 
calculations. But one of the spin-offs of work 
in this area was increased knowledge about 
the implications of random and nonrandom 
measurement errors, as well as problems 
created by slippage between measured indica- 
tors and the theoretical constructs they were 
supposedly representing. 

This measurement-error literature stressed 
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the importance of obtaining multiple indica- 
tors and ran counter to our vested interests, 
however. It warned of substantial complica- 
tions that cannot be handled without consid- 
erably more attention being given to measure- 
ment and conceptualization problems, the 
careful construction of auxiliary measurement 
theories, and the much greater effort needed 
in connection with the expensive and time- 
consuming data-collection enterprise. Today, 
we see in the literature many path analyses, 
only a handful of which admit to the presence 
of measurement errors. Perhaps more accu- 
rately, one finds a large number of journal 
articles that briefly discuss the measurement 
of selected variables, that also admit to the 
possibility of errors, but that then effectively 
announce to the reader that the subsequent 
empirical analysis and related interpretations 
will proceed as though there were absolutely 
no measurement errors whatsoever! 

This is but one illustration of the more 
general point that methodological ideas are 
adopted when it is relatively easy and costless 
to do so, but that they are resisted or totally 
ignored when it is to the investigator’s vested 
interest to do so. There is also the related 
tendency to attempt to substitute sophisticated 
data analysis techniques for inadequate data 
collection procedures, which are of course far 
more costly and time consuming. 

Whenever students who lack even rudimen- 
tary backgrounds in mathematics are given 
only a superficial coverage of any particular 
methodological topic, together with a 
“cookbook” orientation to accompanying 
computer programs, such a tendency seems 
all the more likely. There are, in fact, 
numerous warnings and cautionary qualifica- 
tions existent in the general methodological 
literature, but our students may never learn 
about them or may soon suppress them in 
their eagerness to follow the most recent 
methodological fad. “Digging in” and facing 
up to the implications of complicating factors 
and unmet assumptions may actually be 
dysfunctional from the standpoint of grinding 
out publications, and when such practices are 
aggregated, patterned neglect may result. 
Analyses become routinized, and assumptions 
treated in a perfunctory manner. 


. THE CASE FOR QUANTIFICATION AND 
MATHEMATICAL MODELS 


In the abstract, the case for quantification and 


AMERICAN SOCIOLOGICAL REVIEW 


the use of mathematical modeling is a strong 
one. Sociology is a. very diffuse field, 
containing a bewildering number of ap- 
proaches, orientations, quasi-deductive argu- 
ments, tenuous connections between theory 
and empirically-based research findings, and 
very loose terminology. These facts are, I 
believe, obvious. How seriously one takes 
them and how.one believes they ought to be 
addressed are, however, matters of serious 
disagreement. 

One of the most important features of any 
form of mathematical modeling is that the 
critical role of assumptions is laid bare. In the 
ideal, all such assumptions must be made 
explicit and then carefully examined from the 
viewpoint of their realism in terms of “real 
world" phenomena the model is intended to 
represent. Typically, one begins with very 
simple assumptions, extracts their implica- 
tions deductively, compares them with empir- 
ical results using some sort of goodness-of-fit 
criterion, and then introduces complications a 
few at a time by lifting the least realistic 
assumptions and replacing them with some- 
what less restrictive ones that are still 
circumscribed enough to yield verifiable (i.e., 
rejectable) statements about the real world. 
The deductive reasoning process is confined 
entirely to the abstract model and its assump- 
tions, but the utility of the model is evaluated 
in terms of its ability to produce a set of 
predictions in the form of logically derived 
deductions that also successfully resist elimi- 
nation as a result of unsatisfactory "fit" to the 
data. 

This deductive process thus places a 
premium on our ability to make explicit 
assumptions that are both plausible in view of 
real-world processes and also restrictive 
enough to yield rejectable conclusions. This 
all sounds simple and reasonable enough in 
principle, but there are of course many 
complications and pitfalls to be avoided in 
practice. In particular, they depend rather 
heavily on the mathematical and statistical 
sophistication of both the user and the 
audience, the creation of a series of scholarly 
norms that result in a serious rather than 
perfunctory examination of whatever assump- 
tions are required to carry out the deductive 
process, and the avoidance of intellectual 
"blind spots" as well as the steady reduction 
in important data gaps. 
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Refining Common Sense 


One of the remarkable features of deductive 
reasoning is that it not only serves as a check 
on common sense rules of thumb, but also 
helps to refine our common sense in the 
process and to generate a new common sense 
based on a series of explicit assumptions. 
Before I began thinking seriously about 
causal modeling, for example, I encountered 
all sorts of folk wisdom about when one 
should and should not introduce control 
variables, a folk wisdom that was not helped 
very much by the assertion that “of course 
one should control for all ‘relevant’ vari- 
ables.” 

What amazed me, on examination of the 
very simple formulas for partial correlations 


and regression coefficients, was that these - 


formulas —based as they are on ordinary least 
squares or a simple curve-fitting procedure— 
yielded what were, for me, some very 
important implications that sometimes con- 
firmed and sometimes refuted existing rules 
of thumb about when controls should be 
introduced, when they are unnecessary, and 
when they are improper. It is unnecessary to 
control for antecedent causes, for example, 
provided that one is also willing to assume 
that such variables directly affect only one of 
the variables in one’s causal system. This is 
indeed fortunate, since it avoids the necessity 
of going back to Adam and Eve every time 
one wants io trace out the causal patterning 
among a finite set of variables. It does imply, 
however, a very restrictive set of assumptions 
about any recursive causal system, a fact that 
many empirical investigators tend to ignore. 

Similarly, these simple formulas derived 
from the algebra of ordinary least squares 
show that unstandardized regression coeffi- 
cients behave properly and will be invariant 
(except for sampling errors) under a variety of 
circumstances, whereas standardized mea- 
sures, such as correlation coefficients and 
path coefficients, will not. These include 
situations where there are design shifts, 
comparisons across samples, or. certain re- 
stricted types of cross-level analysis situations 
where one may wish to compare results of 
microanalyses with those using aggregated 
data 


What I have found remarkable in all of this 
is that a procedure that was designed for a 
very straightforward and basically simple set 
of analysis problems has yielded insights 


about much more philosophical matters con- 
cerning the logic of the scientific method and 
the conditions under which causal inferences 
can be legitimated. It has also yielded 
considerably more insights, however, in 
terms of a specification of the nature of the 
conditions under which such inferences are 
not legitimate. These, in turn, have resulted 
in a number of methodological works oriented 
to finding ways of correcting for, or at least 
assessing the biases in, existing modes of 
analysis. 

As cases in point, we now have a very 
extensive literature on complications pro- 
duced by random and nonrandom measure- 
ment errors, as well as a series of computer 
programs (e.g., LISREL) designed to assist 
investigators who attempt to build measurement- 
error assumptions into their theory-testing 
procedures. Another illustration is the grow- 
ing literature on cross-level analyses, includ- 
ing that on contextual effects and the 
aggregation-disaggregation problem (Blalock 
1984; Entwisle and Mason 1985; Erbring and 
Young 1979; Farkas 1974; Firebaugh 1978; 
Hannan 1971; Hannan and Burstein 1974; 
Hauser 1974; Hermalin and Mason 1980; 
Langbein and Lichtman 1978). Common- 
sense ideas about what are appropriate and 
inappropriate forms of data analysis in both 
these areas have thus been "tested" and 
considerably improved as a result of mathe- 
matical and statistical modeling approaches. 


Locating Intractable Problems 


Another important contribution of deductive 
modeling of social processes is that of helping 
us locate and analyze nearly intractable 
methodological problems that, because of 
their intriguing puzzle aspects, can often 
serve to sidetrack scholars for very long 
periods of time or lead to theoretical disputes 
that are inherently unresolvable without 
considerable refinemerits and an agreement on 
certain critical assumptions. For example, 
what appears to be a simple question as to 
whether X causes Y or Y causes X can be 
shown to be empirically unanswerable unless 
we can agree on a set of auxiliary assump- 
tions needed to resolve an "identification 
problem" that can be shown, mathematically, 
to involve too many unknowns relative to 
pieces of empirical information. 

For instance, if two disputants can agree on 
appropriate lag periods and collect data at 
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precisely those intervals, it may become 
possible to reformulate alternative theories 
involving those lagged variables, such that 
the parameters can be estimated and one or 
the other theory reformulated. If lags cannot 
be pinned down, however, or if alternative 
assumptions about predetermined variables 
cannot be specified, the dispute may remain 
empirically intractable. Similarly, in panel 
designs if there are simultaneously both 
(random) measurement errors and unspecified 
lag periods, it can be shown that the 
supposedly advantageous qualities of longitu- 
dinal data will be illusory (Duncan 1969). 

Where one independent variable is taken as 
an exact (or even nearly exact) linear function 
of two or more other variables, attempts to 
introduce all such variables as independent 
variables in an equation for some dependent 
variable will be futile unless the theories 
concerned are sufficiently well formulated to 
justify specific forms of nonlinearity or other 
complications capable of disentangling the 
effects of all such variables. Empirical studies 
of mobility effects, status inconsistency, 
age-period-cohort effects, strains due to gaps 
between aspirations and achievements, or 
those that attempt to sort out the effects of, 
say, husband-wife differences from those of 
the separate partners have all foundered, at 
one time or another, on this particular 
methodological difficulty. Unfortunately, the 
various approaches that have been developed 
to "resolve" such problems all require 
restrictive assumptions that basically require 
one to make strong theoretical assumptions, a 
fact that is often ignored by users who believe 
the most recent, popularly used "resolution" 
is now free of previously noted difficulties 
(Blalock 1967; Glenn 1976; Hope 1975; K. 
Mason, W. Mason, Winsborough, and Poole 
1973; Taylor 1973). 

The main point to emphasize is that 
carefully formulated deductive statistical or 
mathematical models can often show up 
intractable problems before they have en- 
grossed us for too long. They can also point 
to resolutions of these problems, provided 
that users are willing to make an additional 
set of restrictive assumptions based on a priori 
theoretical (rather than strictly empirical) 
arguments. 


Developing Systematic Search Procedures 


The use of statistical models and their 
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resulting data-analysis procedures can also 
provide evenhanded or "objective" search 
procedures and methods of evaluating the 
relative contributions of whatever variables 
have been included in the analysis. Thus the 
notion of "explained variance," if properly 
used and understood, involves a criterion for 
evaluating the contributions of different sets 
of variables no matter what they are or whose 
theoretical predictions are being put to the 
test. Provided that the investigator has been 
successful in obtaining nearly accurate mea- 
surement on all included variables and has 
seen to it that design features yield adequate 
variation in all such variables, each variable 
in a sense has an equal chance to explain 
whatever variance it can in the dependent 
variable(s) of interest. There are all kinds of 
"catches" in this general observation, includ- 
ing the complications produced by multicol- 
linearity (Gordon 1958), differential measure- 
ment errors, nonlinearities and interactions 
that have not been detected, and possible 
feedback effects on one or another of the 
supposedly independent variables. 

The essential point, however, is that the 
underlying statistical theory is "content free." 
The principles apply to whatever variables 
have been inserted into the roles of X, Y, or Z. 
In the ideal, they afford social science users 
with objectifiable criteria for evaluating 
alternative theories, and they also by implica- 
tion suggest systematic search procedures. 
Unlike a necessarily limited theory that 
suggests a small subset of statistical interac- 
tions, for example, they permit not only tests 
for these specific interaction terms but a mode 
of analysis that also includes a more explor- 
atory search for all possible two-factor 
interactions, as well as higher-order interac- 
tions in all possible combinations. Similarly, 
searches for nonlinearities can be routinely 
made, even where they are not anticipated in 
advance. Justifiable rules for eliminating 
variables can also be invoked by the investi- 
gator, letting empirical results rather than 
personal biases dictate which variables are 
retained and which eliminated after a prelim- 
inary empirical search. 

In my judgment, one of the most valuable 
features of statistical approaches to data 
analysis is in the systematic search for and 
possible elimination of complexities of differ- 
ent types. In causal modeling, for example, 
one can add and subtract variables in a 
straightforward way, introduce nonlinearities 
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and nonadditive joint effects, and allow for 
nonrecursive features in a portion of the 
model: If someone objects to a particular 
formulation, he or she can readily alter the 
model and, where adequate data can also be 
coliected, determine whether or not the 
revised model improves the fit to the data. 
Thus elaborations can readily be introduced, 
making it possible to join or synthesize 
several theories, rather than treating them as 
“alternatives,” only one of which can be 
accepted. One may strive for parsimony, 
while at the same time admitting of complex- 
ities that may result in the model’s improve- 
ment. General theories may be formulated, 
with special cases involving model simplifica- 
tions or elaborations that are explicitly 
incorporated. 


Providing New Theoretical Insights 


Finally, both statistical and mathematical 
models can provide important theoretical 
insights simply by striving to achieve precise 
formulations that help to generate new ways 
of looking at familiar and complex processes. 
Lewis Richardson’s (1960) armaments race 
models, for example, involved sets of simul- 
taneous differential equations designed to 
help explain interactive processes among 
multiple parties, each of which is assumed to 
respond to the actions of the others. The 
result may be either a continually escalating 
arms race or stabilization, with one or another 
party being in a dominant position at 
equilibrium, depending on the relative magni- 
tudes of different parameter values. This 
particular formulation of an important kind of 
dynamic process has stimulated a variety of 
alternative formulations, as well as a critical 
body of literature oriented to modifying the 
original formulation. 

Harrison White’s (1970) work on vacancy 
chains involved a mathematical formulation 
that provided a fresh orientation to mobility in 
reasonably closed organizations. Rather than 
taking the “driving mechanism” of occupa- 
tional mobility as residing in individual 
actors, White stressed that alterations in the 
structure of positions may create vacancies 
that permit persons to move upward (or 
possibly downward) in the organization, with 
the length of such chains being predictable by 
means of mathematical models. Thus the 
focus is placed on the combination of persons 
and positions, rather than either alone. This 
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orientation has been carried over into network 
analysis, which has been heavily influenced 
by White’s work. Taken together, the line of 
reasoning involved has introduced important 
new ideas about how to “bring structure back 
in” to sociological analyses. Unfortunately, it 
is my distinct impression that—perhaps be- 
cause of sociologists’ lack of training in 
mathematics—network analysts are forming 
an almost totally closed and relatively small 
network among themselves, with very few 
entry points into the larger body of sociolog- 
ical literature. A similar phenomenon may 
have occurred among the handful of sociolo-: 
gists interested in the stochastic modeling of 
diffusion processes. 

A striking illustration of progress in 
quantitative sociology can also be invoked to 


' stress the limitations of highly simplistic 


mathematical modeling. Early efforts to apply 
simple Markov Chain models to mobility 
tables resulted in rather poor fits to empirical 
data. Subsequent attempts to modify such 
models by dichotomizing between "movers" 
and "stayers" introduced additional parame- 
ters that provided slightly better empirical fits 
between data and models but also strained the 
credulity of the reader (Pullum 1975). It was 
not until Blau and Duncan's pioneering work, 
The American Occupational Structure (1967), 
that the highly restrictive mold of traditional 
mobility research was broken. 

The result has been a highly cumulative 
body of research, centered in the so-called 
"Wisconsin school" of status-attainment tra- 
dition, that has gradually been extended to 
include structural as well as individual-level 
analyses involving sophisticated structural- 
equation modeling (see, for example, Sewell 
and Hauser 1975; Featherman and Hauser 
1978; Wright 1979; Baron and Bielby 1980; 
Kalleberg, Wallace, and Althauser 1981; and 
Hodson 1983). In my judgment, work in this 
area provides perhaps the best illustration of 
genuinely cumulative research findings in 
sociology. Problems have been formulated 
with sufficient clarity to emphasize underly- 
ing assumptions that have subsequently been 
subject to challenge and modification. 

Sociologists have long been interested in 
phenomena that involve both individual- and 
group-level processes, but our theories have 
often left it very unclear just how one passes 
back and forth between micro- and macro- 
levels of analysis. Ever since W.S. Robin- 
son's (1950) article on the so-called *eco- 
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logical fallacy,” sociologists have recognized 
that one can encounter serious analysis and 
interpretive problems whenever one’s data 
and one’s theory are at different levels, but it 
has been difficult to formulate the exact 
nature of what these problems may be. 

As previously noted, there have developed 
twin bodies of literature dealing with, on the 
one hand, what have been referred to as 
“contextual” or “structural” effects and, on 
the other hand, problems of aggregation and 
disaggregation and the “consistency” of 
models formulated at two (or more) levels. It 
has been my experience that practicing 
sociologists are generally uninformed about 
these bodies of literature, and that cross-level 
analysis problems addressed in one field (say, 
educational sociology) may be nearly identi- 
cal in structure to those in another (say, 
demography), with virtually no overlap be- 
tween the two substantive bodies of literature. 
Once more, I believe that the technical 
literature is “out there” waiting to be read and 
the implications incorporated into the respec- 
tive bodies of empirical literature. The 
methodological literature in the field of 
sociology of education has been of especially 
high quality in this regard. f 

The rather recent body of literature on 
event-history analysis has also introduced 
some important theoretical ideas, or at least 
should encourage us to examine existing data, 
or collect new data, so as to take the timing 
and sequencing of events into consideration. 
Does it make a difference whether a woman 
takes a job first and has a child at a later time, 
or vice versa? Since we all experience major 
events at different points in time, with some 
of us due to experience them only after the 
time we are actually interviewed, how can our 
data analyses—and our theories—take these 
sequencing differences into account? As far 
as I am aware, questions such as these were 
not seriously addressed prior to the introduc- 
tion of a body of biostatistical techniques 
referred to as “survival analysis” (Allison 
1984; Tuma 1982; Tuma, Hannan, and 
Groeneveld 1979). 

As a final example, the extensive literature 
in economics dealing with (nearly) simulta- 
neous causal interrelationships has at least the 
potential of reshaping our thinking about 
feedback processes, how distributed lags can 
be handled, and the conditions under which 
dynamic processes can be expected to stabi- 
lize. The simple recursive models in common 
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use in sociology can be seen as restricted 
special cases that, more realistically, ought to 
be reformulated at least into block-recursive 
models of greater complexity. Yet we rarely 
see empirical studies that are designed to test 
such more complex nonrecursive models. The 
work of Kohn, Schooler, and their associates 
(1978, 1983) on the reciprocal relationships 
between the structure of work settings and 
personality variables constitutes an important 
exception to this generalization, and their 
careful efforts to superimpose measurement- 
error models onto their structural models need 
to be emulated. 

What I am saying, then, is that both the 
mathematical modeling tools and the neces- 
sary statistical knowledge are available to 
make possible the development and more 
effective use of quantitative approaches that 
have the potential of improving both the 
quality of our research and also our theoreti- 
cal thinking. There are further obstacles, 
however. Some of these appear to stem from 
the complex nature of the reality we are 
attempting to study, combined with important 
features of the sociological enterprise itself. 
To these I now turn. 


DISCIPLINARY BLINDERS AND 
OBSTACLES TO IMPROVEMENT 


I happen to believe that the methodological 
literature in sociology is of generally high 
quality and that it contains a variety of 
important cautions and recommendations that 
have been ignored or neglected in the conduct 
of day-to-day research and in journal report- 
ing practices and policies. Some of these 
warnings have been repeated in a variety of 
ways in our general journals and textbooks, 
but others have appeared in specialized 
outlets that are read by only a handful among 
us. Part of the problem is that of information 
overload; virtually none of us, even method- 
ological specialists, can keep track of or think 
deeply about the implications of this rapidly 
accumulating and rather technical body of 
literature. The excellent Sage methodology 
series, for example, now has over 65 
monographs, nearly all of which deal with 
different topics! l 

The “hidden” nature of cutting-edge mate- 
rials of a technical nature is reinforced in 
many instances by our vested interests in 
looking the other way. Almost a decade ago, 
Susan Long (1980) developed the argument 
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(in the specialized annual, Sociological Meth- 
odology) that so-called ratio variables are not 
inherently problematic, as some have claimed, 
but that whenever measurement-errors are 
present in any variable that appears in the 
numerators or denominators of several vari- 
ables, serious biases can easily result in 
highly misleading interpretations. For exam- 
ple, population undercounts affecting per- 
capita measures or, say, the underreporting of 
crimes can invalidate the findings of entire 
traditions of research. To me, her arguments 
should have been taken very seriously by both 
demographers and adherents of deterrence 
theories in criminology. Yet I have not seen 
any efforts in the empirical literature to 
address the problems she has raised, to say 
nothing of the paucity of citations to her 


paper. 

There have been similar warnings about 
aggregation biases that are especially likely to 
occur whenever the unit of aggregation 
(usually determined spatially or organization- 
ally) has been selected in such a way that 
self-selection is problematic, so that the 
aggregation criterion is at least partly a 
dependent variable. Twenty-five years ago I 
cautioned about sampling designs that involve 
either grouping by dependent variables or the 
oversampling of extremes on a dependent 
variable (Blalock 1964). Yet one commonly 
finds research reports in which design fea- 
tures are more or less taken for granted, in 
which selective migration into or out of the 
sample is ignored, or in which cross-level 
inferences are made in connection with 
schools, classrooms, or work units where 
self-selection is an obvious possibility. My 
inference, indeed, is that a very high 
percentage of quantitatively inclined empiri- 
cal researchers are not even aware of the 
problems involved in such instances. Even 
less so are those who employ qualitative 
research and who may, for example, select 
for detailed analysis six nation states, three of 
which have experienced a phenomenon and 
three have not. That they have sampled on a 
dependent variable may never have crossed 
their minds. This is of course but one 
illustration of the fact that methodological 
pitfalls cannot be overcome by the simple 
device of relying on qualitative approaches 
that merely hide them from view. 

Nearly all sociologists are aware of the 
possibility that omitted variables need to be 
taken into consideration in some way and that 
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their effects are obscured in the error or 
disturbance terms in our equations.. Clearly, 
one must always omit numerous causal 
factors on practical grounds, and admittedly 
one never has a really adequate theory 
capable of specifying what they may be or 
which will have sufficiently minor effects that 
they can safely be ignored. In my own 
readings of the nonexperimental literature, 
however, I have rarely found serious discus- 
sions of where the omitted variables are 
assumed to belong in whatever explicit or 
implicit causal theory is being employed. In 
many instances the absence of carefully 
constructed theories makes it difficult if not 
impossible to specify the nature of such 
omitted variables or where they are assumed 
to be located in the causal hierarchy of 


- included variables. 


The common practice is to construct a 
causal or path diagram that contains only the 
variables that have been measured (usually 
imperfectly) in the study at hand. In some 
instances, measurement-error models are also 
tested, but variables that are beyond the reach 
of the data-collection instrument being used 
are virtually never brought explicitly into the 
model, nor their implications seriously dis- 
cussed. It is as though the omission of 
variables for understandable practical reasons 
thereby also justifies their omission on 
theoretical grounds as well. Our vested 
interests in the specific piece of research 
being reported seem to inhibit a more careful 
and thorough analysis of specification errors. 
Once more, the warnings in the methodolog- 
ical literature are plentiful, but our practice 
rarely takes these warnings seriously. Journal 
editors, especially, need to be increasingly 
sensitized to such practices, as they are in 
large part responsible for their perpetuation. 

In this connection, Norval Glenn (1988) 
has examined the journal literature in three of 
our most important general journals, as well 
as three comparable journals in political 
Science. He reports a serious lack of caution 
with respect to causal interpretations of 
research findings. Investigators are continu- 
ally "finding" causal relationships, rather 
than merely inferring them on the basis of a 
combination of findings of covariance relation- 
ships and temporal sequences supplemented 
by a series of (usually implicit) assumptions 
required by causal modeling. My own reading 
of the literature is consistent with that of 
Glenn. l 
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Too often there is a failure to emphasize 
that any interpretive statements, whether one 
uses causal language or not, must be partly 
based on theoretical assumptions and partly 
on factual data. What is more, Glenn notes 
that when research findings are summarized 
in textbooks or popular accounts, the tenta- 
tiveness of causal “findings” is even more 
likely to be lost from view. It may be to our 
short-term vested interests to present our 
research findings as more definitive than they 
actually are, but, as Glenn notes, this often 
backfires in terms of our disciplinary reputa- 
tion among outsiders. The recent fiasco 
involving the rejection by natural scientists of 
a prominent political scientist who had been 
nominated for membership in the National 
Academy of Sciences is an obvious case in 
point. 

I have just finished reading a decade of 
articles in the ASAs Social Psychology 
Quarterly. As expected, a very high percent- 
age of the empirical studies reported involved 
experimental designs using almost exclu- 
sively American, middle-class student volun- 
teers who had selected themselves into 
introductory sociology and social psychology 
classes. Such populations are of course 
convenient and pliable and thus highly 
appropriate for exploratory research. Investi- 
gators routinely simply announced, however, 
that the “subjects” were selected from among 
student volunteers. As the data were reported, 
references were often made to “subjects” 
(rather than “student subjects”), but both in 
the prior theoretical discussions and conclud- 
ing remarks references were almost automati- 
cally generalized to “persons.” 

No doubt considerations of generalizability 
and “external validity” are sensitive points to 
many experimental social psychologists, and 
it is indeed true that their research is 
underfunded. I was psychologically unpre- 
pared to discover, however, that these 
particular journal issues contained no studies 
self-consciously designed to replicate prior 
experimental research on very different popu- 
lations. A handful of studies, conducted by 
foreigners, reported on findings among stu- 
dents in other (Western) countries, but again I 
did not locate any systematic efforts to deal 
explicitly with the generalizability problem. 
Such a neglect of replication research is, of 
course, characteristic of sociology as a whole 
and is partly due to policies set by editors of 
our major journals. 
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What also surprised me, perhaps because I 
remain overly naive about our general level of 
sophistication, was that in virtually all of 
these experimental studies straightforward 
ANOVA procedures were used, with no 
attention being given to possibilities of 
nonnormality of disturbances, the lack of 
interval-scale measurement, any examination 
of outliers or otherwise unusual cases, or the 
use of nonparametric alternatives. Measures 
of association such as E? or intraclass 
correlations were also virtually nonexistent, 
so that statistical significance was implicitly 
taken to imply substantive significance as 
well. 

Nor were there serious attempts in these 
articles to link the specific experimental 
manipulations used by the investigator to the 
underlying constructs being studied or to state 
explicitly the assumptions being made in 
doing so. Alternative manipulations, if actu- 
ally used, were practically never reported. 
The idea of using multiple manipulations or 
multiple indicators as a device for discovering 
flaws in experiments was also not explored, 
in spite of several papers in the causal 
modeling literature (written about two dec- 
ades ago) calling attention to the potential: 
fruitfulness of this possibility. (Alwin and 
Tessler 1974; Blalock 1971; Costner 1971; 
Land 1970). It almost appeared to me as 
though there is an unwritten norm to the 
effect that such messy matters are best left 
undiscussed. Perhaps this is an overreaction, 
but I believe that it is common practice in the 
social psychological literature to leave them 
alone. Once more, our journal editors need to 
be held accountable for not discouraging such 
practices. 

Simplifications will always be required, in 
terms of both tbe variables we omit from our 
studies and the assumptions we impose on the 
data to justify our interpretive statements. So 
will the presentations provided in textbooks, 
assertions made in policy recommendations, 
and interpretations that find their way into the 
popular literature. What seems most critical, 
however, is that (1) our simplifications not be 
premature or based on ad hoc grounds of 
expediency, and (2) we be as explicit as we 
can about the assumptions these require, the 
restrictions imposed upon us by data limita- 
tions and measurement errors, and how 
variables we have omitted are presumed to 
operate. Like Glenn, I find that our actual 
practices in these regards fall far short of what 


CONTRIBUTIONS OF QUANTIT. ATIVE SOCIOLOGY 


would be possible if we took more seriously 
our methodological literature, with its numer- 
- ous cautionary warnings. 

The reality we are attempting to study is 
admittedly bighly complex. Many of the 
mathematical models that have been proposed 
from time to time in our literature —some- 
times, unfortunately, in overly pretentious 
ways—are basically inadequate for dealing 
with such multiple complexities. One impor- 
tant function of such relatively simple mathe- 
matical models is basically heuristic: they 
help us clarify our thinking and enable us to 
extract implications from admittedly oversim- 
plified versions of reality. If so, one should 
then not expect them to provide close fits to 
empirical data. Therefore, if we lower our 
expectations about their capabilities, we may 
use them to gain valuable theoretical insights 
about very simple social processes. We must 
be prepared to admit, however, that much of 
what passes for sociological theory is at 
present far too diffuse and unclear to lend 
itself to mathematical modeling. 

In the physical sciences, as well as the 
philosophy of science literature, considerable 
weight is given to the criterion of parsimony. 
Ibelieve such an emphasis, when carried over 
uncritically to the social sciences, tends to 
foster the notion that one must choose among 
alternative theoretical formulations that are 
basically very simple. This in turn means that 
the decision process we face is .that of 
rejecting some theories outright while we 
retain others, rather than seeking combina- 
tions or syntheses of several theories, each of 
which may be partly correct or may account 
for a modest fraction of the variance in 
whatever dependent variables one is attempt- 
ing to explain. 

What we very much need are systematic 
ways of modeling far more complex pro- 
cesses in such a way that additional compli- 
cations can be introduced without destroying 
the practical utility of models. To me, 
structural-equation models, when combined 
with allowances for measurement errors and 
specifiable lag processes, offer us this oppor- 
tunity to combine careful data analyses with 
fairly rigorous theoretical thinking and explic- 
itly formulated assumptions. Without such 
complex theoretical formulations, I do not see 
any way to justify one's making the numerous 
assumptions about omitted variables that will 
be necessary in any given piece of research. 
Rather than simply omitting variables on the 
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grounds of expediency, and then merely 
hoping that the resulting disturbance terms 
behave properly, it seems far preferable to 
attempt to specify a more complete theory 
that contains such omitted variables as well as 
those that have actually been included. 

The more I study the implications of this 
form of theory construction and theory 
testing, however, the more convinced I 
become that it is absolutely essential that we 
take methodological cautions extremely seri- 
ously and that we reorganize our data- 
collection enterprise so that large numbers of 
variables, carefully measured and evaluated, 
can be combined in the same data sets. 
Individual investigators are simply in no 
position to collect such data. If they are 
forced to rely on their own means and are 
subjected to normal publication pressures, the 
corners they will be forced to cut will be 
numerous indeed. 


FACING UP TO THE QUALITY ISSUE 


Sociology is not a high-quality discipline. 
Over at least the past three decades our 
undergraduate majors and graduate applicants 
have consistently scored near the very bottom 
on standardized tests, not only with respect to 
quantitative reasoning scores but verbal rea- 
soning as well. Indeed, the most recent report 
Iread showed sociology below all of the other 
social sciences and only slightly above 
students in speech and education. Sociology 
verbal GRE scores were lower than those of 
engineering students as well as all science 
fields, and of course our quantitative GRE 
scores were considerably beneath them. 

About 20 years ago when I served on a 
national panel charged with the task of 
assessing the status of our discipline, we 
asked several of the giants of our field to offer 
their suggestions. One of them, admittedly a 
very blunt member of our profession, wrote a 
single paragraph ending with a sentence that I 
recall very vividly. “I hope," he said, “that 
you will have the guts to say that the real 
problem is that sociologists just aren't very 
bright!" Perhaps because we were offended, 
or perhaps because we did not dare do so, we 
merely hinted at the matter by indicating the 
need to improve quality. Perhaps it is now 
time to address the issue more squarely. 

The problem exists at all levels. Our under- 


graduate curricula are horizontal rather than . 


vertical. Virtually none of ouj/courses build 
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upon others or require strings of prerequisites. 
Our undergraduate statistics and methods 
courses are extremely low level, as compared, 
say, with the typical freshman calculus se- 
quence or introductory chemistry. How de- 
manding are even our honors programs, ‘and 
how many graduate departments require can- 
didates to have received honors training? 

These issues go far beyond the question of 
quantitative training, and in no sense do I 
wish to equate quality with quantitative 
research. Indeed, I would prefer to see a 
much larger number of “deep thinkers” in our 
field, regardless of their quantitative skills. 
But, confining myself simply to quantitative 
training, I now strongly believe that at least 
our major departments need to do several 
things, perhaps in a coordinated or concerted 
way. First, we need to insist on a minimum 
mathematics background of a year of calculus 
or equivalent courses in probability, finite 
math, or linear algebra. Second, we need to 
beef up our M.A. and doctoral requirements 
to include at least a year of statistics beyond 
the first course, and some specialized work in 
measurement or scaling that connects well 
with at least one theory course. Third, we 
need to be far more courageous in upping our 
overall standards for the PhD and in being 
much more selective in admitting doctoral 
students from the pool of master’s degree 
holders. Perhaps it is unrealistic to expect 
individual departments to take the lead in this 
respect. Therefore, we may need a coordi- 
nated effort among major departments— well 
advertised and allowing for a reasonable lag 
period prior to implementation, so that 
corresponding changes can be introduced in 
undergraduate honors programs. 

Departments need to assess very carefully 
their promotion policies so as to reward 
members for doing a few pieces of high- 
quality research, rather than grinding out 
much more numerous publications and play- 
ing a kind of numbers game. Getting 
something published in a “refereed” journal 
is presently a very simple matter, there being 
such a huge number of journal outlets from 
which to choose. Books addressed to popular 
audiences are likely to be selected by 
publishers with dollar profits in mind, so that 
a mere counting of books and even mono- 
graphs is also a poor substitute for a careful 
quality assessment—assuming, of course, 
that we are capable of recognizing and 
agreeing upon genuine quality. 
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As already noted, journal editors must be 
held accountable for insisting on certain kinds 
of things designed to increase quality: careful 
discussions of basic concepts and their 
linkages to the measures being used; a refusal 
to accept "measurement by fiat” by which an 
investigator merely announces that certain 
measures or experimental manipulations have 
been employed; careful assessments of reli- 
ability; and, above all, an honest elaboration 
of assumptions being made, including those 
that are most in doubt. Certain statistical 
checks, such as those for multicollinearity, 
nonlinearity, nonadditivity, homoscedasticity, 
or possible distorting effects of outliers should 
also be required and reported, if only in foot- 
notes. And, of course, theories being tested 
should be clearly formulated so that linkages 
between theoretically derived propositions and 
testable hypotheses are well specified. 

Finally, our professional associations, and 
especially the ASA, also need to face the 
quality question head on. In the early 1970s, 
when I first served on ASA Council, 
“quality” was a dirty word suggesting elitism 
and an attempt to impose orthodoxy. I even 
encountered instances where potential journal 
editors were passed over because it was 
argued that their standards would be too 
demanding! I sensed that this situation had 
slightly improved toward the end of the 
1970s. I hope it is different today. In the end, 
ASA policies are influenced rather heavily by 
those whom we elect to office, particularly 
those elected to Council and the Publications 
Committee. I am not too optimistic that 
“politics” within the ASA will change 
drastically over the coming years. If not, it 
will remain for our leading departments to 
take our quality problem much more seriously 
than we have in the past. 


HUBERT M. BLALOCK, JR., is Professor 
of Sociology at the University of Washington. 
His areas of interest include applied statistics, 
measurement and conceptualization, causal 
modeling, sociological theory, and race 
relations. His most recent book is entitled 
Power and Conflict Processes: Toward a 
General Theory (forthcoming). 
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DO SUBURBAN GROWTH CONTROLS CONTROL GROWTH? 


JOHN R. LOGAN 


MIN ZHOU 


SUNY-Albany 


This study attempts to replicate results of previous research on the consequences of 
growth controls in suburban municipalities. It combines longitudinal data for 
1970-1980 from census materials with a 1973 questionnaire survey of city 
planning officials concerning local growth-control policies. Growth controls have 
only modest effects on subsequent changes in local population, median family 
income, median rent, and black percentage. We argue that formal policy tools or 
legislation cannot be accepted as indicating that their stated objectives will be 
realized. Data on such tools are potentially misleading; research on local growth 
politics requires more fine-grained information on the balance of forces in t 


community. 


In the last decade, research on urban politics 
has increasingly focused on issues of land 
development (see especially Molotch 1976; 
Mollenkopf 1976; P. Peterson 1981). Con- 
flicts over the built environment were not 
previously ignored but were most often 
treated as one of many potential areas of 
political activity (e.g., Dahl 1961; Banfield 
1970). The innovation of recent work is its 
elevation of growth politics to center stage— 
treating it, in the tradition of Hunter (1953), 
as the “big issue” at the core of local political 
life. And, again following Hunter, recent 
researchers find that the most powerful voices 
in city politics are the proponents of growth 
and urban redevelopment and, in this sense, 
that the city is a “growth machine.” 

In this study we address what appears to be 
the opposite phenomenon—the marshalling of 
municipal authority to restrict and control 
growth. Municipalities, particularly in subur- 
ban areas, have long used zoning to influence 
the location and composition of land develop- 
ment. The new element is that, since 
environmentalism emerged as a formidable 
political movement in the early 1970s, it has 
become commonplace to hear of localities 
that exercise their powers to preserve open 
space and historic sites, that impose restraints 
or even moratoria on new development, or 
that require formal evaluation of the environ- 
mental impacts of growth projects. In public 
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discourse, it is no longer taken for granted 
that growth is desirable. Thus, as early as 
1975, growth controls of various sorts were in 
effect in over 300 jurisdictions (Cohn 1979, 
p. 18; see also Burrows 1978). 

But what is the practical importance of 
growth controls? Do growth controls control 
growth? Does their increasing popularity 
require a reappraisal of assumptions in the 
growth machine model? Certainly planners, 
politicians, and protagonists in battles over 
land development take growth controls seri- 
ously (Plotkin 1987). The many legal battles 
over exclusionary zoning, growth limits, and 
environmental impact regulations are a con- 
crete indicator of these people’s belief that 
local development restrictions make a differ- 
ence. Baldassare (1986, chap. 3) shows that 
65 percent of suburban Orange County 
residents support growth controls, particularly 
on environmental grounds. Dowall’s (1980) 
study of a sample of over 400 communities 
that had instituted growth controls of various 
sorts in the early 1970s, finds clearly that the 
controls were expected and intended to avoid 
the adverse fiscal and environmental impacts 
of growth, and to maintain the existing 
socioeconomic character of the community 
(see also Garkovich 1982). 

In the academic literature, also, the effects 
of local policy are sometimes taken for 
granted. White (1978, p. 203), for example, 
concludes that the “no-growth movement” is 
a direct extension of earlier exclusionary 
zoning policies, which “has strengthened 
local communities’ monopoly power over 
land use within their borders.” Similarly, 
James and Windsor (1976, p. 131) assert that 
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“restrictive controls have often promoted 
inefficient patterns of development in the 
suburbs by encouraging even more sprawl 
and making transportation even more a 
headache." Ellickson (1977) sees growth 
controls as a form of “homeowner cartel,” 
designed to raise home and property values 
for current residents, at the expense of future 
buyers— what Frieden (1979) decries as “the 
defense of privilege.” 

These arguments are connected theoreti- 
cally to the general claim that political 
processes significantly affect land develop- 
ment patterns in the contemporary metropolis, 
particularly in suburbia (Danielson 1976; 
Coke and Liebman 1961; Logan 1976). 
Planning and government intervention, in this 
view, make a difference, whether positive 
(e.g., protecting the environment) or negative 
(e.g., limiting housing choice for minorities). 
Humphrey and Buttel (1980) describe this 
position as the “liberal paradigm” in the 
*growth/no growth debate." More broadly, 
attention to political processes is an important 
departure from the tradition of ecological 
theory. Recently Hawley (1986, p. 128) has 
restated his skepticism about the significance 
of politics: “one would like to know under 
what conditions policies appear," he says, 
*and how effective they may be in overriding 
ecological principles." On this point, there is 
little solid evidence. 

In the most relevant empirical study to 
date, Shlay and Rossi (1981) have reported 
that the zoning designations of Chicago area 
census tracts were useful in predicting actual 
patterns of development over the 1960-1970 
decade. In the suburban portion of the 
metropolitan area, tracts with more restrictive 
zoning classifications had significantly slower 
growth in housing density, less multifamily 
housing construction, and greater increases in 
median family income than other tracts. They 
also experienced less growth in black popula- 
tion, but this effect was not statistically 
significant. 

There is further empirical evidence that 
municipal restrictions on land development 
raise housing prices. One such study evalu- 
ated the impact of the decision by Petaluma, 
California, to limit the number of new 
housing permits issued per year (Schwartz et 
al. 1979). The conclusion was that builders 
constructed larger homes with prices about 10 
percent higher than would have been found 
without the growth restrictions. This finding 
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was echoed by two federal agency studies of 
housing costs (GAO 1978; HUD 1978). 

Other researchers have provided evidence 
to rebut these findings. In a 1977 survey of 
small communities in Pennsylvania, Humph- 
rey and Krannich found that places which 
tried harder to attract growth actually grew 
less, controlling for their location and past 
growth rates (Krannich and Humphrey 1983; 
Humphrey and Krannich 1980). And con- 
versely, a 1978 study of northern California 
cities found that communities with more 
restrictive planning controls actually had 
higher xates of population growth during the 
1970s (Baldassare and Protash 1982). 

In short, growth controls are taken seri- 
ously in municipal politics and in the 
academic literature, but the evidence on their 
effects is mixed. Our purpose is to provide a 
systematic test, using information on a variety 
of types of local regulation from a national 
sample of communities. We will ask whether 
these measures limit population growth, 
whether they affect a community's average 
income or rent levels, and whether they 
restrict black population growth. These are 
the main consequences, intended or not, 
attributed to growth controls in the theoretical 
literature. 


RESEARCH STRATEGY 


We have chosen to study growth controls in 
suburban communities as a critical case. 
Suburbs are theoretically interesting because 
so much of the community power literature 
presumes that there will not be conflict over 
basic goals in such places. Suburbs—as 
contrasted to cities—enjoy an unusual level of 
community identity and have the means to 
enhance and perpetuate that particular sense 
of community through public policy (Wood 
1958). Residents strongly committed to their 
locality through investments in homes and 
family life are expected to make grassroots 
democracy a reality in the suburbs (Greer 
1962, p. 109). The continuing "close con- 
tact" of people to government in high-status 
suburbs makes the public's response to 
political decisions predictable and forces (or 
enables) local officials to adhere to public 
opinion (Crain and Rosenthal 1967, p. 977). 

Thus suburban mayors were less likely than 
central city or rural mayors to favor industrial 
and commercial growth (Maurer and Christen- 
son 1982), and suburban communities in the 
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1970s were 50.percent .more likely than 
central cities to enact growth controls or 
require environmental impact reviews of 
development projects (Carter, Sumek, and 
Frost 1974). If growth controls are responsive 
to public pressure anywhere, and if they are 
implemented seriously anywhere, from this 
point of view, it will be in suburbia. 

Of course, there is an opposing view. 
Suburbs have proven fertile territory for land 
development, offering propitious conditions 
for corruption of development controls (Gott- 
diener 1977). Indeed, Newton (1976) asserts 
that the main consequence of suburban 
governmental autonomy is not so much the 
realization of place-based democracy, but 
rather the capturing of weak and ineffective 
governments by self-interested business elites 
(see also Mansbridge 1980). Logan (1976) 
suggests that in fact most suburbs seek more 
intensive industrial, commercial, or residen- 
tial development, a fact that would be 
obscured by excessive concentration on the 
case of the most affluent exclusionary towns. 
Therefore, our results will bear not only on 
the question of growth controls but also on 
the interpretation of suburban governance., 

This study is based on a survey of all 
municipalities over 10,000 population con- 
ducted in 1973 by the International City 
Management Association (Carter et al. 1974). 
The ICMA survey includes numerous ques- 
tions on suburbs’ environmental and growth 
policies.! Moreover, its timing early in the 
decade allows analysis of the impact of 
growth policy on subsequent development 
using census data from 1970 and 1980. 
Information on other community characteris- 
tics is drawn from a data file on all suburbs in 
55 of the largest U.S. metropolitan areas 
assembled by Logan and Schneider (1981). 

We use measures of population size and 
growth rate, median family income, median 
rent, percentage with college education, 
percentage black, percentage homeowners, 
percentage of current residents who have 
lived in the suburb for over five years, 
percentage of the housing stock constructed 
before 1939, and the year that the central city 


! Data for these variables are available upon 
request from the authors. It would be particularly 
interesting to compare results for suburban munic- 
ipalities with central cities and nonmetropolitan 
cities. The suburban data base will be deposited 
with ICPSR. 
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reached 50,000 population. These data were 
gathered originally from the U.S. Census of 
Population and Housing, 1970 and 1980. 
Distance in miles from the central city 
boundary was measured directly from census 
maps. 

The ICMA study involved mailed question- 
naires addressed to municipal planning offic- 
ers. The response rate for suburbs in the: 
ICMA survey was 44 percent, yielding a final 
sample of 416 suburbs.? Because of missing 
information on other variables, our working 
sample for Table 3 is 387 cases. The growth 
controls on which the ICMA collected 
information are listed in Table 1, along with 
the proportion of suburbs that have adopted 
each control measure.? The wording of the 


? Such bias could distort both our estimate of the 
proportion of suburbs that instituted particular 
growth controls and our causal models presented 
below (Berk 1983). In order to evaluate this latter 
possibility, we have applied logistic regression 
analysis to predict inclusion in the final sample, 
using information on all incorporated suburbs over 
10,000 population in the Logan and Schneider data 
file. Important predictors included education level, 
age of the central city, and distance of the suburb 
from the central city. Thus our sample is weighted 
in favor of outer-ring suburbs in newer metropoli- 
tan areas and with a higher-educated population, 
with central city age being the strongest distorting 
factor. 

To assess the effects of sample selection bias on 
our modeling results, we followed the procedure 
outlined by Heckman (1980). The logistic equation 
was used to calculate a predicted probability of 
sample inclusion (first, the log of the odds was 
transformed to the odds themselves; then the odds 
converted into a probability). A “hazard rate" 
equal to one minus this predicted probability was 
then included in all of the equations described in 
the following sections of the paper. This hazard 
rate is correlated highly with the variables from 
which it is composed, particularly central city age 
(r = —.74). Its primary effect when introduced 
into the models in Table 3 below was to displace 
the central city age variable and, because of 
multicollinearity, greatly increase the standard 
error of the intercept in most equations. It did not 
affect our principal findings on the effects of 
growth controls. Therefore, we do not report 
equations including the hazard rate here. 

? The most significant gap in this list is a 
measure of the restrictiveness of current zoning. 
For example, some suburbs include no provision 
for multifamily housing or for single-family 
housing at densities of more than one or two units 
per acre. Such restrictions might not necessarily 
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Table 1. Indicators of Growth Control 


Control % using control 
Moratorium 19.5 
Growth limitations 26.9 
Environmental impact statement 33.2 
Open space zoning $1.9 
Zoning to protect environment 37.3 
Public facilities requirement 79.8 
Public land dedication 49.0 


questionnaire items, including some not 
analyzed here, is provided in Appendix I. 

The most extreme form of control over 
development listed here is the growth morato- 
rium. Close to 20 percent of all responding 
suburbs had imposed growth moratoria during 
the previous two years. Most commonly, 
these moratoria suspended the issuance of 
building permits, but sometimes they also 
affected proposals for rezonings or for sewer 
and water connections. This is the only 
measure for which the ICMA survey provides 
information on duration and perceived effec- 
tiveness. On average, they were in effect for 
six months. Respondents were asked "how 
effective" the strategy had been, or would be 
if it were used. On a five-point scale ranging 
from “very effective" to “very ineffective,” 
no respondent rated à moratorium as "very 
ineffective." A total of 76 percent rated it as 
*very effective" or "effective," a figure that 
did not differ between suburbs that had or had 
not used a moratorium. Once again, these 
figures reinforce our sense that planners 
believe such growth controls do make a 
difference. 





slow the rate of growth (depending on the 
availability of undeveloped land), but they cer- 
tainly could place an upper limit on population size 
and affect the character and price of new housing, 
as found by Shlay and Rossi. In analyses not 
reported here, we experimented with a proxy for 
restrictive zoning policy based on the proportion of 
multifamily housing in the 1970 housing stock. 
Those with less than 11 percent multifamily 
housing (more than one standard deviation below 
the mean) were classified as “exclusionary.” 
Curiously, we found that suburbs that had 
effectively excluded multifamily housing were 
slightly less likely to use other forms of growth 
control. And, more to the point, while exclusion- 
ary zoning by this measure was positively related 
to family income level, it had no relation to 
population growth or to change in income, rent, or 
black population. 
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Another controversial mechanism is growth 
limitations, which made Petaluma famous. 
Generally, these are understood to include 
either targets or upper limits on the number of 
new housing units that will be allowed in a 
given period. Unfortunately, the ICMA ques- 
tionnaire provides no further information on 
this category of control, and we do not know 
precisely what methods were employed by the 
27 percent of suburbs that reported using 
* growth limitations." 

Three questions specifically address envi- 
ronmental protection, which was a relatively 
recent issue in 1973. These ask whether the 
municipality had adopted a requirement for 
environmental impact statements for public 
and private development projects (33 percent 
had), whether zoning was used to preserve 
open space (52 percent), and whether zoning 
was used generally to protect the environment 
(37 percent). 

A final type of regulation deals with the 
public costs of land development. Among 
suburbs in our sample, 80 percent required 
developers to provide some form of public 
facilities (roads, sewers, school buildings, 
etc.), while 49 percent required the dedication 
of land to public use. 

We have combined some of these growth- 
control indicators based on the correlations 
among them. The correlation matrix is 
reproduced in Table 2. Note that all correla- 
tions are positive, but that most are under .25. 
One exception is the correlation between open 
space zoning and zoning to protect the 
environment (.39). Another exception is 
between public facilities requirements and 
public land dedication (.37). In the following 
analysis, we sum the first two of these 
controls as *environmental zoning," and the 
second pair as "public facilities." The range 
of possible values for these indices is 


Table 2. Correlations among Growth Control Indicators 


I 2 3 4 5 6 7 

1. Moratorium — 
2. Growth limitations .11 — 
3. Environmental 

impact statement .21 .06 — 
4. Open space zoning .10 .19 .15 — 
5. Zoning to protect 

environment .11 .18 .14 .39 — 
6. Public facilities 

requirement 10 .08 .15 .22 224 — 
7. Public land 

dedication .09 .10 .15 .22 2244 .37 — 
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therefore from a minimum of zero to a 
maximum of two. 


RESULTS: EFFECTS OF 
GROWTH CONTROLS 


What are the effects of formal planning 
controls on subsequent patterns? Table 3 
presents results of a series of four multiple 
regression models, predicting 1980 values of 
population, median family income, median 
rent, and black population. These variables 
represent the major dimensions of size and 
character of the community which the exclu- 
sionary zoning literature argues are the 
objects of local policy. Note that we do not 
presume that the purpose or possible conse- 
quence of controls is simply to limit popula- 
tion growth. 

In these models, we include the 1970 
values of these same variables, so that change 
is represented at time (t) deviations from the 


score predicted from time (r-1). These are- 


familiar lagged endogenous variable equa- 
tions, which may overestimate the effect of 
the lagged variable and underestimate other 
effects (Hibbs 1974). Other predictors used as 
control variables, drawn from past studies of 
suburban change, are the educational level, 
age of the central city (number of years since 
the city reached 50,000 population) and the 
suburb’s distance from the central city, the 
proportion of long-term (more than five 
years) residents, and population density in 
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1970. Population growth has been shown 
previously to be positively associated with 
income, education level, and distance from 
the central city. It is negatively associated 
with central city age (largely distinguishing 
regions of the country). Changes in median 
income and median rent, as indicators of 
socioeconomic status, are expected to be 
negatively associated with black proportion, 
positively with distance from the central city 
(representing the location of newer develop- 
ments), and, for the same reason, negatively 
with population density. Changes in median 
income and median rent are expected to be 
positively associated with education level 
(through persistence of status differences) and 
positively with residential stability (which 
provides defense against community “tran- 
sition”). Increasing black proportion is ex- 
pected to be affected by these same variables, 
but in the opposite direction. 

Two of these control variables can be 
understood as measures of the growth poten- 
tial of a suburb. Suburbs with low population 
density, located at a large distance from the 
central city, are likely to have the most 
available undeveloped land and—based on 
standard thinking in ecological life-cycle 
models—are most susceptible to growth and 
change. Another major source of variation in 
growth is likely to be the regional economy. 
In broad strokes, for example, we might 
expect suburbs in rapidly developing Sunbelt 
states to grow more rapidly than those in 


Table 3. Multiple Regression Predicting Change in Population Size, Median Income, Median Rent, and Black 


Percent (Standardized regression coefficients, t-test, and multiple R} 
Dependent Variables 
Population Median Income Median Rent Black % 
Predictors 1980 1980 1980 1980 
Population 1970 .87 (42.43) —.04 (—2.26) — 06 (2.59) 
Income 1970 _ .84 (24.70) 29 (4.87) — 
Rent 1970 — — .50 (12.12) — 
Black 96 1970 — —.05 (-2.33) — .90 (41.76) 
Distance from center — — — —.04 (—2.05) 
Central city age 1970 —.09 (—3.85) — —.08 (—2.44) — 
Education 1970 — .08 (2.48) 1 (2.21) — 
Residential stability —.15 (-- 6.58) —.05 (—2.50) — — 
Density 1970 —.07 (—3.54) —.07 (—3.72) — — 
Moratorium — — — — 
Growth limitations -— — -— — 
EIS requirement _ — .14 (4.61) — 
Environmental zoning — — — —.04 (—1.89) 
Public facilities — — — — 
R squared .85 .87 .67 .83 
Number of cases 387 387 387 387 





466 


older metropolitan areas. This factor is partly 
accounted for by inclusion of “age of central 
city” in the regression models. In addition, 
we experimented with dummy variables to 
represent differences among the nine census 
divisions of the country. However, these 
dummy’ variables did not affect our main 
findings, and we do not report them here. 
Because of the large number of indepen- 
dent variables, several of which are highly 
correlated with one another, the analyses 


were conducted in stepwise fashion. We. 


report coefficients only for those variables 
that entered the prediction equation with a 
significant effect. Several growth-control vari- 
ables have significant “zero-order” effects — 
that is, if they were introduced after entering 
only the 1970 value of the dependent 
variable, they would have a significant 
coefficient. The environmental impact state- 
ment (EIS) and public facilities requirements 
have positive zero-order associations with 
population growth (with betas of .05 and .08 
respectively). The public facilities require- 
. ment is positively associated. with increasing 
family income level (beta=.05). The EIS 
requirement is positively associated with 
increasing rent level (beta—.14). Finally, 
environmental zoning is negatively associated 
with increasing black proportion (beta— 
— .04). 

As shown in Table 3, most of these effects 
are not significant when other possible causes 
of community changes are taken into account. 
There was greater population growth in 
younger and lower-density suburbs and in 
those with smaller proportions of long-term 
residents. Income increased more in smaller 
suburbs with higher educational level, lower 
density, smaller black population, and smaller 
proportions of long-term residents. Rent 
levels increased more in suburbs with higher 
income and education and in younger metro- 
politan areas. Black proportion increased 
more in larger inner suburbs. These findings 
are consistent with previous research. 

In only two equations do growth-control 
variables show a significant effect. The 
environmental impact statement requirement 
has a significant positive effect on median 
rent, that is, raising housing costs, as argued 
by some economists. Environmental zoning 
has a- negative effect on change in black 
population (with t= — .1.89, the coefficient is 
significant using a one-tailed test). This latter 
result is consistent with arguments made by 
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opponents of exclusionary zoning who impute 
racial motivations and effects to the zoning 
mechanism (see Danielson 1976). No other 
growth control affects change in any popula- 
tion characteristic. We also experimented 
with various methods of combining growth- 
control indicators into scales, with no stron- 
ger evidence of their effectiveness. 

Faced with these results, some readers may 
focus on the two positive findings. We do not 
wish to underplay their importance. Indeed, 
to our knowledge, Table 3 provides the first 
systematic evidence that black suburbaniza- 
tion is negatively affected by zoning poli- 
cies—after controlling for other potential 
predictors. And it tends to confirm for a broad 
national sample of suburbs the belief that 
environmental impact statements, perhaps by 
increasing the time required for project 
review, raise housing costs. Making use of 
the unstandardized coefficients, we have 
calculated the effects in absolute terms of the 
EIS requirement and environmental zoning. 
Over the 1970—1980 period, the effect of both 
“open space zoning” and “zoning to protect 
the environment" was to decrease black 
population by 1.2 percent (against an average 
5.6 percent for all suburbs in 1980). This 
small absolute change is large proportion- 
ately. The effect of the EIS requirement was 
to raise median rents by $16.40 (against an 
average of $295.50 in 1980). This 5.5 percent 
increase in housing costs due to the EIS is 
very modest, given that the average median 
rent more than doubled during the period. 

Other readers, by contrast, will note that 
two of the most widely publicized mea- 
sures— growth limitations and moratoria— 
were found to have no effect. (Nor, as 
reported above, did our proxy measure of 
exclusionary zoning.) We conducted further 
analysis of the moratorium variable, using 
respondents’ reports of the moratorium’s 
duration and effectiveness. Possibly morato- 
ria in general have little impact, but their 
effect could be found in places where they 
were imposed for a longer period or where 
planners believed they had been effective. 
However, we found that neither duration nor 
perceived effectiveness was related to change 
in any-of the dependent variables. 


CASE STUDIES OF GROWTH CONTROL 


To help interpret these findings, we believe it 
wil be instructive to look more closely at 
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some specific cases of growth control. We 
discuss Petaluma (California) and Ramapo 
(New York), whose growth policies were the 
object of well-known lawsuits in the 1970s, 
and several more recent California cases. 


Petaluma 


Following the principle that "growth is 
good," Petaluma allowed uncontrolled devel- 
opment through the 1960s. Its population rose 
from 14,000 in 1960 to 25,000 in 1970. What 
had been called the "Chicken Capital of the 
World" had become a bedroom suburb for 
San Francisco (Schwartz et al. 1979). In June 
1971, under some public pressure, the City 
Council drew up a new growth policy, 
pledging that the city would retain a "small 
town atmosphere" and protect open spaces. 
The teeth in this policy were provided by a 
growth-control ordinance passed in June 1972 
and upheld in a public referendum by a 
four-to-one majority. It limited growth out- 
side the current city limits to 500 housing 
units per year for the next five years (allowing 
for a population increase of about 1500 
annually). The city proposed a 200-foot 
greenbelt around Petaluma to serve as a 
boundary for urban expansion. Housing 
permits were to be approved on a competitive 
basis, with an emphasis on balanced geo- 
graphic distribution and an even allocation 
between single-family and multi-family resi- 
dences. From 8 percent to 12 percent of 
housing units were to be targeted for low- and 
moderate-income persons. Rather than an 
anticipated population of 77,000 in 1985 


according to then-current projections, the. 


Petaluma Plan aimed at a population of only 
about 55,000 by that time. 

Petaluma's growth ordinance attracted enor- 
mous national publicity and was viewed as a 
test case of municipal authority. It was 
immediately attacked in court by the Construc- 
tion Industry Association of Sonoma County. 
But in August 1975, on appeal by the City, 
the Petaluma Plan was upheld by the District 
Court of Appeals, which found that the plan 
was rationally related to the environmental 
welfare of the city. 

What, then, was the effect of the Petaluma 
Plan on Petaluma’s future? By 1980, the 
city's population stood at nearly 34,000 (a 
growth of 36 percent over 1970, compared to 
the county's growth of 46 percent) The 
number of occupied housing units increased 
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by about 4400, less than the 500 per year 
specified by the Plan. At the same time, it 
nearly tripled its manufacturing employment 
and more than doubled its retail employment. 
Its median income, which had been slightly 
above the county average, remained slightly 
above the county average. 

Did the Plan limit growth? Appelbaum 
(1985) stresses that the allocation limit was 
not reached in any year, and concludes that 
the Plan therefore was not actually as 
constraining as it appeared. Schwartz et al. 
(1979, p. 56-57) argue to the contrary, 
claiming that it was not market conditions but 


„at least partly the Plan itself that kept the 


number of new housing units below the 
allocation limit. Our view is that the City 
Council's dramatic reversal of its previous 
progrowth policy resulted in (at most) modest 
reductions in residential development and an 
acceleration of nonresidential growth. Partic- 
ularly coming on the heels of very rapid 
expansion over one or two decades, and in a 
limited geographic space, the phase of rapid 
growth was probably already nearing its end. 


Ramapo 


The policy of the Town of Ramapo, in 
Rockland County, New York, is another early 
example of growth control. A 1969 amend- 
ment to the Town's zoning ordinance required 
residential developers (except for individual 
homes) to apply for a special permit. This 
permit would be granted only to those 
projects that included satisfactory provisions 
for sewers, drainage facilities, parks and 
schools, public roads, and firehouses. The 
amendment did not rezone any land or limit 
the number of units that could be built, nor 
did it preclude development for projects 
where public facilities would be provided by 
the builder. But it did effectively impose a 
moratorium on development in areas not 
designated for public facilities in the Town's 
capital spending program. This amendment 
was challenged in two lawsuits, but the Town 
was upheld in 1972 by the State Court of 
Appeals. 

How did this policy affect the growth of 
Ramapo? Between 1970 and 1980, the Town 
grew by 16.1 percent, much less than in the 
previous decade but somewhat faster than 
Rockland County as a whole (12.9 percent). 
Its black population increased by about two 
percentage points, as did the County's. And 
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its median age increased by about four years, 
also the same as the County’s. These 
outcomes are consistent with the Appeal 
Court’s finding that the Town intended “not 
to freeze population at present levels but to 
maximize growth by the efficient use of land” 
within the limits of existing public resources 
(New York Court of Appeals 1972, p. 152). 
In fact, in the 1980s four new villages 
seceded from the Town of Ramapo in order to 
implement tougher zoning rules (I. Peterson 
1988). Thus, both the Petaluma and Ramapo 
cases illustrate the phenomenon of notorious 
“growth-control” policies that were adopted 
after a decade or two of rapid growth. But 
despite the sound and fury of legal battles, 
they had little demonstrable effect on subse- 
quent development. These two cases reinforce 
the findings of our quantitative analyses. 


San Diego Initiatives 


We suspect that growth-control measures are 
often not intended to control growth. In three 


recent instances in suburbs of San Diego,. 


citizens' groups managed to bring tough 
growth-control initiatives to a public vote 
(Bailey 1987). In Oceanside, Vista, and 
Carlsbad, the citizens' measures would re- 
strict growth to limits ranging between 500 
and 1000 units per year. In response, the City 
Councils in all three cities sponsored their 
own “growth-control” initiatives, to be voted 
on in the same elections. The Councils’ 
measures focused on requirements for public 
services to keep pace with population growth, 
but were relatively lenient on actual growth. 
The Oceanside City Council, for example, 
would allow growth up to 225,000 in a city of 
about 100,000 current residents. 

The Council-sponsored measures were 
clearly intended as a defensive response to the 
challenge represented by the citizens’ initia- 
tives. They all contained "killer clauses" —if 
supported by a larger majority of voters than 
their competitors, they would nullify the 
citizen-sponsored initiatives even if these 
latter also received a majority vote. In fact, in 
all three cities, the citizen-sponsored initia- 
tives received a majority vote, but they 
survived the killer clause only in Oceanside. 


CONCLUSION 


There is abundant evidence that conflicts over 
the use of land and location are at the heart of 
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local politics in both cities and suburbs. The 
advocates and opponents of growth in gen- 
eral, and of particular development projects, 
recognize that they have a stake in the future 
of the locality and act collectively and 
politically to support their vision. Sometimes 
formal planning tools to control growth 
become the focal point of conflict, and their 
adoption or nonadoption sometimes reflects 
the balance of forces between the "growth 
machine" and its adversaries. Sometimes, 
those planning tools are used to constrain 
future development. 

Our main conclusion here.is that these 
presumptions cannot be taken for granted. 
Growth controls of the variety studied here — 
moratoria, growth limits, environmental reg- 
ulations, public facility requirements— only 
modestly affect rent levels and racial compo- 
sition, and they do not seem to affect 
population growth or socioeconomic status. 
This finding contradicts much of the literature 
cited at the beginning of this article. The low 
response rate to the ICMA survey means that 
higher-status, outlying suburbs in newer 
metropolitan areas have disproportionately 
contributed to these results, but there is no 


‘indication that selection bias has had distort- 


ing effects. 

We conclude that the planning mechanisms 
afforded by governmental autonomy should 
be evaluated cautiously. The formal ability of 
a community to control its own growth, and 
even the formal adoption of growth controls 
and environmental limitations, do not guaran- 
tee their implementation. This point has been 
made in more general terms with respect to 
government regulation, illustrated by well- 
known cases of the "capturing" of agencies 
by the regulated industry (Bernstein 1955). 
Likewise, intervention in land use in the 
United States does not necessarily imply a 
desire to replace the operation of the property 
marketplace. Entrepreneurs have shown grow- 
ing appreciation that selective intervention 
can yield, in the end, greater benefits for the 
market as a whole (Wilson 1980; Shonfield 
1965; Schultze 1977). Logan and Molotch 
(1987) have argued this position in detail in 
their analysis of three important planning 
tools: zoning, growth limitations, and environ- 
mental review. 

Policies may be adopted for symbolic as 
well as for practical reasons, and to appease 
diffuse public opinion as well as to respond to 
a well-organized political force. What appears 
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to be a restrictive policy may on close 
inspection turn out to be a license for more 
intensive development. Even the most care- 
fully drawn legislation is dependent upon 
subsequent enforcement. And often legisla- 
tion is reactive to a pattern of events that have 
already run their course, too late to make a 
difference. 

From the perspective of traditional human 
ecology, these results may seem to indicate 
that political conflict over growth is inconse- 
quential: sociologically interesting, perhaps, 
but not important to urban structure. That is 
not our interpretation. We regard them rather 
as a challenge to study growth politics with a 
finer grain of detail. Information on formal 
policies is inadequate without data on the 
balance of forces behind those policies, on 
their significance within a local context. And 
this message for researchers has a practical 
importance for American communities. The 
formal autonomy of suburban governments, 
the means, that is, for grassroots democracy, 
is not a very good predictor of its realization. 
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APPENDIX 
Wording of growth-control questions 
Has your municipality adopted a formal require- 
ment for environmental impact statements: (1) on 
public and private projects, (2) on public projects 
only, (3) on private projects only? 

Which of the following land use controls have 
been enacted by your municipality? (Check all 
those applicable.) 


a. Architecture 

b. Flood plain zoning 
c. Growth limitations 
d. Historic preservation 
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. Marshland controls 

Open space zoning 

Required installation of public facilities 

_(e.g., sewers) by developers 

Required dedication of land for public 

purposes (e.g., schools, parks) by developers 

i. Zoning for protection of natural resources or 
ecological systems 

j. Other (specify). 


Has your municipality imposed a moratorium 
based on environmental considerations in the 
last two years? If “Yes,” 


A. What was delayed? (Check all those applica- 
ble.) 
a. Issuance of building permits 
b. Request for rezoning 
c. Water or sewer connections 
d. Other (specify). 

B. How many months was the moratorium in 
effect? (If still in effect, how long has it been 
in effect?) 


w nmo 
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O 


STATE AUTONOMY AND CLASS 
CONFLICT IN THE REFORMATION 


(Comment on Wuthnow, ASR, December 1985)* 


MANSOOR MOADDEL 
Eastern Michigan University 


Robert Wuthnow’s “State Structures and Ideologi- 
cal Outcomes” employs a social structural ap- 
proach to explain why the Reformation movement 
succeeded in England but it failed in France, using 
a comparative historical method. Wuthnow at- 
tempts to provide an alternative to Swanson’s 
Durkheimian approach to religion, exemplified in 
his Religion and Regime. In this work, Swanson 
argues that there is a strong correspondence 
between social conditions and ideology. Using this 
perspective, he then attempts to establish a direct 
relationship between, on one hand, the differing 
Protestant and Roman Catholic conceptions of 
God’s immanence, and, on the other hand, the 
manner in which people experienced political 
sovereignty in the 16th century. His comparative 
historical analysis supports the argument that 
Protestantism was adopted in areas where political 
authority was decentralized, while Catholicism, in 
contrast, remained strongest under centralized 
regimes. 

Wuthnow convincingly criticizes Swanson’s 
argument on historical grounds. His more effective 
critique, however, is theoretical: “Swanson does 
not indicate explicitly who it is that makes the 
connection between regime structures and religious 
ideas, nor exactly how this connection is made” 
(p. 803). In Wuthnow’s formulation, the mediat- 
ing link between social structure and ideology is 
“the embeddedness of ideology in social institu- 
tions” (p. 805). Given this embeddedness, Wuth- 
now argues, the success of a major ideological 
reorientation is contingent upon the mobilization of 
social resources. Since the Reformation, where it 
succeeded, was imposed “from above” (p. 811), 
“the relative autonomy of the central state 
bureaucracy from the landowning elite which was 
closely tied to the established church was a 
decisive factor in determining the success ‘of the 
Reformation" (pp. 805-6). The success of the 
Reformation in England is thus related to the 
development of a strong regime autonomous from 
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landed nobility, and its failure in France to the 
crown’s weakness and its dependence on landed 
elite. 

Wuthnow's critique of Swanson is thorough and 
convincing. His alternative perspective is highly 
persuasive. Nevertheless, it provokes several 
comments. 

1. Wuthnow provides only a weak version of a 
state-centered explanation and thus exaggerates the 
role of the state in affecting the success/failure of 
the Reformation. The analysis is based on the 
assumption that there is a rivalry between the state 
and landed nobility. Such an assumption leads 
Wuthnow to attribute the success of the Reforma- 
tion in England to the autonomy of the crown from 
landed interests, and its failure in France to the 
lack of such an autonomy. Nowhere in the paper 
does Wuthnow deal with the question of why the 
state should be interested in the Reformation after 
all. And even if one takes the analytical position of 
the state-centered theorists and argues that the 
state, by imposing the Reformation from above, 
would have expanded its own power and interests, 
it is important to show that both the English and 
French states were interested in and attempted to 
impose the Reformation. In Wuthnow's historical 
presentation, there is no evidence that the French 
state had any interest ir or attempted to impose the 
Reformation. Wuthnow has convincingly argued 
that the English state was considerably stronger 
than the French state and that the former gained 
considerable autonomy from landed nobility as a 
result of commercial expansion. But many of the 
indicators of the autonomy of the English state— 
that is, the sources of revenues, and the class 
composition of the pariiament and of high offices 
in the state administration— were also proximate 
measures of the growing power of English 
merchants. Given the historical conflict between 
merchants and landed nobility during the course of 
capitalist development in England (Dobb 1978), 
commercial expansion in England was naturally 
increasing the power of merchants vis-a-vis landed 
nobility, providing a favorable situation for the 
State to gain its autonomy from landed interests. A 
strong state-centered analysis, however, requires 
demonstrating that the state was equally autono- 
mous from merchants and commercial interests. 
Wuthnow only mentions that "Merchants were 
personal beholden to the crown for trading 
privileges for the crown's promotion of favorable 
foreign relations" (p. 812). He, however, does not 
substantiate his claims with historical evidence. 

2. A more serious problem with Wuthnow's 
analysis, however, is that his historical narrative 
does not quite fit within his sociological frame- 
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work. While the level of state autonomy is 
considered the principal explanatory variable, 
Wuthnow’s presentation of history supports a class 
analytic framework of the case. For one thing, as 
Wuthnow indicates, “the new doctrines in each 
case were carried by merchants with commercial 
connections in central Europe and by students and 
clergy from German universities” (p. 806). That 
is, both English and French merchants constituted 
the major class basis of the Reformation move- 
ment. For another, in both countries landed 
nobility had strong ties with the Catholic church. 
In the absence of any substantive discussion of the 
state’s interests in the Reformation in both 
countries, one may venture to say that class 
conflict was the principal factor that decisively 
determined the success of the Reformation in 
England and its failure in France. 

3. The superiority of a class conflict model over 
Wuthnow’s state-centered model would become 
evident if the question of a major ideological 
reorientation is posed within a wider historical 
setting and a broader picture of societal totality. 
One thing is clear: prior to the emergence of the 
Reformation, continental Europe was undergoing 
large-scale economic transformation—that is, the 
decline of feudalism and the rise of capitalism. 
Such a transformation involved two sets of class 
conflict: landlord versus serfs, and landlords versus 
merchants (i.e., the burgesses). The overlap of 
these two sets of conflict greatly facilitated the 
decline of feudalism. Tied to the Catholic church, 
the feudalist class enjoyed the support of the 
church against its enemy. During the course of the 
13th and 14th centuries in England, for example, 
the church was involved in landlord-burgess 
struggle: l 

At Dunstable at one time the burgesses, in face 

of the threat of excommunication, declared that 

they would “descend into hell all together” 
rather than submit to the arbitrary impositions of 

the prior. In 1327 at Bury the townsmen made a 

forcible entry into the monastery and carried off 

the Abbot and monks to imprisonment until they 
should allow the grant of a gild merchant; while 
in the same year at Abingdon a crowd, swollen 
- by allies from Oxford, laid siege to the abbey 
and burned down its gates. At St. Albans there 
was a ten-day siege of the monastery, because 
the Abbot refused the citizens the right to erect 
fulling mills of their own; at Norwich there was 
open war between town and cathedral and 
rioting in 1272 in course of which the cathedral 
church was set on fire; while urban disaffection 

“formed a considerable element in the Peasants’ 

Revolt” of 1981. (Dobb 1978, pp. 81-82; see 

also Lispon 1966, pp. 206-7) 


Thus the Reformation, as a protest movement 
against the domination of Catholicism, was 
historically associated with the merchants’ protest 
against feudalist landowners. 
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Given this context, one can see the contrasting 
English and French cases quite differently from 
Wuthnow’s observation. Evidently, when Wuth- 
now claims that “the development of a strong 
regime in England . . . was greatly facilitated by 
the economic development of the period” (Wuth- 
now 1985, p. 808), he is not engaged in a trivial 
commentary. Nevertheless, far from being the 
crown, the prime beneficiaries of England’s 
commercial fortune were merchants. It is logical 
therefore to expect that merchants had considerable 
resources to lobby successfully the crown against 
their rivals, landed nobility. 

4. Although in Wuthnow's formulation it is 
unclear how the success of the Reformation could 
be connected to state autonomy, posing the 
question of a major ideological reorientation within 
the context of class struggle and changing class 
relations, one can provide some support for a 
state-centered argument. Kings received Luther's 
and Calvin's ideas in the context of their capacities 
to utilize and appropriate the resources of the 
Catholic church. Those capacities, in turn, were 
determined by the pre-Reformation structure of 
relations among the crown, aristocracy, merchants, 
and clergy in England and France. Thus the state 
was not so much interested in the Reformation per 
se as it was in seizing the power and properties of 
the Catholic church. (After all the autonomous 
Henry VIII remained Catholic in theology). The 
state capacity in England was enhanced by the 
strength of the merchant-based Reformation move- 
ment, by the weakness of landed nobility, and 
ironically, as Wuthnow also agrees, by the English 
church's structural autonomy from county-based 
landlords, which created a small target, of a few 
dozen clerics at the top of the hierarchy, for royal 
control. On the other hand, French nobles' 
domination of, and financial control over, provin- 
cial clerics and church institutions gave them an 
interest in preserving the formal autonomy of the 
Catholic church and in preventing the French 
crown from controlling the church hierarchy and 
appropriating clerical resources. 
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STATE STRUCTURES AND CLASS 
CONFLICT IN THE REFORMATION 
(A REPLY TO MOADDEL) 


ROBERT WUTHNOW 
Princeton University 


Mansoor Moaddel has raised several critical 
objections to my article on the Reformation which 
question the adequacy of its theoretical framework. 
The core of his argument is that a class conflict 
model might provide a better explanation of the 
evidence than the model I outlined which focused 
on state structures. Moaddel's comments warrant 
serious consideration because they reintroduce 
Marxist ts about the relations between 
class relations and ideology—in this case, the new 
ideology of the Protestant reformers. His concerns 
also parallel debates about the relative role of states 
and classes in early modern Europe that have 
arisen in conjunction with the work of Immanuel 
Wallerstein, Perry Anderson, Robert Brenner! and 
others. None of these other debates has been 
particularly concerned with the Reformation. 
Moaddel’s comments, therefore, provide an occa- 
sion to consider again, but with reference, as it 
were, to a different dependent variable, whether 
we can dispense with class conflict in this period or 
whether Marxist accounts are essentially correct. 
My article represented a small part of a 
comparative-historical project on which I have 
been working for a number of years. This project is 
an attempt to recast theories of the relations 
between social structure and ideology by examin- 
ing variations in the social conditions under which 
the Reformation, the Enlightenment, and socialism 
in the 19th century became institutionalized. | Por 
this purpose, I have developed extensive compari 
sons of some 25 cases in all. My thoughts 
the adequacy of class conflict models inevitably 
reflect this larger project. In that context, I have 
confronted the literature on class conflict land 
ideology repeatedly and have tried to provide some 
summary observations about the ways in which 
capitalist development has contributed to l the 
shaping of cultura! change. Needless to say, Iam 
somewhat reluctant to reply in what must be a 
cursory and superficial manner to the points 
Professor Moaddel has raised. By the time this 
exchange appears, the book resulting from |my 
larger project will likely have been publi 
(Communities of Discourse: Ideology and Sokial 
i 


AMERICAN SOCIOLOGICAL REVIEW 


Structure in the Reformation, the Enlightenment, 
and European Socialism (Cambridge: Harvard 
University Press, 1989]) and it, rather than the 
present remarks, must stand as my best statement 
about the complex relations between state struc- 
tures, class conflicts, and ideology. Nevertheless, 
a few brief observations are perhaps worth making 
here. 

First, I wish to underscore that my article was 
directed primarily at the leading effort by a 
sociologist to date to provide a systematic account 
of the social factors conditioning the Reformation: 
Guy E. Swanson’s Religion and Regime. Swan- 
son’s book presented a near-perfect correlation 
between a measure of authority structure (guber- 
naculum) in European societies at the end of the 
15th century and the outcomes of Reformation 
struggles during the 16th century. In Swanson’s 
view, a near-perfect correlation of this kind should 
be thought of in the same way one might about a 
similar correlation between age and language 
development among young children: it is such a 
powerful regularity in human behavior that, despite 
individual variations in the process by which 
language is learned, any theory that accounts for it 
is powerful indeed. My argument, without too 
much distortion, was that (a) a near-perfect 
correlation of this kind, given the vastly complex 
conditions of 16th-century Europe, is itself suffi- 
cient to raise some skeptical concerns, especially 
when (b) fewer than a sixth of the political entities 
at the time that might have been chosen were 
actually included, (c) the operative concepts in the 
model were sufficiently abstract to allow a great 
deal of slippage between concept and measure- 
ment, (d) historians of the period have routinely 
questioned Swanson's codings, and (e) no plausi- 
ble mechanism was given in the model to specify 
who constituted the critical link between authority 
conditions and Reformation outcomes or why they 
should have constituted this link. I then extended 
these criticisms by comparing Reformation En- 
gland with Reformation France, (f) drawing on 
more than 250 historical studies, which, I argued, 
demonstrated convincingly that (g) Swanson could 
have coded these two cases exactly opposite from 
the way he did and, in fact, would have been more 
in keeping with the mainstream of historical 
interpretation if he had, (A) the orientation of 
neither king was compatible with Swanson's 
account, (i) the orientation of neither ruling 
household or of either ruling parliamentary body 
was consistent with Swanson’s account, (j) local 
and much-studied episodes such as the placards 
affair in France fail to support his arguments, (k) 
the reformers themselves saw things differently 
than Swanson's account would suggest, (D the 
actual distribution of power and representation of 
interests in each society pointed to a different 
theoretical explanation, and (m) this explanation 
could be most easily summarized by looking at 
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differences in the degree to which the central 
regime in each case was fiscally and administra- 
tively autonomous from or dependent on the 
landowning elite. The terms of the discussion were 
dictated to a large extent by my disagreements with 
Swanson and, for this reason, I did not attempt to 
address systematically the question of class 
conflict. 

Second, any attempt to examine the role of class 
conflict in the Reformation would, of course, have 
to pay attention to Dobb, as Professor Moaddel 
does. But this would be only a starting point. It 
would not be even an adequate starting point for 
two reasons. (a) Dobb's account of the political 
economy of the early modern period has been 
roundly and effectively criticized, both directly and 
indirectly, by sociologists and historians, and these 
criticisms suggest a number of modifications of his 
view of class conflict in the early development of 
capitalism: that the breakup of feudalism had 
already been accomplished in many parts of 
Europe well before the 16th century, that the 
relations between capitalist agriculture and capital- 
ist trading were relatively complex, that the pattern 
Dobb saw in England cannot be generalized 
accurately to other parts of Europe, that conditions 
in the broader world-economy need to be empha- 
sized more, that states played a more active role in 
economic affairs, and that other factors need to be 
emphasized in accounting for shorter (e.g., 
50-year) cycles during this period than the long 
view presented by Dobb (which pertains more to 
general tendencies that became pronounced in the 
17th and 18th centuries). (b) Marxist historiogra- 
phy of the Reformation itself has increasingly 
departed from the simple class conflict model 
presented in Dobb and has raised relatively more 
complicated questions about the nature of the 
bourgeoisie in this period, conflicts between 
landowners and peasants, variations within the 
Reformation's teachings, the accuracy of early 
accounts on which Marxist theories were based 
(such as Engels's discussion of the Peasant War), 
and relations between the Reformation and later 
periods of bourgeois expansion. This is clearly not 
the place to discuss the relative merits of these 
arguments. But there is a lesson to be learned: if 
you're going to advance arguments about the 
Reformation (or any other major episode in 
European history), you'd better know the historical 
literature, rather than arguing deductively from an 
outdated theoretical treatise that happens to be 
familiar to sociologists. 

Third, I happen to agree with Professor Moaddel 
that some features of the Reformation struggles 
suggest the relevance of class considerations. 
These include the merchants' propensity to support 
the reformers and the landowners' tendency to 
support the established church. That, however, is 
about all that can be said for a class conflict model. 
A class conflict model does not provide good 
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answers to the following: Why did the leading 
patricians in many European towns—elites who 
were in the forefront of regional and international 
trade—either not support the Reformation at all or 
temporize and take a wait-and-see attitude toward 
it? Why did towns with a very active commercial 
sector and a strong bourgeoisie, such as Lyons or 
Gdansk, fail to institutionalize the Reformation? 
Why did the lower strata of many European 
towns—the artisans, vagabonds, and unskilled ' 
laborers, that is, the very strata that Marxist theory 
says were being most exploited by the rising 
bourgeoisie—take the lead in advancing the 
Reformation? Why did smallholders in the coun- 
tryside who were at last free of feudal obligations 
cling to the established church rather than siding 
with the reformers? Why did rulers, such as those 
in England, Denmark, Saxony, and Hesse, all of 
whom continued to derive large earnings from their 
own lands, take the lead in advancing the 
Reformation? Why did other landowners who were 
capitalizing their holdings, shifting to contract and 
wage systems, and funneling profits into com- 
merce remain such strong supporters of the 
church? Why, in all of this, is there so little 
evidence of clear divisions between class fractions 
or of class formation as such? And why, if the 
triumph of the bourgeoisie accounts for the 
Reformation, does it become necessary for theo- 
rists to make arguments about the triumph of the 
bourgeoisie again to account for nearly everything 
from the civil wars of the 17th century to the 
Enlightenment and revolutions of the 18th century 
(how many times did the bourgeoisie need to 
triumph?)? i 

Finally, questions such as these (and similar 
ones for the 18th and 19th centuries) have led me 
to a model that emphasizes historical conjuncture 
rather than any straightforward model of class 
conflict. State structures play an important role in 
my model; so do elites, even classes, but I have 
found it more useful to examine the relations 
between ruling elites and the state and the relations 
among class fractions. These relations often 
include conflict, but uncertainty, transition, shift- 
ing cleavages, and fluid alliances are more the 
operative conditions. In ali of this, capitalist 
development plays a highly significant role. It does 
so, though, in ways that deviate from many 
standard theoretical accounts: capitalism resulted 
in new ideologies becoming institytionalized in the 
three periods I have examined not primarily 
because it broke up traditional forms of community 
or moral order at the grassroots level, nor because 
an extension of markets gave people new opportu- 
nities and expanded intellectual horizons, nor even 
because it brought to power a new class that had to 
legitimate its interests against rival claimants to 
power. Capitalist development influenced ideology 
more indirectly than directly. It was characterized 
by some periods of relatively exceptional and rapid 
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expansion. These periods of expansion coincided 
with the Reformation, the Enlightenment, and the 
rise of European socialism. Material expansion in 
each period was also strongly facilitated by central 
regimes. It provided new resources—resources that 
states in turn used to provision the production of 
culture, and resources that shifted the relations 
among dominant class fractions. These shifting 
relations also opened up opportunities for culture 
producers to innovate. The new ideas did not 
simply reflect their conditions of origin, but also 
textualized, transformed, and disarticulated the 
relations between social conditions and visions 
of the future. Formulations about classes were 
as much the creation of culture producers in 
each period as they were the conditions of this 
creation. 


RELATIVE COHORT SIZE AND 
POLITICAL ALIENATION: 
THREE METHODOLOGICAL 
ISSUES AND A REPLICATION 
SUPPORTING THE 
EASTERLIN HYPOTHESIS 


(Comment on Kahn and Mason, 
ASR, April 1987)* 


Ropert M. O'BRIEN 
PATRICIA À. GWARTNEY-GIBBS 
University of Oregon 


Richard Easterlin has hypothesized that the relative 
size of cohorts affects their well-being throughout 
their life span. Relatively large cohorts purportedly 
overload social institutions that are not equipped to 
handle their numbers. Members of these cohorts 
have fewer parents per child, fewer teachers per 
student, and when they leave school, they create an 
oversupply of workers. Their relative abundance 
subjects them to higher rates of unemployment and 
depressed salaries. Easterlin also suggests that 
relatively large cohorts are more politically alien- 
ated than relatively small cohorts. 

Kahn and Mason (1987) test Easterlin's hypoth- 
esis concerning political alienation. To do so, they 
modify the Age-Period-Cohort model of Mason, 
Mason, Winsborough, and Poole (1973), by 
including not only dummy variables for age-groups 
and for periods, but also interaction terms for each 
of the three youngest age-groups by their relative 
cohort sizes. They note several problems with 
Easterlin’s (1978, 1980) previous tests of the 
effects of relative cohort size (RCS) on levels of 


* We thank Fred Kerlinger and David Jacobs for 
comments on an earlier draft of this paper. Our 
.interest in this area was motivated by a talk a: the 
University of Oregon by William Mason in 1986. 
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alienation, and they correct these shortcomings by 
(1) using age-groups that are the same number of 
years as the gap between periods, (2) creating an 
index of alienation that contains two items from the 
same domain of content, and (3) avoiding the 
"half-table fallacy" by examining data on alien- 
ation for both younger and older age-groups at 
each period. 

Kahn and Mason's analysis focuses on the 
relationship between alienation and the age- 
group-by-RCS interactions for each of the three 
youngest age-groups, controlling for the relation- 
ship of alienation to age, period, and RCS. These 
controls act as proxy variables for factors associ- 
ated with them: for example, changes in the 
economy or politics should be associated with 
period; changes due to maturation or the shifting 
social roles of individuals should be associated 
with age. 

Development of a modified Age-Period-Cohort 
model to test Easterlin's hypothesis is an important 
innovation. Kahn and Mason present many varia- 
tions on their basic model which offer a fascinating 
window into the complexities of age, period, and 
cohort analysis. Their analysis, however, is flawed 
in three respects, and these flaws lead them to 
inappropriately reject Easterlin's hypothesis. The 
first flaw is their focus on the interaction effects of 
RCS with the three youngest age-groups; Easter- 
lin’s theory focuses on the main effect of RCS. 
The second problem centers on their interpretation 
of the size of effects. The third problem involves 
their failure to use significance tests, We address 
each of these points below. Our replication of 
Kahn and Mason's analysis avoids these three 
methodological problems and finds nontrivial 
support for Easterlin's hypothesis. 


AGE-GROUP BY RELATIVE-COHORT-SIZE 
(RCS) INTERACTION TERMS 


The overwhelming theme permeating Easterlin’s 
work is that cohorts are affected by their size 
throughout their life span. Concerning earnings, 
for example, "[t]he affect of generation size, good 
or bad, persists throughout the life cycle. Every 
generation follows the pattern of below-average 
earnings in early working life and above-average 
earnings later. But the earnings pattern of a small 
generation is more favorable throughout its career 
than that of a large generation” (Easterlin 1987, 
pp. 29-30). It is possible to find reference in 
Easterlin’s work indicating that generations may be 
more affected by a particular variable while they 
are young (e.g., Easterlin 1978, p. 406), but the 
thrust of his argument is that the effects of RCS 
persist throughout the cohort’s entire life span. 
The most reasonable way to test Easterlin’s 
hypothesis is to use an Age-Period-Relative- 
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Table 1. Reanalysis of Kahn and Mason’s Data, Including Standard Errors of the Regression Coefficients and 
Standardized Regression Coefficients R 
Model 3 
. Modi] Model Model 2a AGE-PERIOD-RCS- 
AGE-PERIOD AGE-PERIOD-RCS APRCS AGE*RCS 
p sep? B* seB* (standardized) pt se Bh 
Constant 358 (.031) 278 (.048) .028 (.073) 
Age .492* 
21-24 —.117 (.035) —,227 (.061) —.235 (.209) 
25-28 — .086 (.033) —.178 (.054) -.221 (.240) 
29-32 —.084 (.034) —.165 (.050) — .094 (.287) 
33-36 — .076 (.034) —.149 (.048) —.147 (.067) 
37-40 ~ 101 (.033) —.166 (.045) —.165 (.062) 
41-44 — 127 (.034) —.190 (.044) —.188 (.060) 
45-48 — 109 (.034) — 168 (.043) — .166 (.057) 
49-52 —.113 (.034) —.163 (.041) —.162 (.053) 
53-56 —.104 (.035) —.144 (.039) —.143 (.047) 
57-60 — 040 (.036) — .069 (.038) — .068 (.043) 
61-64 ~ 053 (.036) — .067 (.036) — .067 (.038) 
65-68 .000* 000* .000* 
Period .820* 
1952 .000* 000* .000* 
1956 —.052 (023)  —.053 (.023) —.053 (.023) 
1960 ~ 053 (.026) — .059 (.026) — .060 (.027) 
1964 039 (.024) 040 (.024) .040 (.025) 
1968 116 (.026) 117 (.025) .116 (.026) 
1972 141 (.022) 142 (.021) .141 (.026) 
1976 177 (.024) 178 (.024) .177 (.026) 
1980 170 (.026) 168 (.025) .168 (.028) 
Relative 
cohort size .0161 (.0076) 283? .0157 (.0130) 
Age (21-24)*RCS .0010 (.0221) 
Age (25-28)*RCS .0044 (.0263) 
Age (29-32)*RCS — .0069 (.0313) 
R .7519 .7660 .7664 
Adjusted R? .6939 .7075 .6960 
d.f. . nn 76 73 


Sources: Abenation mean scores are from Kahn and Mason's Table 1. Relative cohort size measures are taken from 
the Current Population Surveys used by Kahn and Mason. Cf. footnote 1. 
Note: Adjusted R? = 1 — {(N-1)/(N-K-1)}(1-R*), where K is the number of independent variables. 


* Metric regression coefficients. 
> Standard error of metric regression coefficient. 


* Sheaf coefficient for a series of dummy-coded variables (see text for discussion). 


3 Standardized regression coefficient. 


* Dummy variable omitted to avoid linear dependency. 


Cohort-Size (APRCS) model to test for the main 
effects of age, period, and RCS. If the main effect 
of RCS is statistically significant once the effects 
of age and period are controlled, then there is 
support for Easterlin's hypothesis (see O'Brien 
1989). After that, interaction terms can be added to 
the model to see if size has an especially potent 
effect for younger age-groups. Standard practice 
suggests examining the main effects first, and so 
does Easterlin's hypothesis. None of the 13 models 
reported in Kahn and Mason's article, however, 
tests the effects of RCS controlling for age and 
period alone; rather, when RCS is included in a 
model, it is always included in equations with 
age-group-by-RCS interaction terms. To test the 
main effects of RCS, we replicate Kahn and 
Mason's analysis below. 


A REPLICATION OF KAHN AND MASON'S 
ANALYSIS USING THE APRCS MODEL 


The analyses in Table 1 are based on Kahn and 
Mason's data. The RCS measure was compiled 
from the Current Population Surveys cited by Kahn 
and Mason, and the alienation scores were taken 
from their Table 1. Our model 1 (Table 1) with 
only age and period in the model reproduces nearly 
identically Kahn and Mason's analysis (their 
model 6).! As they noted, age and period are 


! The slight discrepancies between our regres- 
sion coefficients in model 1 and Kahn and Mason's 
model 6 are probably due to our use of the 
alienation means reported in their article to two 
decimal places. 
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associated with much of the variance in the mean 
alienation scores (75.19 percent). 

Adding RCS to the equation (our model 2) 
directly tests Easterlin’s hypothesis that RCS is 
related to alienation once the effects of age and 
period are controlled. This model was not reported 
in Kahn and Mason's article, but is, as we have 
argued above, the most appropriate model for 
testing Easterlin’s hypothesis. Although adding 
RCS to the age-period model increases the 
explained variance by less than 2 percent, the 
positive direction of the partial regression coeffi- 
cient associated with RCS (.0161) is consistent 
with Easterlin’s hypothesis. In the next section we 
discuss the size of this RCS coefficient. 

Adding the age-group-by-RCS interaction terms 
to the equation has little effect on the size of the 
RCS coefficient (our model 3—again, nearly 
identical to Kahn and Mason’s model 2), although 
it increases the standard error of RCS by over 70 
percent. The introduction of these interaction terms 
transforms the question being asked, from one 
concerning the effects of RCS on alienation 
controlling for age and period, into a question 
about the effects of RCS controlling for age, 
period, and the special effects of RCS for the three 
youngest age-groups. This is a form of the 
partialling fallacy described by Gordon (1968), 
that is, the inappropriate introduction of controls 
that eliminates the effects of the variable(s) of 
theoretical concern. For the Easterlin hypothesis, 
the variable of theoretical concern is RCS; the 
controls used by Kahn and Mason test the special 
effects of RCS for particular age-groups. Even 
when one stretches the Easterlin hypothesis to 
focus upon young age-groups, however, little is 
gained by adding these particular interaction terms 
to the model, since they are so highly multicollin- 
ear with other variables in the equation that any 
results regarding them are essentially meaningless. 

The extent of multicollinearity in an equation 
can be judged by examining the R? between one of 
the independent variables and the others. In our 
model 2 the R? between RCS and the other 
independent variables is .83. When the three 
age-group-by-RCS interaction terms are added to 
the equation in model 3, the same R? jumps to .94. 
The R* for each of the age-group-by-RCS 
interactions with the other independent variables in 
the equation is in excess of .993. Needless to say, 
estimates of these coefficients are extremely 
unstable, and thus Kahn and Mason's test for an 
“extra strong" effect of RCS for young age-groups 
is not a powerful one. 


HOW LARGE IS THE RCS EFFECT? 


Judging the importance or size of an effect in 
regression analysis is a perennial problem. In a 
recent text, Pedhazur (1982) devotes an entire 
chapter to the topic (chapter 8). Kahn and Mason 
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ignore the RCS effect present in their own data 
presumably because of their focus on the age- 
group-by-RCS interaction terms and on "the size 
and direction of patterns observed in the data." 
They argue that the "conditional effects" (interac- 
tions) of age and RCS are minimal (see their 
Tables 4 and 5 and associated discussion). The net 
effect of RCS also eppears to be small in our 
reanalysis of their data (model 2); that is, the 
coefficient associated with RCS is only .0161. 

The regression coefficients presented by Kahn 
and Mason, and by us thus far, are all unstandard- 
ized. Unstandardized regression coefficients, how- 
ever, are the least appropriate for comparing effect 
sizes in this context, since their size arbitrarily 
depends upon the units of measure chosen. For 
example, if the dummy variables were coded as 
100 and O instead of 1 and 0, the unstandardized 
regression coefficients associated with age and 
period would decrease by a factor of 100 and 
would all be smaller than the coefficient associated 
with RCS. Similarly, if RCS were measured as a 
proportion rather than a percent of the population 
of white males aged 21-68 in particular age 
categories, then the RCS coefficient would in- 
crease by a factor of 100. Clearly, the size of the 
unstandardized regression coefficients for the age 
and period dummies and for RCS is, to a great 
extent, an artifact of how they were coded. 
Further, the dependent variable (average alien- 
ation) ranges from .14 to .59 (with a standard 
deviation of .11), so that any changes in it 
associated with other variables will appear small. 

To ameliorate such problems, standardized 
regression coefficients are often used to transform 
variables into a standard metric. The standardized 
regression coefficient for RCS is .283 (model 22). 
This means that for each standard deviation 
increase in RCS, we can expect an increase of .283 
standard deviations in the mean level of alienation 
(with the age and period relationships controlled). 
This would generally not be considered a trivial 
relationship. 

In an attempt to compare this effect to those of 
age and period, we have employed a measure that 
combines the effects of the separate dummy 
variables into a single coefficient. This “sheaf 
coefficient" can be interpreted as a standardized 
regression coefficient, and it was derived by Heise 
(1972) specifically for this type of situation. In 
Model 2a the sheaf coefficients for both period 
(.820) and age (.492) exceed the standardized 
regression coefficient for RCS (.283). But it is 
hardly surprising to find that political alienation is 
highly related to both the period in which the 
questions were asked and the age of the individuals 
who were asked the questions. Given the power of 
period and age in explaining political alienation, it 
is a wonder that RCS has any effect at all. 
Easterlin’s hypothesis certainly does not require 
that RCS be more important than these factors. 


COMMENTS 


Thus far, our analysis of Kahn and Mason's data 
produces an effect for RCS.that is in the direction 
predicted by Easterlin’s hypothesis and that is 
moderately large. In the next section we discuss 
the appropriateness of testing this coefficient for 
statistical significance. 


THE USE OF SIGNIFICANCE TESTS 


Kahn and Mason reject the use of tests of statistical 
significance and therefore do not present standard 
errors for their regression coefficients. They state 
that “one consequence of the use of cell means for 
the dependent variable is that tests of significance 
based on the usual (weighted) least squares 
formulae are incorrect even under the assumption 
of simple random sampling, which is not met in 
this instance” (p. 159). This is simply not the case. 
Haitovsky (1973, p. 9), whom Kahn and Mason 
cite to justify their use of R^ as a measure of 
goodness of fit, provides the standard formula for 
testing the statistical significance of WLS regres- 
sion coefficients; he uses it to test the statistical 
significance of regression coefficients from a WLS 
analysis of sample means generated from two 
cross-classified variables containing 56 cells 
(means). Moreover, Johnston (1984, p. 293), 
whom Kahn and Mason also cite, provides the 
Standard approach for testing the significance of 
the regression coefficients for WLS analyses; his 
example analysis involves inferences about the 
means of aggregates. We have found no reference 
suggesting that the standard errors produced by 
weighted least squares are inappropriate or that 
they are not to be used in significance testing. 

Kahn and Mason correctly note that the National 
Election Survey (NES) samples on which their 
alienation means are based are not simple random 
samples; NES samples are, however, probability 
samples based on multistage sampling. These 
multistage probability samples almost always 
underestimate the standard errors of sample 
Statistics when formulas derived from the assump- 
tion of simple random sampling are applied to 
them, because data generated by multistage 
probability samples are usually not as precise per 
case as simple random samples. 

In Kahn and Mason’s case, however, the stan- 
dard errors of the regression coefficients are not 
directly based on the thousands of cases in the NES 
sample. They are based on only 96 cases—that is, 
the 96 sample means for the 12 age-groups at each 
of 8 periods. These unbiased sample means are the 
units of analysis.? Levels of statistical significance 


? Of course, in a WLS analysis the estimates of 
the regression coefficients are based on the number 
of cases used in the computation of each mean. 
The WLS analysis is used because it is assumed 
that sample means based on smaller size samples 
will be relatively less accurate than those based on 
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in this situation indicate the likelihood of finding an 
effect as strong as the one observed in these 96 
cases, assuming that they were drawn from a pop- 
ulation in which there is no systematic association 
(Winch and Campbell 1969). This use of signifi- 
cance tests examines the alternative hypothesis that 
the results are not due to random fluctuations. 

In footnote 13 Kahn and Mason cite three 
additional difficulties’ with statistical inference 
encountered in this situation: “the aggregation to 
cell means loses the ability to estimate directly the 
individual level error variance; the dependent 
variable is nononormal by construction; and the 
substance of [the] problem has a strong historical 
component.” 

These appear to be pseudoproblems. First, 
Easterlin's theory addresses group-level phenom- 
ena such as unemployment rates, crime rates, or 
average alienation. Thus, estimates of the individual- 
level error variance are not relevant, while the 
standard errors generated from the WLS analysis 
of mean levels of alienation are appropriate. 
Second, it is not clear how the use of age/ 
period-specific means as a dependent variable 
produces a variable that is "nonnormal by 
construction."3 Third, it is not clear what they 
mean by "the substance of our problem has a 
strong historical component." If they fear autocor- 
relation, they could have addressed that problem 
before conducting their own descriptive analyses in 
order to make their estimates more efficient. 

Thus, Kahn and Mason's claim that significance 
testing should not be used appears to us to be in- 
valid. The one-tailed level of statistical significance 
for the RCS coefficient in our model 2 is .018. 


CONCLUSION 


In contrast to Kahn and Mason, we find that their 
data support Basterlin's prediction concerning the 
independent relationship between relative cohort 
size and alienation. The effect is not large as 
measured by its increment to R? (when added to an 
equation already containing age and period dum- 


larger samples. Interestingly, the results of an OLS 
analysis of the data (not using weights) do not 
change the substantive conclusions of our analysis. 

? It is the distribution of the residuals that is 
emphasized by both Haitovsky (1973) and Johnston 
(1984) in their discussion of significance test for 
regression coefficients in WLS. 'The residuals 
generated from our model 2 closely approximate a 
normal distribution. They are slightly, though not 
statistically significantly, skewed negatively (skew- 
ness = —.170; standard error of skewness = 
.246). They are slightly, though not statistically 
significantly, leptokurtic (kurtosis = .024; stan- 
dard error of kurtosis = .488). (The measures of 
skew and kurtosis are based on Fisher’s g statistics 
[Bliss 1967, pp. 142-44].) 


x 


480 


mies), but it is in the predicted direction, its 
standardized regression coefficient is substantial 
(.283), and it is statistically significant (p= .018). 
Even if we allow for some dispute about the use of 
significance tests in sociology (Morrison and 
Henkel 1970), the pattern of results (i.e., the size 
of the RCS standardized regression coefficient and 
its direction) supports Easterlin's prediction. 
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REPLY TO O'BRIEN AND 
GWARTNEY-GIBBS 


WiLLIAM M. MASON 
The University of Michigan 


JoAN R. KAHN 
The University of Maryland 


O’Brien and Gwartney-Gibbs reject our choice of 


an interactive specification in favor of an additive 
one for testing Easterlin’s hypothesis about the 
impact of relative cohort size on trends in political 
alienation. They argue that relative cohort size has 
a nontrivial additive effect on alienation, as 
evidenced by a standardized coefficient of .283 
and a significance level of .018. The authors also 
defend the use of significance tests for this 
problem. 

We contend that the interactive specification 
accurately reflects Easterlin’s theoretical reason- 
ing. It is consistent not only with Easterlin's 
framework (1973, 1978, 1980), but also with the 
analytic approaches taken by other researchers 
(e.g., Welch 1979; Berger 1985), including 
Easterlin’s associates (e.g. Ahlburg and Schapiro 
1984). The additive specification is too simple, 
and exclusive empirical reliance on it is mislead- 
ing. 

In the following paragraphs, we discuss (1) the 
theoretical justification for the interactive specifi- 
cation, (2) further empirical results, and (3) 
collinearity, choice of weights, and tests of 
significance. 


INTERACTIVE VERSUS 
ADDITIVE SPECIFICATION 


At issue is the nature of the effect of relative cohort 
size on political alienation. O’Brien and Gwartney- 
Gibbs read Easterlin to mean that the effect of 
cohort size on alienation is positive and constant 
with respect to age (i.e., additive with respect to 
age). In our article, we argued that Easterlin’s 
theory implies a positive cohort size effect on 
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alienation that is especially strong for young adults 
(i.e., conditional with respect to age, therefore 
requiring an interactive specification). 

Why, within Easterlin’s framework, should the 
effect of cohort size on political alienation be most 
pronounced for young adults? Easterlin first 
developed a relative economic status argument to 
account for “generation”-length periodicity in 
fertility rates (1973). His notion of relative status 
involves a comparison and extrapolation process: 
Young adults make comparisons between what 
they want and what they are able to get in the short 
run, based on expectations fueled by their parents' 
relative economic success when these young adults 
were growing up. This is Easterlin’s original 
mechanism for explaining periodicity in human 
fertility: All other things equal, smaller cohorts 
produce larger cohorts because they are more 
prosperous. Their greater prosperity is due to a 
relative lack of crowding, which leads to increased 
demand for their labor relative to that of their 
parents. 

The offspring of smaller cohorts grow up with 
high economic aspirations based on their parents' 
financial success. When these offspring attempt to 
enter the labor force, they discover that, because 
of crowding, they cannot readily satisfy their 
aspirations. Labor force entry is a shock. There- 
fore, Easterlin argues, larger cohorts during 
young adulthood experience higher unemployment 
rates, longer-lasting job searches, lower wages, 
lower fertility, and higher levels of political 
alienation. 

The initial confrontation between aspirations and 
reality produces political alienation. What happens 
next? Easterlin asserts that the alienation will 
persist—because the crowding will persist. But 
what is the mechanism that would maintain 
political alienation at its initial level? It is hardly 
counterfactual to suppose that aspirations adjust 
with time—as individuals age and mature, and 
current events and situations supplant and modify 
the experiences and hopes of childhood and 
adolescence. Thus, as the shock of the unexpected 
harshness of young adulthood for those in large 
cohorts wears off, the political alienation that has 
its basis in failed aspirations should decline. 
In other words, according to the Easterlin frame- 
work, we would expect the impact of cohort 
crowding on alienation to be greatest during young 
adulthood when individuals with high aspirations 
face the harsh reality of an overcrowded job 
market. 

This interpretation does not contradict O'Brien 
and Gwartney-Gibbs' (or Easterlin's) assertion that 
the effect of cohort size on alienation should be 
positive at all ages. By emphasizing the impact 
during young adulthood, we are focusing on the 
ages when cohort crowding should have its 
strongest effect on alienation. To say simply that 
cohort size has a positive effect at all ages ignores 


481 


the subtlety of Easterlin's argument about how 
cohort crowding affects behavior.! 

Easterlin's theoretical focus is on long-term 
trends (Kuznets cycles). He asks: What accounts 
for the modern rise in homicide, political alien- 
ation, and divorce, among other dimensions of 
social and personal well-being? Thus, when he 
inspects data, Easterlin is concerned with periodic- 
ity. That is presumably why he examines trends 
over time in a dependent variable conditional on 
age (e.g., 1978). This is an indirect form of 
analysis, with age performing as a proxy for 
relative cohort size in an unspecified model. From 
his plots, we are meant to conclude that the rise 
and fall of alienation correspond to the rise and fall 
of relative cohort size. And specifically, we are to 
draw the conclusion that the rise in political 
alienation that began in the 1960s is due to the 
baby boom cohorts. This is the reason for his focus 
on the younger age-groups (and his neglect of 
alienation at older ages). 

Easterlin's theory and the. perspective that 
underlies his plots of political alienation over time 
(1978, p. 411) translate into a model specification 
in which political alienation is a function of 
relative cohort size, conditional on age. The 
primary hypothesis is that large cohort size will 
produce especially high levels of alienation among 
young adults. We should emphasize that to allow 
for an age-relative cohort size interaction in an 
empirical specification does not rule out the 
possibility that the size effect is positive over the 
entire age range (à la O'Brien and Gwartney- 
Gibbs). If the size effect is larger for the younger 
age-groups, then the data are consistent with 
Easterlin’s framework. If the size effect is positive 
and essentially constant over age-groups (i.e., 
additive with respect to age), then the data provide 
less support for the Easterlin framework, since the 
implied heightened reaction of young adults is not 
found. 

Easterlin’s theory of trends in political alienation 
does not refer to any other determinants. For this 
reason, our initial specification (model 1) simply 
included age, cohort size, and age-size interactions 
for the three youngest age-groups. We showed that 
the SIZE effect indeed varied by age-group, with 
positive signs for the three youngest age-groups 
and negative signs for the remaining ages (these 
are the conditional effects in Table 4). -Since the 
youngest ages are the most alienated, we originally 
interpreted these results as supporting Easterlin. In 
fact, they are mixed if not outright countertheo- 


` retical, because the negative effect of the older 


‘It is, of course, possible that other factors 
associated with relative cohort size can have 
persistent, monotonic effects on political alien- 
ation, but this remains an area for theoretical and 
empirical exploration — Easterlin's framework 
not extend this far. ` 


482 
ages is inconsistent with the Easterlin framework.” 


Nonetheless, the significant interaction terms and - 


the sign reversal at the older ages support an 
interactive rather than an additive specification. 


FURTHER EMPIRICAL RESULTS 


Had O’Brien and Gwartney-Gibbs tested the 
additive version of this baseline model, they would 
have found a small and statistically insignificant 
cohort size effect. Furthermore, had they examined 
the zero-order relationship between cohort size and 
alienation, they would have also found a negative 
and insignificant cohort size effect. Instead, they 
looked only at the additive version of our model 2, 
which also included a set of period dummy 
variables. That they found a positive additive 
cohort size effect is not surprising, given our 
results in Table 5 (column 2), which showed very 
similar positive effects of cohort size across 
age-groups. However, the statistical significance 
of the effect is puzzling given the lack of 
“significant” additive cohort size effects when 
either age or period (or both) is absent from the 
model. Why should cohort size have a significant 
additive effect only when both age and period are 
in the model? i 

In sum, a specification in which the effect of 
relative cohort size is additive with respect to age 
leads to a weak test of Easterlin’s theory. Further, 
if the additive effect is present, there is nothing in 
Easterlin’s theory to suggest that it should appear 
only when age and period are controlled. More- 
over, an appropriate, simple interactive specifica- 
tion (model 1) is only partly consistent (and 
crucially inconsistent) with Easterlin’s framework. 
We have shown through numerous specifications 
that cohort membership (variously measured) does 
not provide a basis for understanding trends in 
political alienation. The trend in alienation is not 
fundamentally altered when age, size, the age-size 
interaction, or other combinations, such as age, 
cohort membership, and their interactions, are 
controlled. We conclude, therefore, that levels of 
alienation fluctuate over time for the population as 
a whole and are not a cohort-based phenomenon. 


STATISTICAL ISSUES 
Collinearity 
Collinearity in our analysis is design-based and 


? This age pattern of effects also does not appear 
to be consistent with the pattern hypothesized by 
Estes and Wilensky (1978) and Wilensky (1981) 
for “morale,” which is in any case conceptually 
distinct from political alienation. We apologize to 
Wilensky for our earlier misunderstanding of his 
age-related hypothesis, which asserts a connection 
between feelings of well-being and the “life-cycle 
squeeze.” 
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stems from the high correlation between age and 
the measure of relative cohort size, and from the 
interaction terms between age and size. The 
inclusion of period (model 2) further increases the 
collinearity. Our model 2 is equivalent to modeling 
alienation at each of the three youngest ages as a 
function not only of relative cohort size, but also of 
period, where the period effect is constrained to be 
the same over all ages. In general, these 
design-based interactions increase collinearity, but 
do not necessarily invalidate significance tests. 

The component of collinearity in model 2 due to 
the addition of period can be reduced somewhat by 
an equality restriction on the age-size interaction 
coefficients for the three youngest age-groups. The 
result is that neither the main SIZE effect nor the 
interaction coefficient is larger than its standard 
error. The same results are found when the 
interaction is restricted to the two youngest 
age-groups.? 

There is another way to get at the period versus 
size problem in the presence of collinearity. This 
requires partitioning the data into two subsets—for 
those ages 21-32, and those ages 33-68. For the 
older age-group we have fit the additive model 
(NSDC on age, pericd, and SIZE). The SIZE 
effect in this regression is positive but small, and 
less than its standard error. In itself, this 
contradicts Easterlin's theory. Next, we have 
“borrowed” the period contrasts estimated on the 
older age-groups for application to alienation at the 
younger ages. In particular, we have adjusted the 
NSDC means of the young with the period effects 
found for older people. The process involved is 
nothing more than residualization of the dependent 
variable for the young, using the period contrasts 
estimated on older people. Upon regressing these 
residuals on age, SIZE, and the interactions 
between them implied by model 2, we find that the 
SIZE effect is positive but is not larger than its 
standard error, and the interaction coefficients are 
only fractions of their standard errors. In contrast, 
when the unadjusted NSDC cell mean is the 
dependent variable for the younger ages, there is a 
significant main effect of SIZE. This is evidence 
consistent with the argument that the trend in 
alienation among older people is the same as the 
trend among younger people, and that this trend 
dominates any SIZE effect. 


Choice of Weights 
In general, and contrary to the assertion on this 


? [n another approach to the problem of reducing 
collinearity, Professor Charles Proctor has reported 
to us an analysis using orthogonal polynomials for 
age, period, and cohort size (which he operation- 
alizes as a cohort-constant measure). His specifica- 
tion reduces collinearity greatly, and the results 
suggest an age-size interaction, but the size effect 
appears to be negative over the entire age range. 


p" 
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point in our article, properly chosen weights do 
permit statistical inferences about the distributions 
of regression coefficients in weighted regression. 
Weighting by the cell n's, as we did, assumes 
homogeneity of the error variances at the individ- 
ual level. That assumption is, however, invalid 
since there is a substantial, positive monotonic 
relationship between the cell mean and the 
standard deviation (7? = .66). Indeed, it is 
stronger than the association between the NSDC 
index and cohort size. We have rerun the 
regressions in our article, as well as those 
discussed above that are not in our article, taking 
this heteroscedasticity into account (i.e., weighting 
by the cell variances), and have found that the 
coefficients are generally the same. Furthermore, 
the ratios of coefficients to standard errors do not 
lead to conclusions that differ from those reported 
above or in our article. 

The use of weighted least squares when the 
dependent variable is a vector of cell means 
presupposes an underlying individual-level model. 
So does Easterlin's framework — individuals and 
not cohorts are unemployed, underpaid, or alien- 
ated. The use of means or rates is a computational 
convenience permitted by our particular specifica- 
tion, which exploits the existence of replicate 
observations for all observed combinations of 
explanatory variables. Taking the conditional 
variances of the observations into account, as 
described in the preceding paragraph, is a way of 
recognizing the heteroscedasticity inherent in the 
summed index we used as our dependent variable. 
We would have obtained the same results had we 
done an appropriately weighted least squares using 
the individual observations. Using those observa- 
tions, however, would result in violating the 
normality assumption, -because the dependent 
variable is restricted to three values. This is 
perhaps not critical, since the number of observa- 
tions is large. Nevertheless, the question of 
validity of statistical inference using this approach 
is raised. Had we chosen to base our computations 
on two parallel, correlated-response logit regres- 
sions, with cross-equation coefficient restrictions, 
questions about the error distribution would not 
have arisen. 


Tests of Significance 


Our reticence in reporting coefficient standard 
errors in our article was fueled not by rejection of 
the classical model for statistical inference, but by 
our uncertainty as to its appropriate application. 
We sought to make a case about the nature of 
variability in political alienation using the patterns 
and directionality of coefficients, results obtained 
with different but related specifications, and 
information brought to bear from outside the 
confines of the primary data set. We have adopted 
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the same stance in the data analysis reported in this 
reply. This strategy seems to be consistent with 
views advanced by Leamer (1983) and McCloskey 
(1985, pp. 154-73), among others. Although we 
began our analysis with certain ideas and hypothe- 
ses, our thinking developed as we continued to 
examine the data. How should we have taken 
account of this sequential process? Should we have 
considered statistical power? If so, how should that 
have been done in the context of our exploration of 
the data? : 

Our difficulty in defining the universe from which 
the data were sampled was also a cause of our un- 
certainty about the application of statistical infer- 
ence. This arose at two points in our article. In 
footnote 12 (p. 160) we referred to the “strong 
historical component” of our problem. The ques- 
tion here is the specification of the “urn model" — 
what is the universe from which the sample is drawn? 
Are we talking about a historical problem, the rise 
of political alienation in the post-World War II era? 
In part we are, and so too, in part, is Easterlin. Or 
so we surmise from his references to the experience 
of the Baby Boom cohorts. Given an emphasis on a 
particular historical period, what is the appropriate 
urn model? The question of the definition of the urn 
model was also alluded to in our concluding para- 
graph, when we referred to “the shortness of the 
time series” (p. 168). Here, the point is that the 
series spans approximately 30 years, the length of a 
“generation” (itself a problematic concept in hu- 
man populations), which means that we can ob- 
serve at most one trough and one peak of the kind 
of cycle that Easterlin has focused on. From the 
perspective of long swings, we have, then, a sam- 
ple of one. 
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—Robert N. Bellah, sociologist and co- 
author of Habits of tbe Heart 


Capital in Urban Change 
SHARON ZUKN 

Foreword by David Harvey 

247 pp. 8illus. Paper, $14.00. 


Community and Organization 
in the New Left, 1962-1968: 
The Great Refusal 

WINI BREINES 

With a new preface 

203 pp. Cloth, $32.00; Paper, $13.00. 


Daughters of Independence: 
Gender, Caste, and Class in 
India l 
JOANNA LIDDLE & RAMA JOSHI 


` 264 pp. Cloth $35.00; Paper, $15.00. —— 
: My Life Story:The Autobiogra- 


phy of a Berber Woman 
FADHMA AMROUCHE 

Translated and with an introduction by 
Dorothy Blair 


197 PP. Cloth, $35.00; Paper, $12. 00. 
Structural Criminology 


JOHN HAGAN 
294 pp. Cloth, $40.00. 





. Gender/Body/Knowledge: 


Feminist Reconstructions of 
Being and Knowing 


ALISON M. JAGGAR & SUSAN R. 
BORDO, eds. 
384 pp. 6 illus. Cloth, $42.00; Paper; $15.00. 


Conflict and Change in the 
Catholic Church `. 

JOHN SEIDLER & KATHERINE MEYER 
224 pp. Cloth, $35.00; Paper, $13.00. ; 


Self-Destruction in the Prom- 
ised Land: A Psychocultural 
Biology of American Suicide 
HOWARD I. KUSHNER 

280 pp. Cloth, $24.95. 

Doing Daily Battle: 
Interviews with Moroccan 
Women 

FATIMA MERNISSI 


Translated by Jo Lakeland 
224 pp. Cloth, $35.00; Paper, $12.00. 


Blacks, Science, and American 
Education 

WILLIE PEARSON, Jr., & H. KENNETH 
'BECHTEL, EDS. 

355 pp. Cloth, $35.00. 


Rutgers University Press 


.109 Church Street 


New Brunswick, NJ 08901 





AUCTIONS 


€6 Auctions is a of institutional 

me E opio obici des aod 

i Wie nly She beck T ee 
Goffman woyld have liked it 

~Lewis A. Coser, Boston 

1989 256 Pages 0-02-929530-0 $24.95 

LIVES ON THE 


Brings to life the feelings and thinking of 
at-risk students. It impresses the reader with 
= iee while Saar 
creative int jety's 
cile judgments abadi Her abilities turk 
difference into deficit. The book is a 






More than a decade oL observation and 
interviews with t of adolescents 











teens and the 
which shape their world, he ME the 
pace Hn desi that can provide 
a ine po 
dre dee ditione n to parents 
mir es 0-02-915360-3 $22.95 


Sociology from The Source. 


` a true democracy as 
- founded fear of ape inhibits our full 


 [fp| THE FREE PRESS 


A Division of Macmillan, Inc. 
866 Third Ávenue, New York, NY 10022 


















THE FEMÁLE FEAR 

Margaret T. Gordon and Stephanie Riger 

Demonstrace that se Comnot fune tian es 
long as women’s well- 


partici in society. Gordon and Ri 

Hoe areas cal social science wo 
mscr ep ary ires eden a 
women Without compromising the canons 
of science.99 

—Pauline B. Bart, University of Illinois at Chicago 
1988 272 Pages 0-02-912490-3 $19.95 





PRIVATE ACTS 

SOCIAL CONSEQUENCES 
AIDS and the Politics of 

Public Health RR 

Ronald Bayer 

€6A much-needed, and thought- 


promina Dock of) AIDS wid ds Hae cal, 
cultural, and social-political context. Should 
be read not only by public health officials 
abd politicians bur. obViusiy by everyctie d 
—Mathilde Krim, Founding Co-Chair, 
American Foundation for AIDS Research 
1988 280 Pages 0-02-901961-3 $22.95 


A DEATH IN THE DELTA 


change and continuity in the modem South. 


Evocatively written and intellectually 
this book will a peal my SERE MERE ins 


roots of our contin 
understanding the of our n 
—Dan T. Carter, author of Scottsboro: 
A Tragedy of the American 


South 
1988 208 Pages 0-02-935121-9 $19.95 


For VISA, MasterCard or 
American Express orders, 
call 1-800-323-7445 
between SOO UD EOS Time Eastern Time. 






Defining Rape 

ss Linda Brookover Bourque 
1989. 440 pages. ISBN.0-8223-0901-7 $59. 95 
How available In paper 


Power and Protest in the Countryside . 

c Rural Unrest in Asia, Europe, and Latin America 
Robert P. Weller and Scott E. Guggenheim, editors 
1989 (1982). viii, 212 pages. LC 82-12991 


ISBN 0-8223-0483-X, cloth $34.95 
ISBN 0-8223-0895-9, paper $10.95 


Duke University Press 6697 College Station Durham, NC 27708 


1989 Guide to Graduate 
Departments of Sociology 


This edition of the Guide to Graduate Departments of Sociology includes information 
on 244 graduate departments of sociology. The Guide is a valuable resource for 
students, faculty members, advisors, department chairs, and researchers. 
Information on each department includes: 


O Department address and telephone 

D Chair and Graduate Advisor/Director 

O Degrees offered 

O Special programs and specialties offered 

O Tuition, financial aid, and application deadlines 

D Statistics on new admissions and degrees granted 

D Teacher training 

DO Roster of full-time, part-time, and jointly appointed faculty 
U PhDs awarded in 1987-88 


Send your prepaid order ($5 to ASA members and students, $10 to non-members 
and institutions) to: American Sociological Association, 1722 N Street NW, 
Washington, DC 20036; (202) 833-3410. 





STUDIES ON NORTH AMERICA 


Polities, Government, Society, Economy, and History 





A NEW BOOK SERIES FROM DE GRUYTER 


Volume 1 
The Reagan Administration 
A Reconstruction of American Strength? 

Helga Haftendorn (Freie Universitat of Berlin, West Germany) and 
Jakob Schissler (Freie Universitat of Berlin, West Germany), editors 
Translated by Dennis Mercer 
1988. xii -- 306 pages. ISBN 0-89925-409-8. Cloth $34.95 


To what degree does the Reagan Administration’s security and foreign policy differ from 
prior administrations? Are such changes primarily a reflection of shifts in domestic politics, 
or do they point to an expanded international role for the U.S. based on a "Policy of Strength"? 
This volume examines U.S. policy in the context of these questions through case studies 
which highlight some of the dilemmas that confronted the Reagan Administration in its 
attempt to reconstruct American military and economic strength during the 1980's. 


Volume 2 


Values in the Marketplace 
'The American Stock Market Under Federal Securities Law 
James Burk (Texas A & M University) 
1988. x + 207 pages. ISBN 0-89925-487-X. Cloth $34.95 


This volume traces the effects of federal securities law on the ethical and structural 
development of the American stock market. It assesses the consequences of market 
regulation and the changes it has brought for the market as a social institution. 
The book provides an institutional analysis of the stock market's strategic ability 
to adapt to its new regulatory environment. 


Volume 3 


Economic and Strategic Issues in U.S. Foreign Policy 
Carl-Ludwig Holtfrerich (Freie Universitat of Berlin, West Germany), editor 


1988. xvi + 297 pages. ISBN 0-89925-535-3. Cloth $39.95 

While the U.S. pursued a policy of military build-up and thus asserted its leadership position 
in the strategic field, its international position in the economic field deteriorated dramatically. 

Will the international commitments and claims of the U.S. to a leadership role in the 
Western world falter under pressure from economic constraints? Has the American economy 

become the Achilles heel of U.S. foreign policy? Such key economic and strategic issues 
in U.S. foreign policy are discussed in this book by a distinguished group of 
European and American scholars. 
Prices subject to change. 


C Walter de Gruyter, Inc. 


200 Saw Mill River Road * Hawthorne, NY 10532 








IV 


The Assemblies of God 
at the Crossroads 
Charisma and Institutional 
Dilemmas 

Margaret M. Poloma 

368 pages. 

Cloth Library Edition, 

ISBN 0-87049-604-2, $35.95 
Paperback Edition, 

. ISBN 0-87049-607-7, $17.95 


The Southern Baptists 
A Subculture in Transition 

Ellen M. Rosenberg 

256 pages, illustrations, 

ISBN 0-87049-598-4, $24.50 


Addicts Who Survived 
An Oral History of Narcotic Use in 
America, 1923-1965 


David Courtwright, Herman Joseph, 
and Don Des Jarlais 


With a Foreword by Claude Brown 


416 pages, illustrations, 
ISBN 0-87049-587-9, $24.95 


The University 
of Tennessee Press 


Knoxville 37996-0325 











Returning to | 

Tradition (| 

The Contemporary Revival of 
Orthodox Judaism 


M. Herbert Danzger 


A sociologist who is also a part of the 
world of Orthodox Judaism looks at the 
large numbers of young Americans and 
Israelis raised in non-observant Jewish 
families who have chosen to become 
Orthodox Jews. 


* An outstanding book, original, 
well written, and incisive.” 
—William B. Helmreich $29.95 


Edging Women Out 
Victorian Novelists, Publishers, and 
Social Change 
Gaye Tuchman with Nina E. Fortin 


Before about 1840, most English novelists 
were women; by the turn of the twentieth 
century, however, most successful novelists 
were men. In this book, sociologist Gaye 
-Tuchman examines how men gradually 
supplanted women as novelists once 
writing fiction was perceived 
as prestigious. 

“A delightful synthesis that cuts across 
women’s studies, sociology and literary 
history.” —Publishers Weekly $30.00 


Now available in paperback 


The Nonprofit 


Sector 
A Research Handbook 


edited by Walter W. Powell 


“[This] fine book . . . successfully 
bridge[s] gaps between academic research 
and application. . . . It touches on virtually 
every domain of inquiry within the field." 
— Judith R. Blau, Contemporary Sociology 

$19.95 
Yale University Press 
Dept. oo2 
* 92A Yale Station 
New Haven, CT 06520 ' 








AMERICAN ANABAPTISTS 


The Riddle of Amish Culture 


DONALD B. KRAYBILL 


Talking with the Amish on their farms, in their shops, and around their 
kitchen tables, Donald Kraybill has learried how they have “struck a 
bargain” with modern times—and how many of the rules that seem 
quaint or silly to outsiders have been essential to keeping Amish culture 
alive. l $35.00 hardcover; $8.95 paperback 


Mennonite Society 
CALVIN REDEKOP . 
The first comprehensive introduction to the North American Mennonite 
world. From a sociological and historical perspective, Calvin Redekop 
explores the Mennonites’ beliefs, the organization of their institutions, 
their family life, economics, and aesthetics. 

$40.00 hardcover, $14.95 paperback 


Amish Society 
Third Edition, completely revised 
JOHN A. HOSTETLER | 
An inside view of Amish history, customs, beliefs, and interactions with 
the outside world. Long the definitive work on the subject, Amish Society 
effectively captures the depth and diversity of Amish culture. 

$9.95 paperback 
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BY SOCIAL ISSUES 


( The Politics of Numbers 

C. WILLIAM ALONSO and PAUL STARR, eds. 

. Now in paperback, the first major study of the social and political 

* forces behind the nation's statistics. “The ‘sociology of statistics’ is 

a relatively new, exciting, and important field of study that we may 
neglect at our political peril. This elegantly-written, well-researched, 
and handsomely-designed volume is sure to become the standard in 
the field." —Choice. 480 pages/Paper $18.50 


Big Structures, Large Processes, 
Huge Comparisons 


CHARLES TILLY 

Now in paperback. "In this short, brilliant book Tilly suggests a way 

to think about theories of historical social change... This book 

should find attentive readers both in undergraduate courses and in 
. graduate seminars. It should also find appreciative readers, for Tilly 

is a writer as well as a scholar." — Choice. 167 pages/Paper $8.95 


From Patrician 
to Professional Elite 


The Transformation of the New York City Bar Association 

MICHAEL J. POWELL 

Offering a rare view of the internal dynamics of the exclusive Associ- 
` ation of the Bar of the City of New York as it adapted to postwar 
pressures for inclusiveness and democratization, Powell raises 
important questions about the class basis of modern professionalism 
and the role of elites in a pluralistic society. 256 pages/Cloth $29.95 


Power, Culture, and Place 


Essays on New York City 
JOHN HULL MOLLENKOPE, ed. 352 pages/Cloth $40.00 


Advice and Consent 
The Development of the Policy Sciences 
PETER deLEON. 144 pages/Cloth $17.50 


4 Women in Academe 
Progress and Prospects 
MARIAM K. CHAMBERLAIN, ed. 448 pages/Cloth $29.95 


RUSSELL SAGE FOUNDATION 


c/o CUP Services, P.O. Box 6525, Ithaca, NY 14851 
or call 1-800-666-2211 (for orders only) 





VH 


Westview 


These essays from one of our most influential 
feminist theorists trace the development of 
socialist-feminist theory from its roots in the 
politics of the New Left to its present post- 
Marxist forms, from a unified theory of social 
domination to a more open, multi-system per- 
spective. Of special interest are Ferguson's 
analyses of racism and the issues raised by a 
“divided sisterhood.” 

Apr. 1990 • ca. 256 pp. * $38 (hc) * 

$15.95 (pb) 


Ain’t No Makin’ It 


Leveled Aspirations in a Low-Income Neighborhood 


Jay MacLeod 
“In these pages sociology gains a wonderfully 


heart, mind, and soul.” 
1987 * 198 pp. * $32 (hc) - $13.85 (pb) 


5500 Central A 





Harvard University 
Westview Press 


vigorous narrative spirit, 
which dares connect the knowledge of social science to the first hand 
observations of an idealistic young man who spent a long time getting 
Se ee lea o C Eua A 


State and Society 

in the Soviet Union . 
Exploring the changing character of state- 
society relations in the USSR, the author 
examines the long-term economic, social and 
ideological trends that have been eroding key 
features of the state system, and traces the 
efforts of the Soviet leadership to create a 
new conception of Soviet society and new 
instruments for governing it. : 

Nov. 1989 » ca. 256 pp. * $35 (hc) * 

$16.50 (pb) 


Risking Old Age 
in America 


the elderly's resolute struggle to forestall pov- 
erty and dependency. The author charts their 
course through the benefit maze, proposes 
steps the nation can take to help ensure 
independence for elderly citizens, and offers 
a hopeful based on the belief that 
America has the will and ability to build a 
humane social 

Dec. 1989 » ca. 200 pp. * $35 (hc) « 

$14.95 (pb) 


World System Theory 
An Introduction 


Thomas R. Shannon 
Jan. 1990 « ca. 256 pp. * $35 (hc) « 
$19.95 (pb) 


Robert Coles, 





* Boulder, Colorado 8030 
Phone (303) 444-3541 * FAX (303) 449- 3356. 
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Emerging Forms and Historic Foundations 


THESTRUCTURAL 
TRANSFORMATION OF 


THE PUBLIC SPHERE 

An Inquiry into a Category 

of Bourgeois Society 

Jürgen Habermas 

translated by Thomas Burger 

This lucid study of the origins, nature, 
and evolution of public opinion in 
democratic societies is Habermas' 
most concrete historical-sociological 
book, and one of the key contributions 
tö political thought in the postwar 
period. ^Why is this such a vital study? 
Its significance rests in its analysis of 
one of the central notions on which 
both our political life and our political 
theories rests: ‘public opinion." 
—James Schmidt, Boston University 
$35.00 


New in Paperback 


THE SOCIAL 
CONSTRUCTION OF . 
TECHNOLOGICAL SYSTEMS 
New Directions in the Sociology 
and History of Technology 

edited by Wiebe E. Bijker, 
Thomas P. Hughes, 


and Trevor Pinch 

The 13 essays in this book draw on 

a wide array of case studies—from 
cooking stoves to missile systems—to 
outline an original research program 
based on a synthesis of ideas from the 
social studies of science and the his- 
tory of technology. 

$12.95 paper 


FRENCH MODERN 
Norms and Forms of the 
Social Environment 


Paul Rabinow 

“This path-breaking book opens up 
topics for some new, contemporary 
analysis of modernity that go well 
beyond its immediate occasion in the 
colonial city. Rabinow links space itself, 
and bureaucratic ideologies, with the ` 
nascent biological sciences, with the 
history of the fine arts in the period, 
and with concrete biographical trajec- 
tories and destinies. It is a stimulating 
and exciting performance.” 

—Fredric R. Jameson, William A. Lane Jr. 
Professor of Comparative Literature, 
Duke University 

$35.00 


HIGH-TECH SOCIETY 

The Story of the Information 
Technology Revolution 

Tom Forester 

The most definitive account available 
of the technology revolution that is 
transforming society and dramatically 
and maybe even think. 
$9.95 paper 
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The MIT Press 


55 Hayward Street, Cambridge, MA 02142 








NEW TITLES 


Black Lives, 
White Lives 
Three Decades of Race 


Relations in America 


BOB BLAUNER f 
“This is one of the finest books I’ve read in 
many years—an exceptionally moving and 
thoughtful presentation of ourselves to 
ourselves, so to speak.” 
—Robert Coles, Harvard University 

“Necessary equipment for navigating the 
great divide: race.” —Todd Gitlin, 

author of The Whole World is Watching 
Black Lives, White Lives is a pathbreaking study 
of racial experience and a unique blend of 
social history, sociology, and narrative that 
brings alive the connections between social 
and personal change. $25.00 


The Making of 


Rehabilitation 


A Political Economy of Medical 
Specialization, 1890-1980 
GLENN GRITZER and 
ARNOLD ARLUKE 
New in paperback—“Gritzer and Arluke per- 
suasively argue that the development of 
rehabilitation medicine has as much to do 
with the governmental and medical market- 
places as it does with scientific advancements. 
Specialization, in other words, does not reflect 
the inevitable logic of science so much as the 
perceived ability of a group of doctors to meet 
current needs. .. . Provocative and 
informative.”—Contemporary Sociology 
Comparative Studies of Health Systems and 
Medical Care, 15 

$9.95 paper 
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Social Order/ 


Mental Disorder 


Anglo-American Psychiatry in 
Historical Perspective 


ANDREW SCULL 
“Sculls scholarship is the best work now 
being done—and the best work ever done— 
on the historical sociology of mental illness. It 
is insightful, balanced, and extraordinarily 
well-documented.” 

—John Monahan, University of Virginia 
Social Order[Mertal Disorder represents a 
provocative and exciting exploration of social 
responses to madness in England and the 
United States from the eighteenth through the 
twentieth centuries. 


Medicine and Society, 3 $25.00 


Into One’s Own 


From Youth to Adulthood in 


the United States, 1920-1975 


JOHN MODELL 
“With remarkable ingenuity and attention to 
detail, Modell charts the rise of dating as a 
social institution during the 1920s and the 
decline of courtship in the following decades, 
as well as the reverberating effects of hard 
times and wars on family formation. The 
story ends in the midst of a rising divorce rate 
(the mid 1970s) and a shift to the later marital 
age of the prewar years. For the very first 
time, we can move beyond ‘then-now’ com- 
parisons to a genuine understanding of the 
connecting pathways across the twentieth 
century. A splendid achievement!” 
—Glen H. Elder, Jr., 
The University of North Carolina 
at Chapel Hill 
$40.00 
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It Did Happen Here 


Recollections of Political 
Repression in America 
BUD SCHULTZ and 
RUTH SCHULTZ . 
With a foreword by Victor Navasky 
^Deeply moving testimony, seldom heard, of 
survivors who were pillored by our witch- 
hunters. It is a piece of history uncovering a 
shameful moment. More important it is a 
necessary work.”—Studs Terkel 
“A book of major interest and importance—a 
wonderful recapture of a dark history and a 
small marvel of adept and literate editing. It’s 
publication will help not a little to ensure that 
this part of our past is past.” 

: —John Kenneth Galbraith 

$22.50 


The Comparative 


Method 

Moving Beyond Qualitative and 
Quantitative Strategies 
CHARLES C. RAGIN 

New in paperback—"Without a doubt, Ragin 
has made an important contribution with this 
book. ... This book opens a new field of com- 
parative methodology. It will create new 
research possibilities. It will stir new 
methodological debates. Most important, it 





will allow us all, in a more sophisticated way, _ 


to get down to the task of comparing cases." 
~~Contemporary Sociology 
$9.95 paper 
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Best Friends and 

e 
Marriage 
Exchange Among Women 
STACEY J. OLIKER 
In this fascinating book, Stacey Oliker delves 
into the intimate realm of women’s friend- 
ships and explores the complex relation 
between friendship and family life. Based on 
a series of interviews with women from the 
middle and working classes, this work reveals 
the distinctive values of best friendship and 
marriage, how husbands and their wives’ 
friends feel about each other, and how women 
friends talk about marriage problems. 





$20.00 


Meaning and Moral 
Order 


Explorations in Cultural 
Analysis 
ROBERT WUTHNOW 


New in paperback—“Filled with suggestive 
empirical illustrations and interpretations that 


. will be of great interest to those working in 


such fields as social movements, historical 
sociology, sociology of religion, sociology of 
science, and other substantive areas that 
overlap with ‘cultural sociology." 
— American Journal of Sociology 
$12.95 paper 
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BEYOND GLASNOST 
The Post-Totalitarian Mind 


JEFFREY C. GOLDFARB 

Foreword by Jan Josef Szczepanski 

Galdfarb uncovers the true origins of glasnost by examining 
the Polish Solidarity movement and works by Eastern 
European writers. “Beyond Glasnost raises the level of 
discourse, not only about Solidarity, but about the conditions 
of modern life as a whole. Goldfarb relates contemporary 
struggles for liberty in Eastern Europe to our own ongoing 
struggle for an E public life and a truthful political 
culture in the West. This is a provocative and an inspiring 
book. "— Marshall Berman, author of All That is Solid Melts, 
In:o Air 

Cloth $24.95 272 pages 


RURAL DEVELOPMENT IN CHINA 
Prospect and Retrospect 


FEI HSIAO-TUNG 

With a Foreword by Tang Tsou 

This collection of essays by Fei Hsiao-tung, China's most 
distinguished sociologist and anthropologist, presents a rich 
and representative sampling of the research that has . 
characterized his long career. 

Paper $10.95 (est.) 248 pages 

Library cloth edition $24.95 (est.) 


LITERATURE AND SOCIAL PRACTICE 
Edited by PHILIPPE DESAN, PRISCILLA PARKHURST 
FERGUSON, and WENDY GRISWOLD 


Diverse, sometimes contradictory approaches (Marxism, New 
Historicism, publishing history, and others) mark this lively, 
critical assessment of the sociological practices of literature. 
Contributors include Pierre Bourdieu, Terry Eagleton, J. Paul 
Hunter, Sandy Petrey, John Sutherland, Alain Viala, Robert 
Weimann, and others. 

Paper $13.50 (est.) 312 pages 

Library cloth edition $30.00 


A POETIC FOR SOCIOLOGY 


Toward a Logic of Discovery for the Human Sciences 
RICHARD HARVEY BROWN 


For too long, argues Richard Harvey Brown, social scientists 
have felt forced to choose between imitating science's 
empirical methodology and impersonating a romantic notion 
of art. Developing the idea of a “cognitive aesthetic,” Brown 
shows how both science and art—as well as the human studies 
that stand between them— depend on metaphoric thinking as 
their "logic of discovery" and may be assessed in terms of 
aesthetic criteria of adequacy. 

Paper $14.95 322 pages 
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STATEMENT OF ASA POLICY ON MULTIPLE SUBMISSION 


Submission of manuscripts to a professional journal clearly implies commitment to publish in that 
journal. The competition for journal space requires a great deal of time and effort on the part of 
editorial readers whose main compensation for this service is the opportunity to read papers prior to 


publication and the gratification associated with discharge of professional obligations. For these 
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PROTESTANT AND CATHOLIC: 
IS THE ANALOGICAL IMAGINATION EXTINCT?* 


ANDREW GREELEY 
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The notion of the “Protestant Ethic” is reconceptualized to reflect theoretical 
expectations about the difference between the “Dialectical Imagination” and the 
“Analogical Imagination.” Protestants are expected to be individualist because of 
an imagination of society as “sinful and God-forsaken.” Catholics are expected to 
be communitarian because of an imagination of society as sacramental, that is, 
revelatory of God. The research question is whether the analogical/communitarian 
imagination can persist in a world of industrialization, modernization, 
urbanization, and homogenization. The issue was tested against data from seven 
nations (36 variables) with the null hypothesis that denominational background 
does not predict value orientation. The null hypothesis was rejected. A replication 
was attempted with a different data set from seven nations (36 variables) and the 
null hypothesis was again rejected. Further testing with data and 13 hypotheses 
from only the United States tested the null hypothesis that differences in religious 
imagery could not account for some of the variance between Protestants and 


Catholics on value measures. This null hypothesis was also rejected. 


INTRODUCTION 


Modern sociology emerged in substantial part 
from a discussion of the differences between 
Catholics and Protestants. Weber (1958) thought 
that the "communitarian" ethic of the former 
seemed to impede educational and economic 
achievement while the "inner worldly asceti- 
cism" of the latter with its emphasis on indi- 
vidual achievement facilitated the success of 
the latter. Durkheim (1961) believed, on the 
other hand, that the "individualism" of Protes- 
tants induced higher suicide rates among them 
while the “communitarian” control available to 
Catholics tended to reduce suicide rates. 

The theories of these two giants certainly fit 
the pattern of religious emphasis of the two 
denominational traditions. For Protestantism 
the tendency has always been to emphasize the 
relationship of the individual with God while 
for Catholicism the tendency has always been 
to emphasize the individual relating to God as 


* Funding for data collection in the United 
States as well as early analysis was provided by 
CARA (Center for Applied Research in the 
Apostolate). David Barker, Gordon Heald, Sey- 
mour Martin Lipset, Tom Smith, and Fran 
Gillespie helped in the search for errant data files. 
Sean Durkin labored mightily in the preparation of 
the data for analysis. Michael Hout and Robert 
Michael offered important statistical advice. 
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a member of a community. Suicide rates and 
educational achievement rates would be two 
manifestations (and perhaps not the most im- 
portant) of fundamental orientations which per- 
meate the two traditions. 

Quite apart from achievement! and suicide 
rates, then, the question remains whether, at 
the end of the century which began with The 
Protestant Ethic and the Spirit of Capitalism 
and Suicide, the distinctive orientations of the 
two heritages persist. 

Progress made in the theory of the 
sociology of religion since the publication of 
these two works makes it possible to describe 
more precisely the nature of these two 
orientations. Religion (Geertz 1968) is a set 
of symbols which provide answers to issues 
of the ultimate meaning of life, symbols 
which explain imaginatively what the world is 
about and provides templates for shaping 
human response to the world and thus the 
shape the world in which humans live. 
Religion is an imaginative "cultural system" — ` 
a collection of directing “pictures” through 
which humans organize and give meaning to 
the phenomena which impinge on their 
consciousness, especialy insofar 
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phenomena require some explanation of the 
ultimate purpose of life. 

While these “pictures” may produce theo- 
logical and ethical codes, they are pre- 
propositional and metaphorical. The codes are 
derivative, the superstructure, if one wishes, 
built on an imaginative and preconscious 
infrastructure. The fundamental differences 
between Catholicism and Protestantism, then, 
are not doctrinal or ethical. The different 
propositional codes of the two heritages are 
but manifestations, tips of the iceberg, of 
more fundamental differing sets of symbols. 
The Catholic ethic is “communitarian” and 
the Protestant "individualistic" because the 
preconscious "organizing" pictures of the two 
traditions which shape meaning and respond 
to life for members of the respective heritages 
are different. Catholics and Protestants "see" 
the world differently (or “saw” it differently). 
These preconscious “worldviews” are not, of 
course, totally different, but only somewhat 
different, different enough to produce differ- 
ent doctrinal and ethical codes, different 
church structures, and different behavior rates 
(or to have once produced different rates). 

The central religious symbol is God. One's 
“picture” of God is in fact a metaphorical 
narrative of God's relationship with the world 
and the self as part of the world. Tracy (1981) 
has studied the "classics" of the two 
traditions to discover the underlying imagery 
which shapes these crucial works. On the 
basis of his study, he suggests that the 
Catholic imagination is “analogical” and the 
Protestant imagination is "dialectical." The 
Catholic "classics" assume a God who is 
present in the world, disclosing Himself in 
and through creation. The world and all its 
events, objects, and people tend to be 
somewhat like God. The Protestant classics, 
on the other hand, assume a God who is 
radically absent from the world and who 
discloses Herself only on rare occasions 
(especially in Jesus Christ and Him Cruci- 
fied). The world and all its events, objects, 
and people tend to be radically different from 
God. 

The word tend in the previous paragraph is 
used advisedly. Zero-sum relationships do not 
exist in the world of the preconscious. The 
analogical and the dialectical imaginations 
exist side by side in the personalities of the 
authors of the classics, opposing one ancther 
but also complementing one another. Rarely 
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does one encounter a religious imagination 
that is purely analogical or purely dialectical. 

Tracy contends that two approaches to 
human society of the respective traditions are 
shaped by these imaginative pictures. The 
Catholic tends to see society as a “sacrament” 
of God, a set of ordered relationships, 
governed by both justice and love, which 
reveal, however imperfectly, the presence of 
God. Society is “natural” and “good” 
therefore for humans, and their "natural" 
response to God is social. The Protestant, on 
the other hand, tends to see human society as 
“God-forsaken” and therefore un-natural and 
oppressive. The individual stands over against 
Society and is not integrated into it. The 
human becomes fully human only when he is 
able to break away from social oppression and 
relate to the absent God as a completely free 
individual. 

Tracy's analysis complements and adds to 
the earlier insights of Durkheim and Weber. 
One may continue to speak of a “Protestant” 
ethic, but now one must understand the ethic 
as a set of fundamental world-explaining and 
world-shaping interpretative pictures. In this 
sense ethos, as Geertz (1957) has observed, is 
the flip side of mythos. Formal ethical codes 
are derivatives of interpretative pictures. The 
higher suicide rates and the higher achieve- 
ment rates of Protestants in Turn-of- 
the-Century Europe are the result ultimately 
of different symbol systems, of, if one may 
say it, different stories of God.? 

In this perspective the more “conservative” 
Catholic stands on morality and doctrine are 
not so much the starting point of the Catholic 
tradition as rather the results of images of 
society and religion that are latent beneath the 
formal teaching of the tradition. Because 
communities are pictured by Catholics as 
sacramental, threats to communities must be 
resisted both by a reassertion of the values 
which seem to protect the communities and 
by support for doctrinal notions on which the 
communities appear to have been based. The 
Protestant heritage, freed from such powerful 
concern about the preservation of traditional 
communities because it images the communi- 
ties to be sin-filled and God-forsaken, can 
perhaps afford to be more flexible in matters 
of morality and doctrine and human relation- 


? Note that the stories are not totally different. 
Some Catholics committed suicide and some 
Catholics achieved more than Protestants. 
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ship (so long as the traditional, community- 
protecting norms are observed). ' 

Tracy's work provides background for 
reconsidering the insights of the Two 
Founders, but the issue that remains is 
whether the two different imaginations still 
persist in an urbanized, industrialized, ecu- 
menized world of the End of the Century. Do 
Catholics: still see the world somewhat more 
analogically than Protestants? Do Protestants 
still see the world somewhat more dialecti- 
cally than Catholics? Do Protestants therefore 
still tend to emphasize individual values? Do 
Catholics still tend to emphasize communal 
values? Does the Catholic value orientation 
still reflect an image of God as present in 
creation? Does the Protestant value orienta- 
tion still reflect an image of a largely godless 
creation? . 

To put the issue bluntly: since the Reforma- 
tion ethic can fairly be assumed, at some high 
level of generality, to have shaped the spirit 
of modern capitalism, is the dominant social 
trend away from communal ethic and towards 
individual ethic? Do the combined forces of 
modernization, urbanization, industrializa- 
tion, ecumenization, and religious homogeni- 
zation threaten the analogical imagination 
with extinction? Has the Catholic imagination 
already become or is it in tlie process of 
becoming a casualty of the modern world? 

If one wishes (with a wary eye on the data 
available) to formulate expectations of how 
the different imaginations would shape val- 
ues, one might very tentatively express the 
following “predictions”: 

Catholics will be more likely to value 
social relationships than will Protestants 
because they see not sin but sacramentality, 
however flawed, in such relationships. The 
analogical imagination which pictures hu- 
mans as integrated into social networks— 
networks. which in fact reveal God— will 
stress those values and behaviors which 
contribute to the building up and strengthen- 
ing of those networks. The dialectical imagi- 


- nation, which pictures the individual as 


struggling for his personal freedom over 
against the sinful oppression of social net- 
works, will stress those values and behaviors 
that contribute to the strengthening of per- 
sonal freedom and independence from group 
control. 

Catholics will be more likely to value 
equality over freedom than Protestants be- 
cause equality makes for smoother social 
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relationships. Moreover, because social com- 
plexity and diversity will also be seen as 
sacramental, Catholics will be more tolerant 
than Protestants of diversity in their commu- 
nities; Catholicism, as James Joyce remarked, 
means “here comes everybody." 

Because Catholics view society as a 
community of communities, they will be 
more likely than Protestants to advocate 
decentralization of control to the smaller 
communities; hence the famed Catholic prin- 
ciple of “subsidiarity” —nothing should be 
done by a larger and higher organization 
which can be done as well by a smaller and 
lower organization—is a philosophical articu- 
lation of the Catholic “image” of society. 

Protestants will value in their children the 
virtues of initiative, integrity, industry, and 
thrift more than Catholics, while Catholics 
will ‘value loyalty, obedience, and patience. 
Protestants will be especially likely to deplore 
vices which diminish personal integrity, 
honesty, and sense of duty. Catholics will be 
especially likely to be offended by actions 
which seem to violate relationship networks — 
adultery, prostitution, suicide. Catholics will 
stress the importance of common background 
in the choice of marriage partners; Protestants 
will stress compatibility of individual interests 
and. personal fulfillment. Catholics will em- 
phasize more than Protestants institutional 
religion, religious devotion, and doctrinal 
orthodoxy. Catholics will be less likely than 
Protestants to feel lonely and constrained by 
social relations (which Catholics do not take 
to be oppressive). Since Protestants are more © 
likely to emphasize personal responsibility 
(Weber's “worldly asceticism”) than Catho- 
lics, they will also be more likely to 
emphasize a “work ethic" than Catholics, 
who will be more likely to work because they: 
have to than because they want to. . 

Finally, because Catholics picture God as 
revealing Himself in society, however flawed 
the revelation might be, they will be more 
likely to advocate social change so that the 
disclosure of God will be improved. Protes- 
tants, on the other hand, will be more likely 
to despair of society ever being anything but 
God-forsaken and sin-ridden and hence will 
be less optimistic about and less supportive of 


„social change. 


In describing these expectations of a 
communal ethic against an individualist ethic, 
an analogical imagination against a dialectical 
imagination, a religious vision which pictures 
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God in society against one which pictures 
God as radically “other” from society, one is 
not predicting anything more than differences 
in emphases and tendencies. To predict (as I 
will shortly) that Catholics are more likely to 
be willing to accept heavy drinkers in their 
neighborhoods than Protestants are is not 
necessarily to say that the majority of 
Catholics will tolerate drunks and the major- 
ity of Protestants will not. Rather it would be 
altogether possible for the majority of both 
groups to reject such deviants, but the 
Catholic majority, according to the predic- 
tion, would be smaller than the Protestant 
majority. : 

To state the problem in terms of null 
hypotheses: 


(1) Denominational affiliation will be found 
to have no significant relationship to values. 

(2) To the extent that one might discover 
significant relationships between denomina- 
tion and values, there will be no pattern in 
such relationships that corresponds to expec- 
tations based on the work of Weber, 
Durkheim, and Tracy. 

(3) To the extent that there still seems to be 
a Protestant ethic and a Catholic ethic, a 
dialectical and an analogical imagination, the 
differences will be seen as diminishing over 
time— more likely to exist among an older 
generation than among a younger generation 
and among those with lower levels .of 
educational attainment than among those with 
"higher levels of attainment. 


Or in Tracy's terms the null hypothesis 
could be stated as follows: If it is not already 
extinct, the analogical imagination is well on 
its way to extinction. 

This analysis will proceed in three phases: 
The first will consider data from many nations 
to investigate the possibility of significant 
denominational differences in values. The 
second will attempt a replication of the rirst 
analysis with a different set of data, also from 
many countries. The third will concentrate on 
one nation to see if denominational value 
differences can be accounted for by differ- 
ences in religious imagination. , 


DATA AND HYPOTHESES: PHASE ONE 


I propose to test the expectations described in 
previous paragraphs against data collected in 
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the International Study of Values (ISV).3 This 
project was inaugurated in 1981 by a 
consortium of research organizations under 
the loose direction of British Gallup and a 
committee of scholars assembled in Amster- 
dam (Abrams, Gerard, and Timms 1985; 
Calvaruso and Abbruzzese 1985; Fogerty, 
Ryan, and Lee 1984; Halman, Hennka, 
Moor, and Zanders 1987; Harding, Phillips, 


‘and Fogerty 1986; Kerkhofs and Rezohazy 


1984; Noelle-Neuman and Kocher 1987; 
Orizo 1983; Petterson 1988). 

Similar (thought often not identical) survey 
instruments were administered into national 
probability samples in various countries. The 
present analysis is limited to five English- 
speaking countries: Australia (n = 1228), Can- 
ada (n=1254), Great Britain (n=1231), 
Ireland (n=1529),4 and the United States. 
Each of these countries has sufficient num- 
bers of Protestants and Catholics to examine 
the possibility that denominational sub- 
cultures might cross national lines. On the 
other hand, because they share a common 
language? and a common heritage (in varying 
degrees of completeness), they represent a 
congeries in which there is enough unity to 
facilitate a reasonable examination of diver- 
sity. At the conclusion of this phase of our 
analysis, it will be asked whether predictions 
can be made on the basis of the findings in the 
five English-speaking nations which will 
accurately anticipate findings in the two 
Continental European nations (Holland 
n=1221; and Germany n= 1230) that have 
enough Protestants and Catholics to make 
comparisons possible. 

Comparisons were made in the analysis 
between Catholics and Protestants only. 
There were not enough cases and not enough 
similarity in coding systems to make compar- 
isons among Protestant denominations. Those 


? The ISV data set on which this analysis is 

based (a subset of a study of more than 30 
countries) is available from the archivist at NORC, 
1155 East 60th, Chicago, Illinois 60637. 
- * The 32 counties: which constitute the whole 
island. Separate surveys, administered in the 26 
counties of the Republic (n — 1217) and in the 6 of 
the North (n=312), were combined for this 
analysis. 

5'Two of the five countries are formally 
bilingual and a third (USA) is informally bilingual. 
Nonetheless, English is an official language of all 
five countries. 
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with Anglican affiliation (19 percent of the 
sample) were excluded because it was impos- 
sible to sort out those with Catholic (High 
Church) and those with Protestant (Low 
Church) leanings and because in Great Britain 
“C of E” is often a residual category, albeit 
one which encompasses three-quarters of the 
population.® Similarly, Jews and those with 
other and no religious affiliation were ex- 
cluded from the analysis both because they do 
not fit the Protestant/Catholic paradigm on 


. which this project is based and because these 


groups are not large enough to be suitable for 
the research strategy tbat is being used. The 
sample sizes for those included in the final 
sample are: 


Catholic Protestants 
Australia 312 216 - 
Canada 543 387 
Germany (F.R.G.) 522 640 
Great Britain '143 179 
Ireland 1231 138 
Holland 387 312 
United States 696 1304 
Total 3834 3186 


The questionnaire items were designed to 
measure moral, social, political, familial, and 
religious values. Factor analyses (Principal 
Component with varimax rotation) were 
performed on value clusters which either were 
asked as part of the same question or seemed 
to have some logical coherence. Thirty- seven 
variables emerged Hom this pane of data 
preparation. 


Equality versus Freedom Factor 


This variable is composed of responses to two 
questions: The first question asks about a 
choice between freedom and equality." The 
second question asks about changing society. 


$ A separate analysis was performed on each of 
the variables in this paper in which Anglicans were 
included with Protestants in an "Other Christian" 
category. There was virtually no variation in the 
findings from those reported here, with one 
exception. In Great Britain the Anglican responses 
to the family network variables discussed in Phase 
Two of this paper were similar to the Catholic 
response. Thus it would appear as a tentative 
generalization that the Anglican imagination is like 
the Protestant imagination save in family network 
formation in Great Britain. 

7 Wording of questions is available from the 
£uthor upon request. 
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I -expected that Catholics will emphasize 


„equality more than will Protestants and will 


be more likely to endorse change. 


Fairness Factor 


Three variables which measure "fairness" in 
the work world were found to constitute a 
single factor measuring attitudes towards pay 
scales, control of business firms, and the right 
of workers to understand before obeying. In 
line with their egalitarianism and their 
convictions about decentralization, Catholics 
can be expected to score higher on this factor 
than Protestants. 


Social Change Factors 


Three factors measured the respondent's 
opinion about desirable social changes. 


. Changel: Loadings on this factor stressed 
changes which would improve respect for 
authority and family life. 

Change2: Loadings on this factor stressed 
changes in emphasis on money, work, and 
lifestyle. 

Change3: Loadings on n this factor stressed 
changes in more respect for the individual and 
less reliance on technology. 


My expectations incline me to predict 
higher scores for Catholics on the first and 


second factors and for Protestants on the 
third. 


Confidence in Social Institutions Factors 


Two factors measured confidence in social - 


institutions: army, police, church, legislature, 
and the civil service on the first factor; 
unions, press, the legal system, and education , 
on the second factor. Since Catholics are less 
likely to see society as evil, ny expectations 
were that they would score higher on both 
factors. 


Work Factors 


To measure the "work ethic," respondents 
were asked how they would spend their free 
time if their work week was reduced to three 
eight-hour days with the same pay they now 
receive for five days of work. Three factors 
emerged from an analysis of the responses: 
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Workl: Emphasis on family, hobby, 
relaxation, study. 

Work2: Emphasis on community service 
and small business of one's own. 

Work3: Emphasis on avoiding boredom 
and finding another job. 


On the basis of Max Weber's theorv, I 
predict that Catholics will score high on the 
first 'scale and Protestants on the third. 
Catholics would also presumably score high 
on the second scale with its emphasis on 
community involvement. 


Tolerance Factors 


Tolerancel: Highest loadings on rejection of 
immigrants, unwed mothers, members of 
sects, students, large families, and students as 
neighbors. 

Tolerance2: Highest loadings on rejection 
of political deviants of the left and the right. 

Tolerance3: Highest loadings on rejection 
of heavy drinkers, criminals, and emotionally 
disturbed neighbors. 


Hypotheses based on the theoretical expec- 
tations described in previous paragraphs lead 
me to predict that Catholics would be more 
likely to accept deviant neighbors than would 
Protestants and hence score higher on all three 
scales. 


Morality Factors 


"Morall: Highest loadings on disapproval of 

homosexuality, prostitution, abortion, di- 
vorce, suicide, mercy killing, adultery, pre- 
marita] sex, and drug usage.‘ This variable 
‘might be called a “life ethic” factor. 


Moral2: Highest loadings on disapproval 
of lies, bribes, cheating on tax, welfare, and 
transportation fare, and buying of stolen 
property. This variable might be called a 
“personal ethic” factor. 

Moral3: Highest loadings on disapproval 
of strike breakers, political assassins, joy 
riding, fighting with police, not reporting an 
auto accident. This variable might be called a 
“social ethic” factor. 

Moral4: Disapproval of killing in self- 
defense. This variable might be called a 
pacifist factor. 


My expectations lead me to hypothesize 
that Catholics will score high on factors 1 and 
3 and Protestants will score high on factcr 2. 
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There is no reason to expect either group to 
score high on factor 4. 


Spouse and Parent Factors 


These two factors measure what commonal- 
ties respondents reported between parents and 
children and between husband and wife. In 
both factor analyses to produce these vari- 
ables, only one factor emerged: the scales are 
in effect a count of the number of commonal- 
ties a respondent reports. It is expected 
therefore that, since Catholics are more 
concerned about community relations, they 
will be more likely to score high on both 
factors. 


Marital Success Factors 


Four factors emerged in response to a 
question about the qualities required for a 
successful marriage: 


Marryl: Highest loadings on similarity of 
background, religion, politics, taste, and 
attitudes regarding children, sharing of chores, 
and adequate income. 

Marry2: Highest loadings on tolerance and 
respect. 

Marry3: Highest loadings on satisfactory 
sex and living apart from in-laws. 

Marry4: Loading on faithfulness. 


My expectations are that Catholics would 
score high on the first factor and Protestants 
on the second and the third. There is no basis 
for making a prediction on the fourth factor. 


Divorce Factors 


Two factors emerged in an analysis of 
responses to a question about when divorce 
was justified. 


Divorcel: High loadings were on drink, 
violence, unfaithfulness, absence of love, 
unsatisfactory sex, and personality conflict. 

Divorce2: High loadings were on sickness, 
dislike of each other’s relatives, and financial 
problems. 


Very hesitantly one might expect higher 
Catholic scores on the first factor and higher 
Protestant scores on the second. 


Child-Rearing Factors 


Six factors emerged from a question about the 
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traits a respondent wished that a child would 
learn at home. 


Childi: Displayed high positive loadings 
on independence, determination, leadership, 


and imagination and high negative loadings 
on manners. 


Child2: Revealed high positive loadings on _ 


politeness, work, and thrift and high negative 
loadings. on respect. 

Child3: Loaded positively on obedience 
and unselfishness. . 

Child4: Loaded positively on religion. 

Child5: Loaded positively on self-control 
and patience. 

Child6: Loaded positively on loyalty and 
duty. 

The first two factors seem to be appropriate 
for the Protestant ethic, the last four for the 
Catholic echic. 


Family Revolution Factors 


A factor analysis of six variables which 
measured attitudes towards sex and family 
life produced two scales. 


Family1: Emphasis on sexual traditional- 
ism. 

FamilyZ: Emphasis on traditional relations 
between parents and children. 


Because of their emphasis on communities 
and the desirability of sustaining them, 
Catholics can be expected to score higher than 
Protestants on both these scales. 


Religious Factors 


Scales were constructed from three sets of 
questions about religion, each scale the result 
of a factor analysis which produced a single 
factor. 


Church: Measures satisfaction with the 
church's response to human needs. 

Devotion: Measures religious behavior. 

Doctrine: Measures acceptance of tradi- 
tional relizious beliefs. 


Since formal religion is more important to a 
religion of community than to a religion of 
the individual, we may expect Catholics to 
have higher scores on both doctrine and 
devotion. However, precisely because Catho- 
lics are more likely to expect leadership from 
their church, they are also more likely to be 
disappointed, while Protestants, expecting 
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less because the church is less necessary, are 
less likely to be disappointed. 

A final two variables were added to the list 
to be analyzed. In deference to Max Weber, 
“postsecondary” education was examined 
with the expectation, based on Weber, that 
Catholics would be less likely to seek 
postsecondary education than Protestants. In 
deference to Emile Durkheim, it was hypoth- 
esized that Protestants would be more likely 
to be lonely than Catholics. 

Of the 38 variables described in previous 
paragraphs, it was possible to make predic- 
tions about 36 from the perspective of the 
“ethic/imagination” theory postulated in this 
essay. Whether the analogical imagination is 
becoming extinct depends on whether signifi- 
cant relationships are absent between denom- 
ination and some of these 36 variables. 


ANALYSIS: PHASE ONE 


For important analytic questions must be 
answered for this enterprise to be successful — 
for a decision to be made about the null 
hypotheses. 


(1) Are there significant relationships in 
the English-speaking nations, net of country, 
between denomination and values? 

(2) Do these relationships persist across 
national lines or are they the result of deviant 
behavior in one or two countries? 

(3) Are these relationships diminishing 
with time? Since there is no longitudinal data 
available, one is forced to ask if the 
differences diminish among younger respon- 
dents, conscious that if this phenomenon 
occurs it might be a life cycle and not a trend 
phenomenon. On the other hand, if there is 
little difference between younger and older 
respondents, then there is no evidence to 
sustain a hypothesis of a trend towards 
convergence of values. 

(4) Do the relationships which exist in the 
English-speaking countries enable us to make 
predictions about the two non-English- 
speaking countries in the present analysis? 


Analysis of Variance (ANOVA) with the 
Multiple Classification Analysis option (MCA) 
appeared to be the appropriate technique to 
answer these questions. Table 1 presents a 
summary of the findings that resulted from 
the application of this technique to the 36 
variables about which predictions were made. 

The seventh column in the table reports the 


492 


AMERICAN SOCIOLOGICAL REVIEW 


Table 1. Values by Denomination by Country (English Speaking) 





USA GB IRL CAN 
Equality 09^ 13 48 14 
Fairness 34 27 30 31 
Changel 06 —04 10 29 
Work2 17 —03 06 —08 
Toler2 12 37 20 36 
Toler3 10 07 15 09 
Moral? 00 03 30 07 
Moral2 16 26 44 71 
Moral3 —10 42 16 10 
Spouse 16 36 29 25 
Parent 21 20 16 12 
Maryl 13 37 04 12 
Marry3 09 10 12 21 
Child2 06 10 17 li 
Child4 05 12 19 01 
Child6 13 07 —33 30 
Familyl 06 24 -32 37 
Family2 08 17 16 31 
Church 13 18 06 24 
Devotion 06 32 39 17 
Doctrine il 34 00 19 

* Not significant. 


! For Country. 
? For Denomination. 
? For Denomination under 40 years old. 


AUS BETA! BETA? BETA? 

.13 .07 .08 
.13 ll 10 
.08 .08 .07 
01 .60 .04 .04 
04 .03 Jl .09 

-12 .19 .05 .02* 
32 24 .07 .07 
48 JH 22 .20 
34 09 09 .08 
.09 .10 .06 
At .09 Al 
Al .06 .06 
21 .06 .06 

.09 .05 .03* 
23 .05 .05 
.16 .05 .09 
.07 .08 .08 
.10 .05 .05 
AD .08 13 

19 10 .03* 
43 28 .06 ll 


* Difference in standardized points (Z score) in direction of the hypothesis. 


beta (standardized eta) between denomination 
and variable, net of country. Only those 
variables which related significantly to denom- 
ination are listed in the table. All the 
relationships are significant at a level of .01 
or greater, all but four at the level of .001 or 
greater. Of the 36 variables, then, 21 relate at 
levels of considerable statistical confidence 
with denomination—all of them in the 
predicted direction. Only in the divorce and 
confidence in social institution categories was 
there not at least one statistically significant 
relationship. There were no significant corre- 
lations in the opposite of the predicted 
direction. 

It is worth noting that neither on the work 
ethic measure nor on the postsecondary 
education variable was support found for 
specific hypotheses drawn directly from 
Weber. (Only in the Federal Republic was a 
statistically significant relationship observed 
between Protestant and postsecondary educa- 
tion: in that narrow sense, then, was the 
original Weber finding replicated. In the ISSP 
data to be analyzed subsequently, in which a 
much better educational code is available, 
there is no significant difference between 
Catholics and Protestants in educational 
achievement, even in Germany.) 


Moreover, there was also no significant 
relationship found in the sample, net of 
country, between being Protestant and being 
lonely. Durkheim’s finding is not replicated 
in most countries; only in the Federal 
Republic and Canada are Protestants signifi- 
cantly more likely to be lonely than Catholics. 
The two theories of the Founders seem to 
apply together only in Germany. 

The first five columns of the table report 
the differences in the predicted direction 
between Protestants and Catholics (in Z-score 
points) in each of the countries. A minus sign 
indicates that in a particular country and on 
the given variable, the direction of the finding 
runs in the opposite of the predicted direction. 
Minus signs appear only once in the United 
States, Canada, and Australia and twice in 
Great Britain and Ireland—in 7 of the 88 
relationships. Thus the beta in the seventh 
column indicates not only a significant 
relationship when country is held constant but 
also a relationship which exists—to a greater 
or lesser extent—in all the countries being 
studied. 

Moreover, when the analysis is repeated 
with the population under 40 (the last column 
in the table), the relationships continue to be 
Statistically significant on 18 of the 21 
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variables—religious devotion, tolerance for 
neighbors with drinking and emotional prob- 
lems, and polite and thrifty children being the 
only exceptions. Since religious devotion is 
closely related to age (independent of cohort), 
the change on the first of the three variables 
may be illusory. Moreover, the magnitude of 
the betas between denomination and value is 
not notably altered when the analysis is 
limited to those under 40. Thus the Catholic 
ethic/Analogical Imagination and, con- 
versely, the Protestant ethic/Dialectical Imag- 
ination are not withering away. 

Thus in all countries Catholics are more 
likely to emphasize “fairness” and “equality” 
while Protestants are more likely to empha- 
size “freedom” and “individualism” in the 
workplace. With the exception of Great 
Britain, Catholics are also more likely to 
advocate strengthening of authority and of the 
family. And in the United States and Ireland 
they are also more likely to say that if they 
didn’t have to work five days a week they 
would devote themselves to the community 
and to a small business of their own. 

In all five countries Catholics are more 
willing than Protestants to accept political 
extremists of either the left or the right into 
their neighborhoods. With the exception of 
Australia, they are also more likely to accept 
those with drinking and emotional problems. 

In Ireland and Australia and to some extent 
in Canada (though not in the United States 
and Britain), Catholics are more likely than 
Protestants to have strong positions on issues 
of “life ethics" (abortion, extramarital sex, 
euthanasia, suicide, etc.). In all five countries 
Protestants are more likely than Catholics to 
emphasize issues of “personal ethics" — 
lying, cheating, stealing, bribing. In all the 
countries but the United States, Catholics are 
more likely than Protestants to disapprove of 
"socially disruptive" behavior such as joy 
riding, union busting, fighting with police, 
and failure to report damage to another's car. 

In all four countries Catholics are more 
likely than Protestants to report agreement on 
crucial issues with parents and spouses and to 
emphasize the importance of shared back- 
ground (religion, politics, tastes) as condi- 
tions for successful marriage. Protestants, 
however, are more likely than Catholics to 
insist on the importance of sexual fulfiliment 
and living apart from parents and in-laws as 
conditions for a successful marriage. 

In all four countries Protestants are more 
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likely to value industry and thrift in their 
children, and Catholics more likely to value 
religious faith and a sense of loyalty and duty 
(with the exception of Ireland where Protes- 
tants rate loyalty and duty higher than do 
Catholics). These findings are not necessarily 
inconsistent with those reported by Alwin 
(1986), who found a convergence between 
Catholic and Protestant child-rearing values in 
the United States. The difference between 
Protestants and Catholics on the importance 
of thrift and industry is modest in the United 
States in Table 1. Moreover, that variable is 
one of the three in which there is no 
statistically significant relationship with de- 
nomination for those under 40 years old. 
Finally, Alwin in working with time-series 
data from the General Social Survey and can 
therefore write with much more confidence 
about converging child-rearing values. 

In all the countries Catholics are more 
likely than Protestants to emphasize tradi- 
tional family values; and in all countries but 
Ireland Protestants are more likely than 
Catholics to be tolerant of "sexual revolu- 
tion" behavior. In Ireland Protestants disap- 
prove of such behavior even more strongly 
than do Catholics— which is strong disap- 
proval indeed. 

Finally, as predicted, Catholics are more 
likely to be devout and to be doctrinally 
orthodox while Protestants are more likely 
than Catholics to be satisfied with the 
response to tbeir needs provided by their 
churches. 

In general, differences between Protestants 
and Catholics, as measured by the scales 
available to this analysis, are more modest in 
the United States than in the other four 
countries. Perhaps "ethnic differences" ac- 
count for the greater diversity in Ireland, 
Canada, and Great Britain —Northern Ireland 
versus Southern Ireland, French versus Brit- 
ish Canada, lrish immigrant versus host 
culture in Britain. However, such an explana- 
tion would not account for the considerable 
differences between Protestants and Catholics 
in Australia on the morality scales—on the 
average the largest differences on these scales 
in any of the five countries. 

Can we apply the predictions which were 
so successful in studying English-speaking 
respondents to Continental respondents? The 
data summarized in Table 2 suggest that, 
while the English-speaking findings are useful 
in predicting differences between Catholics 
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Table 2. Values by Denomination by Country (English 











Speaking and Continental) 
(Z Scores)! 
Germany 
English (F.R.G.) Holland 

Equality 26 01 05 
Fairness 33 -24 08 
Changel 15 —18 02 
Work2 60 01 02 
Toler2 21 —03 03 
Toler3 24 -il 06 
Morall 16 29 —07 
Moral 41 07 29 
Moral3* 

Spouse 26 02 -02 
Parent 24 03 05 
Marry! 09 03 20 
Marry3 29 —19 02 
Child2 02 16 01 
Child4* 

Child6 12 03 22 
Familyl 13 29 -1 
Family2 12 29 -11 
Church 35 —37 50 
Devotion 14 —44 20 
Doctrine 20 —57 66 

* N.S. 


! Difference between Protestant and Catholic in 
direction of hypothesis. 


and Protestants in Germany and Holland, the 
theoretical orientation of this paper is much 
less successful in the Continental countries. 
In 8 of the 21 variables the differences 
between Catholics and Protestants in the 
Federal Republic run in the opposite of the 
hypothesized direction. In several other vari- 
ables the differences between Protestants and 
Catholics are slight. However, on issues of 
family values, "life ethics," and "personal 
ethic" moral questions, the differences be- 
tween Protestants and Catholics are larger 
than in the English-speaking world. 

The first four variables in the table might 
be described as measures of the "Catholic 
ethic" as the opposite of the Protestant ethic 
in Max Weber's classic sense of the term. 
Interestingly enough, therefore, the Protestant 
ethic is less likely to be found in Germany 
than in the English-speaking countries— even 
if German Catholics seem less likely to seek 
postsecondary education. 

In Holland there are only four variables in 
which the findings go in the opposite direction 
from the predictions; on two of these variables 
(Family Values) the differences between Dutch 
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Protestants and Catholics are exactly the op- 
posite of the differences among their West Ger- 
man neighbors. Similarly the Dutch respon- 
dents provide strong support of our religious 
predictions — again just the opposite of the Ger- 
mans, among whom Catholics are more satis- 
fied with their church than are Protestants but 
less devout and doctrinally orthodox than Prot- 
estants. Finally, while the Dutch sample pro- 
vides confirmation for some predictions about 
moral values and child-rearing values, the con- 
firmation occurs precisely on scales where there 
is either no or little support in the German 
sample (Mora, Marryi, Child6) and not on 
the scales where confirmation can be found in 
the German sample (Morall and Child2). 

The English-speaking findings then provide 
a useful road map for a tour of the Continent, 
but part of the map applies in Holland and a 
different part applies in the Federal Republic. 

Presumably such differences among the 
differences can be accounted for by variation 
in the history, geography, and political and 
social structures of the two countries. Attempt- 
ing to explain them in detail is beyond the 
purposes of this paper and probably beyond 
the capacity of the ISV data set to provide the 
raw material for explanations. 


DATA AND HYPOTHESES: PHASE TWO 


The previous analysis involved data from six 
English-speaking countries— Great Britain, 
Ireland, Canada, the United States, Australia, 
and New Zealand—and two Continental 
countries — West Germany and the Nether- 
lands. By turning to the International Social 
Survey Project data for 1985 and 1986, one 
may attempt to replicate the previous findings 
with different samples and completely differ- 
ent questions. The ISSP surveyed four of 
those countries (Great Britain n— 1530; the 
United States n — 677; Australia n — 1528; and 
Germany n= 1048) and two others (Austria 
n::987 and Italy n= 1580) in 1985. More- 
over, the next phase of the ISSP in 1986 
involved seven countries (Great Britain 
n=1416; United States n= 1470; Australia 
n=1250; Hungary n=1747; Germany 
n=2808; Austria n=1027; and Italy 
n= 1027). The 1985 ISSP study focused on 
attitudes towards government, and the 1986 
study attempted to measure social support 
networks. The ISSP data also provided an. 
occasion to test the possibility that education 
can diminish the differences between the 
Protestant and Catholic imagination. 


PROTESTANT AND CATHOLIC 
It should be noted that there is no 


significant correlation, net of country, be- 


tween Catholics and Protestants either in 
education or in self-perceived social class; 
thus whatever differences may emerge cannot 
be accounted for by educational or social 
class variables. 

Twenty-two variables which seemed appro- 
priate for study were created by factor 
analysis. 

(1) A factor based on measures of respect 
for laws and contractual responsibilities. It 
seemed reasonable to expect that Catholics 
would score lower than Protestants on this 
variable. (OBEY) 

(2) Two factors were constructed from 
variables concerning social protest. The first 
with high loadings on the first four items 
represented protests which did not involve 
violence, the second factor with high loads on 
the fifth and sixth protests represented 
protests which involved violence. Despite 
their respect for society—and possibly be- 
cause of it—Catholics, I hypothesized, would 
be more likely to approve of resistance to 
oppressive government. (In an old and 
long-forgotten debate of the Reformation 
days, Catholic theologians argued for the 
possible legitimacy of tyrannicide, Protestants 
against it.) In the ISV data, Catholic 
willingness to resist oppression was indeed 
stronger than Protestant. (PROT) 

(3) Two factors were computed from 
responses to questions about civil liberties: 
The first seemed to measure respect for 
freedom of speech, the latter freedom of 
publication. The Catholic church has often 
opposed both sets of liberties and Protestant- 
ism has always emphasized freedom of 
conscience. On the other hand, the Catholic 
imagination is more open to diversity, and the 
Protestant imagination more likely to respect 
law. Because imagination is more powerful 
than institution, the tentative hypothesis is 
that Catholics will be more likely to advocate 
civil liberties, just as they will be more likely 
to approve revolutionary action. (LIB) Two 
other factors measured responses to feminism: 
the first emphasized the fact of injustice 
against women, the second the state's obliga- 
tion to provide extra support for women. 
Other research (Greeley 1988, 1989) indicates 
that Catholics are more likely to imagine God 
as woman and to support feminist positions. 
(FEM1) 

(4) However, Protestants would, I suspect, 
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be less likely than Catholics to approve of 
tight police supervision and restraint of 
criminals and suspects. The Catholic might 
support the revolutionary against an oppres- 
sive government but she or he would not 
support the criminal who threatens the 
networks of ordinary life which for the 
Catholic are sacramental. Thus I expect 
higher Protestant scores on the two factors 
which indicate opposition to police power 
against both suspects and criminals. The first 
factor reflects opposition to trailing ‘and 
intercepting the mail and the phone calls of 
suspects and criminals, the second opposition 
to holding them overnight. (CRIME) 

(5) Two factors:emerged from an analysis 
of responses to questions about political 
efficacy. The first had high loadings on 
responses concerning despair about dealing 
with government. The second emphasizes 
participation in government. One would 
expect higher scores from Protestants than 
from Catholics on feelings of oppression and 
lower scores on participation in government. 
(POLIT) 

(6) Three variable constitute a single 
variable of support for aid to students. 
Catholics, more inclined to support welfare 
and government intervention in support of 
social goals, might be expected to score 
higher on this factor than Protestants. (AID) 

(7) Three factors were extracted from a 
series of questions about government interven- 
tion in the economy. The: first measures 
attitudes towards job and hours, the second 
attitudes towards wages and prices, and the 
third opposition to government regulation. 
Catholics might reasonably be expected to 
Score higher than Protestants on the first two 
factors, Protestants more likely to score 
higher than Catholics on the third. (GOVT) 

(8) Two factors record reactions to govern- 
ment spending programs: the first emphasizes 
spending on social problems, the second 
spending on police and the military. Catholics 
are more likely to approve of government 
intervention than are Protestants, so should 
score higher on both these factors. (SPEND): 
Another factor measures attitudes towards 
taxation (TAX) on which Catholics, because 
of their belief in government intervention and 
the fundamental goodness of society, would 
be expected to have higher scores. 

(9) A single factor, based on the three 
variables concerning the power of business, 
labor, and government, measures a respon- 
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dent’s feeling that there is too much power 
concentrated in social institutions—a classic 
measure, it would seem, of the view that 
society is sinful and God-forsaken. Hence 
Protestants might be expected to score higher 
on this measure than Catholics. (POWER) 

(10) On the other hand, a single factor 
which measures support for government 
ownership of industry would also appear to be 
a measure of the classic Catholic propensity 
to support government intervention in eco- 
nomic life. Hence Catholics will be more 
likely to have high scores on this scale than 
Protestants. 

(11) Yet another single factor measures 
attitudes towards Government responsibility 
in economic life. Again one would expect 
Catholics, with their greater confidence in 
society, to be more likely to favor these 
government interventions than Protestants. 

(12) A final factor was designed especially 
to measure the Catholic egalitarianism re- 
ported in the previous analysis. Hence 
Catholics can be expected to score higher on 
this scale. (EQUALITY) 

The “communitarian” emphasis of Cathol- 
icism and the “individualist” emphasis of 
Protestantism, it was suggested, are the 
. behavioral results of these different pictures 
of social reality. The Catholic propensity will 
be to see virtue as integration into social 
realities while the Protestant propensity will 
be to see virtue in the struggle for freedom 
from the constraints of social realities. 
Salvation, both religious and personal, for the 
Dialectical Imagination is an individual ef- 
fort. For the Analogical Imagination it is a 
social effort. 

(13) Therefore, in particular with the data 
of ISSP 86 in mind, Catholics will form more 
intense family networks. They will live closer 
to and visit more often relatives than will 
Protestants. (VISIT, DISTANCE). 

Thus the prediction is that net of the 
influence of the various countries Catholics 
will be more likely to live close to and visit 


their mother, father, sister, brother, daughter, 


son, and other relatives than will Other 
Christians. 


ANALYSIS: PHASE TWO 


Table 3 presents results of an analysis of 
attitudes towards government intervention 
and the family nétwork patterns of Protestant 
and Catholics. "Of the 22 attitudes towards 
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Table 3. Religion and Attitudes towards Government 
and Family Networks by Education 








Beta net of country 








More than 
. All 11 EDYEARS 
Attitudes towards Government 
Obey! C6 .05 
Prot2 06 .06 
Lib2 C5 .00* 
Fem! C4 .05 
Crime2 5 .06 
Govtl C4 .10 
Poll C6 .04 
Tax cs 04 
Power G7 .04 
Ownership 06 .06 
Welfare 07 .09 
Equality 05 .06 
Average 06 .06 
Visits 
Mother 04 ‘21 
Father .04 24 
Sister .04 .04 
Brother .06 .01 
Son .07 .04- 
Relative .04 AL 
Distance 
Mother 04 18 
Father .03* 18 
Sister 05 19 
Brother .07 48 . 
Daughter .05 .05 
Son .05 .03 
Relative .04 04 


* Not statistically significant. 


government predictions, 12 were supported at 
levels of statistical significance (all but 2 with 
a probability of error less than .001). None of 
the predictions was refuted by significant 
evidence in the opposite direction. Only in the 
categories of SPEND and AID was there no 
evidence to support hypotheses. 

Thus Protestants are more likely than 
Catholics to support obedience to laws; 
Catholics are more likely than Protestants to 
approve violent protests and to support 
freedom of publication and to recognize 
inequal treatment of women; Protestants are 
more likely than Catholics to resist temporary 
arrest for suspects and criminals, to feel 
oppressed by the power structures of society 
and by the lack of power in dealing with 
government. They are also more likely than 
Catholics to think taxes are too high. 
Catholics are more likely than Protestants to 
support government intervention in the econ- 
omy, government ownership of industry, and 
equalization of income. 


PROTESTANT AND CATHOLIC 


The Catholic attitude towards the role of 


government may seem paradoxical—on the 
one hand supportive of more intervention and 
on the other more likely to approve of violent 
resistance. Perhaps the reason is that the 
Catholic imagination inclines people to expect 
the government to be good, modestly and 
imperfectly good perhaps, but still good. 
When the flaws in government become 
intolerable, those who believe government 
can be a positive good are more likely to take 
to the streets than those who take it for 
granted that governmental power is always 
evil. i 

To repeat a theme of the previous analysis, 
the “liberal/conservative” paradigm cannot 
cope with the Catholic propensity to support 
“liberal” policies of government intervention 
and egalitarianism and conservative policies 
on response to crime and criminals. However, 
a paradigm based on a theory of different 
“imaginations” (or “ethics”) can easily 
account for and find consistency in the 
patterns reported in this paper: Catholics tend 
to picture society as supportive and not 
oppressive; Protestants tend to picture society 
as oppressive and not supportive. 

The second and third sections of Table 3 
present the results of an analysis of the family 
network variables in the ISSP 86 study. Of 
the 14 items—7 for visits and 7 for 
distance —analyzed in ANOVA equations, 13 
correlate at levels of statistical significance 
with religion when the effect of different 
national cultures is removed. Catholics are 
more likely to visit parents, children, sib- 
lings, and other relatives than are Other 
Christians. They are also more likely to live 
close to everyone (but their fathers) than-are 
Other Christians. Net of national differences, 
then, Catholics do form more intense family 
networks than do Other Christians, as the 
theory suggested they would. Moreover, the 
propensity to visit is additional to and 
independent of the propensity to live near 


relatives (except for brother and other rela- 


tives). . 

The second column in Table 3 presents the 
beta correlation in ANOVA analysis for 
country and religion for those respondents 
who have had 12 or more years of education 
to test the possibility that differences between 
Protestants and Catholics may diminish among 
those who have higher education. Differences 
in scores on the LIB disappear but there is an 
increase in differences on political efficacy 
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and welfare. The average correlation is the 
same for those who have had .advanced 
education as it is for the rest of the sample. 
Among those who have advanced education, 
Catholics are even more likely than Protes- 
tants to visit mother, father, and relative, and 
live close to mother, father, sister, and 


 brothet in comparison with the general 


population. Perbaps the increased educational 
attainment provides the affluence required for 
such choices. . 

Thus 26 of the 36 hypotheses fashioned for 
testing against the ISSP data are sustained. 
There is no evidence to support any of the 
contrary hypotheses. Analysis if ISSP data, . 
then, neatly replicates the analysis of the ISV 
data. In two separate samples, taken by 
different organizations and using different 
sets of questions, evidence is found to reject 
the null hypothesis that denominational differ- 
ences do not affect attitudes and behavior. 
Moreover, since the theory of the analogical 
imagination enables one to make 47 correct 
predictions out of 72 possible about the 
direction of the differences (and in none of 
the others is the difference significant in the 
opposite direction), one must also reject the 
hypothesis that the differences between Prot- 
estants and Catholics cannot be accounted for 
by different ways of imagining the world and 
human society and communities. 


DATA AND HYPOTHESES: 
PHASE THREE 


There are no variables in the ISV which 
enable us to ask whether the differences 
between Protestants and Catholics recorded in 
the first two tables of this paper can be linked 
to differences between analogical and dialec- 
tical imaginations. However, the special 
religion module of the.1988 General Social 
Survey does provide items which measuré 
both values and the religious imagination. 

The null hypothesis for this second phase 
of analysis is that the correlations which exist 
between denomination and values in the 
United States can not be attributed in whole 
or in part to differences in religious imagina- 
tions between members of the two denomina- 
tions. : 

The following variables seemed appropri- 
ate for analysis: 


Feminism. Factor analysis of five items | 
measuring attitudes towards equality for 
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women were found to constitute one scale. 
Because of Catholic acceptance of the good- 
‘ness to be found in the complexity of human 
relationships, it is expected that Catholics will 
be more tolerant of multiple roles for women 
and hence score higher on this scale than 
Protestants. 

Welfare. A single factor was discovered 
with loadings on four welfare items. It was 
expected that because of their communitarian 
. orientation Catholics would score higher on 
this scale than Protestants. 

Capital punishment. A single item mea- 
sures respondents attitude towards capital 
punishment. It is expected that Catholics, 
with greater tolerance for the complexity of 
the human condition, would be more likely to 
oppose capital punishment than Protestants. 

Presence of God. Because they are more 
likely to see God as an intimate other with 
whom one can disagree instead of a distant 
and absent God and because they believe that 
God, present in the world, is responsible for 
the world, Catholics are likely to score higher 
. on a factor that measures anger towards God 
and doubts about Her. 

Devotion. A factor emerged which empha- 
sized being born again, reading the Bible, 
inviting someone else to accept Jesus as 
savior, and saying grace. Such devotions 
seemed clearly Protestant in their orientation. 

Moral rigidity. Four items which stressed 
rigid moral decision making constituted a 
single factor which seem to reflect a dialecti- 
cal approach to reality. Hence Protestants 
were expected to score higher on it than 
Catholics. 

Doubts. Four questions were asked about 
events which created doubt about religious 
faith. These also clustered on one factor. 
Since Catholics, I have hypothesized, are 
more likely to doubt, it is expected that they 
will score higher than Protestants. 

Faith. Four items were asked about phe- 
nomena which strengthened religious faith; 
they too clustered on a single scale. Since 
Catholics see the world and its events as 
revelatory, it was expected that they would 

score higher than Protestants on this scale. 
. Life decisions. Two factors emerged in an 
analysis of four items about influences on life 
decision making: one with high loadings on 
the Bible and church leaders, the second with 
loadings on self and others. It was expected 
that Protestants, with their emphasis on 
individual decisions and the Bible as road 
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map, would score higher than Catholics on 
both these factors. 

Personal goodness. Four questions were 
also asked about what constitutes the good 
person. All four clustered on the same factor. 
With the emphasis on following one's own 
conscience in this factor, it was expected that 
it would also produce higher scores for 
Protestants. 

Altitudes towards science. Two factors 
were developed from four questions about 
Science, one emphasizing the positive aspects 
of science, the other the negative. Since 
Catholics are more likely to see the world as 
sacramental, they would be more likely to 
appreciate the scientific study of the world 
and score higher than Protestants on the 
positive scale and lower on the negative scale. 


ANALYSIS: PHASE THREE 


There are 13 variables to be tested for 
denominational difference. Once a difference 
is discovered, it must then be asked whether 
the religious imagination items developed for 
the General Social Survey (Greeley 1988, 
1989) can account for some of the differ- 
ences. Catholics are more likely than Protes- 
tants to see God as an intimate other—lover, 
friend, spouse, and mother—and the world 
and human nature as basically good, as the 
theory of the analogical imagination predicts 
they would. Does this difference account for 
at least some of the denominational value 
differences which exist in the General Social 
Survey? 

The first column in Table 4 reports the 


Table 4. Values, Denomination, and the Religious 











Imagination 

r! beta? 
Feminism 10 .05* 
. Welfare .10 .06* 
Capital Punishment 1 .06* 
Presence of God .12 .08 
Devotion 31 .26 
Moral Rigidity 15 10 
Good Person —.10 —.05* 
Life Decisionsl 21 .16 
Sciencel 1 .08 .03* 
Doubts Al f .09 





* Not statistically significant. 
! Correlation with denomination in the direction of the 
hypothesis. 
Correlation with denomination net of religious 
imagination. 


PROTESTANT AND CATHOLIC 


Significant differences between Protestants 
and Catholics in the GSS data. Only Science2, 


Faith, and Life Decision2 did not correlate . 


significantly with denomination. The Good 
Person scale relates in the opposite of the 
predicted direction: Catholics are more likely 
than Protestants to emphasize all the charac- 
teristics of the good person. 

The second column in the table presents the 
standardized correlations between denomina- 
tion and the value scales when the religious 
imagination is taken into account. All the 
correlations decline, half of them into statisti- 
cal insignificance. 

Thus Catholics are more likely than 
Protestants to support feminism and welfare. 
Protestants are more likely than Catholics to 
engage in certain devotional practices (such 
as reading the Bible and being born again), to 
support capital punishment, and to disapprove 
of science. Catholics have lower scores than 
Protestants on moral rigidity and are less 
likely than Protestants to rely on the Bible and 
an authority for moral decisions. Protestants 
are less likely to have doubts and to be angry 
at God than are Catholics. All these differ- 
ences are reduced, some to the point of 
insignificance, once one takes into account 
the differences in religious imagination be- 
tween the two denominations. 

The IVS and ISSP data show denomina- 
tional value differences in nine countries. The 
GSS data show that somewhat parallel 
differences in one country can be accounted 
for in part by differences between the 
analogical and dialectical imaginations. The 
final null hypothesis must therefore be 
rejected. 


DISCUSSION 


Granted that all but one of the differences 
reported in Tables 1, 3, and 4 are in the 
hypothesized directions and that they are 
Statistically significant, how important are 
they? Only seven of the standardized etas for 
denomination are higher than .1, only one 
higher than .2. Are not such differences 
trivial in the general order of human diver- 
sity? Granted that they exist, what difference 
do they make? 

One notes—as sociologists always do when 
such an objection is raised—that most social 
variables, even those that everyone agrees are 
important, explain rather little of the variance 
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in human behavior precisely because humans 
are so variegated and complex. 

One also adds that there ought not to have 
been any reasonable expectation of enormous 
differences between the two denominations. 
As was observed earlier, the two different 
“imaginations” represent tendencies and not 
zero-sum views of reality. Moreover, both 
Protestants and Catholics are part of the 
Western cultural heritage, the North Atlantic 
tradition (which can for present purposes be 
extended to include Australia), and the 
“English-Speaking World.” One could not 
expect enormous differences between denom- 
inational communities which in all but one of 
the countries continue to coexist more or less 
amicably with one another (and do so even in 
the Republic of Ireland, whatever may be 
happening in the Six Counties). 

On a list of predictor variables, then, where 
might one rank denomination? How important 
is it relative to other variables? 

In Table 1, with six exceptions, the beta 
between variable and country is larger than 
that between variable and denomination. The 
average beta for country is .16, the average 
for denomination is .09. However, the 
average beta for the relationship between 
gender and the 20 variables in Table 1 is 
.045. How important then is denomination in 
predicting values (of the sort measured by 
ISV)? Not as important on the average as 
nation but more important on the average than 
gender. 

One leaves denomination out of design and 
analysis, then, with less reason than one 
would exclude gender. 

However, such a mechanical approach to 
the cross-national importance of denomina- 
tion is artificial. In the real world one would 
want to know who the people were, where 
they were from, and what the relevant value 
was. If one was, for example, dealing with 
issues of tolerance of deviants in a neighbor- 
hood or of moral decision making or of the 
sexual aspects of family revolution or of the 
"fair" use of authority, one would want to be 
very sensitive to nuances of denominational 
difference because they would likely be as 
important as the differences in national 
background. 

Thus Table 5, which replaces ANOVA 
with simple cross tabulations on specific 
items, demonstrates that the differences 
between Protestants and Catholics on certain 
matters can be important to policymakers— 
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Table 5. Differences between Protestants and Catholics 
(English Speaking) 
Catholics Protestants 
(96) (99) 
Same pay 34 22 
Owners run 53 61 
Obey without 
understanding 53 66 
Reject 
Drinkers 49 59 
Disturbed 29 35 
Right extremists 23 29 
Left extremists 29 36 
Disapprove Strongly! 
Bribe 80 88 
Dole 75 84 
Bus fare 67 78 
Euthanasia 51 41 





! Ten and nine on a ten-point scale of approve/ 
disapprove. 


and possibly dangerous for the policymakers 
who have persuaded themselves that in a 
modernized, urbanized, and industrialized 
society, differences in denomination ethic/ 
imagination no longer matter. 

Thus, in the work situation, Catholics are 
13 percentage points less likely to accept 
orders they do not understand. They are 12 
percentage points more likely to demand 
equal pay for equal work, regardless of the 
quality of the latter. They are 8 percentage 
points less likely to take for granted the right 
of owners to manage a firm. They are 7 
percentage points more likely to choose 
equality over freedom. A factory manager or 
trade union leader with equal proportions 
Catholic and Protestant in his constituency 
would be ill-advised to ignore such differ- 
ences. 

In neighborhood situations, Catholics are 
more likely to be willing to tolerate those with 
drinking problems and emotional disorders 
and political extremists of either hue, than are 
Protestants. A neighborhood leader, opting 
for increased diversity, should be aware of 
where he is likely to find allies and where 
adversaries. 

Moreover, a political reformer should also 
understand that while the majority of Catho- 
lics are likely to oppose bribery and cheating 
on welfare and on public transportation, their 
opposition is not as strong as is the opposition 
of Protestants. Reform historically has Bpen 2 
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word with which Protestants are more com- 


fortable than Catholics. 
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Finally, a college edministrator would want 
to take into account that more than half of the 
Catholics with advanced education in the 
ISSP sample report that they had visited their 
mother and/or father at least several times a 
week as opposed to a third of the Protestants. 
An increase of freed time for family visits 
might improve the morale of Catholic stu- 
dents more than tbe morale of Protestant 
students. 

These four sets of differences, easily 
predictable from the theoretical orientation of 
this paper, demonstrate how useless it is to try 
to pin the labels “liberal” or “conservative” 
on either denominational heritage. On issues 
of sexual morality and family life, Catholics 
are clearly more "conservative" (because of 
their imaginative predispositions, I have 
argued). But on issues of social justice and 
neighborhood community, they are just as 
clearly more "liberal." And on matters of 
"corruption" whether they are more "toler- 
ant" or more "corrupt" probably depends on 
which denominational heritage provides the 
Observer's perspective. 


SUMMARY AND CONCLUSION 


In summary, the four null hypotheses can be 
safely rejected. There is no evidence in the 
data analyzed in this paper that the analogical 
imagination is either extinct or becoming 
extinct. The Protestant ethic and the Catholic 
ethic are alive and well, though perhaps not in 
their original form in the land where the idea 
originated. Both ethics seem to result from 
preconscious narrative symbols. 

Modern empirical social science emerged 
just at the time when change from an old 
order was everywhere (in Europe) to be 
observed, a change already far along accord- 
ing to the men who founded our tradition. 
The "mechanical" scciety was being replaced 
by the “organic” society, "gessellschaft" was 
replacing "gemeinschaft," "association" was 
replacing "community." The Protestant ethic— 
broadly defined as the "individualist" ethic— 
was replacing the Catholic ' ethic —broadly 
defined as the “communal ethic." Human- 
kind (European and North Atlantic) was on 
pilgrimage from one end of what Talcott 
Parsons would later call his pattern variables 
to the other end. 

Since the end of the Second World War, 
scholars have refined and nuanced that vision 
and concluded that, while change has oc- 
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curred and continued to occur, it is far more 
complex, uneven, multidimensional, and multi- 
directional than appeared a hundred years 
ago. 

Despite contemporary sophistication about 
the evolutionary changes which the Founders 
noted, there still is often a moment of surprise 
when one finds that in yet another matter, the 
old, the archaic, the communal manages to 
survive, especially when that matter is 
religious. For example, virtually all the 
individual reports from the ISV assume as a 
given, without time-series evidence, the 
disappearance of old value structures; the 
possibility that denominational differences 
across national lines can persist despite the 
forces of modernization seems too absurd to 
consider. 

Ought not the Catholic ethic, clearly a relic 
of earlier, more rural, less industrialized 
times, gradually erode in the face of the 
irresistible forces of individualism and social 
and geographic mobility? How does one 
explain the residual strength of this suppos- 
edly outmoded worldview? How does a value 
orientation rooted in an agricultural and 
feudal experience manage to survive in an 
utterly different environment? Did not even 
Weber himself predict that industrialism 
would eventually eliminate differences be- 
tween Protestants and Catholics? 

The answer seems to be that worldviews 
are not neatly written propositional para- 
graphs that can be. explicated and critiqued in 
rational and discursive fashion. Rather they 
are, in their origins and their raw, primal 
power, tenacious and durable narrative sym- 
bols that take possession of the imagination 
early in the socialization process and provide 
patterns of meaning and response that shape 
the rest of life. The theory of different 
imaginations of society, encoded in different 
stories of God’s relationship with the world, 
on which this paper is based can explain the 
durability of separate worldviews among 
Protestants and Catholics. The differences 
. will disappear only when Protestants become 
more likely to picture God as present in the 
world and/or Catholics more likely to picture 
Her as absent from the world. 

Surely there are convergences, as Alwin 
has established, among American Protestants 
and Catholics in their child-rearing orienta- 
tions. But how central to the religious 
imagination are expectations about child 
rearing? Or to put the question at a more 
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general level, what is essential to an imagina- 
tion tradition and what is peripheral? What 
changes more or less easily and what resists 
change, perhaps implacably? What values 
flow almost inevitably from one’s image of a 
God present in society or absent from society? 

At the present time there are no data to 
answer that question and perhaps no theory to 
provide the questions to ask of a data set. The 
items in the ISV study were there to be used, 
but there is no reason to believe that of 
themselves they can do any more than force 
us to reject the null hypotheses of the present 
paper. They do not necessarily reflect what is 
essential to the different religious imagina- 
tions, save perhaps in remote and indirect 
ways. 

The GSS findings, limited to one country, 
give a hint that images of God and world can 
indeed account for some of the value 
differences between Protestant and Catholic, 
but such an exploration must remain tentative 
until the technique for measuring religious 
imagery becomes more sophisticated. 

The Analogical Imagination may be alive 
and well, but there is much work to be done 
before we begin to understand it and the 
Dialectical Imagination, its counterpart and 
complement. 
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Do intraclass struggles shape the political terrain on which ensuing struggles, 
within and between classes, are fought and resolved? Out attempt to answer this 


question focuses on the rivalry among the factions and parties involved in . 


organizing American industrial workers from the late 1930s on. We assess how the 
political practices of the Communists and their rivals determined which political 
camp won power in the new CIO unions. A logit model shows that two ensembles 
of political practices “loaded the historical dice” in favor of the Communists. The 
chances that Communists would win union leadership were far higher: first, if the 
union had seceded from the AFL and joined the CIO from below, in an insurgent 
workers’ movement, rather than from above, in a revolt of its top officers; and 
second, if the union had been organized independently, rather than by a CIO 
“organizing committee.” Two other political practices indirectly favored the 
Communists; earlier Red union organizing in the industry (although its effects were 


contradictory); and forming the union as an amalgamated rather than as a unitary - 


organization. 


Men and women make their own history, 
although not just as they please nor under 


circumstances they choose, but under “circum-. 


stances directly found, given and transmitted 
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from the past” (Marx [1852] 1963, p. 15). But 
these circumstances, Marx forgot to add, 
have also been shaped both by the attempts of 
previous generations to make history and by 
the effects of their own earlier political 
struggles. We shall try to show that this is.so 
through a multivariate analysis of the indepen- 
dent effects of political struggles in a specific, 
theoretically relevant, historical case, namely, 
the organization of American industrial work- 
ers from the late 1930s on. 

Thus, our analysis addresses the theoretical 
problem of the "relative autonomy of poli- 
tics." This refers, in our formulation, not 
merely to the possible autonomy (or origina- 
tive potential) of the "state," but rather, 
comprehensively, to the possible independent 
effects of political phenomena in the shaping 
and transformation of basic social relations. 
Of course, this is within the objective limits 
imposed and the objective alternatives made 
possible by the existing circumstances. Our 
originating question, in particular, is whether 
political struggles have relatively independent 
effects on class and intraclass relations. This 
covers such substantive questions as the 
effects of political struggles on the develop- 
ment of capitalism, on the emergence of the 
democratic or authoritarian state, on the labor 
process, and on class formation (see, e.g., 
Moore 1968; Aminzade 1981; Zeitlin 1984; 
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Skocpol 1980; Stark 1980; Griffin, Wallace, 
and Rubin 1986). 


THE INTRACLASS STRUGGLE. WITHIN 
THE CLASS STRUGGLE 


' Every class struggle is simultaneously an 


intraclass struggle. For no class, as Max 
Weber remarks (1968, p. 930), is infallible 
about its interests. What are the real interests, 
immediate or historical, of a class? What is 
the class struggle really about? Indeed, is it a 
‘class struggle? These questions are always at 
‘issue among the contending factions and 
parties of a class as they struggle to define its 
interests and what has to be done to protect 
and advance them. The process of "self- 
organization” of a class, then, involves 
concrete political struggles within it—and 
within its organizations—over what its class 
interests are and who should organize and 
lead it. Classes do not simply organize 
themselves. They become organized in a 
particular way and by particular leaders, 
factions, or parties, with particular theories, 
social objectives, and political/organizational 
strategies. 

Political struggles, therefore, are never 
mere effects of structures. Nor are their 
consequences pregiven by the functions of 
structures.! Rather, the critical issue is, given 
certain circumstances, or. objective condi- 
tions, how do men and women make their 
own history? The existing historical situation 
and social relations (economic, political, and 
cultural) constitute a “realm of possibilities” 
or a “structure of choices” that is given to 
men.and women, individually or collectively, 
at a particular historical moment (Przeworski 
1977, p. 377). These relations impose limits 
on the political options and, consequently, on 
the political struggles that are possible. But 
they also simultaneously make possible these 


‘As in the “structuralist” theory of Nicos 
Poulantzas (1973, esp. pp. 85-98), where a 
political “practice” is the mere expression of its 
“specific place and function . . . which are its 
objective" (1973, p. 42). So, the consequences of a 
"practice" are pregiven and known a priori, even 
though the theory rhetorically affirms the "relative 
autonomy of the political." So, "history . . . 
becomes a history that proceeds," as Przeworski 
(1977, p. 368) correctly remarks, "from relations 
to effects without any human agency." 
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options and the struggles resulting from 
choosing one or another of them. 

Thus, political struggles, and the political 
relations they create, are no less real than 
other so-called material conditions or struc- 
tures. Whatever limits the underlying social 
relations impose, political struggles have their 
own "relatively autonomous" consequences 
for the circumstances under which subsequent 
political struggles are waged. These conse- 
quences, in principle, can be revealed empir- 
ically. So, to the extent that intraclass 
struggles result in new political relations 
among the factions and parties of a class, they 
also determine who organizes and leads it. In 
turn, both the form of organization and type 
of leadership—its specific capacities and 
political consciousness (or ideological com- 
mitments) — affect the way that class engages 
in the struggle against other, dominant or 
subordinate, classes.? These propositions form 
the guiding thread of our study. Thus, our 
central theoretical question is: how and to 
what extent do concrete intraclass political 
struggles independently shape the political 
terrain on which, as time goes by, subsequent 
struggles, within and between classes, are 
now fought and resolved? 


ANALYTICAL OBJECTIVES 


This is an empirical analysis of the relatively 
independent political effects of concrete 
political struggles within the emergent orga- 
nized segment of the industrial working class 
in the United States at a decisive historical 
moment (i.e., the Great Depression cum New 
Deal). We hope to reveal how and to what 
extent the political strategy and practices of 
class organization of the rival factions and 


? We do not assume that intraclass struggles in a 
subordinate class and efforts to organize it are 
equivalent to, or have the same sorts of determi- 
nants and effects as, those within a dominant class. 
For, unlike workers, capitalists do not have to be 
organized to act as a class. Capitalists as 
individuals, outside of any "peak association," 
employers' organization, or combination (cartel) 
have the power that capital-ownership gives them 
over individual workers and the "right," in 
exchange for wages, to dispose of their labor time. 
But workers can organize themselves as a class 
only if they are already organized by capitalists, 
that is, as employees of capitalist enterprises (cf. 
Offe and Wiesenthal 1980, pp. 70-72). ` 


WHO GETS THE BIRD? 


parties of the Congress of Industrial Organi- 
zations (CIO) determined who won the 
leadership of its emergent unions. Our 
analysis focuses on the Communists—who at 
that time constituted “the most important 
minor party in the union world” (Mills 1948, 
p. 23)—and their political rivals. 

If unionization is an indispensable organi- 
zational means for workers to achieve their 
common interests, those interests are not 
simply given. Nor, when recognized, do they 
automatically translate into class organiza- 
tion. On the contrary, their “translation” into 
class organization inescapably involves com- 
bat between the conscious agents of capital 
and of labor (Griffin et al. 1986, p. 148). So, 
the “interests” that workers recognize and act 
upon are affected both by struggles between 
them and capital and, we wish to emphasize, 
struggles among them, over the definition and 
organization of their common class interests 
(cf. Przeworski 1977; Stark 1980, pp. 97— 
98). 
For these reasons, the political effects of 
the political practices involved in organizing 
CIO unions constitute a critical, theoretically 
relevant historical case. Put otherwise, histor- 
ical materials on CIO organizing are what 
Robert K. Merton terms "strategic research 
materials (SRM),” for they clearly “exhibit 
the structure and workings of the phenomena 
to be understood" (1987, p. 11): namely, the 
independent effects of intraclass struggles, in 
this instance, among contending working- 
class political factions and parties. Vying for 
political power in the emerging organized 
industrial segment of the working class, they 
had sharply different conceptions of working- 
class interests and the political ies aie to 
protect and advance them. 

The assorted run-of-the-mill unionists, so- 
cialists, Catholic activists, radicals, and 
Communists involved in organizing the CIO 
unions were engaged in a simultaneous fight 
on two fronts. The main front was their 
common struggle against capital to organize 
and reorganize workers into.industrial unions. 
The second front was the struggle among 
these political rivals to win political leader- 
ship of the new industrial unions— and thus to 
actively define the interests and shape the 
cohesion and self-consciousness of this orga- 
nized segment of the working class. 

Asking our empirical question, that is, 
whether the political practices of the CIO's 
contending political camps determined which 
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of them won the leadership of its unions, and 
asking it in this way, allows us to identify a 
critical *pocket of theoretical neglect" and of 
"specifiable and specified ignorance" in 
political sociology (Merton 1987, p. 11). This 
type of question cannot be posed within the 
prevailing paradigm in political sociology. Its 
central question is: What are the "social 
bases" of the aggregated individual political 
attitudes and political behavior of various ` 
categories of workers? (See, e.g., Form ` 
1985, chaps. 8-10; Goldthorpe, Lockwood, 
Bechhofer, and Platt 1968, 1969; Hamilton 


.1967; Leggett 1968; Lipset 1960, chap. 7; + 
Zeitlin, 1967.) 


Its unexamined premise, however, is that 
the social bases themselves somehow also 
account for the existing balance of political 
forces. How the contending parties and 
factions were able to establish themselves and 
thus viably compete to influence workers 
politically and win their’ votes in the labor 
movement and in the wider society is simply 
ignored.? This, then, is the rationale of our 
study: We both pose a central theoretical 
question that has rarely even been asked in 
political sociology and try to answer a crucial 
empirical question derived from it.4 

We focus on the political impact of 
political struggles, not on their sources; so we 
do not examine the ideological, program- 
Matic, or strategic issues over which the rival 
political camps fought. Among these issues, 
of course, was the Communists’ defense of 
“Stalinism” and the Soviet dictatorship. For, 
whatever the courage and admirable personal 
qualities of individual Communist labor ` 
organizers and leaders, even the most saintly 
of them was linked politically to that 
dictatorship (Zieger 1984, p. 300). But “most 
labor leaders actively or passively [fought] . 
Communists,” as even an anti-Communist 


3 Of course, a few sociologists who have 
conducted studies in this tradition (e.g., Lipset et 
al. 1956, p. 443; Zeitlin 1967, p. 6) have also 
emphasized that “politics” matters in determining 
workers’ consciousness and organization. But this 
observation is not integrated into their own 
empirical analyses. 

Griffin et al. (1986) is perhaps the only such 
previous study. Although it is not explicitly 
formulated in terms of the relative autonomy of 
politics, its aim clearly is to assess the independent 
effects of political practices on the organized (or 
political) relations between classes (cf. also Rubin, 
Griffin, and Wallace 1983). 
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radical such as C. Wright Mills (1948, pp. 
190-91) observed, not out of principled 
opposition to “Stalinism,” but simply “to kill 
off troublesome factions within their unions” 


' or to solve a problem in “public relations.” 


During the “Red decade” of the 1930s and 
beyond, through the early postwar years, the 
Communists and their allies led a “powerful 
and pervasive radical movement in American 


` life" (Starobin 1972, p. x). In particular, they 


established a broad base in the insurgent 
industrial unions organized by the CIO. At 


, the height of their strength among U.S. 


industrial workers, they and their allies led 
nearly half of the CIO unions and officially 
represented at least two million workers, or 
over 30 percent of the CIO's members.5 


SOURCES AND METHODS 


The 38 CIO unions included here comprise 
almost all of the known. durable CIO 
* international" unions. Of the 40 internation- 


5 Obviously, these figures depend on how the 
"Communist camp" gets defined. Of the 40 CIO 
unions Kampelman lists as of 1946, he classifies 
18 (or 45 percent) in the "Communist camp" 
(1957, pp. 45-47). (Here he follows Avery 1946.) 
RIA (1946, pp. 17-18) lists 36 CIO unions, 17 of 
which it classifies as "left-wing unions" (which 
coincide with 17 of Kampelman's Communist-led 
unions), and another 5 of which it classifies as 
"probably left-wing unions" (of which one is 
classified as Communist-led by Kampelman). 
Barrington Moore, Jr.'s assessment in 1945 was 
that “the Communists and their sympathizers are 
quite evenly matched within the CIO” (1945, p. 


*. 37). Mills (1948, pp. 195, 308) relies on the 


estimates of Avery (1946) and the RIA (1946). 
According to Seidman (1950), Communists led 
unions representing a third of the CIO membership 


` in 1946. This is what we calculate from the CIO’s 


union-by-union 1944 membership figures (Huber- 


. man 1946, pp. 161—80): each camp had roughly a 


third of the CIO's total membership. Kampelman 
estimates that 25 percent of the CIO's members 
were in Communist-led unions in 1946, down to 
15 percent by 1948; but he also says elsewhere 
"that Communist-led unions in 1949 claimed a 
membership of more than two million" (1957, pp. 
157, 249), which would amount to about a third of 
the CIO's members. These estimates of the 
membership of Communist-led unions do not take 
into account the members of Communist-led locals 
in other, CIO unions or of unions in which 
Communists were very influential, although not 
strong enough to "dominate" them. 
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als at the first postwar CIO convention in 
1946 (Kampelman 1957, pp. 45-47), we - 
omitted one for lack of data, and another 
because it lasted only a year. All 36 of the 
unions listed by the Research Institute of 
America (RIA) (1946, pp. 17-18) are 
included among our 38.5 We drew our data on 
the political practices of each union from 
published historical works. We reviewed 
every relevant work in search of information 
that would allow us to categorize each CIO 
union on our four major independent political 
variables.? 

Obviously, the attempt to construct the 
major dependent variable, that is, the “politi- 
cal camp" of the union's leaders, and 
especially the decision to categorize a union 
as "Communist-dominated," is inherently 
controversial. *Red-baiting," or charging that 
a union organizer or leader was a "Red" or 
*Communist," was long a standard antiunion 
tactic used by American capitalists. Many of 
the old American Federation of Labor (AFL) 
unions also prohibited Communists from 
holding union office, or even from union 
membership. Such prohibitions were also not 
uncommon in CIO unions, even years before 
the CIO expelled a dozen so-called "Commu- 
nist-dominated unions" in 1949 and 1950 
(Saposs 1959, p. 121; Taft 1953, p. 23). 
Communist unionists’ habit -of concealing 
their membership in the party was therefore 
not merely a Leninist reflex. It was a matter 
of both principle (“don’t let Red-baiting 
break you up”) and political if not physical 
survival. 

Thus, classifying the politics of CIO union 
leadership is to some degree guesswork. 
Because Communist union leaders seldom 
publicly acknowledged their party member- 
ship, classifying unions as Communist-led or 


Leo Troy (1965, pp. A20-A23) lists 11 
short-lived CIO unions, founded sometime during 
the era (1937~1950) when Communist-led unions 
were still a potent CIO force. None of the: 11, 
however, is on Max Xampelman’s list (or ours); 
only 4 of them lasted more than three years. 

A chart is available from the authors citing 
specific sources and pages and briefly summarizing 
the information on which our classifications on the 
political variables are based. Aside from the, 
sources already cited, these include: Brown 1947; 
CIO Proceedings 1933; Galenson 1940; Filipelli 
1970; Fink 1977; Lens 1949; Pinsky 1947; Schatz 
1977; Stolberg 1938; Twentieth Century Fund 
1945; Weinstein 1975. 
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“dominated” involves something of a distor- 
tion of political reality. 

The classification of the unions into 
“political camps” used here is taken from 
Max Kampelman (1957, pp. 45-47). His 
classification was based mainly on other 
(anti-Communist) sources (e.g., Avery 1946; 
RIA 1946, pp. 17-18). For instance, RIA 
explains its (somewhat more cautious) label- 
ing of unions as “left-wing” (rather than as 
“Communist-dominated”) as follows: “ ‘Left- 
wing’ [unions] have espoused causes or taken 
positions similar to the Communist Party 
positions as revealed by the Daily Worker. 
Whether this is coincidence or is the result of 
Communists within the union can best be 
determined bv one who deals with them over 
a period of time. Nor is there any attempt to 
distinguish here between those unions whose 
action is caused by the fact that the officers 
are Communist [sic] and those unions whose 
policy is set by the fact of their having either 
a majority Communist membership or a small 
but active group of Communist members” 
(1946, p. 16). 

Kampelman also classifies unions politi- 
cally on the basis of the political issues 
raised, causes advocated, and positions taken 
by union leaders. In particular, most of his 
evidence that a union is in the Communist 
political camp is drawn from the CIO's 
so-called formal cases against the unions 
accused of being "Communist-dominated" 
(1957, pp. 121-40, 167—224). He rarely 
presents any evidence of party membership. 
The second political camp consists of the 
unions led by "uncertain and shifting coali- 
tions": Communists were at times highly 
influential in the coalition holding union 
office or in the coalition forming the main 
opposition to the current leadership. The 
unions classified in the "anti-Communist" 
political camp were regularly led by officials 
who considered the Communists an illegiti- 
mate political force. 

This classification of the CIO's internal 
political divisions is partly a product, there- 
fore, of what historian Robert Zieger (1986, 
pp. 131-32) calls the CIO’s “own dispiriting 
version of the red scare that dominated 
American politics in the early 1950s." The 
CIO put 11 of its affiliated international 
unions on “trial,” based on elaborate pseudo- 
legal "cases," mainly because of the dissent- 
ing foreign policy positions taken by their 
officers. That leaders of the expelled unions 
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had often “parroted the pro-Soviet line" was 
the main "evidence" used against them. 
Moreover, “the political struggle within the 
unions [often] led to suspicion of anyone with 
a dissenting reputation. . . . Many [anti- 
Communist radicals who] supported at least 
the original efforts to discredit the pro-Soviet 
elements, found themselves . . . frozen out of 
union politics, and often hounded out of the 
labor movement because of their alleged 
'subversiveness'." 

Yet, if both analytical tendentiousness and 
political repression were involved in creating 
Kampelman's classification of the unions into 
political camps, we found that (except for 
minor differences) it accords with our own 
study of the historical materials. 


IS THERE A PATTERN? 


How did the Communists and their allies win 
the leadership and secure a political base in 
CIO unions? The broad historical answer, and 
the starting point of our empirical analysis, is 
that the Communists won "positions of power 
and trust in the CIO by the standard method 
of gaining power in U.S. labor unions: by 
being the organizers" (Mills 1948, p. 196). 
"The main . . . source of CP [Communist 
Party] strength in the CIO," Irving Howe and 
Lewis Coser (1957, p. 375) emphasize, “was 
the participation of thousands of its members 
in the organizing drives of the late thirties. If 
there was dirty work to do, they were ready. 
If leaflets had to be handed out on cold winter 
mornings before an Akron rubber plant or a 
New York subway station, the party could 
always find a few volunteers. If someone had 
to stick his neck out within the plants, a 
Communist was available. . . . Never were 
the Communists more than a minority among 
the CIO organizers. . . . Plenty of other 
people, ranging from run-of-the-mill union- 
ists to left-wing Socialists, worked hard and 
took chances. But the Communists were the 
best-organized political group within the CIO. 
. . . The devotion, heroism, and selflessness 
of many Communist unionists during these 
years can hardly be overestimated" (cf. also 
Moore 1945, p. 37). 

Although historians of American labor 
recognize that organizing battles among CIO 
rivals mattered in shaping the CIO's internal 
political lineups, they deny that any pattern 
existed in who won and who lost from union 
to union. Ín his recent CIO history, for 
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instance, Harvey A. Levenstein argues that, 
“as for patterns in the response to Communist 
overtures among American workers, the most 
striking feature is a lack of apparent pattern.” 
Where the “CP was strong” in a union, he 
suggests, it was “mainly because a few 
leaders . . . happened to be in the right place 
at the right time, creating a following for 
themselves and their factions through their 
determined leadership under fire. . . . There 
is more accident than pattern in CP strength in 
the . . . CIO unions" (1981, pp. 71, 55).8 
Nathan Glazer (1961, p. 120) makes a similar 
observation, and concludes: "In the light of 
the various histories of many unions, large 
generalizations appear to be crude and 
clumsy, scarcely helpful in explaining any 
single outcome. . . . In the end it would seem 
to be the organizational factors that predomi- 
nate; the skill and training and luck of 
Communists and their opponents." 

Although these observations are correct, 
they are also incomplete. For if historical 
accidents (like being “in the right place at the 
right time") or contingent "organizational 
factors" count in the making of history, the 
question remains, “Why?” If accidents or 
contingencies matter, what makes one rather 
than another contingency matter? Are appar- 
ent historical contingencies (at least partly) 
socially determined? We shall try to show that 
this issue —the social determination of histor- 
ical contingency — is closely involved with the 
issue of the relatively independent effects of 
political struggle. Our analysis will reveal an 
indelible pattern of relationships among 
earlier political practices, emergent political 
relations, and subsequent political alignments 
in the international unions of the CIO. 

We abstract here from the sorts of 
"objective conditions" or structural factors 
that are usually the focus of sociological 
explanations of workers' political attitudes 
and behavior. No doubt they affect workers' 
Struggles and the types of unions and parties 
which emerge and gain workers' adherence. 
But they do not determine how these struggles 
are waged nor who wins and who loses. In 


8 Important insights concerning the general 
political processes involved are contained in the 
narratives of specific struggles by Levenstein and 
other labor historians cited in our list of references. 
We have reformulated some of these particulariz- 
ing and unsystematic observations as explicit 
hypotheses. 
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this analysis, we assume that such objective 
conditions remain constant; and we explore 
how the political relations resulting from 
political struggles (whether wittingly or not) 
become integral components of the emergent 
Objective conditions for subsequent struggles.? 

The CIO was itself the proximate historical 
product of intraclass political struggles, which 
took place in two phases. The first phase, 
from the early 1920s on, was the fight of 
Communists and other radicals for industrial 
unionism both from within and outside the 
AFL, which refused to organize except along 
craft or trade lines. The second was the 
radical response to the labor insurgency of the 
early 1930s, when top AFL officials and 
many AFL unions split over the issue of 
whether and how to organize the unorganized 
in the mass production industries. 

With the birth of the CIO, rival factions 
and parties now vied with each other for 
power in the fledgling organization and its 
constituent unions. These political struggles 
and the resultant political relations can be 
seen as constituting and being constituted by 
four bundles of historical events: (1) whether 
there had been earlier Red organizing in an 
industry; (2) whether the union seceded from 
the AFL from above, in a revolt of its top 
officers, or from below, in a workers’ 
insurgency; (3) whether the union was 
originally organized independently or under 
the aegis of a CIO "organizing committee"; 
and (4) whether the union was formed as an 
amalgamated or as a unitary organization. We 
emphasize that these historical bundles repre- 
sent crucial types of political practices and 
sets of internal relationships among contend- 
ing political forces.!? So, these bundles are 


? We did try to assess the independent effects on 
the unions' political alignments of some of the 
“objective economic conditions" of the industries 
they organized and of some demographic charac- 
teristics of the workers in these industries. We 
discuss this in Appendix 1. 

10 Our focus on these “historical bundles" is not 
arbitrary. They are sets of events, as should 
become clear from the following discussion, 
representing crucial formative intraclass struggles 
over class organization. But of course they do not 
exhaust the universe of prior intraclass struggles 
which could also have shaped the CIO's internal 
balance of political forces. Among these, for 
instance, are the IWW's organizing efforts in the 
decade or so before World War 1! and the 
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meant to be “relational concepts,” and our 
attempts to measure them, “relational vari- 
ables.” 

Our empirical analysis is presented in four 
steps: (1) Explication of the hypotheses and 
assessment of the evidence, in a set of four 
bivariate contingency tables, concerning the 
expected direct effects of each of these 
political variables. (2) Estimates, through a 
“logit” analysis, of their independent direct 
effects. (3) Explication of a theoretical model 
illustrative of both their direct and indirect 


effects. (4) Assessment of the evidence, in a. 


set of five bivariate contingency tables, 
concerning the indirect effects posited in this 
theoretical model.!! 

RED UNIONISM 

If being in the “right place at the right time” 


with the "skill and training" is what matters - 


in making history, it was neither mere luck 
nor an accident that thousands of skilled and 
trained Communists were in the right place 
(in the thick of organizing struggles), at the 
right time (during the workers' insurgency of 
the 1930s), when others were not. On the 
contrary, "Red unionism" and the cause of 
industrial unionism (organizing all workers in 
an industry, across trade or craft lines) had 
been almost synonymous for many years 
before the CIO took up the call. After the 
decline of the ““Wobblies” (Industrial Work- 
ers of the World [IWW]), the Communists 
were the main carriers of the ideas of militant 
action and industrial unionism. So in the early 
days of the CIO's split with the AFL, the 
Communists were skeptical or even hostile 
toward CIO efforts. "In a way, the Commu- 
nists looked upon the CIO as a rival that was 
capitalizing on some of its issues, particularly 
that of industrial unionism" (Saposs 1959, p. 
123). 


spawning of "employee representation plans" or 
"company unions" in the postwar period of 
“welfare capitalism." How did the earlier organiz- 
ing struggles of the Wobblies or the existence of 
company unions affect the chances years later that 
the Communists or other factions would win power 
in the emergent CIO unions? We are engaged in 
research to answer these and other closely related 
questions on the political determinants of the 
CIO’s political lineups. 

‘In Appendix 2, we discuss our use of 
significance tests and logit modeling. 
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From 1920 on, when the Trade Union 
Educational League (TUEL) called for orga- 
nization of the unorganized and for industrial 
unionism, the Communists had been trying to 
organize some of the same industries and 
plants that the CIO later targeted for organiz- 
ing (Foster 1947, pp. 198ff.; Keeran 1980b, 
p. 137). “The AFL made so little effort to 
organize the unorganized,” Theodore Draper 
(1972, p. 374) emphasizes, “that there was 
plenty of room for a ‘dual union,’ even if the 
AFL wasn’t destroyed in the process.” These 
Red organizing drives took place in two 
phases. Until the Autumn of 1929, the TUEL 
tried to “bore from within” to create 
“revolutionary oppositions” within the major 
AFL unions. Then, until late 1935, its 
successor, the Trade Union Unity League 
(TUUL), tried to organize independent indus- 
trial unions and wage a struggle of "class 
against class."!? So, on the eve of the CIO's 
birth, several thousand Communists already 
had fought for years trying to organize 
independent unions and had won a reputation 
in several industries as militant partisans of 
industrial unionism and workers' rights. 

As a result, they won some supporters 
(historians differ on the numbers) among the 
workers in these industries. They also pre- 
pared many others for the coming wave of 
industrial unionism. In auto, for instance, the 
Communists had started agitating for an 
industial union in 1925 and soon were 
printing shop papers and distributing them to 
a dozen of Detroit's major auto plants 
(Keeran 1980a, p. 37). "These little four- 
page sheets, sold for a penny or given away 
by Communist distributors at plant gates... 
provided the only news of conditions and 
grievances inside the plants available to 
workers” (Cochran 1977, p. 63). 

Red unions led some heroic, fiercely 
fought and bloodily suppressed strikes. But 
many, if not most, of these unions "went to 
pieces afterward" (Klehr 1984, p. 133). For 
“all unions were fought bitterly in those days. 


12 Other than Draper's 1972 article, as he notes 
(p. 371), "not a single book, dissertation, or 
article, scholarly or otherwise, has ever been 
devoted to [the TUUL or] . . . to any of its 
constituent unions." Nor, to our knowledge, has 
such a work been published since then. On the 
TUEL and TUUL, see Cochran 1977; Draper 
1957, 1960, 1972; Foster 1937; Klehr 1984; 
Starobin 1972. 
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But the most brutal terror was reserved for the 
Communist unions." With no "revolutionary 
wave" to ride, "the TUUL had to depend on 
the straining, sweating, and plodding of its 
own organizers. They were too few, and they 
tried to do too much" (Draper 1972, p. 392). 

As TUUL’s Jack Johnstone lamented in 
early 1930, “the objective conditions were 
never better for building militant revolution- 
ary unions, but objective conditions do not 
create organizations” (quoted from Labor 
Unity, in Klehr 1984, p. 41, italics added). 
By 1932, even the few more or less durable 
Red unions became "for ‘all intents and 
purposes, moribund." Seeing that the upsurge 
of labor was bypassing their "dual" unions, 
the TUUL officially disbanded in mid-1935, 
and its unions “faded away” (Klehr 1984, pp. 
47, 133). 

In some struggles (such as in Harlan 
County, Kentucky), Draper suggests, the 
revolutionary intransigence and plain incom- 
petence of the Communists had been so 
disastrous for the workers involved that once 
the Communists were driven out, they could 
never return; in the end, “for all their 
fortitude and determination, they had nothing 
concrete to show for their efforts” (1972, pp. 
392, 389). But in other struggles in auto, 
transport, electric, lumber, and shipping, for 
instance, Communists apparently succeeded, 
argue Howe and Coser (1957, p. 373), in 
putting together and holding on to “a kind of 
skeleton apparatus. In this way the Commu- 
nists were able to begin functioning in the 
CIO with an embryonic structure of organiz- 
ers who knew each other from ‘the old days’ 
and, though assigned to different industries, 
could help one another with regard to both 
party interests and their own status.” If 
nothing else, the TUEL and TUUL experi- 
ences served “as a training ground for the 
Communist unionist in organization tech- 
niques and in administering unions” (Taft 
1964, p. 16). Also, “aside from these 
organizational advantages," as Glazer (1961, 
p. 111) emphasizes, "the Communists were 
in fact founding fathers, with all the moral 
authority that gives a leader." 

On balance, even if nearly all of their 
“revolutionary unions" were stillborn, and 
some struggles the Communists led were 
politically disastrous for them, these hard 
years of Red organizing also probably created 
in some industries a cadre of experienced 
Communist organizers and effective, even 
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charismatic, leaders with a legitimate claim to 
many workers' support. If this is so, then the 
Communists should also have had a better 
chance of winning the leadership of the CIO 
unions established in the industries where 
party members had tried to organize Red 
unions in earlier years (i.e., before the CIO's 
founding), than in unions established in 
industries where no such earlier Red organiz- 
ing hàd occurred. 

Our findings are more or less consistent 
with this hypothesis. First, over half of the 
CIO unions in the industries penetrated by 
earlier Red union organizing were later led by 
Communists, as compared to 4 out of 10 of 
the unions in the industries where Red 
unionists had not actively organized in earlier 
years. Second, the contrast is sharper when 
we examine the effect of earlier Red organiz- 
ing on the subsequent success of the anti- 
Communists: the percentage of  "anti- 
Communist” ‘unions in industries the Reds 
had not tried to organize in the pre-CIO era 
was nearly twice as large as in those they had 
tried to organize (Table 1, Part a).!? 


AFL SECESSION: "FROM ABOVE" OR 
“FROM BELOW" 


In the fall of 1936, the AFL "suspended" 10 
unions affiliated with the CIO (then still the 
“Committee on Industrial Organization") on 
charges of "dual unionism" (the same charge 
the AFL used to throw out the adherents of 
the TUEL 11 years earlier) and of “fomenting 
insurrection." The 10 unions immediately 
started making their per capita payments to 
the now independent CIO (Bernstein 1970, 
pp. 422-23). These founding unions of the 


13 As we shall see, Red unionism also had 
significant indirect effects on the chances the 
Communists would win union power in later years. 
We can think of no convincing line of argument 
that the other insurgent political practices which 
are examined in the following analysis (insurrec- 
tion from below, independent organizing, and 
amalgamation) themselves somehow "reflected" 
the specific “objective economic conditions" or 
the social characteristics of the workers in an 
industry. But in the case of Red union organizing, 
it is plausible to argue that an industry's structure 
did have a bearing on the very existence of that 
insurgent organizing practice and on the success or 
failure of the Communists in penetrating that 
industry. We briefly discuss this issue in Appendix 
3. 
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Table 1. CIO International Unions in the 1940s Led by 
Communists, Shifting Coalitions, and Anti- 
Communists, by Types of Political Practice (in 











percent) 
Anti- 
Com- Shifting Com 
munists Coalitions munists (N) 
Earlier Red 
organizing* 
Some -> 52 29 19 QD 
None . 41 23 35 (17) 
Source of secession^ 
Worker's insurgency 72 22 5 (18) 
Top officers’ revolt 25 30 45 (20) 
Organizing strategy" —— 
Independent 61 19 19 (26) 
CIO committee 17 42 42 (12) 
Union formation? 
Amalgamation 67 17 17 (2 
Unitary ` . 38 31 3 (26) 





"log odds ratio (uniform association) = 0.339; 


> Log odds ratio (uniform association) = 1.521; 


' s.e. = .525; p<.01. 


“Log odds ratio (uniform association) = 1.010; 
s.e. = .456; p<.05. 

“Log odds ratio (uniform association) = 0.665; 
s.e. = .468; p<.08. 


CIO, and a few others that soon followed 
them, came into the new industrial union 


" movement as the result of what we term “a 


revolt from above." Their top officers broke 
away from the AFL and joined the CIO with 
their staff and organizational hierarchy — and 
much of their union jurisdiction — intact. As a 
result, they *had a continuity of leadership," 
Tack Barbash suggests, "that [was] proof, by 
and large, against Communist domination" 
(1956, p. 342). . 

In contrast, most other CIO unions grew 
out of local and district battles between craft 
and industrial unionists over the control of 
their AFL precursors. Such workers' "insur- 
gencies from below" split many AFL unions. 
The workers in these AFL locals and districts 
then came into the CIO to form the core of 
new international unions. This happened, for 
instance, in the AFL's Upholsterers Interna- 
tional Union in 1937, where a number of 
locals defected from the AFL and combined 
with some other independent craft unions and 
a few CIO locals to form the CIO United 
Furniture Workers (Peterson 1944, p. 135). 
Other struggles "from below" took place in 
the newly chartered AFL “federal labor 
unions," thàt is, the newly organized locals 
given a temporary AFL charter to “store 
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workers” until they could be “parcelled out” 
to AFL craft affiliates (Bernstein 1970, p. 
355). Some seceded from the AFL to become 
the locals of a new CIO union, rather than be 
parcelled out and subordinated to craft 
control. Of the 18 CIO unions in our study 
that seceded from the AFL as the result of an 
insurgent workers’ movement, 4 originated in 
battles in federal unions, the other 14 in 
rebellions in various locals and districts of 
existing APL unions. . 

Often the leaders of the local rebellions 
against the craft leadership were radicals of . 
various stripes, including Communists and 
their allies. Unlike the situation in the former 
AFL unions that came into the CIO from 
above, these left leaders had the opportunity 
to gain secure political bases in the new CIO 
unions they built in struggles from below.’ 
This was the pattern among longshoremen, . 
for example, with the Australian seaman 
Harry Bridges. An outstanding leader in the 
epic 1934 West Coast maritime strike, and in 
the San Francisco general strike, Bridges 
rose, with Communist support, from ordinary 
dockworker to president of the San Francisco 
local and then, in 1936, to president of the 
entire Pacific Coast district of the Interna- 
tional Longshoremen's Association (ILA). ' 
The next year, he led 17,000 West Coast. 
dockworkers out of the ILA into the CIO, to 
form the International Longsboremen's and 
Warehousemen's Union (ILWU) (Levinson 
1956, pp. 262-63). 

For these reasons, we suggest that far more 
of the unions that had been born from below 
than from above would be Communist-led. 
We find, indeed, . that the numbers of 
Communist-led unions in each category differ 
sharply. Nearly three-fourths of the CIO 
unions whose secession from the AFL had: 
come through workers' insurgence were later 
led by Communists and their.allies; but fewer 
than one-sixth of the AFL internationals 
whose top officers had broken away to join 
the CIO were later Communist-led. Seven 
other CIO unions were independent non-AFL 
unions before the CIO was established (e.g., 
the Federation of Architects, Engineers, 
Chemists, and Technicians) or were orga- 
nized in an industry that had no prior AFL 
union (e.g., the Farm Equipment Workers 
[FE]). Because they also joined the CIO with 
their organizational hierarchies intact, we 
included them in the “from above" category, 
which raises the proportion of Communist-led. 
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unions in that category to one-fourth. The 
effect of the type of secession on the chances 
that anti-Communists would win union lead- 
ership is especially sharp. Nearly half of the 
unions that joined the CIO from above were 
in the anti-Communist camp compared to 
only one-twentieth of those that joined from 
below (Table 1, Part b). 


INDEPENDENT ORGANIZING 


Forged in the earlier Red union organizing 
drives, thousands of experienced Communist 
organizers dedicated to industrial unionism 
formed a ready but recalcitrant, perhaps even 
politically dangerous, reservoir of organizers 
who could be tapped by the CIO's founders. 
CIO President John L. Lewis and other CIO 
leaders "had no choice but to accept the 
support of the Communists,” as Saul Alinsky 
gives Lewis’s thinking on the matter. “Even 
after the debacle of 1933 and 1934, when the 
American Federation of Labor smashed the 
spirit of unionism, it was the left-wingers who 
zealously worked day and night picking up 
the pieces of that spirit and putting them 
together” (quoted in Cochran 1977, p. 97). 
“John L. Lewis was forced to employ many 
‘young Communists as organizers for the 
_ .C.1.0. when it first started,” Seymour Martin 
Lipset (1960, p. 386) also emphasizes, 
“because they were the only people with the 
necessary skills who were willing to take the 
risks involved for low pay.” 

The CIO’s founders, notably Lewis, tried 
both to use Communist organizers and to 
hobble them so that they could not take power 
in the new CIO unions. Responding to 
warnings that hiring Communists meant 
trouble for the CIO, Lewis asked sardoni- 
cally, "Who gets the bird, the hunter or the 
dog?” (Cochran 1977, p. 97). But to prevent 
the “dog” from straying, Lewis and other 
CIO officials exerted tight control over the 
“organizing committees” they put in charge 
of organizing the unorganized in an industry 
(Taft 1964; Bernstein 1970, p.616). 

What happened to Communist organizers 
in the Steel Workers Organizing Committee 
(SWOC) exemplifies the process of control: 
“With Philip Murray [later head of the CIO] 
and his superbly competent, experienced, and 
anti-Communist lieutenants in charge of the 
steel organizing campaign, when 
Communists.were spotted, or became too 
dangerous a threat, they were discharged" 
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(Sapòss 1959, p. 122). SWOC organizers 
were all hired, paid, and fired by the head 
office.‘ When they organized a local, the 
SWOC moved them to another area, allowing 
SWOC officials to take control (Taft 1964, p. 
57). “Thus, a Communist who had helped to 
organize twenty-five SWOC locals lamented,” 
says Levenstein (1981, p. 51), “that despite a 
few successes, Communists ‘weren’t too 
successful’ in wooing the leaders of the locals 
they helped to organize.” Communists played 
an important role in the four-year battle to 
organize “Little Steel,” the violently anti- 
union steel companies that held out long after 
the US Steel Co. capitulated. "But when the 
struggle was over, they were quietly fired by 
SWOC head Philip Murray.” 

In auto, in contrast, a host of contending 
radical, Communist, Socialist, Coughlinite 
Catholic, and other factions competed to or- 
ganize the unorganized and to win power in 
the new CIO union. The UAW financed much 
of its own organizing drives by collecting dues 
from the workers. Although the CIO also con- 
tributed money and organizers to the Ford 
drive, top CIO officials had little direct influ- 
ence on the conduct of the campaigns against 
the big auto companies. As a result, “even at 
the height of CIO influence in the internal af- 
fairs of the UAW," the CIO was unable to 
impose "outside leadership" (Galenson 1960, 
p. 133). Also, the major auto companies bit- 
terly resisted unionization. GM, for instance, 
agreed to bargain with the CIO union only 
after a tenacious and often violent struggle with 
the workers. In these battles, and 1n some of 
the decisive sit-down strikes —for example, in 
Flint, “the first great victory” for the UAW 
“and one of the epic confrontations in Amer- 
ican labor history" (Zieger 1986, pp. 46— 
47)— Communists gained a reputation as su- 
perb organizers and combative and courageous 
leaders. Consequently, they were able to cre- 
ate strong rank-and-file groups in the auto in- 
dustry (Galenson 1960, p. 150). 

We suggest that the chances that Commu- 
nists could build their own base and later win 
the union's leadership would be higher where 
they were involved in independent organizing 
than where they organized as employees of a 
CIO organizing committee. Aside from the 
moral authority they commanded as militant 
organizers in the fight against the companies, 
the Communists could, where there was no 
CIO organizing committee to stand in their 
way, "bring in reinforcements on the lower 
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levels who could provide a solid layer of 
support for its people on top” (Howe and 
Coser 1957, p. 377). 

For these reasons, we expect that far more 
of the unions that had been built by 
independent organizers than by CIO organiz- 
ing committees would be led by Communists 
and their allies. We find that, indeed, the 
Communists won leadership in one-sixth of 
the unions organized by CIO organizing 
committees compared to two-thirds of the 


independently organized unions. Also, the: 


percentage of unions organized by CIO 
organizing committees that later were led by 
“anti-Communists” was over twice the per- 
centage among independently organized unions 
(see Table 1, part c.).!4 


AMALGAMATION 


Whatever their origins, the formation of unions 
as organizations tends to follow roughly either 
a “unitary” or a “federated” (or “amalgamat- 
ed”) path. Unitary organizations (i.e., central- 
ized and typically hierarchical) tend, as they 
grow, to incorporate new members and locals 
into the union’s existing structure, “with the 
new subordinate officials and groups deriving 
their authority from the summits of the orga- 
nization.” In contrast, an amalgamated orga- 
nization grows through the merger or combi- 
nation of a number of existing independent 
locals or groups of leaders (Lipset, Trow, and 
Coleman 1956, p. 442).!5 


‘4 A CIO organizing committee sometimes fell 
into the hands of men who were not hostile to the 
Communists, or it was short-lived (or both). In 
either case, in consequence, Communists were 
able to form their own nuclei of support in the new 
union. For instance, this appears to have happened 
with the Shoe Workers Organizing Committee. Its 
CIO-appointed head, Powers Hapgood, was will- 
ing to work with Communists (Levenstein 1981, p. 
108), and the organizing committee lasted barely a 
year before establishing the fledgling union itself. 

15 Amalgamation refers to the merger in the 
1930s of several independent units to form one 
CIO union. So, unions are not classified as 
amalgamated if they were formed out of the merger 
of AFL federal locals alone; in general, these 
federal locals had little if any prior independent 
organizational existence. If a union already had 
been an established AFL union before its merger 
with another union or unions, it is classified as 
amalgamated only if this merger led to a 
substantial reorganization of the union's adminis- 


51: 
In the early days of the CIO, some union: 
amalgamated because their "jurisdictions," 
whatever a union's CIO charter, were stil 
mixed and shifting. Sometimes several union: 
were organizing in different branches of the 
same "industry." Sometimes a single unior 
branched out and organized locals in severa 
closely related "industries." (Analysis woulc 
show, we suggest, that what came to b: 
considered an "industry" in capital-labo: 
relations was itself at least in part a politica 
artifact of these organizing struggles.) It wa: 
thus always a political question as to how anc 
to what extent CIO officials could impos 
their demands on these new unions: that is 
whether they allowed (or compelled) a unior 
to retain and amalgamate such mixed indus 
trial locals into its permanent structure, o 
allowed (or compelled) several unions ir 
different parts of an industry to amalgamate 
into a single union. It was, of course, also : 
political question, as well as a specific matte 
of organizing strategy, for the unions anc 
locals involved.!$ 


trative or political structure after it joined the CIO 
Only three unions had any record of mergers afte: 
joining the CIO: (a) Mine, Mill, which added : 
Die Casting Division in 1942, five years after i 
bolted the AFL to become a charter member of the 
CIO; (b) the Fur and Leather Workers; and (c) the 
Amalgamated Clothing Workers. Because tht 
Mine, Mill merger occurred long after its politica 
structure had been established, it does not qualify 
in our terms, as an amalgamated union. (Classify. 
ing it as amalgamated would strengthen ou 
findings.) As to the Amalgamated Clothing 
Workers (ACW), it became a unitary highly 
centralized union long before 1936, when i 
absorbed the Journeymen Tailors Union (and it: 
some 6,000 members), and later on, the CIC 
Laundry Workers (and its nine locals). N: 
significant reorganization of the ACW's adminis 
tration occurred after these mergers (Galenso: 
1960, p. 285; also see Bernstein 1970, pp.73ff) 
So it does not qualify as an amalgamated union. Tt 
contrast, the merger of the International Fu 
Workers Union in March 1939 with the Nationa 
Leather Workers Association resulted in it 
restructuring. The new union was constituted o 
two relatively independent divisions, fur arm 
leather; each elected its own officers and manage 
its own finances. Their combined executive board: 
constituted the executive body of the new amalgam 
ated International Fur and Leather Workers Unior 
(Foner 1950, p. 556; Brown 1947, p. 135). So, i 
is classified as an amalgamated union. 

‘6 For example, the CIO ordered the Unite 
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By bringing their leaders, members, and 
finances into one organization, a strategy of 
amalgamation among independent unions 
strengthened the new, larger, that is, amal- 
gamated, union. Of course, amalgamation 
also meant a redistribution of power within 
the new union. Some officials of the 
previously separate unions were now reduced, 
at best, to secondary officers of the new 
international or even to officers of a local, 
while others emerged as top officers of the 
international. 

But whatever the outcome for individuals, 
the amalgamation of once-independent unions 
tended to preserve autonomous centers of 
power in the new union, and thus fostered 
political competition if not factionalism. In 
turn, this tended to enhance the chances for 
union democracy. Two of the “big three” 
CIO unions, for instance, in the auto and 
electrical manufacturing industries, came into 
existence through amalgamation. The UAW 
“was formed out of an amalgamation of a 
number of existing automobile unions, and a 
number of its other local units were organized 
independently of national control. . . . Most 
of the factional leaders in the UAW were 
leaders in the early organizational period of 


Electrical, Radio, and Machine Workers (UE) to 
relinquish the locals it had been organizing in the 
utilities industry. The UE had been organizing 
utility workers with CIO assistance for over a year. 
But CIO officials: created a new jurisdiction and 
established a separate Utility Workers Union 
(Galenson 1960, p. 253). The officials of the 
Utility Workers union were to be among the most 
vociferously anti-Communist officials of any CIO 
union. As Kampelman (1957, p. 46) observes, 
"Recently granted [its] CIO charter, [the Utility 
Workers'] new constitution contained one of the 
most drastic bans against Communists ever adopted 
by a labor union, specifically excluding Commu- 
nists and providing for their expulsion from 
membership.” The UE leaders probably accepted 
the CIO order to stop organizing utility workers 
because, although UE was in charge, paid CIO 
organizers were doing most of the organizing; and 
the rival Association of Catholic Trade Unionists 
(ACTU) was also organizing them. Under these 
circumstances, fighting to hold on to the utility 
workers would have been both costly and destruc- 
tive of labor unity. Relinquishing jurisdiction over 
them may also have been a political gesture by UE 
Communists to their new CIO allies, in tune with 
the “Popular Front” line of the party. The same 
occurred in auto and steel (see Keeran 1980a; 
Levenstein 1981, p. 51). 


AMERICAN SOCIOLOGICAL REVIEW 


the union, and the different factions have 
largely been coalitions of the groups headed 
by these different leaders jointly resisting 
efforts to subordinate them to the national 
administration” (Lipset et al. 1956, p. 443). 

For the same reasons, where radical or 
Communist-led locals or unions were incorpo- 
rated into the new union through amalgama- 
tion, their leaders also had a chance to 
compete for its international leadership. “A 
strong local dominated by the party,” Philip 
Selznick (1952, p. 213) points out, “may 
become the [Communist] fraction’s base of 
operations, its officers assuming a guiding 
role in relation to other party-led groups in the 
union.” i 

In the UAW, one of the most influential o 
the organized factions was led. by Commu- 
nists. They were among the leaders of several 
important locals, including “the world’s 
biggest local,” Local 600 at Ford’s River 
Rouge plant. They also had political bonds to 
powerful non-Communist allies forged during 
the sit-down strikes. Although they were 
highly influential, the Communists made no 
effort to take over the union’s leadership 
(Glazer 1961, p. 112; Keeran 1980a). Not 
only the Communists, of course, but other 
factions (including the one led by the Reuther 
brothers and the Association of Catholic 
Trade Unionists [ACTU]) were successful in 
winning local centers of power, and using 
them as bases for further operations, in an 
effort to forge alliances and build their own 
machines within the union (cf. Selznick 1952, 
p. 213). : 

In contrast, in the relatively diversified 
electrical industry, the United Electrical, 


‘Radio, and Machine Workers Union (UE), 


whose name reflects its amalgamated origins, 
was organized and led mainly by Communists 
and their allies. Among the latter were many 
older radical and socialist workers. Reds. had 
organized one of the major unions (the 
TUUL's Steel and Metal Workers Union), 
and the TUUL had some members and 
significant influence among several other 
independent and local unions (e.g., in GE's 
Schenectady plant) that amalgamated in 1936 
to form the new union. Communists conse- 
quently succeeded in winning the new union's 
international leadership. "Because it was 
essentially a coalition of independently- 
organized unions," as Levenstein (1981, p. 
62) explains, "the UE had a relatively 
democratic, constitution with many features 
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ensuring local autonomy and decentralization 
of power. The districts into which UE was 
organized were exceptionally powerful, pay- 
ing the salaries of their own elected officers. 
With much of the support for the Communists 
resting at the local level, . . . the task of 
overthrowing [their] leadership would have to 
involve arduous battles for local after local, 
for district after district." 

The amalgamation of several unions, we 
suggest, would tend to improve their leaders' 
chances to retain local political bases within 
the new union, from which they could try to 
extend their influence and contend for leader- 
ship. So, the Communists should have had a 
better chance of winning union leadership in 
amalgamated than in unitary unions. We find, 
in support of this reasoning, that whereas 
two-thirds of the amalgamated unions ended 
up in the Communist camp, just over a third 
of the unitary unions did, with the rest split 
evenly between the shifting and anti- 
Communist camps (Table 1, part d). 


LOGIT ESTIMATES OF 
DIRECT EFFECTS 


What were the independent direct effects of 
each political practice, taking the effects of 
the others into account? We constructed both 
a logit and a probit model to measure the 
independent effects. The substantive results 
of both models are identical: only secession 
from below and independent organizing had 
substantial (and statistically significant) inde- 
pendent direct effects in determining which 
CIO unions would be in the Communist 
political camp. We present only the results of 
the logit model (Table 2). Logit modeling 
provides not only coefficients interpretable as 


Table 2. Logit Estimates of the Direct Effects of 
"Insurgent Political Practices" in Determining 
Which CIO International Unions in the 1940s 
Were in the Communist Political Camp (Dichot- 
omizing Unions in the Communist Camp and 





All Others) 

Logit Odds 
Insurgent Practice Coefficients Multipliers 
Earlier Red organizing 19 1.21 
Workers’ insurgency 1.95* 7.05 
Independent organizing 2.14* 8.51 
Amalgamation 21 1.23 
Intercept —2.T]* 


Likelihood ratio chi-square (df) 14.67 (4) 
* Significant at the .05 level. 
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precise measures of effect, but also (unlik: 
probit) coefficients that can be restated ii 
"everyday language" as the comparative odd 
of alternative political outcomes (Swaffon 
1980, p. 672). 

Controlling for the effects of the other thre 
political practices, we find that the odds tha 
unions would later be led by Communist 
were 8.5 times greater if they had bee 
independently organized than if they had beei 
organized by a CIO organizing committee 
Similarly, the odds that unions would later bi 
led by Communists were about 7 time 
greater if they had seceded from the AFI 
from below rather than from above. On th 
other hand, the odds that unions would be le: 
by Communists were only 1.2 times greate 
for an industry with earlier Red organizin, 
than for an industry without it. The odds wen 
also 1.2 times greater for amalgamated thai 
for unitary unions. 


A THEORETICAL MODEL OF DIRECT 
AND INDIRECT EFFECTS 


So, neither earlier Red organizing nor amal 
gamation had sizeable direct effects on th 
odds that the Communists would win unioi 
power. What do these findings imply? Wen 
these two insurgent political practices of littl 
import in shaping the CIO's internal politica 
lineups? 

Most important, were the long, hard year 
of Red union organizing wasted, scarcel 
leaving any working-class legacy of industria 
unionism for the CIO organizers who late 
followed in their footsteps? In the end, did thi 
Red unionists, as Draper suggests, hav 
almost nothing concrete to show for thei 
efforts? The short answer is a sort o 
complicated “no.” Our “theoretical model’ 
suggests (see Figure 1) and our findings ten 
to confirm that these "premature" struggles ti 
build industrial unions had complex an 
sometimes contradictory indirect politica 
effects in shaping the two decisive politica 
practices —secession from below and indepen 
dent organizing—that allowed the Commu 
nists to win the leadership of the new CIC 
unions. 

We offer three related hypotheses about th 
indirect effects of earlier Red organizing 
First, it probably had contradictory effects o: 
independent organizing. On the one hand, th 
presence of some experienced Red organizer 
in an industry should have improved th 
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Figure 1. Theoretical Model of the Direct and Indirect Effects of Insurgent Political Practices on the Odds That 
Communists Would Win the Leadership of CIO Unions. 


chances for the independent organizing of a 
CIO union there. On the other hand, the 
chances for independent organizing in that 
industry were probably reduced because of 
the past battles there between Red unionists 
and the then-AFL officials who later founded 
the CIO. (During the Reds’ so-called “Third 
Period” of “revolutionary upsurge,” when 
the TUUL defined the AFL as “fascist,” they 
could call even a militant AFL leader like 
Sidney Hillman, a “fascist gangster leader” 
[Draper 1960, pp. 302-06; TUUL 1930, p. 
17; Klehr 1984, pp. 39, 17].) Remembering 
those battles and knowing the Reds had been 
active in an industry, top CIO officers (e.g., 
John L. Lewis, Philip Murray, Sidney 
Hillman) may well have tried to put a CIO 
organizing committee in charge of that 
industry's campaign in order to prevent 
Communists from winning power in the new 
unions being built there. Consequently, much 
of the impetus that earlier Red organizing 
might have given to subsequent CIO indepen- 
dent organizing was probably nullified. Our 
findings support this hypothesis. CIO organiz- 
ing committees were established more fre- 
quently in industries where there had been 
earlier Red organizing than in those where 
there had been none (Table 3, part a). 
Second, we suggest that amalgamation was 
more likely in industries that had been 
penetrated by earlier Red unionism than in 
those that had not been. Some former Red 


unions or their remnants which retained 
sufficient independence and cohesion (e.g., 
James Matles's Metal Workers) were able to 
amalgamate with other such remnants, AFL 
federal locals or independent unions as a 
strategy to build new CIO unions. This, we 
suggest, enhanced the capacity of these 
unions to go it alone (i.e., without CIO 
tutelage) in unionizing their industry. In turn, 
establishing a union through independent 
organizing would increase the Communists' 
chances to win its leadership. Our findings 
strikingly support this reasoning. Over half of 
the unions in industries penetrated by earlier 
Red organizing were formed through amal- 
gamation. But over nine-tentbs of the unions 
in the industries untouched by earlier Red 
organizing were formed as unitary organiza- 
tions (Table 3, part b). In turn, amalgamated 
unions were far more likely than unitary 
ones to have been organized independently. 
Indeed, over twice as many of the unitary 
unions were established by CIO organizing 
committees (Table 4, part b). 

Third, and perhaps most important, by 
creating local Red bases within many AFL 
unions, earlier Red organizing made possible 
subsequent local union insurrections and 
secessions from the AFL. The Communists 
had won adherents and forged a significant 
base within the AFL itself in the earlier TUEL 
phase. Although the main Red organizing 
efforts after establishing the TUUL in 1929 


Table 3. The Effects of Earlier Red Organizing on Other Types of Political Practice in CIO International Unions in 


the 1940s (in percent) 
































Organizing Strategy* Source of Secession? 
E —M — " H b 

Earlier Red Ind. CIO Union Totmation Workers’ Officers’ 
organizing Org. Com. (N) Amalg. Unit. (N) Insurgency Revolt (N) 
Some 62 38 21) 52 48 (21) 62 38 (21) 
None 76 23 (07) 6 94 (17) 29 7i (17) 

* Log odds ratio = —.693; s.c. = .727; p<.20. 

b Log odds ratio = 2.890; s.e. = 1.120; p<.05. 


€ Log odds ratio = 3.900; s.e. = .696; p<.01, 
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Table 4. The Effect of the Source of Secession on How the Union Was Formed, and the Latter's Effect on the 
Organizing Strategy, in CIO International Unions in the 1940s (in percent) 


Union Formation? 
Source of secession Amalg. Unit — (NW) 
Workers' insurgency 50 50 (18) 
Officers’ revolt 15 85 (20) 


* Log odds ratio = 1.73; s.e. = .609; p<.05. 
> Log odds ratio = 1.14; s.e. = .870; p<.10. 


were aimed at building independent “revolu- 
tionary unions,” many Reds continued to 
organize oppositions and to try to win 
leadership in existing AFL unions. 

In 1934, a year before the dissolution of the 
TUEL and the decision to return to the AFL, 
a confidential Communist Party memorandum 
reported that Communists were in the leader- 
ship of 135 AFL locals, with 50,000 
members, and of “several” entire union 
districts. The Communists also led organized 
opposition groups in another 500 locals 
(Klehr 1984, p. 225). The next year, with the 
TUUL’s dissolution, these Communist bases 
in hundreds of AFL locals were reinforced, 
and new ones were established when its union 
remnants rejoined the AFL, as units if 
possible, or as individuals if necessary. 

Thus, we suggest that these hundreds of 
Communist-led locals and opposition groups 
were probably among the main centers 
involved in what AFL officials condemned as 
fomenting insurrection against them and 
leading the workers out of the AFL into the 
burgeoning CIO unions. Again, our findings 
strongly support this hypothesis. In the 
industries that had been a focus of earlier Red 
union drives, six-tenths of the CIO unions 
were born in rebellions from below, while in 
the industries that had not been penetrated, 
seven-tenths were led into the CIO from 
above (Table 3, part c). 

A union's secession from the AFL in a 
workers' insurgency, as we have seen, 
increased the chances that the Communists 
would win the leadership of these unions. 
But, as our theoretical model suggests, this 
also indirectly favored their winning union 
leadership: the unions that seceded from 
below were also far more likely than those 
that seceded from above to amalgamate and 
then go on to organize independently (see 
Table 4); in turn, this increased the chances 
that the Communists rather than other factions 
would win union power. This means that if 
earlier Red unionism had contradictory effects 


Organizing Strategy” 
Union formation Ind. Org. CIO Com. (N) 
Amalgamation 83 17 (12) 
Unitary 61 38 Q6) 


on independent organizing, it also had a 
positive effect on the Communists' chances of 
winning power, both through secession from 
below and amalgamation. 


CONCLUSION 


Our analysis has focused on the intraclass 
struggle for intraclass leadership. It has 
therefore neglected the question of how the 
main struggle against capital affected the 
intraclass struggle (and vice versa). American 
capitalists waged a “long and implacable 
fight . . . against the recognition of labor's 
democratic rights" (U.S. Congress 1937-38, 
Part 3, p. 5)—with a ferocity unparalleled 
elsewhere in the West. They deployed their 
own private armies and had the assistance of 
local and state police, the national guard, and, 
occasionally, the U.S. Army itself, to repress 
their workers’ strikes and efforts to unionize 
(Taft and Ross 1969; Goldstein 1978). But if 
many employers fought unionism with every 
weapon at their command (Taft and Ross 
1969, p. 289), some were readier to compro- 
mise with the new unions. 

Assessing the effects of the employers’ 
responses to unionism would be necessary to 
“round out” our analysis. How did the 
varying levels and kinds of employer resis- 
tance or accommodation to unionization 
affect the political practices of the contending 
CIO political camps? How, in turn, did this 
affect the odds that Communists, rather than 
their CIO rivals, would win the workers’ trust 
and take power in the new unions? To provide 
a systematic empirical answer to these 
questions will require primary historical 
research on, at least, the stance taken toward 
unionization by each of the major corpora- 
tions in all of the industries organized by CIO 
unions during the wave of unionization in the 
late 1930s. In steel, for example, U.S. Steel 
(“Big Steel") suddenly and surprisingly 
capitulated to the Steel Workers Organizing 
Committee in 1937, while the so-called 
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"Little Steel" companies engaged in a bitter 
four-year battle before grudgingly recogniz- 
ing the union. On the basis of such 
industry-by-industry research, it would be 
possible to assess the relatively independent 
effects of both class and intraclass struggles in 
determining the political alignments within 
the CIO. 

In any event, our analysis reveals that 
concrete intraclass struggles can create, wit- 
tingly or not, new objective political relations 
which tend in turn to shape the organized 
working class' internal political alignments. 

If our analysis has shown the relative 
autonomy of political struggles in making 
history, it emphatically does not imply a 
“voluntaristic theory" of politics or of social 
change. The original historical opening for 
the left in the working class was provided by 
an extraordinary crisis of American capital- 
. ism, the consequent political upheavals at all 
levels of government—local, state, and fed- 
eral, legislative, executive, and judicial —and 
the unprecedented, primarily spontaneous, 
“labor upsurge” of the 1930s (Brody 1980, 
pp. 103, 130-44). “Such times are rare and 
certainly not of anyone's deliberate making" 
(Piven and Cloward 1977, p. 173). And it 
was in these rare times that "the few 
Communists who had been working in 
factories and mines and shops found them- 
selves, . . . carried like corks riding a flood to 
top positions in a host of unions" (Glazer 
1961, pp. 100-101). 

But the Communists and other radicals of 
the 1930s involved in the leadership and 
organization of the working class were not 
mere "corks riding a flood." They were 
active, self-conscious men and women, and 
they were not merely "riding" but struggling 
to give shape to the sudden social eruption in 
which they were leading participants. This 
social eruption, this "flood," did not deter- 
mine what happened, except in the sense that 
it constituted, as we emphasized at the outset, 
a realm of immanent historical alternatives. 
Which alternative was suppressed or realized 
was determined by the active intervention and 
struggles of real actors (collective and individ- 
ual). Indeed, this realm of alternatives (as we 
have seen) was itself the partial creation, 
within (unknown) objective limits, of what 
these and other men and women already had 
done in the past, as leading participants in 
earlier workers' struggles. 

The balance of political forces in the 
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organized industrial segment of the working 
class was the result, partly revealed here, of 
the political practices of the factions and 
parties contending for the workers' leader- 
ship. These concrete political struggles and 
their consequences (both intended and unin- 
tended) "loaded the historical dice" (to 
borrow Weber's metaphor), and thus deter- 
mined the odds that, at that historical 
moment, the Communists or their rivals 
would win power and trust in America’s 
industrial unions.!7 


EPILOGUE 


On hearing the phrase in this article’s title, 
“how the Communists won power and trust in 
America’s unions,” a colleague quickly sug- 
gested adding “—and abused both.” What 
was the impact of the Communists on 
organized labor during their historically brief 
sojourn at the helm of so many CIO unions? 
Did they abuse the positions of power and 
trust they held in American unions? These 
questions, to say the least, still arouse 
passions 40 years after the expulsion of the 
Communist-led unions from the CIO.!8 

On the eve of the expulsions of the 
Communist-led unions from the CIO, the 
“primary charges” made by liberal and 
left-wing opponents against the Communists 
in organized labor were summed up as 
follows: “First, the turns of these U.S. 
Stalinists from leftward to rightward, and 
back again, have been determined not by their 
judgment of the changing needs of the 
working people, or by pressures from these 
people, but by the changing needs of the 
ruling group in Russia. Second, the ways for 
maintaining power which are habitual with 
the U.S. Stalinists include personal defama- 


17 We have not tried to assess how “unintended, 
unanticipated, or unrecognized consequences of 
action,” in Merton’s well-known phrase, bear on 
our substantive analysis. We have avoided this 
question intentionally, because we cannot address 
it empirically, while signaling our awareness of it 
by an occasional parenthetical phrase such as 
“witting or not.” 

1? Witness, for instance, the debate between 
Theodore Draper (1985a,b,c,d) and several “new 
historians” of American Communism whom he 
charges with “political partisanship” and “histori- 
cal bias.” See also the articles and commentary in 
Zeitlin and Kimeldorf 1984. 
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tion and intrigue, carried, if need be, to the 
point of wrecking a man or a labor union... . 
Third, Communist rule within the U.S. 
unions they control~is dictatorial; although 
they talk the language of democracy they do 
not believe or practice democratic principles. 

. . Fourth, the existence of Communist 
factions, and their lack of independence, is a 
strong deterrent to . . . any genuine leftward 
tendencies of labor in America” (Mills 1948, 
pp. 199-200). 

Implicit in these charges are critical 
political and historical/sociological questions. 
What did the Communists do with their power 
in CIO unions, once they won it? What was 
the comparative impact of Communist leader- 
ship? These questions represent still another 
theoretical question about the relative auton- 
omy of politics: What are the political effects 
of political leadership? How, in this instance, 
does the political consciousness (or ideology) 
of a union’s leadership and the practices it 
inspires affect the achievements of specific 
unions and the entire labor movement? Two 
“primary charges” against the Communists 
can be answered empirically. Was Commu- 
nist “rule” of CIO unions “dictatorial”? 
Were the “needs” of their union members 
subordinated to the "needs" of the Soviet 
regime? Concretely, how different were the 
formal constitutions and actual internal polit- 
ical life of the unions in the Communist and 
other political camps? And how different 
were the contracts and the actual working 
conditions they won? 

Answering these questions — which is the 
aim of our continuing research — will, in turn, 
enhance our understanding of the historical 
meaning of the CIO's expulsion of the 
Communist-led unions (and the subsequent 
dissolution of all but a few of them). What 
were the effects of this expulsion on the 
vitality of organized labor and on its subse- 
quent political power and social achievements 
in American life? Put differently, what did 
this mean for the making—or unmaking —of 
the American working class in our times? 


APPENDIX 1. 
On Measuring "Objective Conditions" 


Since union-specific data do not exist for the 
"objective economic conditions" and demographic 
characteristics of industries in the 1930s or 1940s, 
we had to use U.S. Census industry-level (four- 
digit SIC) data. But many unions lack such data. 
Moreover, we doubt that measures based on 
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industry-level data are reliable and valid measures 
of union attributes. First, the jurisdictions and 
memberships of CIO unions often cut across the 
boundaries of several SICs. Many of the unions 
had to be categorized, on the basis of our best 
estimates, into several different SICs. We assigned 
very rough estimates of industry weights in the 
absence of union-specific data. We examined the 
relationships with and without these weights and 
found no substantively important differences. 
Second, unless an industry is quite homogeneous 
on a given variable, it may not be correct to infer 
from an industry characteristic to the union. For 
example, a union in an industry with a high 
proportion of women may have few women 
members. Similarly, the industry may contain a 
high proportion of small shops, but the union may 
have organized only the largest ones, etc. Third, 
we have many missing cases for the economic and 
demographic variables; the number of unions for 
which we have adequate industry-level data on 
these variables ranges from 20 to 29. 

Withal, we examined the bivariate relationships 
between each of the following industry-level 
economic and demographic variables and the 
union’s political camp: (1a) the industry’s “sensi- 
tivity to depression” (relation of employment level 
to gross consumer income, 1936 dollars) (N= 26); 
(1b) the industry’s ‘level of unemployment (men 
only) (N=29); (1c) the size-distribution of the 
industry's establishments (96 large establishments; 
% of workers in large establishments) (N= 20); 
(1d) the industry's level of concentration (four- 
firm) (manufacturing only) (N=22); (2a) the 


` industry's skill composition (96 skilled craftsmen; 


men only) (N—28); (2b) the industry's sexual 
composition (% women) (N=31); (2c) the indus- 
try's racial composition (% “nonwhite”; men only) 
(N— 28); (2d) the industry's age composition (96 
under 25; men only) (N —28). 

We found no theoretically relevant bivariate 
relationship between any independent variable and 
the union's political camp, with one exception. 
Unions in industries with a high proportion (25 
percent or more) of skilled craftsmen were less 
likely than low ones (under 10 percent) to be in the 
Communist political camp; but the relationship is 
curvilinear. Unions in industries with "medium" 
proportions (10—24.9 percent) skilled craftsmen 
were by far the least likely to be led by 
Communists. That Communist-led unions were 
less likely to be in high-skilled than low-skilled 
industries—given the, AFL's preeminence among 
craftsmen—is what we would expect. But it could 
be misleading because, for instance, a CIO union 
in an industry with a high proportion of skilled 
craftsmen might, nonetheless, have few skilled 
members. These craftsmen could be members of 
one or more long-standing AFL unions. (This was 
true of Mine, Mill and the AFL unions in 
nonferrous mining.) 
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But the question remains: Did the strategy of 
class organization and the consequent political 
practices of the organizers have different political 
Consequences, depending on variations in the 
economic conditions in an industry and/or in its 
internal social composition? That is, was there 
interaction (or specification) among the variables, 
such that the “economic” or “social” conditions in 
an industry determined the measurable effects of 
the political practices? Alas, we can have no 
confidence in the results of our efforts in this 
regard because these variables are at best crude and 
at worst misleading indicators of the actual 
conditions in the industry encountered by union 
organizers. Further, not only are there many 
missing cases, but the number of cases with 
adequate data varies considerably from one 
variable to another. Moreover, the sparsity of cases 
varies at different values of these variables. Such 
unavoidable flaws in the data make an analysis of 
interaction effects worthless. 

Yet, for variables with enough cases, we did 
examine several theoretically relevant logit models 
which also include economic and demographic 
variables. We found that, controlling for these 
variables, the direct effects of the political 
variables did not meaningfully differ from the 
original logit model. The % skilled craftsmen has a 
small negative effect on the odds of Communist 
union leadership; but it does not remain significant 
in a reduced model, that is, one omitting control 
variables lacking significant effects. Unexpectedly, 
the % nonwhite has a small significant negative 
effect. These logit tables are available from the 
authors. 


APPENDIX 2 


On the Appropriateness of Significance Tests and of 
Logit Modeling 


Significance Tests. Although we report signifi- 
cance levels in our tables, we are not convinced 
that tests of statistical significance are appropriate 
for this empirical analysis. In particular, in Tables 
1.a, 1.d, 3.a, and 4.b, we have allowed ourselves 
to think seriously about theoretically salient 


bivariate relationships which fall below the conven- 


tional acceptable .05 confidence level. 

We question whether it is appropriate to use 
significance tests bere to reject a given finding. 
First, the small number of cases (38) in the 
population of CIO unions means that even strong 
relationships could be rejected at the .05 confi- 


dence level. Second, causal relationships are not ' 


uncovered by tests of significance. A large enough 
sample can yield small effects that are statistically 
significant but substantively unimportant (Swaf- 
ford 1980, p. 687). Rather, causal connections are 
revealed by demonstrating, in successive approxi- 
mations to the underlying realities, the existence of 
8 coherent set of theoretically relevant empirical 
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relationships. After providing estimates of the 
direct effects of the political variables, we also 
present a "theoretical model" to illustrate the 
presumed causal connections. Third, this study is 
an exploratory and not a confirmatory study. 
Indeed, it is "designed to find out what was not 
even guessed at before" (see Lipset et al. [1956, p. 
483]). In trying to explore a new, theoretically 
relevant subarea of inquiry, it makes sense to try to 
think through the implications of all theoretically 
salient, empirical relationships. . 

Logit Modeling. Using a logit model may not be 
fully appropriate here because our data do not meet 
all the assumptions for such modeling. In particu- 
lar, a much larger numter of cases than ours (38) is 
necessary for the approximations to be adequate. 
Our N of 38 is far from "asymptotic." 

But the logit formulation provides a convenient 
technique for describing relationships among 
binary events; and the variables in our analysis are 
both theoretically and empirically binary. Further, 
the logit analysis is not meant to “confirm” formal 
hypotheses, but to enhance the credibility of our 
substantive theory. In addition, the results of the 
logit analysis are consistent with the results of a 
conventional examination of contingency tables. 
On balance, then, we have decided to present the 
results. of the logit analysis of independent effects. 
We have not applied a full pathlike model because 
its applicability to our data is especially problem- 
atic. To apply structural equations and path 
analysis to discrete date, we would have to make 
several untenable assumptions. For instance, the 
models developed by Winship and Mare (1983) 
assume a recursive structure, but, again, it is not 
possible to test this assumption. In addition, in 
their "Model Four" (i.e., the model for binary 
variables without an underlying continuum), which 
is the closest to our own situation, it is assumed 
that no measurement error exists. But this would 
be a silly assumption in a sociohistorical analysis, 
especially one based on secondary sources. 
Overall, the use of logit modeling to describe 
independent effects stretches our data, but permis- 
sibly so. But a full pathlike model would force us 
to make assumptions that we are not willing to 
make. 


APPENDIX 3 
On Red Unionism 


Did an industry's structure have an important 
bearing on the success of early Red union 
organizing? The Communists were inveterate 
seekers after the right “objective conditions" in 
which to carry out their activities. So, the TUEL 
and its successor TUUL targeted certain industries 
because they considered organizing them critical in 
the struggle to organize all industrial workers 
(e.g., such core. mass production industries as 
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steel, auto, chemicals). In short, it could be argued 
that earlier Red union organizing had an impact 
precisely where “objective conditions” favored it. 
If so, the apparent indirect effects of earlier Red 
unionism on the chances of subsequent Communist 
union leadership could be spurious. They might 
reflect instead the “objective conditions” of the 
industries at the time the Red unionists originally 
penetrated them. 

Yet Communists often tried to organize indus- 
tries that were neither “strategic” nor characterized 
by objective conditions that might favor industrial 
unionism. As Levenstein (1981, p. 71) empha- 
sizes, "Often their egalitarian impulse led [the 
Communists] to expend inordinate energy on 
organizing those least powerful and least strategi- 
cally placed: tragic cases such as the migrant 
workers, ‘losers’ such as southern textile workers, 
the infinitely replaceable Macy's salesclerks, or 
hospital workers." Howe and Coser (1957, pp. 
257, 272), assessing the Red unionists’ activities 
during the revolutionary “Third Period," make the 
same point. In their view, the Communists bowed 
to Comintern decisions that ignored the real 
situation in the United States. “TUUL leaders and 
members often displayed a heroism and self- 
sacrifice which no amount of political disagree- 
ment should deter anyone from admiring." But 
time after time, in one industry after another, from 
coal mining to textiles, say Howe and Coser, .the 
Reds led workers into disastrous strikes and 
senseless efforts, where the objective conditions 
were heavily, and obviously, against them. 

For these reasons, we doubt that "objective 
conditions" in the extraordinarily diverse indus- 
tries penetrated by the TUEL and TUUL could 
account for the enduring impact of their years of 
Red union organizing. Indeed, Communists went 
on to win the leadership of unions in a broad range 
of entirely different industries, and to lose and win 
in others whose objective conditions appear to 
have been quite similar (also see Glazer 1961, p. 
120). 

Most important, 
TUUL leader Jack Johnstone said, do not organize 
workers. The Red unionists’ decision to try to 
organize an industry, for any reason ("strategic," 
"revolutionary," or “egalitarian impulse"), is 
itself a political act which has its own independent 
political consequences, whatever the existing 
“objective conditions." To assess the effects of 
industry structure on earlier Red unionism will 
require primary historical research on the TUEL 
and TUUL's varying organizing successes and 
failures and on the nature of the industries in which 
they occurred. ; 
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DENSITY DEPENDENCE IN THE EVOLUTION OF POPULATIONS 
OF NEWSPAPER ORGANIZATIONS* 
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A model of density dependence is proposed to explain regularities in the growth — 
and decline of organizational populations. Density—the number of organizations— 
is assumed to be a function of the social processes of legitimation and competition. 
At low density, the model predicts that the legitimation process will dominate and 
will lead to high organizational founding rates and low organizational mortality 
rates. At high levels of density, competition will dominate, and consequently 
founding rates will decline and mortality rates will rise. The model is tested with 
hazard function models using data from nine newspaper populations spanning the 
19th and 20th centuries and covering over 5200 newspapers. Analysis of founding 
rates provides strong empirical support for the model. Analysis of mortality rates 
reveals that the model holds for large populations. 


There appears to be a common pattern in the 
evolution of diverse organizational popula- 
tions over time. Railroads, labor unions, 
banks, and newspapers each emerge as 
organizations in small numbers, increase 
rapidly in population size, and then stabilize 
or decline in numbers (Carroll 1984; Hannan 
and Freeman 1989). This paper investigates a 
model of organizational evolution that ex- 
plains such trajectories by integrating ele- 
ments of institutional and ecological theories 
of organization. The model holds that both 
rates of founding and mortality in organiza- 
tional populations are nonmonotonic func- 
tions of the number of organizations in the 
population (density). We analyze rates of 
founding and mortality in nine populations of 
newspaper organizations in three countries to 
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learn whether the model of legitimation and 
competition (1) fits the data, (2) improves 
over simpler models that can also explain the 
general pattern of population growth, and (3) 
holds cross-nationally. 


LEGITIMATION AND COMPETITION IN 
ORGANIZATIONAL EVOLUTION 


Why does the number of organizations in a 
population typically follow a concave pattern 
of growth and decline? It could be that 
organizational populations exploit ephemeral 
resources, growing rapidly while the resource 
abounds and then declining as it fades. For 
instance, firms that serve immigrant commu- 
nities may flourish during periods of peak 
immigration and then decline in numbers as 
immigration slows and ethnolinguistic popu- 
lations assimilate. Such a process can appar- 
ently account for the history of non- 
English-language newspapers (Park 1929) 
and local life assurance societies (Lehrman 
1986). 

A second possibility is that organizational 
forms embody historically specific social and 
material technologies (Stinchcombe 1965), 
Older populations eventually lose their com- 
petitive edge as new technologies emerge and 
social conditions change. For example, wide- 
spread distribution of electricity apparently 
eliminated populations of many kinds of firms 
such as ice houses and local breweries. 
Changing political conditions and legal rul- 
ings eliminated others such as slave planta- 
tions and opium dens. 


American Sociological Review, 1989, Vol. 54 (August:524-541) 
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A third possibility is that the characteristic 
growth trajectory reflects the operation of 
opposing processes of legitimation and com- 
petition. At low density, growth in numbers 
mainly legitimates a population and the 
organizational form it uses. But when density 
is high relative to resources, increases in 
density mainly strengthen processes of com- 
petition. A model based on this reasoning can 
explain well the historical evolution of 
national labor unions in the United States 
(Hannan and Freeman, 1987, 1988). 

The three scenarios are compatible with 
each other. In fact, the evolution of any 
particular organizational population likely 
depends on all three types of factors. 
Nonetheless, although some rudimentary con- 
trols for resource fluctuations and historical 
obsolescence will be introduced, we concen- 
trate here on the density dependent processes 
of legitimation and competition. We do so 
because these two processes are the main 
mechanisms of change in two of the most 
vigorous strands of contemporary organiza- 
tion theory: institutional theory (Meyer and 
Scott 1983) and ecological theory (Carroll 
1988; Hannan and Freeman in press). Our 
effort can thus be seen at a more general level 
as a synthesis of theoretical perspectives 
commonly held to be in opposition to each 
other (DiMaggio and Powell 1983). 


A MODEL OF DENSITY DEPENDENCE 
IN VITAL RATES 


The pattern of population growth described at 
the outset— gradual rise in numbers followed 
by a rapid rise and then stabilization and 
possible decline —can arise from many differ- 
ent processes. It is helpful for understanding 
the model we use to begin with a simpler, 
classic model that is also consistent with the 
main features of the growth trajectory de- 
scribed at the outset. By far the best-known 
model of growth of populations in limited 
environments is the logistic (Verhulst) model. 
One way to motivate the logistic model is as a 
compound growth process: 


dN/dt — rN, 


where the growth rate, r, is defined as the 
difference between the birth rate and the 
mortality rate, say r = ry — rg. Suppose that 
the birth rate falls linearly with density (N), r, 
= dg — a,N, and that the mortality rate rises 
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linearly with density: rg = bg + bN. Then 
substituting these expressions into the model 
of compound growth yields 


dN/dt = (ag — bg)N 
aap (a, +b,N’. 


If, as is usually plausible, the birth rate 
exceeds the death rate at low density, this 
model produces logistic population growth. 

Numerous related models without density 
dependence can produce similar qualitative 
patterns of growth. For example, a model of 
compound growth in which the growth rate 
changes exponentially over time implies 
growth trajectories that are quite similar to 
those of the logistic (Tuma and Hannan 1984, 
Chapter 14). So both a model of linear 
density dependence in vital rates or a model 
of time dependence without density depen- 
dence can explain the gross features of the 
growth trajectories we observe.! 

The model proposed by Hannan (1986) 
holds that growth in organizational popula- 
tions is more complicated than the logistic 
model. Instead of linear dependence on 
density, this model posits that density depen- 
dence in rates of founding and morality is 
nonmonotonic. The model builds on the 
assumption that processes of legitimation and 
competition for limited resources shape rates 
of founding and failure and that these 
processes are sensitive to density. 

How does the legitimacy of an organiza- 
tional population vary with its numbers? 
Institutional theory stresses legitimacy as 
taken-for-grantedness rather than legality or 
official approval (Meyer and Rowan 1977; 
Meyer and Scott 1983). That is, an organiza- 
tional form is legitimate to the extent that 
relevant actors regard it as the "natural" way 
to organize for some purpose. From this 
perspective, rarity of a form poses serious 
problems of legitimacy. When few instances 
of a form exist, it can hardly be the “natural” 
way to achieve some collective end. On the 
other hand, once a form becomes prevalent, 
further proliferation is unlikely to have much 
effect on its taken-for-grantedness. Legiti- 


! [t is worth pointing out that the models we 
estimate below allow the rates to vary monotoni- 
cally with the age of the population so that density 
dependence can be discriminated from this form of 
time dependence. 
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macy thus grows monotonically with density 
but at a decreasing rate.? 

In the case of competition, effects of 
variation in the upper range are especially 
strong. When numbers are few, adding an 
organization to the population increases the 
frequency and strength of competitive interac- 
tions slightly if at all. But when density is 
high, further growth exacerbates competition 
{adjusting for availability of resources). In 
other words, growing density increases com- 
petition at an increasing rate. 

We use a model based on functional forms 
consistent with these qualitative predictions. 
Hannan (1986) argued that the founding rate 
of an organizational population is propor- 
tional to the legitimacy of the organizational 
form and inversely proportional to the level of 
competition within the population: 


A(f) = « LIC, 


where A(t) is the founding rate, L is the level 
of legitimacy, C is the level of competition, 
and « is a function summarizing the environ- 
mental conditions that affect the rate. 

According to the arguments outlined above, 
organizational density increases legitimacy at 
a decreasing rate and increases competition at 
an increasing rate. Hannan (1986) assumed 
the particular parametric relationships: 


L= N", 
C 


0<a,<1, (2) 


(3) 


exp[a2N?], do $9 0: 


"where N denotes density. 

Replacing L and C in equation 1 with the 
functions that relate legitimation and compe- 
tition to density (equations 2 and 3) gives a 
model for the effect of density on the 
founding rate: 


A(t) = x N” exp[ — a2N?], 


O<a, <1; 


(4a) 
(4b) 
According to this model, the founding rate 


a, > 0. 


? We recognize that the conceptions of legitima- 
tion offered here are broad and that sociologists 
commonly use other, different formulations. For 
the most part, we do not think that these 
differences represent challenges to our arguments 
about the effects of density but rather suggest 
possible refinements in interpreting these effects. 
Hannan and Freeman (1989) have reviewed and 
compared several conceptions in greater detail. 
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has a nonmonotonic relationship with density: 
initial increases in density increase the 
founding rate but further increases eventually 
depress the founding rate. The lack of 
monotonicity reflects the complexity of growth 
in organizational populations. 

Parallel arguments hold for the mortality 
rate—but with the rate proportional to C and 
inversely proportional to L. This assumption, 
along with the assumptions in (2) and (3), 
leads to models of density dependence with 
the form of (4a) but the expected signs of the 
coefficients in (4b) reversed. Hannan and 
Freeman (1988) had difficulty in getting 
maximum likelihood programs to converge in 
estimating this model. So they used instead a 
slightly different model? 


p) = B exp[biN + bjN?], (5) 


where u(t) is the mortality rate, B summarizes 
the effects of enviranmental conditions, and 
the coefficients of density are expected to 
obey the restrictions bj < 0 and b, > 0. 
Models with the form of (5) fit better than (4) 
for both the founding rates and mortality rates 
of newspapers.* So in analyzing foundings, 
we estimate models of the form:? 


>There are two ways of motivating this 
alternative model. The first is to assume that 


legitimation is an exponential function of density. 


The second is to assume that legitimation is a 
log-quadratic function of density with positive 
first-order effect and negative second-order effect. 
Such a function implies that legitimation eventu- 
ally declines at high densities, which is not 
consistent with our argument. However, if we 
restrict attention to the range over which the 
log-quadratic relationship is positive, this model 
gives a potentially useful approximation to the 
relationship assumed in (2). We think that the 
second motivation is a more useful one in the 
present context. 

^ Hannan and Freeman (1987, 1989) report that 
models with the form of (4a) fit better than those in 
the form of (5) for fcunding rates of American 
national labor unions and for rates of entry of firms 
in the semiconductor industry. The two models 
differ little in terms of substantive conclusions. For 
the most part, they are simply alternative ways to 
develop parametric specifications of the argument. 

5 [n order to focus on the central issues of the 


~ paper, we drop here the notation for the functions 


summarizing environmental conditions. 
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X(t) = exp[aiN + a2N?], (6a) 
and hypothesize that 
a, > 0;a, <0; | a, | > |a |. (6b) 


In analyzing mortality, we estimate models of 
the form 


w(t) = exp[biy + bN?] (7a) 
and hypothesize that 
by <0; b»0 |b| > |b]. (7b) 


In empirical testing the key issues are 
whether these nonmonotonic models improve 
over simpler models with only monotonic 
(e.g., linear) dependence of each rate on 
density (which are generally consistent with 
the shape of the observed growth trajectories) 
and whether the inequalities implied by the 
theory, restrictions 6b and 7b, hold. A 
secondary but important issue is whether the 
point of inflection falls within the observed 
range of density. 

. This modeling strategy, which assumes that 

processes of competition and legitimation can 
be usefully represented as effects of density 
on vital rates, has obvious advantages and 
disadvantages. One possible disadvantage 
follows from its indirect nature—it does not 
rely on measurements of competition and 
legitimacy. Rather, it infers the operation of 
these processes from estimates of parameters 
of models of density dependence. But the 
strategy allows us to analyze and compare the 
dynamics of very different kinds of organiza- 
tional populations; it also facilitates cross- 
national comparisons. Ease of application to 
dissimilar populations and to similar popula- 
tions in different countries facilitates learning 
about the generality of the dynamic pro- 
cesses. 

This modeling strategy also allows us to 
study long time periods. Data on density and 
vital events can be collected for complete 
populations over long historic periods for 
which it would be extremely difficult—if not 
impossible—to reconstruct detailed and subtle 
measures of changing levels of legitimacy and 
local competition. Gaining a longer perspec- 
tive conveys strong advantages in attempts to 
understand the social processes that have 
shaped . populations of organizations and 
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increases the value of organizational research 
to work on macrosocial change. 


PREVIOUS RESEARCH ON 
DENSITY DEPENDEN CE 


Hannan and Freeman’s (1987, 1988) analysis 
of national labor unions in the United States 
over the entire history of the population, 
1836-1985, strongly supported the model of 
legitimation and competition. In analyses that 
controlled for differences among historical 
periods and for numerous environmental 
conditions, the: nonmonotonic model im- 
proved significantly over monotonic models 
of density dependence for both founding rates 
and mortality rates. The point estimates 
agreed with the predictions of the model for 
both rates. 

Prompted by these findings, researchers 
applied the model to other kinds of organiza- 
tional populations. So far several studies have 
yielded estimates of density dependence 
supporting the model of legitimation and 
competition. These are Hannan and Free- 
man's (1989) analysis of rates of exit from 
the population of American semiconductor 
manufacturers during 1945-1980, Barnett and 
Carroll’s (1987) analysis of rates of founding 
and mortality of independent local telephone 
companies in several Iowa counties during 
1900-1917,5 and Mitchell’s (1987) study of 
rates of entry into the medical diagnostic 
imaging industry during 1959-1986. In addi- 
tion, Tucker, Singh, Meinhard, and House's 
(1988) analysis of foundings of voluntary 
social service organizations in Toronto from 
1970 to 1982 provides limited support. While 
Tucker et al. (1988) report that the predicted 
pattern of density dependence holds (Table 
8-1, second equation), they conclude that this 
pattern holds only during increased public 
funding. 

Results of several other analyses run 
counter to the model. Hannan and Freeman's 
(1989) analysis of rates of entry into the 
semiconductor manufacturing industry reveals 
that density had a monotonic positive effect 
on the rate..Delacroix, Swaminathan, and 
Solt’s (1989) analysis of exits from the 
population of California wine producers from 


© The effect of density is significant only for 
their largest subpopulation, mutual telephone 
companies. 


528 


1940 until 1985 shows at best weak support 
for the model. And Tucker et al. (1988) 
report that density dependence in rates of 
mortality is nonmonotonic but with signs 
opposite the predictions of the model. 

How might the discrepancy in findings be 
explained? One of the main differences— 
perhaps the primary one—between these 
more recent studies and the Hannan-Freeman 
union studies concerns time frames. The 
Hannan-Freeman data cover the history of a 
population from its emergence to the present 
day, a period of almost 150 years. The other 
studies cover periods of only 40 years 
(semiconductor manufacturers), 17 years (tele- 
phone companies), 45 years (wine produc- 
ers), 27 years (diagnostic imaging manufac- 
turers), and 13 years (voluntary social 
services). Moreover, only the semiconductor 
and diagnostic imaging studies can claim 
without debate to have information on the 
.early history of the population. Excluding 
data on the formative period is particularly 
problematic for studying the legitimacy com- 
ponent of the density model since strong 
effects of density are predicted in this early 
period. There is no reason to expect that these 
effects should be observable at late stages of a 
population's development, because the model 
predicts that competition dominates at this 
point. A similar argument could be made 
about exclusion of the later phases of 
population evolution (whether by research 
design or by the censoring imposed by current 
time) as it pertains to the competition process. 


DESIGN OF THE NEWSPAPER STUDIES 


In view of the differences among previous 
studies, it is crucial to analyze organizatonal 
populations for which complete histories can 
be coded over long periods. Populations of 
newspaper publishing organizations are attrac- 
tive for this purpose. Newspaper publication 
leaves dated material products of direct 
interest to historians who have catalogued 
many. publishing histories. In addition, direc- 
tories compiled for advertisers have been 
published yearly since about 1870. 

Analysis of populations of newspaper 


7 While they do report estimates of two models 
whose first- and second-order effects of density 
agree with the model (Table 3, Equations 3-5 and 
3—6), estimates of many of other models show 
different, inconsistent effects, 
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publishers is also attractive because historical 
research on these industries suggests that 
processes of legitimation and competition 
have played important roles in shaping them. 
In order to place our analysis in institutional 
context, we briefly review what is known 
about such processes in the case of the 
American industry. 

Legitimation. The earliest American papers 
had trouble staying open unless they received 
the explicit blessing of authorities. The first 
known newspaper, Publick Occurrences Both 
Foreign and Domestick, published only one 
issue' in 1690 before its owner was jailed for 
printing “the truth as he saw it" (Emery and 
Emery 1984). The next paper, John Camp- 
bell’s Newsletter, began publishing in 1704 
under the condition that all copy be approved 
by the governor. Later postmasters continued 
publishing newspapers, noting that they were 
circulated "by authority." In 1721 James 
Franklin's New England Courant challenged 
the notion of official sponsorship. Although 
his efforts landed him in jail, his paper lasted 
five years and had a profound influence on 
the development of an independent press. By 
1785, ‘all 13 colonies had witnessed the 
emergence of newspapers, most of which 
were private enterprises. 

Failure rates of the early newspapers were 
extremely high. Of the more than 2100 papers 
founded prior to 1320, more than half are 
estimated to have closed within two years 
(Emery and Emery 1984). Although financial 
difficulties accounted for most failures, indi- 
cating a lack of full consumer acceptance, 
there were also legal, political, and civil 
difficulties. In particular, charges of libel 
could result in jail sentences or bankruptcy. 
Not until at least 1805 did legislation allow 
the introduction of evidence as to the truth of 
an alleged libel as legitimate defense (Mott 
1962). Previously courts had not usually 
allowed such evidence and recognized the 
principle, "the greater the truth, the greater 
the libel" (Emery and Emery 1984). 

Until 1830, American newspapers were 
generally of two kinds: mercantile papers, 
which reported commercial news, and parti- 
san papers, which printed political news and 
opinions and which usually received funding 
from political parties and candidates. Both 
kinds of enterprises were typically one-person 
operations. The emergence of the so-called 
Penny Press (named after its retail price) in 
the 1830s marked the beginning of indepen- 
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dent, mass-based newspapers (Schudson 
1978). The sales and advertising income 
which accrued from their broad social base 
(or niche) allowed these papers to gain 
independence from political parties. In edito- 
rial policy, the penny papers were conscien- 
tiously nonjudgmental: accounts of trials and 
the like were often published verbatim from 
the official records. Although such actions 
were called sensational by the established 
press of the day, at least one scholar credits 
the Penny Press with inventing the concept of 
“news” (Schudson 1978). By the late 19th 
century, newspaper reporting had become a 
profession, and press clubs were started in 
many cities, The informational component of 
journalism increasingly stressed fairness, ob- 
jectivity, and dispassion (Schudson 1978). 

Each of these developments led to in- 
creased acceptance of the newspaper, its 
taken-for-granted right of existence, and its 
value as a source of information. Yet the 
process by which this occurred was slow and 
gradual, with fits and starts. For instance, 
duels between publishers and their editorial 
targets were not uncommon until at least the 
mid-19th century (Mott 1962). Challenges to 
a free press during later periods were more 
common in the newly settled areas of the 
South and the West. While national events set 
the stage for legitimation, the battle was 
fought afresh in new local industries on the 
expanding frontier. 

Competition. Journalism history of the 19th 
century abounds with accounts of bitter 
newspaper rivalries. Yet from the perspective 
of organizational mortality, competition inten- 
sified in the 20th century. The best data on 
competition and consolidation in the newspa- 
per industry concern English-language, gener- 
al-circulation dailies (Rosse, Owen, and 
Dertouzos 1975; Emery and Emery 1984). 
The number of these newspapers grew from 
850 in 1880 to a peak of 2200 immediately 
prior to World War I, then declined to about 
1750 between World Wars I and II. It has 
remained roughly stable since then, with 
periodic small fluctuations. This national 


trend masks an underlying process of severe ` 


local competition. Within cities, there has 
been a steady downward trend toward a single 
daily. Among all cities with dailies, the 
percent with two or more competing dailies 
has fallen from 61 percent in 1880, to 57 
percent in 1910, to 20 percent in 1930, to 1.9 
percent in 1981. The usual explanation for the 
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“evolution of one newspaper cities" relies on 
economies of scale (Rosse 1978, 1980; 
Bogart 1981; Hgyer 1975). Once a newspaper 
achieves a size advantage over its local rivals 
(for whatever reason), it becomes extraordi- 
narily difficult for the smaller competitors to 
survive. In the long run, the equilibrium 
outcome of the process triggered by this 
condition is a local monopoly. Exceptions to 
the rule usually occur in only the largest 
markets or where the competitors manage to 
differentiate by serving different markets or 
developing special appeal to certain submar- 
kets. Thus, while the daily markets have 
consolidated, there has been a concomitant 
upswing in the viability of specialized news- 
papers (Emery and Emery 1984; Carroll 
1987). Density-dependent competitive pro- 
cesses appear to operate both within the 
overall market of an area and within its 
submarkets. 

Data. We analyze data fom: studies that 
attempted to define and enumerate ‘the 
complete historical populations of newspapers 
in Argentina, Ireland, and seven metropolitan 
areas in the United States. Most countries, 
including Argentina and Ireland, have news- 
paper industries that*are essentially national 
(Bagdikian 1971): By contrast, American 
newspapers: depend primarily on local mar- 
kets, usually consisting of a metropolitan 
area. So we used different geographical units 
to define populations. For Argentina and 
Ireland, we used the nation-state and its 
political boundaries; but for the United States 
we used standard metropolitan statistical areas 
(SMSAs). Carroll (1987) studied seven 
SMSAs: San Francisco-Oakland-San Jose 
(CA), Little Rock (AR), Springfield (MO), 
Shreveport (LA), Elmira (NY), Lubbock 
(TX), and Lafayette (LA). 

For each locality, archival sources were used 
to code records on complete historical popu- 
lations of newspapers. The cornerstones of this 
effort were compilations of library newspaper 
holdings prepared by councils of librarians 
(British Museum 1905; British Library 1975; 
Brigham 1947; Lathem 1972; Gregory 1937) 
and annual directories of advertising agencies 
(Rowell various years; Ayer various years). 
Area-specific sources were also used when 
available (Fernandez 1943; Galvan-Moreno 
1944; Anonymous n.d.; Daggett 1939; Tinker 
1932; Grose 1977; Wheeler 1973; Taft 1964; 
Louisiana Historical Records Survey 1941; His- 
torical Records Survey Program 1941; Histor- 
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Table 1. Summary of Nine Organizational Populstions of Newspapers 





Approximate End of 

Location Beginning Date Observation 
San Francisco- 

Oakland-San Jose 1845 1975 
Argentina 1800 1906 
Ireland 1800 1975 
Little Rock 1815 1975 
Springfield 1835 1975 
Shreveport 1840 1975 
Elmira 1815 1975 
Lubbock 1890 1975 
Lafayette 1835 1975 





ical Records Survey 1942). A more complete 
discussion of these sources and their quality 
can be found in Carroll (1987). 

These sources provide extraordinary cover- 
age. In most instances, a fairly complete his- 
torical population has been assembled, with 
the exception of Argentina, where the sources 
found cover only the 19th century. The coding 
procedures recorded every newspaper leaving 
a trace in the historical records. When avail- 
able, the exact dates of a founding and demise 
were coded. In other cases, a paper’s appear- 
ance and disappearance in a periodical direc- 
‘tory were used to infer the approximate, usu- 
ally annual, dates of founding and closure.® 
Table 1 summarizes the data. 

A measure of density was constructed for 
each population. Because of imprecision in 
some dates, this variable was calculated on an 
annual basis. It counts the number of 
newspapers recorded as existing at any time 
during the year. Figure 1 displays fluctuations 
of density from 1800 to 1975 for each of the 
nine populations. Six populations exhibit the 
concave pattern of growth and decline 
discussed at the outset. Of the other three 
populations, one (Argentina) is interrupted by 
a lack of data at what would appear to be its 
approximate peak, assuming that it follows 
the general pattern. The other two (Lubbock 
and Lafayette) have apparently been too 


5 The sources usually give dates based on the 
life histories of newspapers rather than publishing 
firms. Most of the time there is a strong 
correspondence between the two although occasion- 
ally there may not be. For Argentina and Ireland, 
we had no additional information and thus were 
forced to use the dates as given. For the American 
cities, we often had additional information about 
dates of events. 


Total No. of 
Newspapers Number 


Peak Curvilinear 


Date of Peak Temporal Trend 


D 


2169 395 ^ 1916 Yes 
1453 125 1898 No 
996 224 1914 Yes 
220 43 1909 Yes 
161 32 1894; 1895 Yes 
108 26 1932 Yes 
55 14 1891 Yes 
26 12 1961; 1963; No 
1972; 1973 
19 5 1928 No 


sparsely populated to show any kind of 
discernible pattern. 

As Table 1 and Figure | indicate, the nine 
populations differ considerably in when their 
first member appears. We constructed for 
each population a variable called “population 
age” by calculating for each spell the time (in 
years) elapsed from the date of its first 
recorded newspaper to the beginning of the 
spell. Although it is somewhat crude, we add 
this variable to our models to control for the 
possible historical obsolescence of the news- 
paper organizational form (see Hgyer 1975 
for a similar argument). This specification 
also allows us to control for general temporal 
trends in population growth. As explained 
earlier, time-dependent population growth 
represents a major alternative to density 
dependence in explaining logistic-like growth. 

Carroll and collaborators coded the history 
of political turmoil in Argentina, Ireland, and 
San Francisco.? They found that political 
turmoil coincides with increased foundings 
and that papers founded in turmoil-ridden 
periods have higher mortality rates (Carroll 
and Delacroix 1982; Delacroix and Carroll 
1983; Carroll and Huo 1986). In terms of the 
theoretical scenarios introduced earlier, polit- 
ical turmoil might therefore be considered ari 
ephemeral resource that sustains a higher 


? Histories of political turmoil were coded using 
standard historical sources for the three localities. 
In Argentina, much of this turmoil involved 
struggles between interests in Buenos Aires and 
those in outlying rural provinces. In Ireland, much 
of the strife revolved around the long-term battle 
with Britain for independence. In San Francisco, 
turmoil was commonly associated with labor issues 
and ethnic violence. The smaller cities were not 
studied because of lack of detailed source material. 
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carrying capacity of newspapers. So we use 
as a.control a dummy variable that indicates 
years of turmoil for these three areas (the only 
ones for which the time series of political 
turmoil has been produced). 

We also use the number of foundings and 
the number of closures in the immediately 
preceding year as controls for short-run 
variations in environmental conditions. On 
the basis of previous findings, we use 
quadratic specifications of the effects of these 
variables (Delacroix and Carroll 1983; Carroll 
and Huo 1986; Hannan and Freeman 1987). 
Models with these variables also allow the 
separation of the effects of transitory changes 
in density (due to foundings and closures in 
the prior year) from the effects of long-term 
(presumably more systematic) changes in 
density. This separation may be important 
because Delacroix, Swaminathan, and Solt 
(1989) have argued that the effect of 
transitory population changes dominates that 
of density for business organizations. 


METHODS 


In modeling the founding process, we take the 
population to be the unit of analysis and treat 
foundings as events in a point process for the 
population (Cox and Isham 1980; Amburgey 
and Carroll 1984; Amburgey 1986). Follow- 
ing Hannan and Freeman (1987), we use the 
durations between events in a population to 
estimate effects of covariates on the founding 
rate (the rate of transition from M foundings 
to M+1). Because we do not have any a 
priori hypothesis about the nature of the time 
dependence of the founding process, we use 
Cox’s (1975) proportional hazard model 
which allows parametric specification of 
effects of covariates and nonparametric treat- 
ment of the effects of time (since the previous 
founding). 

Specifically, the model for the rate of 
founding at historical time t given that the 
previous founding occurred at time f, is 


Alt | t) = h(t—t) expleiXi(t) +... 
+ Xtal, t 0t, (8) 
where A(t—t,) is an unspecified baseline 
hazard and the X(t,)'s are covariates evaluated 
at ty. 
For mortality, we analyze the histories of 
individual organizations (the unit of observa- 
tion is the organization rather than the 
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population). In this case, we do have a 
hypothesis about age dependence. Following 
Carroll (1983), Freeman, Carroll, and Han- 
nan (1983), and much subsequent research, 
we use the Gompertz specification of mono- 
tonic age dependence in mortality rates (the 
“liability of newness”). This model for 
mortality rate of the jth organization at age r* 
is 


wr) = exp[diX,Q*) +... 
+ d,Xj(t*)] exp[dor*] (9) 


where f* denotes the organization’s age, do 
measures the effects of age dependence, and 
the X(r*)'s are covariates whose values are 
evaluated at the beginning of the calendar 
year in which the organization reaches age f*. 

Methods. Estimates of models for founding 
rates are obtained by analyzing the intervals 
between events using partial likelihood (PL) 
(Cox 1975). The observations are the dura- 
tions between the foundings in the popula- 
tion.!° When more than one founding occur 
within a year, the durations are allocated 
proportionally. All but the last of the spells 
for each locality end with an event; the last is 
censored on the right at the end of the 
observation period. In order to dea] with 
temporally changing covariates, durations 
that exceed one year are split into yearly 
segments so that covariates. such as density 
can be updated to current values. Because all 
covariates are defined at the population level 
and are treated as constant within years, the 
artificial ordering created by the assignment 
of founding dates within years of multiple 
foundings does not pose a problem. PL 
estimates for founding were obtained by using 
the routines in BMDP (Dixon 1981). 


10 A reviewer suggested that the risk set is not 
well-defined in such analyses. This is not so. The 
time dimension in this analysis is duration 
(between point events). In constructing a risk set, 
we order episodes by duration. Consider the ith 
episode. The risk set in analyzing this episode is 
the set of all episodes with equal or greater 
duration. PL estimates the contribution of the 
covariates to explaining the probability that this 
episode was “closed” before any of the others in 
the risk set. In substantive terms, this approach 
assumes that the population is the unit at risk. For 
further discussion, see Delacroix and Carroll 
(1983) and Hannan and Freeman (1987; 1989, 
Chapter 8). 
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In estimating models of mortality, we use 
maximum likelihood (ML) procedures. The 
observations are the lengths of observed 
newspaper lifetimes and an index that distin- 
guishes censoring on the right from mortality. 
In order to allow values of the covariates to 
vary over tbe lifetimes, we broke each 
newspaper's history into a sequence of yearly 
spells with all but the last spell censored on 
the right. Age of newspaper, age of popula- 
tion, and the values of density and other 
covariates are updated at the beginning of 
each year for each newspaper. However, all 
covariates are treated as constant within 
years. ML estimates for mortality were found 
by using Tuma's (1980) special purpose 
program RATE. 


RESULTS 


Foundings. Table 2 contains PL estimates of 
models for founding rates. For all nine 
populations, density has the predicted non- 
monotonic effect. Consistent with the predic- 
tions in (6b), the estimated coefficients of the 
first-order terms are positive and those of the 
second-order terms negative. The absolute 
magnitudes of the estimated coefficients agree 
with predictions as well (the decimal must be 
adjusted on the coefficient associated with the 
second-order terms). Moreover, for six of the 
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nine populations, the estimates of both of the 
density terms differ significantly from zero as 
well. Overall, the model of legitimation and 
competition improves significantly over the 
simpler model that underlies the logistic 
growth process. 

Age of the population has a consistent 
negative effect on the founding rate, suggest- 
ing that entrepreneurial activities decline as 
the populations grow older. Political turmoil 
has positive effects, consistent with previous 
research and with the view that selection in 
newspaper populations is partly shaped by 
larger political changes. The measures of 
population dynamics occasionally have statis- 
tically significant effects. However, no con- 
sistent pattern is apparent in the estimates for 
either of them. 

Just how strong is the effect of density? 
The plots in Figure 2 show the predicted 
effects of density on the baseline rates across 
the observed ranges of the density variables 
(enclosed by the vertical dotted lines). While 
the size of the effect varies considerably 
among populations, in every instance the 


‘predicted maximum effect of density falls 


well within the observed range. We conclude 
that the founding process for newspapers is 
typically density dependent and that it be- 
haves in a manner consistent with the'model 
of legitimation and competition. 


' 


Table 2. Partial Likelihood Estimates of Effects of Density and Population Dynamics on the Rate be Newspaper 


Founding (Standard errors shown in parentheses) 








Popula- Prior Prior 
Density?/ tion Political Found- Found- Prior Prior 





Location Density 100 — Age Turmoil ings ings? Closures Closures? x? D.F 

San Francisco- .033* —.005*  —.047* .138* .012 —.0001 —.050* .001* 1272. 8 

Oakland-San Jose  (.002) (.0003)  (.002) (.005) (010) (0002) (.010) (.000) 

Argentina .053* — .030* .001 1.03*  —.157* .003* .170*  —.003* 870. 8 
(.009) (.005) (004)  (.070 (024) (.001) (.025) (.001) 

Ireland .025* —,007* — .006* -567* .227* —.008* .003  —.002 557. 8 
(.003) (.001) C001) (070) (029)  (.002) (.030) (.002) 

Little Rock .271* ~.491* —,.024* 142) —.024 —.166 .050* 342. 7 
(.040) (.080) (.004) (109)  (.013) (.089) (.013) 

Springfield .285* —.484* — .035* — 162 .036* —.197 .029* 179. 7 
(.055) (.144) (.006) (133)  (.017) (.112) (.010) 

Shreveport .306* —,848* —.019* .184  —.007 — 011 —.004 32.3 7 
(.088) (.284) (.006) (175)  (.032) (.205) (.046) 

Elmira .566*  —2.97 —.015* 1.08* —.326 —.376 .137 35.0 7 
(.234) (1.66) (.006) (457)  (.198) (.458) (.170) 

Lubbock .483 —3.76 — .004 —.546 .301 —.220 3.83 6 
(.302) (2.06) (.029) (1.09) (.702) (.619) 

Lafayette 295  -194 002 872 7154 4 
(.982) (21.7) (.009) (.648) 





* p<.05 (two-tailed test). 
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Mortality. Table 3 reports estimates of the 
models of organizational mortality. Ir this 
case, the pattern of results depends strongly 
on the size of the population. The first-order 
effect is negative and the second-order effect 
is positive for six of nine populations But 
both the first-order and second-order e:fects 
of density are statistically significant only for 
the three largest populations: San Francisco- 
Oakland-San Jose, Argentina, and Ireland. 
For the six smaller populations, only the 
second-order effect differs significantly from 
zero and then only for two cities, Springfield 
and Lubbock. So it appears that the model of 
legitimation and competition holds only in 
large populations of newspapers. We discuss 
possible reasons for the weaker (and occasion- 
ally reverse) effects for the other locelities 
below. ` 

The estimated effect of organizational 
aging is negative and significant in all cases, 
which indicates a liability of newness. Age of 
the population also consistently has a signifi- 
cant negative effect, suggesting that aging of 
a population of organizations means some- 
thing more than just historical obsolescence 
of the form (otherwise age of the population 
would have a positive effect). The indicators 
of population dynamics often fail to have 
significant effects. However, when -hese 
effects are significant, the estimates show the 
same pattern. For both prior foundings and 
prior closures, the first-order terms are 
positive when significant and the second- 
order terms negative. 

In Figure 3 we show plots of the predicted 
density effects for the three larger pooula- 
tions. Here the predicted inflection points of 
density dependence fall close to the observed 
peaks. Surprisingly, this suggests that legiti- 
mation processes have dominated competition 
processes in affecting mortality over most of 
the study periods. Still, the significance af the 
second-order term suggests that competition 
is an important part of the evolution of these 
populations of organizations. 


DISCUSSION 


We began by identifying three general 
theoretical scenarios that might accoun: for 
the pattern in the rise and decline of 
organizational populations. These included: 
(1) the exploitation of ephemeral resou-ces, 
(2) the obsolescence that comes from the 
historical specificity of organizational forms, 
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and (3) the density-dependent processes of 
legitimation and competition. Our analysis of 
nine newspaper populations focused on the 
last of these, although we attempted to control 
for the other two explanations. 

The evidence we reviewed suggests that the 
density-dependent processes of legitimation 
and competition are powerful explanations of 
organizational evolution. Even after control- 
ling for ephemeral resources and historical 
obsolescence, organizational density consis- 
tently had nonmonotonic effects of the kind 
predicted. For foundings, the expected pattern 
was found for all nine populations, although it 
was statistically significant in only six. For 
mortality, the predictions were upheld in six 
of the nine populations, but statistical signifi- 
cance for both relevant coefficients was found 
only in three. Comparison of these findings 
across the two types of processes suggests 
that nonmonotonicity in density dependence 
characterizes founding processes more consis- 
tently than mortality processes. The pattern of 
effects suggests that legitimation affects both 
founding rates and mortality rates strongly, 
but that competition has more powerful 
effects on founding than on mortality. 

The nonsignificance of the density esti- 
mates for some of the populations might be 
explained by a variety of factors. First, note 
that the estimates for the larger populations 
are always significant and in the predicted 
directions but those for smaller populations 
are sometimes not. This pattern suggests that 
density dependence might operate only in 
populations that become large and crowded. 
Second, the smaller populations operate in 
small metropolitan areas. Since these areas 
are the most likely to be penetrated by outside 
media and newspapers, local density may not 
be the proper measure for these populations. 
Third, several of the smaller populations 
began later than the others. Their develop- 
ment may have been truncated by the research 
design and before the competition process has 
come to dominate. Yet San Francisco's 
population (for which the model works well) 
was also a later starter. Fourth, the lack of 
significance is most common for the U.S. 
localities where the predicted initial effect of 
density is to lower mortality rates via 
increased legitimacy. Undoubtedly, some— 
and perhaps the most important —legitimacy 
battles were fought at the national rather than 
the local level. If so, this would suggest that 
many local American populations began with 
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substantial—although varying (depending on 
date of initiation) —levels of legitimacy. 

Analysis of the smaller populations in this 
study therefore suffers from design problems 
that might plausibly explain the poor fit of the 
model of density dependence for these 
populations. So, as with the studies reviewed 
earlier that exclude data on the formative 
period (or a later period), we conclude that 
the mixed evidence from analysis of the small 
populations does not compromise our argu- 
ments seriously. We think that the evidence 
from studies of full histories of populations 
(and on processes of legitimation and compe- 
tition) generally supports the model of 
nonmonotonic density dependence. More- 
over, the processes appear to be consistent 
across disparate types of organizations and 
national contexts. Documenting such a nonob- 
vious empirical pattern provides strong sup- 
port for the theoretical arguments we have 
advanced. It also presents a challenge to those 
who subscribe to other theoretical viewpoints. 
While other explanations for the nonmono- 
tonic effect of density on both rates may be 
forthcoming, we have not learned of any 
other parsimonious explanations of the pat- 
tern. 

The empirical standing of the density 
model has important theoretical implications. 
As we noted at the outset, processes of 
legitimation and competition constitute the 
dominant mechanisms of change in two 
leading perspectives on organizations, institu- 
tional theory and ecological theory, respec- 
tively. The density model (and the reasoning 
behind it) integrates these two theories but 
with the twist of recasting institutional 
processes at the population level. This shift in 
level is warranted, we think, by the some- 
times contradictory pressures of institutional 
processes at different levels of analysis. For 
instance, when a small publisher challenges 
accepted notions of freedom of speech, the 
action may be financially devastating for the 
firm and force it to close. But the institutional 
consequences of the action may very well 
benefit publishers who come later and attempt 
to do exactly the same thing which had 
previously been considered unacceptable. 
Separating the effects of processes at the 
different levels of analysis involved in 
organizational research makes it easier to 
build systematic theory. Moreover, the inte- 
gration of population ecology and population 
institutional theory yields a unified model of 
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the evolution of organizational populations. 
We think that this is a significant achievement 
for organizational sociology. 

The population dimension of the model 
examined here brings us back to our opening 
puzzle, namely, the sources of the long-term 
pattern of growth, stability, and decline 
observed for many organizational populations 
including newspapers. The rate estimates 
reported here can be used to project popula- 
tion trajectories, although the task is not 
trivial. While the founding rate functions are 
estimated at the population level, the mortal- 
ity rates are for individual organizations. So 
to predict population change, the mortality 
rates must be aggregated and then compared 
to the founding rates. But when a population 
is heterogeneous, aggregating means sum- 
ming over the individual cases at risk of 
mortality after conditioning for age and 
covariates. We have done this for the various 
components of the model (e.g., density) in 
order to learn which factors can account for 
the various aspects of the population trajecto- 
ries. 

The results of these projections show that 
density alone can explain the growth and 
stability parts of the trajectory, but not the 
eventual decline. The estimated nonmono- 
tonic density dependence estimated here 
produces roughly logistic growth to a stable 
equilibrium. It is clear that some other 
process is needed to explain eventual decline 
in numbers. Among the variables examined 
here, population age is the most powerful in 
this respect. The negative effect of age of the 
population on the founding rate exerts a 
gradual but accelerating downward pull on 
population size. So, even though the density 
model performs impressively, a number of 
important questions about growth trajectories 
in organizational evolution remain to be 
addressed. 
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COMBINING INSTITUTIONAL THEORY 
AND POPULATION ECOLOGY: NO 
LEGITIMACY, NO HISTORY! 


(Comment on Carroll-Hannan, ASR, 
August 1989) 


LYNNE G. ZUCKER 
University of California, Los Angeles 


Amid signs of increasing rapproachement between 
the two newest areas in organization theory 
(Carroll and Huo 1986; Zucker 1987a and b; 
Hannan and Freeman 1984; Carroll and Hannan 
` 1989), a number of major differences in both 
theory and method have emerged that threaten 
convergence. These differences in some cases are 
simply matters of emphasis and non-overlapping 
development that is to be expected in two areas 
that derive from disparate intellectual traditions. In 
Carroll and Hannan’s preceding paper, “Density 
Dependence in the Evolution of Populations of 
Newspaper Organizations,” however, the two 
major areas of divergence result in inadequately 
specified tests of institutional theory: (1) use of 
“legitimacy” as an unmeasured construct to 
explain increasing rates of organizational found- 
ing, along with “competition” to explain decreas- 
ing rates (with the opposite relationships holding 
for exits of organizations); and (2) the omission of 
historical context from models of founding. These 
raise broad issues of research strategy and of 
appropriate analytic models. 

It is the spirit of increasing convergence, and not 
competition, that motivates me to write this 
comment. Population ecology, as an area, has been 
productive of insights and research; Glenn Carroll 
and Mike Hannan, along with John Freeman and 
Nancy Tuma, share the honors for the early 
development of one branch of the paradigm. Thus, 
I offer this comment in the spirit of community 
development of important intellectual traditions, 
and as a friendly amendment—not a mean-spirited 
reply—to their article, which as usual in their 
path-breaking research makes a number of impor- 
tant contributions to our understanding of organi- 
zational populations, especially in the area of 
measured density dependence. What it doesn’t do, 
however, is move the merger with institutional 
theory ahead, since the variables of interest to 
institutionalists, though promised, are simply 
missing. 


MODELING UNMEASURED VARIABLES 
In the Carroll and Hannan paper, theoretical 


l Kazuo Yamaguchi provided detailed com- 
ments and notes on my initial critique of the 
method. Comments by Art Budros and Pamela 
Tolbert were also important in shaping the 
argument. 
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connection between population ecology and insti- 
tutional theory rests on hypothesized relations 
between organizational births/deaths and two 
major theoretical variables: legitimacy and compe- 
tition. Increasing legitimacy, early in the develop- 
ment of an organizational population, is asserted to 
increase density (defined as the number of 
organizations) by causally producing high organi- 
zational founding rates and low organizational 
mortality rates. Later, when the organizational 
population is well-established, having reached a 
point of “high density,” competition is expected to 
decrease density by causally producing low 
founding rates and high mortality rates. As a 
theoretical formulation, this meets the test of 
parsimony and is intuitively appealing. 

But in their article there is no real test of these 
formulations. Neither legitimacy nor competition 
is measured; they are replaced in equations with 
functions of density, defined as “the number of 
organizations in the population.” There are several 
problems with this approach. First, no prior tests of 
the relationship of either legitimacy nor competi- 
tion to density have been conducted. At least for 
legitimacy, available evidence suggests a steep 
rather than a gradual rise, closer to zero/one than 
the gradual increase suggested here (e.g., Tolbert 
and Zucker 1983). At one point, Carroll and 
Hannan appear to support this view: “once a form 
becomes prevalent” it is legitimated, with “further 
proliferation . . . unlikely to have much effect on 
its taken-for-grantecness." Although Carroll and 
Hannan might argue that the task of measuring 
legitimacy and competition over so many organiza- 
tions over a long time period is not feasible, some 
empirical grounding of their constructs is necessary 
for an initial test. A second problem is that neither 
of the theoretical relationships, even with the 
substitution of density for legitimacy and competi- 
tion, is actually tested, but rather simplified 
models that seem to fit the data better (Equations 6 
and 7). 

There is no evidence presented in their analysis 
that either supports or refutes their claims that 
legitimacy increases with the number of organiza- 
tions in the population, but at a decreasing rate.? 
However, there is some evidence, at least for 
organizational mortality rates, that suggests com- 
petition (as measured by "density") may not be a 
sound explanation of the findings. Note that 


? Legitimacy may also retain importance later in 
an industry's develcpment. For example, decreas- 
ing legitimacy after just six years of activist 
intervention by animal rights activists in the 
Netherlands the fur industry had virtually disap- 
peared (Los Angeles Times, April 30, 1989): 
"Nearly 400 furriers have dwindled to just 32. 
. . . Since 1982, retail fur sales have dropped 
more than 90%. . . . No department store even 
carries fur anymore and hardly anyone wears it." 
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density is not related to any specific spread over a 
given geographic region, but rather the total 
number of organizations across highly varying 
geographic units. Carroll and Freeman provide a 
number of plausible explanations for the weak 
findings for the smaller geographic areas, but they 
fail to note that these smaller geographic areas 
should theoretically involve the most intense 
competition, since they are tightly bounded 
resource arenas. As they put it “When numbers 
are few, adding an organization to the population 
increases the frequency and strength of competitive 
interactions slightly if at all. But when density is 
high, further growth exacerbates competition 
(adjusting for availability of resources).” Yet the 
number of organizations in existence alters organi- 
zational mortality least reliably in the smaller 
geographic units, where the direct competition 
argument is in fact most plausible. For the six 
cities, there are no consistent significant effects of 
density on the rate of organizational mortality, and 
the effects are the reverse of what is expected in 
half of the cities (Table 3; Figure 3). 

Further, density dependence is defined within 
populations, not between them. Newspapers may 
at one stage be competing within population (with 
other newspapers), but at a later stage be 
competing for both consumers and advertisers with 
radio, news magazines, and television news, none 
of which is entered into density as measured by 
Carroll and Hannan. Even at the end of the time 
period in question, the advertising share of 
newspapers is continuing to drop as a proportion of 
total advertising expenditures, whereas radio and 
television continue to increase. Just between 1960 
and 1975, the proportion of advertising expendi- 
tures for television and radio increased from 
roughly 19 to 27 percent of the total; newspapers 
experience about a 3 percent drop over the same 
period. 

Finally, there is no necessary relationship 
between nonmonotonic form and the opposing 
processes of legitimacy and competition. The 
nonmonotonic relationship is not “non-obvious,” 
but can be predicted by a number of other 
formulations. This same effect might be produced 
by any external shock to the system at the 
beginning of the founding process. This shock 
might take the form of new inventions or major 
scientific discoveries, as in biotechnology, or may 
involve political and/or social changes. In the case 
of newspapers, changes in transportation, creation 
of national markets, and other broad economic 


3 Geographic boundaries may shift across his- 
toric time; marked changes in the population of 
potential readers (literate population) occur. Changes 
in the number of potential local advertisers also are 
not controlled. For the U.S., Census data available 
from ICPSR in machine-readable form can provide 
good population estimates for the full period. 
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changes might create more of a demand for timely 
and accurate information, though in small satellite 
cities this demand may not be met by a small local 
newspaper. Initially, the external shock pushes the 
founding rate up sharply and it may continue to 
rise as entrepreneurial talent is trained at preexist- 
ing newspapers and initial success spurs additional 
capital investment. Then the founding rate gradu- 
ally decays. Further, it is likely that an external 
shock that increases births will also decrease exits 
from the population; shocks that decrease births 
will also tend to increase deaths, though not so 
sharply because it is relatively more difficult to 
discard organizations, even when they are “perma- 
nently failing” (Meyer and Zucker 1989). 

In fact, Carroll and Hannan appear to ignore. 
evidence of an exogenous shock that might explain 
the apparent tipping point in the roughly logistic 
curves shown in Figure 1. Among the three more 
complete histories, with an average of 135 years in 
the series, the date of the peak number of 
newspapers varies by only an average of 12 years. 
Two of three peak at about the same time. The 
smaller city samples show more variance, though 
four of the six also peak at around 1900 
(Shreveport in its first of two peaks). The San 
Francisco area may be exhibiting the William 
Randolph Hearst effect, but I leave it to Glenn 
Carroll, with his detailed knowledge of newspaper 
history, to identify the most plausible historical 
events. 


TIME DIMENSIONS: WHAT HISTORY? 


Most of the weight of the Carroll/Hannan argument 
rests on the founding equations, since the majority 
of the equations on organizational mortality do not 
demonstrate significant effects of density. Problem- 
atic aspects of their analysis arise primarily from 
the particular model they adopt. The model is 
defined by equation (8), as follows: 


Alt) = ACt~f,) exp 
KIGE. . eX t9] £1 


where t is the general notation for historical time 
and f£, is the historical time of the n-th birth. 

This model assumes that every time. interval 
between births must be independent, except for 
effects modeled by the covariates.* Implicitly, 


* The application of Cox partial likelihood to the 
data of organizational birth intervals: as a set of 
duration data implies that the authors assume a 
modulated renewal process (Kalbfleisch and Pren- 
tice 1980:185-86), with a common baseline hazard 
function across spells such that the duration of 
multiple spells are identically and independently 
distributed, controlling for covariates they enter 
into the model. The assumption of conditional 
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Carroll and Hannan appear to view organizational 
births as similar to births of children, where society 
is the mother who only bears one child at one time 
and who repeats the birth process all over again 
after each birth. The society (region, city) is at risk 
for one birth, and, like a mother, when one 
“child” is born, the hazards start all over again at 
time zero. This would be a plausible model of 
organizational births only if actors in the society 
(region, city) who were planning to found a firm, 
and/or had already accumulated resources to found 
a firm, all canceled their plans and started over 
from nothing at each time another firm (from that 
same population of firms) was born in that society. 
In reality, actors found firms, and there is always a 
probability of multiple actors planning to found a 
firm at any given historical moment. Competing 
events such as the founding of newspaper A and 
newspaper B may occur at a certain historical time, 
either of which can occur earlier. 

The model further assumes that the time 
between births is the major time dimension, not 
historical time. But time between births has little 
substantive meaning in an organizational context. 
Thus, h(t— t,) from Equation 8 does not appear to 
be a meaningful baseline hazard function. Simply, 
it assumes that a change in the timing of one birth 
shifts all the other births in the future; very small 
changes in t, cause major changes in the hazard 
rate. Under this assumption, if an organizational 
birth happens to occur in the society (region, city) 
in March instead of February, the whole society/ 
city changes its structure of risk at this moment in 
a very systematic way such that one month delay 
or acceleration in the timing of birth always makes 
the same change in the underlying structure of 
baseline hazards for the rest of the historical 
periods, regardless of when in historical time the 
birth occurs. For example, if the founding of Firm 
A is delayed for three months, from February to 
May, then all succeeding foundings are subject to a 
baseline hazard that likewise shifts three months 


independence among spells of birth intervals that 
was implicit in the model ànd becomes explicit in 
the use of Cox's technique indicates that Carroll 
and Hannan assume that the historical segments 
given as birth intervals of organizations are 
independent in determining the hazard rate of a 
birth, controlling for a few covariates that they 
include in the model. Such a conditional absence 
of interdependence among historical segments is 
quite unrealistic, because it implies the absence of 
any other factors that are common among some 
segments and influence the hazard rate of organi- 
zational birth. Technically, the interdependence 
among spells may be captured in part by including 
correlated error terms across spells as Flinn and 
Heckman (1982) advocate for the analysis of 
repeatable events. Then the model would require a 
method of estimation other than Cox's method. 
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(e.g., baseline risk of one-month shifts from 
March to June). A 

If an historical period has an independent effect 
on the time interval, as I have argued above for the 
period around 1900, then the model is misspeci- 
fied. The hazard rates, as defined by Equation 8, 
do not depend on historical time itself; there is no 
direct dependence on t, over and above those that 
are characterized by the effects of covariates 
measured at fa. Yet Carroll and Hannan are 
analyzing exceptionally long periods, punctuated 
by major events such as the industrial revolution 
and invention of radio and television. There is also 
major growth in both the population of consumers 
and the number of wealth of advertisers. Thus, 
assuming that covariates measured at /, can 
adequately capture all of the information that the 
history can provide on the hazard rates of 
organizational births is very unrealistic. 


IMPLICATIONS 


The strong claims made for institutional theory, 
regrettably, are not supported by any empirical 
measure of legitimacy. I think it is highly unlikely 
that increasing legitimacy accounts for more than a 
modest initial increase in the rate of founding in 
most organizational populations, though I admit it 
is tempting to claim more. In any case, we know 
no more about the effects of legitimacy on 
founding after reading the Carroll and Hannan 
paper than before. It is population ecology, 
specifically theories of population density, defined 
as the total number of organizations in a 
population, that is being explored. In order for 
there to be an ongoing dialogue between popula- 
tion ecology and institutional theory, there must be 
a serious attempt to develop both, not just an 
attempt to borrow from one as a "gloss" for 
theoretical lacunae in the other. 

We also have little new information on some of 
the basic questions that elicited this study. Why do 
new firms get founded? Is it because they are seen 
as legitimate, or is it because they provide an 
economic opportunity (through capital investment 
or employment) for those who found them? Now 
population ecology is confronted with the pathos of 
institutional arguments (DiMaggio 1988): where is 
the actor? Who founds an organization, and why? 

Organizational births are seldom independent, 
and they are more often altered by historical time 
than by the time elapsed between births. Thus, the 
occurrence of a previous birth cannot be regarded 
as a substantively meaningful starting point of the 
risk period for the next period, contrary to the 
assumptions made by the model specified in 
Equation 8 by Carroll and Hannan. The model of 
founding is unrealistic at best, and possibly 
seriously misspecified. If the historical period has 
an independent effect, any delay or acceleration 
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will change the whole function associated with 
historical time. 

Again, a brief comment on the reasons behind 
this reply. Despite- these critical observations, 
earlier research in both the population and 
institutional traditions has given strong impetus to 
rapproachement of the two newest theoretical 
approaches to organizations. Realizing that con- 
structing such a bridge is not easy, I would be the 
first to admit that Glenn Carroll and Mike Hannan 
make an important contribution to the research on 
organizations simply by allowing us to debate 
issues that pave the way for further development. 
They have some significant and provocative ideas 
that now need to be tested using actual measures of 
legitimacy and competition, in more realistic 
models of the founding process. 


Lynne G. Zucker is Associate Professor of 
Sociology and Program Director for Organizational 
Research at the Institute for Social Science 
Research, both at UCLA. Her current research 
interest include empircally grounded integration of 
sociological and economic theories, development 
of formal structures in which action is embedded, 
and modeling of time-distributed future intended 
actions, such as intentions to leave one’s current 
employer. Recent publications include an edited 
book, Institutional Patterns and Organizations: 
Culture and Environment (Ballinger, 1988) and 
Permanently Failing Organizations (Sage, 1989), 
coauthored with Marshall W. Meyer. 


ON USING INSTITUTIONAL THEORY IN 
STUDYING ORGANIZATIONAL 
POPULATIONS 


(Reply to Zucker, ASR, this issue) 


GLENN R. CARROLL 
University of California at Berkeley 


MICHAEL T. HANNAN 
Cornell University 


‘Zucker claims that the model of density depen- 
‘dence in organizational evolution makes inappro- 
priate use of institutional theory. Although the 
comment makes several methodological criticisms, 
it primarily takes issue with out interpretations. 
The main complaint is that we test the implications 
of a theory of competition and legitimation for the 
relationship between density and rates of founding 
and mortality. Zucker prefers that we estimate the 
effects of direct measures of legitimacy and 
competition on these rates. 

We argue that our use of institutional theory 
reflects accurately its core ideas and that success in 
testing the model of density dependence does not 
hinge on direct measurement of legitimation and 
competition. Moreover, we show that our research 
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Strategy conforms to accepted practice among 
researchers working within the institutional perspec- 
tive. 


SPECIFYING INSTITUTIONAL PROCESSES 


The underlying rationale for the density, model 
derives from assumptions about general sociologi- 
cal processes of legitimation and competition. 
Density drives both processes, though in different 
ways. Density increases legitimacy at a decreasing 
rate but increases competition at an increasing raté. 
Legitimation increases founding rates and de- 
presses mortality rates. Intensification of competi- 
tion lowers founding rates and heightens mortality ` 
rates. Taken together, these arguments imply that 
the legitimation process dominates in a region of 
low density while the competition process domi- 
nates at high density. According to the model we 
use, the relationships between density and the rates 
have specific nonmonotonic forms. Founding rates 
rise and then decline as a function of density. 
Mortality rates drop and then increase with rises in 
density. Empirical testing focuses on these.straight- 
forward, falsifiable predictions. 

Zucker claims that this formulation yields 
"inadequately specified tests of institutional the- 
ory' because legitimation and competition are not 
measured directly. Our article defends this feature 
of the model on the grounds that it preserves 
comparability of procedures and findings across- 
organizational populations and historical contexts. 
Indeed, in studying newspapers it allows us to use 
data from three countries over two centuries. 

The comment gives the impression that we have 
ignored a well-codified approach to direct measure- 
ment of legitimation. But this is hardly the case. In . 
a recent review article on institutional theory, 
Zucker (1987a: 447) herself points out, "Because 
of the ‘social fact’ quality, indicators of institution- 
alization are more indirect than, say, measures of 
resource dependence" and that "the empirical 
work examines, the theory piecemeal, seldom 
testing the causal predictions." 

Consider, for example, Zucker's (1987b) find- 
ing that hospitals adopting innovations late had 
lower rates of mortality than those adopting early. 
She explains this difference as the result of the 
innovation becoming institutionalized over time. 
The logical structure of the argument resembles 
ours: the relationship between two: observable 
variables is explained by the process of legitima- 
tion, which is not measured directly. 

Of course, current practice alone does not justify 
a research strategy. For this reason we included a 
historical overview of the newspaper industry, 
intended to provide “empirical grounding" for the. 
model. We sought to establish the plausibility of 
the argument that legitimation drove the early (low 
density) evolution of the population while compe- 
tition dominated in the later (high density) period. 


546 


The critique does not take issue with this historical 
account. 

The key research design question is whether it 
makes sense to try to measure legitimation directly 
in research on the evolution of organizational 
populations (as neither ecologists nor institutional- 
ists have yet done). We do not think so. A direct 
measure of legitimation as taken-for-grantedness 
requires learning whether individuals take a 
particular organizational form for granted. Leaving 
aside the obvious problems of reliability and 
validity in asking people questions about things 
they may take for granted, such a research strategy 
limits severely the possible historical scope of 
organizational studies. 

A less direct approach identifies one or more 

idiosyncratic events thought to be associated with 
legitimation of particular historical instances of an 
organizational form. For example, Singh, Tucker, 
and House (1986) use listing a social service 
agency in a charitable registry as a measure of 
legitimation that is specific to the organizational 
form and the period of study. This indirect 
approach has some appeal. Yet we worry that it 
may be too tempting to choose such indicators post 
hoc. 
. We are interested in developing and testing 
general arguments, ones that apply to all kinds of 
populations in all kinds of contexts. Reliance on 
conceptions and measures of legitimation that are 
idiosyncratic to populations and time periods 
impedes comparative analysis, which is essential 
for evaluating claims to generality. 

Moreover, it may well be that processes of 
legitimation and competition can be .studied 
systematically and comparatively only by testing 
implications of theories about the relationships 
between other observables. If so, it makes sense to 
concentrate on observables that can be evaluated 
over long periods and in many contexts, such as 
density and vital rates. That is, the substantive 
problem may require what seems to be an indirect 
approach. 

This point is perhaps easier to comprehend for 
competition (although the same logic applies to 
legitimation). Other disciplines, such as economics 
and population biology, have long relied on 
indirect evidence about competition processes. 
Competition is defined as the process whereby the 
expansion of one unit (firm or population, 
depending on the application) diminishes the 
viability and growth rates of others. Evidence 
about competition typically involves showing that 
growth of one firm or population negatively affects 
the relative size of another. Such an effect is often 
summarized in one or more parameters of an 
estimated model (similar to what we have done). 
Overtly "conflictual" behavior, which economists 
often refer to as rivalry rather than competition, 
may or may not correspond with competition. 
Simmel made a similar point in distinguishing 
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conflict, a social relation, from competition. 
Whether two or more social actors dependent on 
the same limited resources recognize that they are 
competitors is an interesting question. But the 
absence of such recognition surely does not mean 
that competitive constraints cease to operate. 

Human ecology has also relied on a research 
strategy of analyzing the implications of competi- 
tion, and we see our work as following in this 
tradition. We suspect that institutional theory is 
moving in this direction as well in studying 
legitimation. Zucker's (1987a) review notes that 
“research on causes of institutionalization has been 
eclipsed by study of its consequences or outcomes 
(p. 449).” 


METHODOLOGICAL ISSUES 


In testing one aspect of the model, we specified the 
population as the unit at risk for founding. The 
organization observed to be founded cannot be the 
unit since “non-events” (the absence of foundings 
in some period) are as important as observed 
foundings from the perspective of testing theories 
about founding rates (for further discussion see 
Delacroix and Carroll 1983; Hannan and Freeman 
1987, 1989). So, we treat foundings as a 
realization of a point process for the population, as 
a flow of events in continuous time. 

The comment rejects this reasoning and suggests 
that the appropriate unit is the potential founder. In 
the simplest case, tke potential founders are all 
individuals. Then the risk set consists of all 
members of the relevant social system. Narrowing 
the set further potentially creates selection bias. 
Following this proposal would require collecting 
information on the millions of Argentines, Irish 
and American city dwellers who did and did not 
start newspapers since 1800. Furthermore, attempt- 
ing to specify the set of actors at risk in this way 
would require considering itinerant printers who 
can and do migrate to a city or nation and begin a 
newspaper. Ánd, nct all potential founders are 
individuals: partnerships and collectives start 
newspapers. So one must consider all pairs, 
triplets, and so forth, meaning that the size of the 
risk set quickly becomes impractically large. 

Fortunately, these issues are problematic only to 
those who want to study theories that specify 
which individuals, collectives, and corporate 
actors start organizations. As long as attention is 
focused on the types of macrosociological issues 
we treat, nothing is lost by considering the 
population to be the unit. 

The comment also gives the impression that we 
ignored historical time. In fact, we used two time 
scales in analyzing each outcome, one tracking 
historical time. For foundings the dominant time 
scale is the waiting time between foundings, the 
interarrival time; the secondary time scale is 
historical time (time elapsed since the first 
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founding). In analyzing mortality, the dominant 
time scale is age of organization; the secondary 
time scale is again historical. In each case, we 
represent short run time variations in rates in terms 
of the dominant time-scale and long-term varia- 
tions in terms of the historical time scale. This 
approach allows us to combine local contingency 
and historical trend. 

Zucker prefers that only historical time be used 
and claims that “the time between births has little 
substantive meaning in an organizational context.” 
We disagree. We think that it makes good 
sociological sense to assume that events are 
contingent. The dependence among events may be 
either positive, as would be the case with processes 
of imitation and contagion, or negative, as would 
be the case if there were a finite number of 
business licenses or printing presses. Analyzing 
social processes in terms of interarrival times does 
indeed have substantive meaning because it allows 
us to represent such dependence in a natural way 
(Coleman 1981; Hannan 1989; Olzak 1989). 

In arguing against the use of interarrival times in 
analyzing foundings, Zucker also claims that a 
change in one time of occurrence affects the rate 
for “the rest of the historical periods, regardless of 
when in historical time the birth occurs.” This 
claim may confuse those unfamiliar with event 
history analysis. The rate or hazard summarizes the 
local behavior of the flow of events in time (in any 
chosen time scale). If one could intervene and 
change the timing of one or more events, this 
would change the hazard ipso facto. This is not a 
special feature of our research. Moreover, use of 
historical time appears to be more sensitive than 
use of interarrival times to misclassification of one 
or more event times, although this depends on 
exactly how historical time is used. In any case, 
the tradeoffs between the two approaches are much 
more subtle and complex than the comment 
indicates. 


EXPLAINING EFFECTS OF DENSITY 


Zucker’s boldest claim is that nonmonotonic 
density dependence is “not non-obvious” and that 
it “can be predicted by a number of other 
formulations.” To illustrate, she sketches a story 
intended to show how obvious is the pattern we 
study. She suggests that any external shock can 
cause organization founders to rush in, build 
organizations at a rapid rate, and then dissipate. 
As stated, this story concerns time dependence. 
It has nothing to do with density dependence— 
much less nonmonotonic density dependence. 
Zucker’s confusion on the difference between 
density dependence and time dependence suggests 
that our argument may not be so obvious. 
Discerning density dependence in vital rates is 
not a simple matter. Nor is discriminating density 
dependence from time dependence without a model 
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and good data. The difficulties are well illustrated 
by the course of similar research in demography. 
There, despite the well-known patterns of logistic 
population growth, a major debate has raged on for 
at least 15 years as to whether human fertility is 
density dependent, and if so exactly what is the 
functional form of density dependence (Lee 1974, 
1987; Frauenthal and Swick 1983; Tuljapurkar 
1987; Wachter 1988). The pattern underlying the 
dynamics turns out to be far from obvious. 

Our article concludes by noting that the 
estimated model can explain the patterns of growth 
and stabilization in numbers but not the decline 
from a peak before stabilization. Zucker suggests 
we should try to identify specific “exogenous 
shocks” in the periods where density peaks to 
explain the downturn. We disagree. Our current 
research focuses instead on general explanations of 
the decline from the peak. A recent paper (Carroll 
and Hannan 1988) finds strong and consistent 
evidence that density at time of founding has a 
persisting positive effect on organizational mortal- 
ity rates above and beyond the contemporaneous 
effects reported above. This is so for the three 
large populations of newspapers as well as for 
American labor unions and brewing firms. Such 
effects may account for some portion of the 
observed population declines and may generate 
population cycles as well. 

We have also continued to collect life-history 
data on other organizational populations in order to 
test the generality of the density model. One of 
these, the historical population of American beer 
brewers (studied over the period 1633-1988), 
experienced an extraordinary external shock, the 
national Prohibition. Yet the predicted pattern of 
density dependence in rates of founding and 
mortality is strong and statistically significant 
(Carroll and Swaminathan 1989). The pattern also 
holds for founding rates (the only outcome yet 
studied) of banks in New York City during 
1790-1980 and American life insurance companies 
during 1759-1937 (Ranger-Moore, Holl, and 
Hannan 1989). 

Far from finding the density results obvious, we 
are not yet ready to rule out the possibility that we 
are wrong about the population pattern or the 
processes that generate it. However, we continue 
to encounter positive empirical support. The latest 
of which we are aware is Aldrich et al.’s (1989) 
study of density dependence in mortality rates of 
business interest associations. 

So it appears that we may be on to something 
important. And, given that density rarely figured 
as a variable in previous organizational studies, we 
believe that this is a new contribution. 
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Japanese women participate in the labor force at rates similar to women in 
Western industrial nations, but gender stratification patterns are sharper. Women 
in Japan are less apt to work as employees, a tendency that increases with age. 
Likewise, female employees tend to shift from larger to smaller firms across the life 
cycle, whereas male employees do not. These aggregate patterns imply that 
Japanese women are seldom placed in career-track positions in large firms early in 
their careers. Analyses on labor market entry data from the 1984 “Survey on Work 
Patterns” substantiate this view. Although Japanese men and women enter large 
firms at equivalent rates upon leaving school, 22 percent of men and only 7 percent 
of women enter career ladders. The majority of women enter large firms as clerical 
workers, three-quarters of whom are in low-level "assistant clerical" positions. 
Causal processes governing entrance to large firms and career tracks are 
examined in the paper, with particular attention to the relative returns to different 


GENDER STRATIFICATION IN CONTEMPORARY URBAN JAPAN* 


levels and types of education for Japanese men and women. 


INTRODUCTION 


In recent years, Western social scientists have 
produced many historical and sociological 
studies of Japanese industrial relations. This 
body of research has contributed to a greater 
understanding of the origins and functioning 
of the Japanese “permanent employment 
system.” But these studies have focused 
almost exclusively on the small proportion of 
the labor force actually covered by permanent 
employment policies: male full-time workers 
in large firms. 

The rate of female labor force participation 
in Japan roughly parallels that of Western 
industrial nations, yet the role played by 
women in the economy remains largely 
uninvestigated. Researchers routinely cite the 
low status of women in the Japanese 
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workplace (Clark 1979; Cole 1979; Rohlen 
1974) and link it inferentially to the perma- 
nent employment system. The permanent 
employment system refers to the internal 
labor market structure of the large firms in the 
economy, where workers are hired directly 
upon graduation from school and move up 
career ladders through principles of seniority 
and merit.! Observers of the Japanese econ- 
omy allege that women do not have access to 
these career ladders, and case studies of work ` 
organizations support this view. But organiza- 
tional case studies leave a number of 
questions unresolved. 

First, we do not know what proportions of 
women enter large firms. Second, we have no 
estimates of the number of women who gain 
access to career ladders in these firms. Thus it 
is unclear whether Japanese women are 
excluded from the large-firm sector of the 
economy or, alternatively, participate heavily 
in this sector but are in non-career-track 
roles. Third, we do not know the extent to 
which exclusion from career ladders is based 
on human capital considerations (e.g., sex 
differences in education and ability) or on 


! The terms career ladder and career track are 
used synonymously throughout the paper to 
indicate a systematic progression of hierarchically 
ranked jobs in one firm. This term is used in 
preference over internal labor market, which has a 
somewhat broader meaning. See Althauser and 
Kalleberg (1981) and Osterman (1984) for discus- 
sions of internal labor markets and career ladders. 
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other considerations such as discriminatory 
hiring and job assignment practices. 

This paper investigates these questions. 
Since little has been written about Japanese 
women’s status in the economy, I first use 
aggregate data to give an overview of gender 
stratification patterns in Japan and other 
industrial economies. I also examine the 
divergence in men’s and women’s work 
patterns in Japan across the life cycle. 
Attitudinal data and ethnographic information 
illuminate processes of sex discrimination in 
Japanese recruitment and hiring. In the 
second half of the paper I present a microlevel 
analysis of labor market entry using data from 
the "Survey on Work Pattems" for two 
cohorts of urban Japanese men and women. I 
estimate the relative proportions of men and 
women who enter large firms upon leaving 
school and the proportions who enter career- 
track positions. Finally, I examine the 
variables governing these processes for men 
and women, to assess whether the returns to 
family background, ability, education, and 
other characteristics differ for the sexes. 


JAPANESE WOMEN'S PARTICIPATION 
IN THE ECONOMY 


As shown in Table 1, Japanese women 
exhibit a labor force participation rate similar 
to that of women in Western industrial 
nations. With 49 percent of adult females in 
the labor force, Japan stands between the high 
rates of North America and Scandinavia and 
the somewhat lower rates of Western Europe. 
However, an examination of the relative 
patterns of men's and women's participation 
in the economy shows Japan to be an outlier 
among industrial economies.. 

Among industrial economies, Japanese 
women make up the lowest proportion (36 
percent) of paid workers relative to men. In 
Table 2 I show how the sexes are distributed 
by employment status (employee, self- 
employed, and family enterprise worker) in 
various industrial economies. Compared to 
other nations, Japan displays the largest gap 
between the proportions of men and women 
who work as employees. The sex distribution 
of the self-employed is fairly even in Japan; 
but 20 percent of women work as family 
enterprise workers as opposed to only 3 
percent of men. In short, the comparatively 
low rate (67 percent) at which Japanese 
women work as employees in complemented 


AMERICAN SOCIOLOGICAL REVIEW 
Table 1. Female Labo: Force Participation Rate in 





Industrial Economies 
Japan 48.6 
United States 52.7 
Canada 54.3 
West Germany (1984) 41.1 
England (1984) 37.4 
France (1984) 43.1 
Norway 59.8 
Sweden 68.1 
Denmark 59.0 
Australia 46.2 


Notes: (a) All figures are for 1985, except where 


(b) Figures are calculated as (total number of 
women in the labor force/total female population age 15 
and above) X 100, with the following exceptions: For 
England, the denominator is the total female population 
(thus rendering the figure relatively low); for Norway and 
Sweden, the denominator is the total female population 
age 16 and above. 

(c) Countries in Tables 1—3 were chosen on the 


basis of geographical representation and comparability of 
data. 


Source: International Labor Organization (1986); 
Japanese Ministry of Labor (1987). 


by the high rate of unpaid family enterprise 
workers in small family-run businesses or 
farms. Even in the nonagricultural popula- 
tion, Japanese women exhibit a much higher 
rate of family enterprise employment than 
women in other countries. Although Japanese 
men exhibit a somewhat higher rate of such 
employment than men in other nations, the 
magnitude of the difference in much lower 
than for women. 

As shown in Figure 1, the proportion of the 
Japanese female labor force who work as 
employees is highest at young ages (20-24) 
and declines monotonically except for a slight 
increase in the age-group 40—44, when 
women's childcare responsibilities typically 
decrease. In short, even among working 
women, the proportion of paid employees 
decreases sharply with marriage and childbear- 
ing in the late 20s and early 30s and does not 
again approach the earlier level. Although not 
shown here, data for 1970, 1975, and 1980 
indicate that this is largely an age rather than 
a cohort effect. While the proportion of 
employed women in each age-group who 
work as employees has increased over time, 
the shape of the curve remains the same: large 
numbers of women move from employee to 
nonemployee status after age 25. These data 
may reflect conflict between women's family 
responsibilities and the demands of paid 
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Table 2. Employment Statuses of Workers in Industrial 





West Germany 88.1 4.8 7.0 


Economies 
Family 
Em- Self- ^ enterprise 

: ployee employed worker Other 

Japan 67.2 12.5 20.0 — 
] (78.9) (179 | (2.8) _. 

Nonsgricuitural 74.3 12.0 13.5 -— 

population only (83.9) — (13.8) (0.9  — 

United States 93.6 5.7 0.8 — 

89.8) (10.1) (0.2) — 

Canada (1984) 91.2 6.8 1.9 — 

(88.7) (10.9) (0.4) _ 

England (1980) 96.0 3.8 — 0.2 
88.0) (10.0) — Q.1) 

France (1975) 83.2 9.3 7.5 — 

: (81.2) (17.5) (1.2) — 

Norway 90.1 3.9 36 24 
(84.5) (13.0) (0.9 | (1.7) 

Sweden 95.3 4.3 0.4 — 

(90. 4 (9.4) (0.2) — 

Denmark 92.0 3.1 4.9 — 

(85.3) (14.5) (0.2) — 

Australia 80.7 11.0 0.5 7.9 


(76.5) (15.8) (0.3 0.4) 


Notes: (a) Upper figures are for women; figures in 
parentheses are for men. 

(b) All figures are for 1985, except where 
indicated. 

(c) "Other" signifies workers whose status 
was unknown. The table excludes workers 
classified as unemployed. 

Source: International Labor Organization (1986); 
Japanese Ministry of Labor (1987). 


labor, or reflect employer discrimination as 
women reach marriageable age (Japanese 
women’s mean age at marriage is currently 
25.5 years). Both explanations likely come 
into play. 

Comparable figures for men (also in Figure 
1) show that men’s rate of labor force 
participation as employees declines slightly 


with age. But while women employees . 


evidence- a 26-percentage-point drop between 
ages 20-24 and 35-39, men show only a 
7.5-percentage-point drop. 

Aggregate data provide clues to the limited 
opportunities for female employees to work 
on a long-term basis in large firms and to 
benefit from the high wages and promotion to 
managerial ranks that such firms offer. 

Data in Table 3 show that young men and 
women employees (ages 20-24), who have 
generally just left school and entered the labor 
market, have very similar rates of entrance to 
work organizations of different sizes. In fact, 
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Source: Japanese Ministry of Labor (1987). 
Figure 1. Proportion of Female and Male Labor Force 
Working as Employees, by Age Group (1985) 


female employees are’ slightly underrepre- 
sented in the smallest firms and overrepre- 
sented in the largest ones. But men’s and 
women’s representation in firms of different 
sizes changes dramatically in the older 
age-groups. By age 45-49, women’s partici- 
pation rate in small firms in one and a half 
times that of men. Men’s participation rate in 
small firms remains extremely stable across 
age-groups. Conversely, women age 45-49 
participate in large firms at a rate approxi- 
mately three-fifths that of men (and also 
three-fifths the rate for 20-24 year-old 


Table 3. Distribution of Female and Male Employees 
across Firms of Different Sizes, by Age-Group 





(1985) 
Firm Size 
Tiny Small Medium Large 
(1-9) (10-99) (100-999) (10004) 
Age-group 

20-24 - 13.8 28.4 24.9 33.0 

(15.4)  Q9.7) (23.6) (31.3) 
25-29 17.4 26.9 19.3 36.7 

(14.4 (27.0) (22.2) (36.3) 
30-34 23.0 30.3 17.1 30.3 

45.8 (27.5) (20.6) (35.8) 

. 35-39 25.0 34.3 17.2 23.0 

(45.9) (27.0) (20.4) (36.5) 
40-44 24.0 36.5 18.3 21.2 

(15.3) (27.5) (19.8) (37.1) 
45-49 22.4 38.8 19.1 20.2 

(14.9) (30.0) (19.5) (35.6) 
50-54 21.7 38.5 18.2 21.7 

(15.5 | (30.6) (18.8) (34.7) 
55-64 28.2 39.7 13.7 18.3 


(17.60) | (34.9) (19.6) (28.1) 


Note: Upper figures represent proportion of employed 
women in each age-group who work for firms of varying 
sizes; figures in parentheses represent proportions of 
men. 

Source: Office of the Prime Minister, Japan (1985). 
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women). Men age 45—49 are represented in 
large firms at a slightly higher rate than men 
age 20-24.2 

These trends could be produced either by 
an age (life cycle) effect or by a cohort effect. 
That is, the data could be indicating that 
employed women move out of large firms and 
into small firms later in life (an age effect), or 
that older employed women are overrepre- 
sented in small firms because they started out 
in small firms (a cohort effect). In order to 
assess these two explanations, similar tables 
were constructed for 1972 (the first year such 
data are available), 1975, and 1980. The 
same age pattern also appeared in these 
former periods: women employees began to 
work in large firms at a rate somewhat higher 
than men, but the firm size pattern by sex 
reversed at older ages. This suggests a life 
cycle interpretation whereby women employ- 
ees leave the large firms they initially entered, 
either dropping out of the labor force or 
shifting to smaller firms. Whichever of these 
two courses they pursue, it is clear that 
women are not moving up in career ladders in 
the internal labor markets of large firms. 

Managerial and wage data support this 
view of limited female participation in career 
ladders. International comparability of both 
occupational and wage data is limited, but a 
few illustrative figures may be given. The 
ratio of Japanese female to male employees in 
managerial ranks is only 13 percent, com- 
pared to 68 percent in the United States, 63 
percent in Canada, and 41 percent in West 
Germany (International Labor Organization 
1986). The overall female/male wage ratio for 
full-time workers in 1983 ranged in Western 
industrial nations from a low of 66.5 (weekly 
rate) in the United States to highs in the 
84-89 percent range (hourly) in France and 
Northern Europe. Wages in Japan are typi- 
cally reported as monthly rates, and the 
female/male ratio in 1983 was 55.5, substan- 
tially lower than in any other industrial 
country (Japanese Ministry of Labor 1987). 

Attitude survey data and ethnographic data 
supplement these aggregate labor force fig- 
ures. A national opinion poll of 3000 people 


? Table 3 is restricted only to employees, which 
means that these trends are not produced by 
women who work as unpaid workers in small 
family enterprises. Rather, the trends are produced 
only by women who work in the formal (paid) 
sector of the economy. 
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conducted by the Yomiuri newspaper in Japan 
in April 1984 indicated a widespread percep- 
tion of sex discrimination in Japanese work- 
places: 80 percent of respondents believed 
that women are treated in a “disadvanta- 
geous” way in regard to hiring, and 84 
percent felt that this situation also applies to 
job rotation and promotion. 

Young Japanese women and their parents 
take note of the restrictive ‘recruitment 
policies for women. Especially marked is the 
perception of low returns to university- 
educated women in the labor market. People 
interviewed during my fieldwork in 1984 
indicated that education for a daughter was 
well and good to a point; once that point was 
reached, education didn’t help and could 
actually hurt a young women’s chances of 
getting a job: This was summed up in the 
comments of one puzzled mother, a middle- 
aged employee at a suburban branch of a 
Tokyo bank: 


Now my daughter is debating about whether to 
go to junior college or to university. She says 
that getting a university education will be a 
handicap (furi) when she looks for a job; it's 
true that the situation for women university 
graduates is very bad and that close to 100 
percent of women junior college graduates can 
get jobs. But even so, I think she should go 
ahead and go to university. It’s a hard 
situation —and it's hard for me to give advice to 
my daughters. (Brinton 1986) 


A 1984 survey asked young Japanese 
women's opinion on the correspondence 
between education and success in their job 
search. Respondents were women who were 
to graduate the following March, and thus had 
either completed or were in the midst of job 
seeking. One-fifth of the university women 
felt that in the final analysis their education 
was more of a disadvantage than an advan- 
tage, compared to only 12 percent of junior 
college women. Conversely, half of the junior 
college women felt that their education helped 
them on the job market, a proportion similar 
to that for university women. 

Japanese employers have customarily asked 
women during the job interview whether they 
plan to quit when they marry.? Many young 
women reported in my fieldwork interviews 


? Such questioning is now formally illegal as a 
result of the Japanese Diet's passage of an Equal 
Employment Opportunity Law in 1985. 
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that previous job seekers advised them to state 
an intention to quit, because this would 
heighten their chances of being hired. By 
quitting in their mid- to late 20s, Japanese 
women provide employers a cushion against 
too many "permanent" employees. Yet at the 
same time that employers encourage this 
behavior, many claim that they do not give 
equivalent training and promotion opportuni- 
ties to men and women because they fear 
women will get married and leave the labor 
force. The employer's returns from the 
investment in women workers' training will 
then be lost. This constitutes one of the 
contradictions of female employment in 
Japan, a link in the vicious circle of strongly 
gender-defined expectations on the employer 
side and conforming behavior on the labor 
supply side. 

‘Japanese employers are strongly motivated 
to hire employees who plan to remain with 
the firm over a long period, so that the costs 
of on-the-job training will not be wasted. Past 
experience has taught them that women are 
more likely to quit and that investment in 
women's training is therefore riskier than 
investment in men's. In this way, the 
pervasiveness of the internal labor market 
structure and accompanying firm-specific 
training constitutes an institutional barrier to 
women.’ And as poignantly illustrated in the 
ethnographic material above, employer expec- 
tations and behaviors feed back to young 
women’s attitudes and behavior as well as to 
their parents’. This is especially meaningful in 
a cultural setting such as Japan, where parents 
are almost uniformly responsible for financ- 
ing children’s education and are therefore 
intimately involved in educational decision 
making and investment (Brinton 1988). 

The employment practices outlined above 
can be analyzed with. microlevel data on 
Japanese men's and women's entry into the 
labor force after leaving school. In the next 
section, I make predictions concerning the 
relative proportions of men and women 
entering large firms and the proportions 
entering career-track positions in such firms. T 
then make a number of predictions about the 
factors important in the process by which 





* See Roos and Reskin (1984) for a general 
discussion of institutional barriers. For discussions 
of internal labor markets, promotional tracks, and 
on-the-job training in Japan, see Cole (1979), 
Koike (1983), and Shirai (1983). 
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individuals enter large firms and career 
ladders, paying particular attention to the 
translation of education into employment 
outcomes for men and women. 


ENTRANCE INTO LARGE FIRMS AND 
CAREER LADDERS 


Descriptive Predictions 


I make two descriptive predictions concerning 
Jabor market entrance of men and women. 


(1) Japanese men and women will have roughly 
equal probabilities of initially being hired into 

- large firms upon school graduation; women’s 
probability may be slightly higher. 


Why? Japanese employers in large firms 
have only a limited number of career-ladder 
positions to offer; career lines represent 
“packages of training and jobs” (Miyahara 
1988, p. 41). Hiring women over men into 
non-career-track positions in the firm repre- 
sents savings in wage costs for employers, for 
these women will “retire” in a few years and 
be replaced by new school graduates at 
starting wages. If this is true, we should 
expect to see large numbers of women 
entering large firms, but we should expect to 
see them entering different slots than men. 
This leads to the second prediction: , 


(2) Significantly more men than women will 
enter career-track positions on their first job. 


Causal Predictions 


I make several predictions about the process 
of entering large firms and career tracks, and . 
differences in the process for men and 
women. A number of factors can be hypoth- 
esized to be important: (1) the individual’s 
educational attainment and the quality of 
education; (2) ability; (3) labor force attach- 
ment (relevant for women); (4) the historical 
timing of labor force entry, represented by 
cohort; and (5) family background variables 
(father's employment status and mother's 
labor force participation). Because particular 
features of the Japanese social-institutional 
context affect both measurement and predic- 
tion, these issues are also discussed below. 
Educational level and quality. Educational 
attainment is typically used as a measure of 
human capital in the prediction of job 
placement. The hierarchical structure of the 
Japanese educational system and the strong 
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link between schools and work organizations 
(Miyahara 1988; Rohlen 1983; Rosenbaum 
and Kariya 1987) suggest that “quality” of 
schooling in addition to level is an important 
predictor.5 Further, the effects of educational 
level and quality may differ for males and 
females. Specifically: 


Educational attainment should be positively 
related to entrance into a large firm and into a 
career track for males. However, the effect is 
not necessarily a linear one: male high school 
graduates may fare better than those with the 
next level of schooling (junior college/ 
vocational school). 


Among males who continue on to higher 
education after high school (currently about 
38 percent), 95 percent are in four-year 
universities and only 5 percent are in 
vocational schools. Given the strength of the 
recruitment relationship between large firms 
and high schools (Rosenbaum and Kariya 
1987) as well as large firms and universities, 
a young man's probability of entering a large 
firm from high school or university should be 
higher than the probability attached to voca- 
tional school. The survey to be used for the 
analysis distinguishes the rankings of schools, 
and I expect that more of the males who 
attended top-ranked high schools and univer- 
sities will be recruited into large firms and 
into career-track positions.$ 


A linear relation is expected between females' 
educational attainment and probability of enter- 
ing a large firm, with only a slight gain for 
women with a university education. For entrance 
into a career-track position, education will be 
significantly less helpful to women than to men. 
Further, only the most highly educated women 
(graduating from top universities) will have any 
chance of entering a career-track position. 


Women's educational attainment in Japan 


5 [n Japan, both high schools and universities 
are hierarchically ranked, with entrance to each 
level regulated by a highly competitive examina- 
tion system. Rosenbaum and Kariya (1987) found 
that among high school students, school rank was a 
significant predictor of placement in clerical jobs 
in large firms, the placement most preferred by 
males and females in gene?al and commercial high 
schools. 

Career ladders exist for both blue- and 
white-collar jobs. Therefore, I predict that gradua- 
tion from a prestigious high school or university 
will heighten men's probability of placement in a 
career ladder. 
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differs significantly from men's. While about 
one-third of women continue their education 
after high school, nearly two-thirds of these 
go on-to junior college and the rest enroll in 
four-year universities. Since junior college 
represents the most "typical" route after high 
school and since such women largely follow a 
sex-typical curriculum, graduates represent a 
ready pool of labor for large firms’ low-level 
clerical needs. University education will not 
represent a great improvement in women’s 
chances of being hired into large firms. For 
entrance into career tracks, I predict that only 
graduation from a top university will increase 
women’s chances. 


Ability. Ability is hypothesized to be a predictor 
of placement in both large firms and career 
positions for men and women. 


Work plans. No relationship is predicted be- 
tween women’s work plans and placement in a 
large firm, but a positive relationship is expected 
between women’s placement in career-track 
positions and plans to work across the life cycle. 


Japanese employers frequently claim that it 
is not economically rational to place women 
in career-track positions and on-the-job train- 
ing programs because of their high propensity 
to quit upon marriage or childbirth. This is 
difficult to test without a longitudinal study, 
but an indirect approach is to examine the 
relationship between first job and women’s 
retrospective report of their adolescent plans 
for future work life. If the claim of Japanese 
employers is correct, we would expect to find 
an association between women’s plans for 
continuous work and their placement in a 
career-track position. And if the arguments in 
this paper are correct, we would also expect 
that women’s plans for continuous work do 
not have anything to do with placement in a 
large firm per se, because it is likely that they 
will be in positions that neither require nor 
encourage long-term employment. 


Control variables 


Cohort. It is predicted that members of the older 

cohort (40-44 years of age) in the sample 

experienced greater opportunities initially to 
enter career-track positions than members of the 
younger cohort (25-29 years of age). 

The sample includes two cohorts. Members 
of one (40—44) entered the labor market in the 
early 1960s during a period of rapid economic 
growth, and members of the other (25-29) 
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entered in the late 1970s, a recessionary 
period. Some scholars have suggested that 
Japanese women are the first to be affected by 
economic downturns, as employers attempt to 
protect the core male workers who are 
permanent employees (Rohlen 1979).7 If this 
is correct, then female members of the 
younger cohort would have been particularly 
subject to exclusion from career-track posi- 
tions. 


Father’s employment status, Father’s employ- 
ment status is a control variable and is not 
expected to directly influence labor force entry, 
except that children of self-employed fathers 
may be less likely to work in large firms or in 
career-track positions because their labor is 
valued in the family business. 

Mother’s labor force participation. In the highly 
sex-discriminatory environment of Japan, moth- 
ers’ participation in the paid labor force may 
exert a discouraging effect on daughters’ 
career-track entrance because daughters witness 
their mothers’ difficulties. Mother's labor force 
participation will not necessarily have any effect 
on daughter’s entrance into a large firm, a 
temporary state before marriage. No effect is 
predicted for sons. 


DATA AND METHODS 


Data for the following analysis come from a 
survey (the “Survey on Work Patterns”) I 
conducted in three urban locations in Japan 
during 1984: Sapporo, Kodaira (a suburb of 
Tokyo), and Toyohashi. Up to the present 
time, no microlevel data set that includes the 
necessary variables has been publicly avail- 
able. The data were collected as part of a 
larger project on gender stratification. in the 
Japanese economy. Questions in the mail 
survey include extensive information on natal 
family, educational, marital, and work histo- 
ries, as well as educational aspirations and 
behaviors towards respondents' own children. 
A stratified random sampling procedure was 
followed, with random samples of men and 
women in two cohorts (25-29 years and 
40-44 years) drawn from the senkyonin 


7 Bowman notes the high demand for both male 
and female clerical workers in the mid-1960s in the 
Japanese economy. She adds, “But recent history 
suggests that net movements of men out of clerical 
employment into higher level jobs will persist, 
while women will continue to fill the clerical 
posts" (1981, p. 214). 
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meibo (voting registration records) in each 
city. These records constitute a complete 
current listing by household of all adults over 
age 20 in the city. À 13-page questionnaire 
was mailed to the sample, with an overall 
response rate of 50.1 percent for the three 
cities. This response rate, while respectable 
for a mail survey, nevertheless may entail 
biases that could affect the statistical results. 
Therefore, a number of checks were carried 
out to assess the degree of response bias. 
Comparison of the respondents to the 
populations from which they were drawn 
indicates a slight overeducation bias. Weights 
were applied to equalize the educational 
distribution of the age/sex groups in each city 
with the populations from which the samples 
were drawn.? The overall sample was then 
compared with the general urban population 
(or, where possible, with the figures for the 
three sampled cities) on labor force status, 
employment status, and industry—employ- 
ment indicators available in published census 
materials. The distributions on all three 
variables are very similar for the respondents 
and the population.? Further, the proportions 
of men and women in the younger cohort 
(25-29) who entered large firms after leaving 
School are strikingly similar to labor force 
data collected by the government. Govern- 
ment statistics for the age cohort 20-24 in 
1975 (the cohort most comparable in terms of 
historical timing of labor market entrance) 
show that 33.0 percent of men and 35.2 
percent of women were employed in large 
firms. This compares to rates of 32.0 and 
35.0 for men and women, respectively, in the 
younger cohort in our sample. 
Statistics computed from this sample also 
closely match national surveys on a number 
of attitudinal variables such as perceptions of 
sex discrimination in the labor market, 
educational aspirations for sons and daugh- 
ters, and expectations to live with children in 
old age. Finally, no significant differences 
exist among the means of variables for the 
three major waves of respondents. In short, 


* The unweighted sample was used in the 
logistic regression analyses reported in this paper, 
as the analyses were done separately for each sex, 
and education and age were both used as 
independent variables. 

? Tables showing these comparisons are avail- 
able from the author upon request. 
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Cohort Dummy variable (representing historical timing of respondent's entry into the labor 


market); 0 respondent 25~29 years of age, 1 respondent 40—44 years of age. 


Father's employment Series of dummy variables for father’s employment status when respondent was 15 years 
of age: unemployed or absent, self-employed or farm, blue-collar, white-collar (omitted 
category). 

Dummy variable for mother's employment status outside home when respondent was 15 
years of age: 0 = mother was full-time housewife, family worker, or worked in a home 
handicraft job; 1 — mother was employed part-time or full-time outside the home. 
Respondent’s class rank in senior year of high school, by decile, scored 1-5. (If 
respondent did not attend high school, class rank in final year of junior high school is 
substituted.) 


Mother's employment 


Ability 


Education Series of dummy variables for respondent's educational level and, for university, 
educational quality: junior high school (omitted category), high school, junior 


college/vocational school, low-ranking university, medium-/high-ranking university." 


Dummy variable representing women's reported plans in adolescence to work 
continuously across the life cycle." 


Work plans 


Large-firm placement Dummy variable representing initial job placement in the government sector or in a firm 


employing 1000-- employees. 


Career-ladder 
placement 


* The scale of university quality was constructed with the cooperation of Professor Keiko Nakayama Watanabe, a 
member of the Social Stratification and Mobility (SSM) project in Tokyo. Universities were ranked into three groups: 
the former seven Imperial Universities, Tokyo Institute of Technology, and Hitotsubashi University (Group 1), a larger 
group of public and private universities (Group 2), and lower-ranked private universities (Group 3). A complete listing 
of universities and their rankings is available from the author. 

> The survey question was phrased in the following way: "When you were about 13 years old, what combination of 
work and family life did you most hope to have in the future?" The proportion of wamen reporting the expectation of 
working continuously throughout the life cycle was only 15.9 percent; this category was combined with the category 
of women who expected that they would quit work temporarily with childbirth and later reenter the work force. The 
combined category represents 39.6 percent of women. 


Dummy variable representing initial job placement in a career ladder (see text for details 
on measurement). 





all checks suggest confidence in the represen- 
tativeness of the data. 

Measurement of the variables is described 
in Table 4, and means and standard deviations 
are shown in Table 5. (Means for dummy 
variables are proportions of respondents who 


Measurement of the two dependent vari- 
ables— placement in a large firm and place- 
ment in a career-track position in such a 
firm— merits separate discussion below. 


Operationalizing Firm Size and 


' scored “1” on those variables.) Ninety-three 
percent of the 1154 respondents have had 
some work experience, yielding a total of 
1075 respondents in the present analysis. Of 
these, 96 percent entered the labor market 
within two years after leaving school, and 
there is no differentiation in this timing by 
sex. Thus, gender differentiation in employ- 
ment outcomes is not due to women’s delay 
of their initial labor market entry. In prelimi- 
nary analyses, a dummy variable was created 
to reflect whether the respondent entered the 
labor market within two years after school 
graduation. This variable was insignificant in 
the models. Less than 5 percent of either 
males or females were married at the time of 
entry into the labor force, and a dummy 
variable measuring marital status was also 
insignificant in all models. 


Career-Track Pasition 


A dummy variable was created to measure 
initial placement in a large firm, with large 
firm considered to be one that employs at 
least 1000 people, or the government sector. 
This measure is in accord with Hashimoto and 
Raisian’s (1983) conceptualization and is 
consistent with classic studies of the Japanese 
economy such as Cole’s (1979) and with pub- 
lished government statistics on firm size.!° 


10 A further justification for classifying large 
firms as those employing 1000 or more workers 
stems from Hashimoto and Raisian’s findings on 
median: tenure and estimated eventual tenure of 
currently held jobs in medium (100-999 employ- 
ees) and large (1000+) firms: median tenure 
increases by a factor of 1.5 and estimated eventual 
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Males Females 
(N= 424) (N=651) 
Mean S.D. Mean S.D. 
Cohort .587  .493 .570 .495 
Father's employment 
Unemployed or absent .083  .276 .077 .266 
Self-employed .425 495 .421 .494 
Blue-collar .252  .435 .258 .438 
White-collar .241 .428 .244 .430 
Mother's employment 
outside home -217  .408 .241 .421 
Ability 3.234 1.018 3.315 .899 
Education 
Junior high .160  .367 .137 .344 
High school 373 .484 .47] .500 
Junior college/ 
vocational school .090 .286 .252 .434 
Low-ranking 
university .226 .419 .094 .292 
Medium-/high- 
ranking University — .151 .358 .046 .210 
Work plans 
(in adolescence) — — .396 .490 
Large-firm placement .307  .462 .300 .458 
Career-ladder 
placement .219 .414 .069 .254 


The variable measuring initial career-track 
position was constructed as a composite of 
several indicators. Respondents provided the 
following information on the first job they 
held after leaving school: employment status 
(employee, manager, self-employed, family 
enterprise worker, piece-rate worker), size of 
firm (1-9, 10-99, 100—299, 300-999, 1000 
or more employees, government), working 
status (full-time, part-time, temporary worker), 
occupation (10 categories), industry (9 cate- 
gories), and self-assessed promotional possi- 
bilities in first job (none, supervisor or 
foreman, section chief, department head, 
higher than department head). Individuals 
were classified as starting their work life in a 
career ladder if they (1) started as full-time 


employees or managers in a firm (not an ` 


individual establishment) of 1000 or more 
employees or in the government sector, (2) 
reported having perceived some possibility 
for promotion (to section chief or any 
category above), (3) were neither in an 
agricultural occupation nor in an "assistant 





tenure by a factor of 1.3 between medium and 
large firms. The differences between medium, 
small, and tiny firms are of a much smaller 
magnitude. 
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clerical" position, and (4) were not working 
in primary industry (agriculture, mining, or 
fishing). 

The principal unconventionalities in the 
construction of the career ladder variable are 
the use of perceived promotional possibilities 
and the limited use of occupation and 
industry. These decisions are related to the 
Japanese case but also have relevance for the 
measurement of career ladders in general. 
The present study follows the spirit of several 
previous approaches (Spilerman 1977; Ville- 
mez and Bridges 1988) but makes special use 
of the standard hierarchy of positions in 
Japanese enterprises to construct the measure 
of career ladder placement. !! 

Positions in large Japanese work organiza- 
tions are arranged in vertical tiers, each 
corresponding to a span of control (1) 
supervisor or foreman (kakaricho), generally 
in charge of up to 5 subordinates, (2) section 
chief (kacho), heading a section of approxi- 
mately 10 subordinates, (3) department head 
(bucho), supervising a department (typically 
consisting of three or four section), and (4) 
the highest level administrative positions, 
above department head. This structure is 
common in large firms across industries, and 
it represents both the factory and office 
systems of titles and statuses: "The standard 
ranks also supply a common frame of 
references for people in different departments 
or sections doing different kinds of jobs. A 
worker on the shop floor can look forward to 
being first a sub-section head and then a 
section head, with exactly the same privileges 
as sub-section heads and section heads in the 
sales department or the accounts section" 
(Clark 1979, p. 108). The terms for rank even 
appear on the ubiquitous Japanese business 
card. 

The survey question that addressed the 
issue of position/status was as follows: “At 


11 In an influential article on career ladders, 
Spilerman (1977) suggested replacing status and 
earnings of first job by representative measures of 
the career line’s worth. He proposed measures of 
expected lifetime earnings and expected status 20 
years after career-line entry, proposing a simula- 
tion procedure for estimating values on such 
variables. More recently, Villemez and Bridges 
(1988) constructed a composite variable measuring 
internal labor markets based on workers’ and 
employers’ perceptions of how given jobs are filled 
and what their promotional possibilities are. 
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the workplace where you held your first job 
(after leaving school), what was the highest 
promotional possibility for you and for others 
doing the same type of work as you?” The 
question was phrased in such a way as to 
inquire about the promotional possibilities 
inherent in the position rather than for the 
specific respondent.!? All those who reported 
having promotion possibilities to the second 
level (kacho) or above were considered as 
fulfilling this measurement criterion for career 
track. 

Occupation and industry. Occupational and 
industrial classifications were used in only a 
limited fashion to construct the career ladder 
variable because both occupations and indus- 
tries are quite heterogeneous (Baron and 
Bielby 1980). Blue- as well as white-collar 
jobs may be structured in career ladders 
(Hartmann 1987; Doeringer and Piore 1971) 
and this is particularly true in Japan. Salary 
and status differences are also lower between 
blue- and white-collar workers in Japan than 
in the United States. Thus any distinctions 
based on a blue-/white-collar criterion would 
be erroneous. The measure used in this paper 
employs occupational and industrial classifi- 
cations in a discretionary way, using only 
those categories that clearly distinguish work- 
ers as being in or out of a career track. 

When promotional possibilities are cross- 
classified with occupational categories, the 
occupations that emerge as having very low 
promotional possibilities are agricultural activ- 
ities and the assistant clerical category. Other 
occupational categories such as sales work, 
general office work, and services demonstrate 


12 A comparison of the response to this question 
with the response to a question later in the survey 
that asked about actual promotional experience of 
the individual respondent yielded the following: 
Among respondents who reported having been in a 
position with no promotional possibilities, only 1.8 
percent of women and 6.1 percent of men reported 
that they had actually received a promotion in their 
first workplace. However, much greater variability 
in actual experience was reported among respon- 
dents who said that they were initially in a position 
with promotional possibilities. These results are 
consistent with the intention of the question on the 
structure of promotional possibilities: One would 
expect very few people who say they are in a 
position with no upward trajectory to ever 
experience such a trajectory, whereas people who 
say their position can lead to promotion may or 
may not actually experience that outcome. 
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a wider distribution across promotional cate- 
gories. Only two occupational categories 
were therefore used to sort respondents into 
career ladders: people in the combined 
category including agriculture, fishing, and 
mining, or in the category of assistant clerical 
work were classified as non-career-position 
incumbents. The latter was a category used in 
the questionnaire in addition to "clerical" in 
an attempt to distinguish among white-collar 
workers. The length of career ladders in 
entry-level white-collar jobs in large firms 
varies widely (Halaby 1979; Hartmann 1987; 
Kanter 1977; Kelley 1982; Rosenbaum 1985), 
and “assistant clerical" was intended to 
capture those low-level white-collar or pink- 
collar jobs with limited career potential. 
Industrial classification was used only as a 
check to classify people whose work lives 
began in primary industries (agriculture, 
fishing, mining) in the non-career-ladder 
category and to include government sector 
employees in the career-ladder category (if 
they fulfilled the other qualifying criteria 
listed above). Checks were carried out to see 
whether further use of industrial classification 
would refine the composite measure of career 
ladder based on employment status, full- 
time/part-time work, firm size, and occupa- 
tion. Industries were classified into core and 
periphery sectors.?? The mean promotional 
possibility reported by full-time employees 
and managers was computed for each sector, 
as well as the mean promotional possibilities 
for full-time employees and managers in large 
firms (1000 or more workers). When firm 
size is left unspecified, the difference in mean 
promotional possibility is significant (t-test, 
p.001) between the core and periphery. But 
when the analysis is restricted to large firms, 


13 The classification developed for Japanese data 
by Kawashima and Tachibanaki (1986) was used 
for this purpose. They classified manufacturing 
industries as being in the competitive or noncom- 
petitive sector based on the production share of the 
10 largest firms in each industry. Nonmanufactur- 
ing industries were considered to be in the 
noncompetitive sector if at least 30 percent of 
workers in that industry were employed in large 
firms (1000 or more workers). In the data set at 
hand, manufacturing, finance/insurance/real estate, 
transportation and communications, utilities, and 
the civil service were classified in the core 
(noncompetitive sector). Agriculture, construction, 


- wholesale and retail sales, and services were 


classified in the periphery (competitive sector). 
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the mean promotional possibility in the two 
sectors is nearly exactly equal. So, specifica- 
tion of economic sector based on industry 
does not add information above and beyond 
specification of firm size. The reverse does 
not hold. The mean promotional possibilities 
for firms of different sizes (small, medium, 
and large) within industrial sector are signifi- 
cantly different.!+ Given these results, data on 
industry were used only to make certain that 
agriculture/primary industry and government 
employees had been properly classified. 

Logistic regression methods were em- 
ployed for the analysis. The dichotomous 
nature of the two dependent variables (large 
` firm or government versus small firm; 
career-track position/non-career-track posi- 
tion) violates the assumptions of ordinary 
least squares regression. In this case, and 
especially when the split on the dependent 
variable is rather extreme (as in the case of 
females' entrance into career ladders), logistic 
regression is advisable (Hanushek and Jack- 
son 1977). 


RESULTS 


Virtually equal proportions of men and 
women (30.7 percent vs. 30.0 percent) in the 
sample entered a large firm upon leaving 
school (see Table 5). This result replicates the 
aggregate government statistics reported ear- 
lier. As predicted, there is no significant 
difference in the rate at which men and 
women enter large firms. 

But the proportions of men and women 
whose first job was a career-track position in 
a large firm are significantly different: 22 
percent of men and only 7 percent of women 
started their work lives in a career ladder. 
Stated differently, fully 71 percent of men 
who started out in large firms were in 
career-track positions, in contrast to 23 
percent of their female counterparts. This is 
brought into sharp relief when we consider 
the fact that 60 percent of the women who 
entered a large firm when they left school 
entered as clerical workers. Among these, 
three-quarters were "assistant clerical" posi- 
tions. Ín contrast, one-third of the men who 





1^ Tiny firms (those employing less than nine 
workers) were excluded from this analysis because 
of the very limited promotional possibilities: within 
them. i 
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entered large firms were placed in clerical 
jobs and only 7 percent of these (one-tenth the 
proportion of women) were “assistant cleri- 
cal” positions. 

In Table 6 I show the results of regressions 
for placement in a large firm at the start of 
one’s work life. Two models were run for 
women, an initial one that omits women’s 
retrospectively reported adolescent work plans 
and a second one that includes this variable. 
This permits comparison of the initial model 
for men and women and comparison of the 
second model with the first one for women, to 
see if adolescent work plans add significantly 
to the explained variance. 

The process of entering a large firm is 
similar for males and females. Ability and 
education are positively associated with 
entering a large firm for both sexes, and 
family background variables do not exert 
significant effects. High school education is 
worth more than junior college/vocational 
school ‘or low-ranking university for males, 
whereas progréssively higher educational at- 
tainments increase women’s probability of 
entering a large firm.!5 For women, mother’s 
labor force participation is not related to 
entrance into a large firm. Nor are work 
plans— Model 2, which includes this variable, 
does not represent a significant improvement 
over. Model 1. These findings are as pre- 
dicted. 

When the coefficients for men and women 
are.compared,!6 high-ability men are signifi- 
cantly more likely than high-ability women to 
enter large firms. Male high school graduates 
are also significantly more likely to start out 
in large firms than are female high school 
graduates. 

Further sex differences emerge when the 
process of entrance into career-track positions 
is examined. Human capital variables signifi- 
cant for men’s placement in a career ladder 





r 157 originally ordered high schools into three 
tiers, expecting to insert rank order into the 
equations. But virtually all individuals who 
attended top-ranked or second-ranked high schools 
Continued on to university, leaving these cells 
nearly empty. The number of respondents who 
attended top-ranked universities is also low in a 
sample of this size, so the categories of top- and 
second-ranked universities were combined. 

16 Analyses were carried out on the full sample 
(men and women), with interaction terms between 
sex and each variable included. 
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Table 6. Logit Coefficients Describing the Effects of Individual Characteristics on Entrance into Large Firms after 











Leaving School 
Males Females 
Model 1 Model 1 Model 2 
Constant —4.909** —2.557** —2.545** 
(.867) (.545) (.550) 
Cohort .074 —.234 —.236 
E: i (.254) (.202) (.203) 
Father's employment 
Unemployed or absent —.574 i .127 .127 
(.518) (.358) (.358) 
Self-employed —.497 — .309 —.310 
. (.317) (.229) (.229) 
,  Blue-collar —.215 —.251 —.251 
(.333) (.258) (.258) 
White-collar -— — 
* Mother's employment outside home — .268 — .047 — 045 
(.299) (.232) (.232) ` 
Ability '596** .269** .269** 
, ; (.130) (.103) (.104) 
Education 
Junior high school — — — 
High school . ' 2.525** 1.088** 1.085** 
: (.744) (.380) (.380) 
Junior college/vocational school 2.138** 1.192** : 1.190** 
i (.838) (.407) (.408) 
Low-ranking university 2.429** 1.600** l.604** . 
(.765) (.468) (.469) 
Medium-/high-ranking university 3.283** 2.135** 2.140** 
: i (.787) (.539) (.540) 
. Work plans in adolescence — —.032 
S ; (.186) 
Maximum likelihood x? 432.64 744.20 744.16 
D.F. 413 640 639 


Note: All significance tests are one-tailed; * p<.05, ** p<.01. 


. are’ ability and attendance at a medium-/ 
high-ranking university (Table 7). The coef- 
ficients for education suggest a monotonic 
effect of educational attainment, but the only 
educational credential that significantly af- 
: fects probability of placement in a career 
‘ Jadder is graduation from a medium- 
/high-ranking university. These follow our 
predictions. Sons with self-employed fathers 
are less likely to enter a career track than 
others. 

Ability exerts a'strong effect on women’s 
probability of placement in a career ladder 
and the coefficients for educational attainment 
suggest a linear effect for education, but the 
similarities in the process of placement in a 
career ladder for the two sexes end there. 
Graduation from a medium-/high-ranking 
university has a positive but insignificant 
effect on women’s probability of entering a 
career track, and graduation from junior 
college has a negative, though insignificant, 
effect. Graduation from a low-ranking univer- 


sity has a positive insignificant effect. The 
pattern of these coefficients shows that 
university graduation does not substantially 
increase women's chances of entering a 
career-track position; junior college gradua- 
tion may actually damage those chances 
(whereas the effect of junior college on 
entering a large firm is positive and signifi- 
cant). 

Family background variables also exert 
some play in determining women's probabil- 
ity of entering a career ladder. Having a 
self-employed father depresses a woman's 
probability of entering a career track, and 
having a mother in the paid labor force 
dampens the probability. 

Women's adolescent plans for labor force 
participation exert a predicted strong positive 
effect on probability of entrance into a career 
track (whereas there was no effect on the 
probability of entrance into a large firm). 
Causal interpretation of this variable must be 
somewhat tentative because of its retrospec- 
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Table 7. Logit Coefficients Describing the Effects of Individual Characteristics on Entrance into a Career Track after 


Leaving School 
Males Females 
Model 1 Model 1 Model 2 
Constant —2.769** —4.9T7]** —5.297** 
(.540) (.843) (.870) 
Cohort — .005 .656* .738* 
(.274) (.390) (.393) 
Father’s employment 
Unemployed or absent —.312 — .800 — 860 
; (.522) (.688) (.694) 
Self-employed — .644* —1.015** —1.013** 
(.338) (.410) (.412) 
Blue-collar — .278 — .007 — .059 
(.348) (.428) (.429) 
White-collar — — 
Mother's employment outside home —.080 — 1.283* — 1.325* 
(.315) (.591) (.595) 
Ability .453** .725** .713** 
(.133) (.192) (.194) 
Education i 
Junior high/high school — — 
Junior college/vocational school .134 —.325 —.338 
(.500) C443) (.444) 
Low-ranking university .353 .209 .088 
(.327) (577) (.580) 
Medium-/high-ranking university, 1.176** .556 .384 
(.354) (.555) (.562) 
Work plans in adolescence — .758* 
(.333) 
Maximum likelihood x? 399.62 289.56 284.32 
D.F. 414 641 640 


Note: All significance tests are one-tailed; * p<.05, ** p<.01. 


. tive nature. Nevertheless, the importance of 

.the variable, demonstrated by Model 2's 
significant addition to explanatory power, 
` offers support for the notion that there is a 
strong fit between women's entrance into 
career ladders and their intention to work 
continuously across the life cycle. But other 
significant coefficients in the model remain 
stable when the variable measuring work 
. expectations is added, indicating that work 
: motivation is not the mechanism through 
which those variables operate. 

Finally, cohort exerts a significant effect on 
.women's, but not men's, probability of 
entering a career ladder. Women in the older 
cohort had significantly higher chances of 
initially entering a career track upon leaving 
School than women in the younger cohort. 
Between 1973 and 1978, when most of the 
younger cohort in our sample entered the 
‘labor market, Japan experienced its worst 
recession in the post-World War II period. 
While permanent employment for men was 
largely preserved, large numbers of women 


exited from the labor force and employers' 
willingness to allocate career-track slots to 
women became even slighter than in the era 
of high economic growth (Rohlen 1979). The 
empirical finding of a cohort effect for 
women is understandable in this context. 
Despite the differences in independent 
variables’. effects for men and women, the 
only coefficient that differs significantly for 
the sexes is that for mother’s labor force 
participation (a predicted effect).!7 
Differences in males’ and females’ process 
of entering career-track positions could be 
due partly to sex differences in the propensity 
of blue-collar and white-collar workers to 
enter career ladders. That is, if male blue- 
collar workers are more likely than female 


17 While the coefficient for medium-/high- 
ranking university is much higher for men than 
women, there is no statistically significant interac- 
tion effect with sex. It may be that this is due to the 
small numbers of women in this category (and the 
large standard error). 
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blue-collar workers to enter career ladders (or 
similarly for white-collar workers), then the 
determinants of career ladder entrance may 
‘differ for the sexes because we are dealing 
with different types of career ladders.!? I 
investigated this further. 

Equivalently low proportions of blue- and 
white-collar women enter career tracks (7 
percent). White-collar men, on the other 
hand, are significantly more likely to enter a 
career ladder (27 percent) than blue-collar 
men (14 percent). Given this, I ran regres- 
sions for white-collar men's and women's 
entrance into career ladders to see whether the 

‘ results differed from the previous results for 
all workers. The results for white-collar 
women are the same as for all women. The 
results for white-collar men are also the same 
as the general results for men, with one 
modification: father's employment is not 
significant for white-collar men. A regression 
on blue-collar men shows that men with 
blue-collar or white-collar (non-self-em- 
ployed) fathers have higher probabilities of 
entering caréer-track positions. For young 
men entering the labor market, then, the 
effect of father’s self-employment differs 
according to whether the young male is in a 
blue- or white-collar occupation. It may be 
that the career-track fate of young men in 
blue-collar occupations is affected by father’s 
self-employed/employee status because it is 
young blue-collar men who would be most 
apt to work in a family business if such an 
opportunity existed. 


‘CONCLUSION 


The first half of the paper documented 
Japanese women’s participation in the econ- 
omy and investigated gender stratification 
patterns using available aggregate data. Japa- 
nese women participate in the labor force at 
rates similar to women in other industrial 
nations but are less likely to work as 
employees. And among the employee labor 
force, the male-female wage gap is greater in 
Japan and women’s representation in manage- 
rial ranks is lower. As women move across 
the life cycle, they are more likely to work in 
family businesses; older women who are 
employees are also more likely than younger 


1$ T owe this modification to the comments of an 
anonymous reviewer. 


AMERICAN SOCIOLOGICAL REVIEW 


women to work in small firms. These findings 
from aggregate government statistics coupled 
with ethnographic evidence suggest that very 
few Japanese women find their way into 
career ladders at the start of their work lives. 
Rather, they are apt to work as temporary 
employees in low-level clerical positions. 
The second half of the paper probed these 
findings using microlevel data. While about 
30 percent of each sex enter large firms when 
they initially start working, women are much 
less likely to enter career ladders (the 
Japanese "permanent employment system") 
than men: 22 percent of men and only 7 
percent of women have such an experience. 
Among the population entering large firms, 
twice as many women as men enter as clerical 
workers, and the majority of these women are 
in low-level "assistant clerical" positions. 
Although not always statistically different 
for men and women, the effect of human 
capital variables (ability and education) on 
men's entrance into large firms and career- 
track positions is generally stronger than for 
women. High-ability women have a higher 
probability of entering career ladders than 
other women, but larger increments of 
education do not significantly help women. 
Family background variables do not exert 
significant effects on entrance into large firms 
for either sex. But children of self-employed 
fathers are significantly less likely to enter 
career-track positions in large firms. This 
could be due to some unmeasured disadvan- 
tage or to less interest on the part of these 
individuals in the long-term participation in 
corporate or factory work that a career-track 
position implies. This will be interesting to 
investigate in future work, as will the 
negative effect of mother's employment on 
daughter's probability of entering a career- 
track job. Finally, employers’ 'oft-cited reluc- 
tance to hire women into career ladders 
because of women's lack of labor force 
attachment is supported by a test of that 
hypothesis. f 
The paper demonstrates that the strong 
gender stratification patterns in the Japanese 
economy are alreacy apparent at the begin- 
ning of individuals’ work lives. But we would 
not have seen this were our focus only on the 
distribution of men and women in the large- 
and small-firm sectors of the economy when 
they start working, for this sex distribution is 
even. This points to the importance of 
attempting to measure career-ladder position 
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and constructing such a measure in a 
culturally appropriate fashion. Researchers 
have asserted that the first job is critical for 
future wages and promotions in the Japanese 
context; future work should address the extent 
to which Japanese women are handicapped in 
later life by their initial failure to enter career 
ladders. 

On a more upbeat note, the Japanese Diet 
ratified an Equal Employment Opportunity 
Law in spring 1985, after years of debate 
within the Labor Ministry, employers’ groups, 
labor unions, and women’s associations. The 
law prohibits sex discrimination in all phases 
of the employment process from recruitment 
to retirement, but no penalties are imposed on 
employers who do not conform. A recent 
check of classified advertisements in Japanese 
newspapers shows that some employers 
continue to recruit on the basis of sex as well 
as age. The effectiveness of the law will 
depend heavily on the degree to which local 
administrative units set up by the Japanese 
government monitor businesses, and the 
degree to which firms find it in their interest 
to gradually move away from a statistical 
discrimination rule that reserves the great 
majority of career positions for men. Future 
research on gender stratification in Japan will 
need to address this new exogenous influence 
on the operation of labor markets. 

In conclusion, the findings in this paper are 
consistent with arguments in the American 
sociological literature that internal labor 
market structures are disadvantageous to 
women. Japan represents a prototypical case 
of such structures, where employers and 
newly recruited employees make an implicit 
bargain to engage in a long-term employment 
relationship. As Japanese industries face 
collective belt-tightening and permanent em- 
ployment even for men becomes less taken 
for granted, the internal labor market struc- 
ture is beginning to exhibit cracks. The effect 
of these changes on gender stratification 
patterns will be fascinating to observe. 


MARY BRINTON is Assistant Professor of 
Sociology at The University of Chicago. Her 
interests are in the areas of gender stratifica- 
tion, contemporary Japanese society, and 
rational choice theoretic applications to social 
institutional issues. She is currently complet- 
ing a book on the role of women in the 
Japanese economy. 





563 
REFERENCES 


Althauser, Robert P. and Arne L. Kalleberg. 1981. 
“Firms, Occupations, and the Structure of Labor 
Markets: A Conceptual Analysis.” Pp. 119-49 
in Sociological Perspectives on Labor Markets, 
edited by Ivar Berg. New York: Academic 
Press. 

Arrow, Kenneth. 1973. “The Theory of Discrimi- 
nation.” Pp. 3-33 in Discrimination in Labor 
Markets, edited by Orley Ashenfelter and Albert 
Rees. Princeton, NJ: Princeton University Press. 

Baron, James N. and William T. Bielby. 1980. 
“Bringing the Firms Back In: Stratification, 
Segmentation, and the Organization of Work.” 
American Sociological Review 45:737-65. 

Bowman, Mary Jean. 1981. Educational Choice 
and Labor Markets in Japan. Chicago: Univer- 
sity of Chicago Press. 

Brinton, Mary C. 1986. Women and the Economic 
Miracle: The Maintenance of Gender Differ- 
ences in Education and Employment in Contem- 
porary Japan. Seattle: University of Washing- 
ton, unpublished doctoral dissertation. 

. 1988. “The Social-Institutional Bases of 
Gender Stratification: Japan as an Illustrative 
Case.” American Journal of Sociology 94: 
300-34. 

Clark, Rodney. 1979. The Japanese Company. 
New Haven: Yale University Press. 

Cole, Robert E. 1979. Work, Mobility, and 
Participation: A Comparative Study of American 
and Japanese Industry. Berkeley: University of 
California Press. 

Doeringer, Peter B. and Michael J. Piore. 1971. 
Internal Labor Markets and Manpower Analy- 
sis. Lexington, MA: Heath. 

Halaby, Charles. 1979. “Sexual Inequality in the 
Workplace: An Employer Specific Analysis of 
Pay Differences.” Social Science Research 8: 
79-104. 

Hanushek, Eric A. and John E. Jackson. 1977. 
Statistical Methods for Social Scientists. New 
York: Academic Press. 

Hartmann, Heidi I. 1987. “Internal Labor Markets 
and Gender: A Case Study of Promotion.” Pp. 
59-92 in Gender in the Workplace, edited by 
Clair Brown and Joseph A. Pechman. Washing- 
ton, DC: Brookings Institution. 

Hashimoto, Masanori and John Raisian. 1983. 
“Employment Tenure and Earnings Profiles in 
Japan and the United States.” American Eco- 
nomic Review 75:721-35. 

International Labor Organization. 1986. Yearbook 
of Labor Statistics. Geneva: International Labor 
Organization. 

Japanese Ministry of Labor. 1987. Fujin Rodo no 
Jitsujo (Status of Women Workers). Tokyo: 
Government Printing Office. 

Kanter, Rosabeth Moss. 1977. Men and Women of 
the Corporation. New York: Harper and Row. 

Kawashima, Yoko and Toshiaki Tachibanaki. 


564 


1986. “The Effect of Discrimination and of 
Industry Segmentation on Japanese Wage Differ- 
entials in Relation to Education.” International 
Journal of Industrial Organization 4:43-68. 

Kelley, Maryellen R. 1982. “Discrimination in 
Seniority Systems: A Case Study.” Industrial 
and Labor Relations Review 36:40—55. 

Koike, Kazuo. 1983. "Workers in Small Firms 
and Women in Industry." Pp. 89-115 in 
Contemporary Industrial Relations in Japan, 
edited by Taishiro Shirai. Madison, WI: Univer- 
sity of Wisconsin Press. 

Miyahara, Kojiro. 1988. “Inter-College Stratifica- 
tion: The Case of Male College Graduates in 
Japan." Sociological Forum 3:25—43. 

Office of the Prime Minister, Japan. 1985. 
Rodoryoku Chosa (Survey of the Labor Force). 
Tokyo: Office of the Prime Minister. 

Osterman, Paul. 1984. ‘“White-Collar Internal 
Labor Markets." Pp. 163-89 in Internal Labor 
Markets, edited by Paul Osterman. Cambridge, 
MA: MIT Press. 

Rohlen, Thomas. 1974. For Harmony and Strength. 
Berkeley: University of California Press. 

. 1979. “ ‘Permanent Employment’ Faces 

Recession, Slow Growth, and an Aging Work 

Force.” Journal of Japanese Studies 5:235-72. 

. 1983. Japan’s High Schools. Berkeley: 
University of California Press. 

Roos, Patricia A. and Barbara F. Reskin. 1984. 
“Institutional Factors Contributing to Sex Segre- 
gation in the Workplace.” Pp. 192-232 in Sex 








AMERICAN SOCIOLOGICAL REVIEW 


Segregation in the Workplace: Trends, Explana- 
tions, Remedies, edited by Barbara F. Reskin. 
Washington, DC: National Academy. 

Rosenbaum, James E. 1985. “Jobs, Job Status, 
and Women’s Gains from Affirmative Action: 
Implications for Comparable Worth." Pp. 116— 
36 in Comparable Worth: New Directions for 
Research, edited by Heidi I. Hartmann, Wash- 
ington, DC: National Academy. 

Rosenbaum, James and Takehiko Kariya. 1987. 
“Market and Institutional Mechanisms for the 
High School to Work Transition in the U.S. and 
Japan." Paper presented at the Annual Meetings 
of the American Sociological Association. 

Shirai, Taishiro. 1983. “A Supplement: Character- 
istics of Japanese Managements and their 
Personnel Policies." Pp. 369-82 in Contempo- 
rary Industrial Relations in Japan, edited by 
Taishiro Shirai. Madison, WI: University of 
Wisconsin Press. 

Spence, A.M. 1973. "Job Market Signaling." 
Quarterly Journal of Economics 87:355-74. 

Spilerman, S. 1977. "Careers, Labor Market 
Structures, and Socioeconomic Achievement." 
American Journal of Sociology 83:551-93. 

Thurow, Lester. 1975. Generating Inequality. 
New York: Basic. 

Villemez, Wayne J. and William P. Bridges. 
1988. "When Bigger is Better: Differences in 
the Individual-Level Effect of Firm and Estab- 
lishment Size." American Sociological Review 
53:237-55. i 


CONCEPTUALIZING SOCIAL STRUCTURE: 
THE MISUSE OF CLASSIFICATION IN 
STRUCTURAL MODELING* 
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University of Alabama in Huntsville 


Two distinct strategies used by scientists to order observations are modeling and 
classifying. Modeling furnishes the foundation for explanation while classifying 
provides the tool through which explanation can be generalized. These two 
Strategies are often confused in the analysis of social structures and this confusion 
leads to serious epistemological flaws with far-reaching implications for the entire 
discipline. Structural as opposed to classificatory relationships as they relate to 
concepts, units of analysis, levels of scale, and causation are also discussed. 


INTRODUCTION 


The objective of this paper is to criticize the 
way many sociologists use social categories 
to conceptualize social structure and to reveal 
the epistemological flaws that lie in such 
usage. The problem to be discussed pervades 
virtually all work in sociology. Essentially it 
involves the practice of thinking of social 
structure as containing or being composed, 
partially or wholly, of social categories which 
are seen as basic component units or elements 
in a set of relationships that constitute the 
structure of society. 

For example, Peter Blau (1974, p. 616) 
states: 


[the] concept of social structure starts with 
simple and concrete definitions of the compo- 
nent parts and their relations. The parts are 
groups or classes of people, such as men and 
women, ethnic groups or socioeconomic strata; 
more precisely, they are the positions of people 
in different groups and strata. . . . In short, by 
social structure I refer to population distributions 
among social positions along various lines— 
positions that affect people's role relations and 
social interaction. 


This suggests that "gender categories," "eth- 
nic groups," and "social strata" as well as 


* Direct correspondence to Walter Gillis Pea- 
cock, Department of Sociology, University of 
Alabama in Huntsville, Huntsville, Alabama, 
35899. The authors would like to thank Charles D. 
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classification and modeling, Walter R. Sullins for 
his graphics, Thomas J. Fararo, and an anonymous 
reviewer for their helpful comments and sugges- 
tions. 
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"groups" in the sense of networks of 
interacting people, and “positions” in the 
sense of the roles that they play are all 
included as "parts" or "elements" of society 
over which people are distributed. This 
perspective is not unique. Indeed, it so 
pervasive in sociology that almost every 
introductory textbook describes the variety of 
social categories and groups that are included 
as the parts of society. The question to be 
addressed here is whether under any circum- 
stances, and from any logically consistent 
theoretical perspective, sociologists are war- 
ranted in thinking of social categories as parts 
in their conceptions of social structure 
capable of functioning or acting as parts and 
of exercising causal influence on other 
elements in the organization of society. 

This paper argues that no category, no 
matter how concretely defined (i.e., grounded 
in empirical observation), can ever be used as 
if it were an operating part or unit in'the 
structure of society. It is our contention that 
such use of a category amounts to confusing 
imaginary, abstractly defined concepts that 
refer to entirely mental phenomena resulting 
from an act of classification, with concepts 
that refer to observable phenomena which 
have objective rather than subjective refer- 
ents. Furthermore, those who define social 
structure through the use of category concepts 
fail to come to grips with.the real problems of 
structural analysis and actually treat social 
structure as a “black box” whose internal 
order is left unexplored while they examine 
the consequences of that structyre for various 
categories of people who differ in their 
relationship to society. PP A Ie 

There are three tasks that „imise, X" per 
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formed before the meaning of the above 
statements can be made clear and defended. 
The first is to explore the meaning of the term 
social structure. The second is to specify 
what is meant by structural modeling and 
classification.! Within these discussions the 
ontological status of both categories in a 
classification scheme and units in a structural 
model will be examined. The third task is to 
discuss the appropriate uses of these strategies 
in science. Only after completing these tasks 
is it possible to present a logical defense for 
the above statements. 


SOCIAL STRUCTURE AND THE 
NATURE OF THE PROBLEM 


Like all terms, social structure has a variety 
of definitions, explicit and implicit, depend- 
ing on the user. This discussion assumes 
social structure refers to the differentiated and 
bounded units that constitute the parts of 
society and the network of connections 
between and among the constituent operating 
units. Social structure consists of parts or 
units that are joined by specific relationships 
that connect them directly and indirectly into 
subnetworks and supernetworks which form 
larger and larger social units in ascending 
order of scale until an entire society is 
encompassed by a definable network of 
interrelated units or elements. Thus, the 
concept structure implies that the structural 
analyst must attend to three aspects of 
structure: (1) the parts or units that make up 
the whole, and the characteristics of these 
parts relative to others, (2) the relationships 
that join the parts to form a larger configura- 
tion, including the characteristics of these 
relationships, and (3) the boundaries that 
separate and enclose the parts in forming the 
larger whole. 

The question immediately arises, What is 
meant by a “part” or “unit” in the structure 
of society? Two contrasting options exist 
which, taken together, produce a third or 
mixed option. The first adopts the structural 
analytic framework and regards interactional 
networks as the parts of society (Bates and 
Harvey 1975; Berkowitz 1982; Wellman and 
Berkowitz 1988; Fararo and Skvoretz 1986a). 


! See Watson (1979) for additional discussions 
. of the distinction between modeling and classifica- 
tion. 
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In empirical cases, this network involves the 
actual interaction of actors that transpires in 
time and space and can be observed or 
measured. These actors occupy positions in 
relationship to one another and perform 
interacting roles. In addition to the whole 
network, this conception also includes as 
units any part of a network. For example, a 
social group such as a family or a work group 
qualifies as a social unit suitable for the 
conceptualization of social structure since 
group members interact as occupants of 
positions and are related to each other by the 


‘role relationships characterizing these posi- 


tions. But the positions occupied by family or 
work group members would also qualify as 
units or parts of structure because they can be 
shown to be involved in the interaction 
process that constitutes the group. On this 
basis, multigroup systems such as extended 
kinship systems or business organizations, 
themselves composed of many interconnected 
groups, would qualify as larger units of social 
structure since it can be demonstrated that 
they are composed of parts called “groups” 
which interact with and have identifiable 
connections to each other. 

The second option is to regard social 
categories which are usually part of a 
classification scheme rather than interactional 
networks as the units of social structure. For 
example, one might contend that the structure 
of society is made up of gender categories, 
ethnic groups, occupational categories, and 
social classes. This conceptualization de- 
mands that the analyst construct theoretically 
relevant social categories and then explicitly, 
or implicitly, utilize them as if they were 
operating units in the structure of the social 
system. Hence, the contention that society is 
composed of such categories as parts. 

The third option is to mix categories and 
interactional networks in the same conception 
of social structure. In this case, social groups 
in the interactional sense and social categories 
in the sense of classes in the classification 
scheme are treated as having equal ontologi- 
cal status. Both are thought of as real 
empirical phenomena capable of operating 
and therefore exerting causal influence on 
each other. For example, ethnic groups are 
regarded as empirical realities in the same 
sense as a family or an industrial plant. The 
theorist is comfortable thinking of the struc- 
ture of society as being composed simulta- 
neously of various groups, multigroup sys- 
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tems, and a whole host of social categories 
such as gender, ethnic, or social classes. The 
important point is that these categories are 
explicitly or implicitly regarded as being as 
real, or “concrete,” as interactional networks. 

We contend that only the first option is 
reasonable for defining the structure of social 
systems since it is so formulated as to refer to 
phenomena that transpire outside the mind of 
the observer and are capable of being 
observed and of operating in a causal sense. 
The second option depends upon concepts 
that result from mental processes which 
separate them from and make them indepen- 
dent of the empirical world and observations 
of it. We will show that it is reasonable to 
study categories, but that it is an epistemolog- 
ical error to consider them part of social 
structure. Because the second option is 
flawed and the third option uses concepts that 
depend upon it, mixing concepts with entirely 
different ontological status, it is also inappro- 
priate for the study of social structure. 


MODELING AND CLASSIFICATION AS 
INTELLECTUAL STRATEGIES 


A discussion of the nature of classification 
and modeling as intellectual strategies is 
mandatory because only by understanding 
these strategies can we hope to understand the 
difference in ontological status between social 
categories and social units as defined above. 


Structural Modeling 


Many distinct meanings exist for the word 
model in sociology (cf. Willer 1967; Blalock 
: 1969). We do not intend to discuss models in 
general, but only models created for the 
expressed purpose of representing social 
structure. Such models are used to construct 
an intellectual image of how a society or one 
of its component parts is "put together" or 
"structured." Models therefore are used to 
represent the structure of a social system, 
- facilitate the creation of theories to explain its 
operation, and derive hypotheses to be tested 
concerning it. Although diagrams are often 
used to represent aspects of structure, in 
general, models of social structure at this 
stage of development are usually put together 
using symbols and verbal definitions to stand 
for the parts of structure used in the model 
and to symbolize the relationships that pertain 
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to the placement im a conceptually defined 
configuration. 
Structural models represent the structure of 
a phenomenological target by analytically 
breaking it down into its component parts, 
defining the characteristics of these parts that ' 
are relevant for a structural representation, 
and then specifying how they are joined or 
connected in a web of relationships forming a 
bounded whole. A structural model of a social 
group identifies the units of group structure, . 
social positions, describes the nature of these 
positions by specifying the kinds of roles 
associated with them, and then specifies how 
they are joined in interactional interdependen- 
cies. For example, relationships may be 
described in terms of power or in terms of the 
types of exchanges that take place among 
members or in terms of the flow of 
information.? ; 
The relationships constituting the connec- 
tions among units are not relationships as in : 
multiple regression equations where the 
elements are variables and the relationships ' 
represent covariation. Instead, they represent 
connections, bonds, or ties (Bates and Harvey 
1975; Berrien 1968; Buckley 1967; von 
Bertalanffy: 1968; Berkowitz 1982; Wellman 
and Berkowitz 1988).3 These connections 


? We do not imply that communications or 
exchanges are equal, harmonious, or system 
maintaining in function. They may be conflictual 
as well as cooperative and may involve tension and 
stress as well as stability enhancing input-output 
relations. Le. Và 

3 Relationship as a concept in modeling has two 


' possible interpretations, mathematical and struc- 


tural. The mathematical takes the form: as A 
increases by 2 units, B will decline by a half unit. 
The structural interpretation suggests a certain 
form of linkage or connection that joins the units. 
Structural models employ the latter relationship 
concept. Mathematical or attribute models may be 
used to summarize quantitatively the relationship 
system posited in a structural model, but to do so 
they must be underpinned by a structural model 
that supplies the structura! mechanism which 
makes the mathematical statement- meaningful. 
Thus, the mathematical model is an attribute model 
because it is a theoretical statement based upon the 
structural mechanism posited by the structural 
model. Unfortunately, in sociology such models 
are often created using class-defining attributes and 
in reality examine how the attributes of classes and . 
subclasses are related to each other and not how 
the attributes of interrelated units vary in relation- 
ship to each other. When such procedures are 
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permit the flow of inputs and outputs among 
the parts and also fumish the pathways 
through which the parts affect each other as 
they interact. The positions making up the 
structure of a group simultaneously represent 
locations in a structural pattern and structural 
units with specific behavioral characteristics 
that occur at that location. Positions represent 
parts of social structure that are empirically 
observable, occupy space, and transpire as 
behavior over time. Although they may be 
described in terms of attributes that vary from 
one position to another, or from one time to 
another with respect to the same position, as 
parts of social structure positions are defined 
as units and not as variables. 

The key to understanding structural model- 
ing as an intellectual strategy is the idea of 


interconnectedness among analytically sepa- 


rate parts. In pursuing this strategy, the 
analyst makes observations on a phenomeno- 
logical target and breaks it down into its 
component parts. In so doing, the locations 
and relationships of the parts are noted and 
later represented symbolically by concepts 
referring to location and relationship, in the 
sense of connection. The objective is to create 
a mental image, using symbols, that repre- 
sents the object being modeled and preserves 
its spatial and temporal relationships. 

' Before discussing classification. we must 
recognize that the construction of a structural 
model of a particular social unit, for example, 
a particular university, is always carried on 
. using concepts which are categorical abstrac- 
tions. For example, the term social position 
refers to a category of social units which 
conform to a particular abstract definition. 
When we say: that the structure of a -given 
university includes positions such as presi- 
dent, dean, department head, and faculty, we 
‘are using category labels to refer to “types of 
positions.” However, the model constructed 
of a given university's structure is not 
constructed out of categories of positions. 
The position of president in a particular 
university is a unique, exact position and it is 
related in a model to other unique positions in 
exact locations. A model, so developed, 
represents locations of unique positions in 


based on class attributes, they fail to offer 
explanations for social phenomena. Often they 
merely reconfirm the logical relatedness of classi- 
fication criteria, an essentially circular and unpro- 
ductive procedure: 
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particular configuration where a specific 
president, deans, department heads, and 
faculty interact and play roles toward each 
other. The same applies at the group level 
where exact academic departments are placed 
in particular locations in a network of 
relations toward each other. ` 

Structural models are abstract in that only 
certain aspects of a phenomenological target 
are modeled. Other nonstructural attributes 
are “stripped away” or ignored. Thus, a 
structural model of an object represents a 
pattern underlying the surface appearances of 
the object being examined and must be 
arrived at by procedures that follow certain 
rules. These rules determine what aspects of 
the object will be examined and symbolized 
and which will be ignored as surface 
appearances. The analyst selects those rela- 
tionships that are theoretically relevant to 
understand the behavior of an object. The 
sociologist may choose to ignore unique 
personality attributes of the interacting people 
and focus instead upon characteristics that 
preserve the configuration of parts and their 
relationships within the system. This is 
analogous to the physicist who models the 
interaction of two objects and ignores their 
color, focusing instead on vectors of force 
and mass. 

Thus, structural modeling is an inherently 
theory-laden activity that requires the execu- 
tion of an intellectual procedure which first 
identifies a set or parts or units and then 
specifies how they are related, all of this 
according to a set of rules which themselves 
specify what is and is not a structural 
attribute. In other words, the attributes of 
units and relationships selected for use in a 
particular model are derived from and are 
based upon the theorist "problematic," philo- 
sophical orientation, or worldview (Hanson 
1965; Glucksmann 1974; Horan 1978). 


Classification as an Intellectual Strategy 


Classification is an intellectual procedure that 
uses an entirely different set of rules from 
those employed in structural modeling (Crow- 
son 1970; Simpson 1961; Sokal 1974; Sokal 
and Sneath 1967). These rules, because they 
violate the rules of structural modeling, result 
in conceptualizations which, although impor- 
tant and useful, are inappropriate for use in 
structural models. Classification. constructs 
categories on the basis of the mental proce- 
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dures of comparison, where similar objects 
are placed together conceptually under a 
common label.* This label implies class- 
defining attributes that operate as the criteria 
of classification. A category system can be 
either. explicit or implicit and simple or 
complex. At a minimum, it consists of two 
categories, A and non-A, although it may 
contain many more. 

Most important; classification does not 
require connections among the members of-a 
category or even that they exist contempora- 
neously. All that is required is that the 
members of a class be similar to each other 
with respect to the defining criteria. Some 
members of a class may have relationships to 


` other members of the class but these relation- 
ships are not relevant to the classification 


scheme and are merely coincidental. For 
example, in the categories “males” and 
“females” some individuals may be brothers 
or sisters, cousins, or coworkers, but this has 
no relevance to the classification “males and 
females.” Indeed, the members of a class are 
not thought of in spatial and temporal terms, 
but only in terms of similarity in class- 
defining criteria. 

This holds even when siioni. temporal, 
or spatial attributes are employed in the 
classification process. For example, it is 
possible to classify individuals or the posi- 
tions they occupy in specific business organi- 
zations on the basis of their relative control 
over the means of production and workers, 
arriving at three categories: workers, owners, 
and managers (cf. Wright 1976, 1979, and 
1980). Clearly, it is possible to say that some 
owners and managers have direct relatión- 
ships to some of the; workers so classified. 
However, we would be in error to assume that 
“owners” and “managers” as categories have 
a direct interactional relationship to “work- 
ers” as a category. 


4 In classification, real objects are not sorted 
into homogeneous bins or piles that exist as 
collectivities in the real world, but the analyst 
groups them together mentally in conceptual space: 
Thus, when we classify fruit as apples and 
oranges, we do not necessarily sort them physi- 
cally into homogeneous piles. Sorting and classifi- 

. cation are different operations, one à physical act 
and the other a cognitive one. When we classify, 
real-world objects are not affected or changed by 
the act of classification. It is only our conception 
of things that is affected. 
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The relationship between classes in a 
classification scheme always involves the use 
of contrast or opposition to define class 
boundaries and relative similarity to define 
relationship of classificatory kinship. For 
example, the class plumber is contrasted to 
the class electrician by differences in the 
class-defining attributes (e.g., working with 
pipes, electricity, wire). But plumbers and 
electricians could be subsumed under a larger 
class called “construction workers" using 
more. general criteria. Thus, plumbers are 
considered to be more closely related to 
electricians and carpenters than to secretaries, 
bookkeepers, and file clerks, who are E. 
workers.” 

In a classification scheme it is starches 
possible to think of the various classes in 
terms of closeness of relationship if relative 
similarity and difference is used as the 
‘method of defining relationship. But this sort 
of relationship, based as it is on affinity, is 
quite different from that referred to in 
structural modeling where the term means 
connectedness or articulation. The connected- 
ness referred to in models is thought of as 
existing in the empirical world and as only : 
being represented symbolically in the model. ` 
The relationship among the members of a 
class and among subclasses undér a general 
class is a result of comparison to mental 
criteria and exists only as an idea of similarity 
in the mind of.the classifier. The point is that 
the mind constructs the class on the basis of 
comparison and there is no object in the real 
world that corresponds to the class. It is a 
pure mental construct with no empirical 
referent. l 

One might argue that similarity exists in the 
real world independent of the mind’s percep- 
tion of it. This is like saying that plumbers are 
a category whether we define them as such or 
not. On this basis some argue that classes or 
Categories ate “discovered” rather than con- 
structed. There is, however, a difference 
between the idea that plumbers exist in a state 
of being alike, and the idea that a class as an 
entity exists and is part of a relationship 
system. The collective thing called the class 

“plumbers” does not exist as a collective 
entity in the empirical world. It does not 
occupy space, transpire in time, or operate 
within itself or upon anything else in nature as 
an entity. The category exists as an idea or 


` concept in the mind of the classifier representi- 


570 


ing the perception of similarity among many 


separate objects. 

Because the classes or categories in a 
classification scheme do not exist as collec- 
tive entities outside of the mind, they cannot 
' be used as units in the structure of society. 
There are no observable elements of society 
which are composed of categories as parts. 
Rather, all of the observable parts of society 
are composed of particular parts or units, not 
whole categories existing as collective enti- 
ties. A construction crew may include several 
plumbers, carpenters, and electricians, but the 
crew is made up of individual practitioners of 
these trades occupying particular positions 
connected by a network of interactional 
relationships, not of categories. Even if the 
plumbers work as a team or are organized as a 
chapter of a labor union, this does not mean 
that the construction crew contains the 
' category plumbers as a part or unit. Rather, it 
contains work teams, and the individual 
members are joined or connected by a set of 
social relationships and constitute a group, 
not a category. 


THE ROLES OF MODELING AND 
CLASSIFICATION IN SCIENCE 


The very rules which govern classification as 
an intellectual strategy and those which 
govern structural modeling result in conceptu- 
- alizations that are incompatible. Individual 
structural models are not classes in a 
classification scheme but representations of 
particular social objects often composed of 
dissimilar parts. Classification schemes are 
not models of social objects, nor can their 
component categories be used to construct 
such models because they stem from a 
procedure, that violates the rule of connected- 
ness that is central to structural modeling. 
This argument leads to the conclusion that 
under no circumstances can the structure of 
society be described as containing or being 
modeled by the use of categories or classes as 
fundamental units. What then are the proper 
roles of these two different strategies? 


Modeling 


The central role of structural models in 
Science is to provide the basis upon which 
explanation can rest (Willer 1967). Structural 
models make possible the conceptualization 
of mechanisms upon which causal relation- 
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ships are grounded. To understand this role, 
we begin this discussion by looking at models 
in terms of levels of scale and by examining 
the implications of raising the level of scale of 
a model. 

It helps to think of a continuum ranging 
from micromodels to macromodels. When we 
proceed from the level of a micromodel which 
describes or represents the structure of a small 
group to the creation of a model of an 
organization which contains several groups, 
and then to one which includes many 
organizations and groups in a community 
structure, and finally to the formulation of a 
structural model of an entire society or world 
system, we raise the level of scale of our 
models. Each successively larger scale model 
implies the need for smaller scale models to 
supply knowledge about the internal organiza- 
tion of the units that make up larger social 
structures. The problem in modeling is to 
introduce relationship concepts which will 
represent the network of connections that bind 
the smaller units together to form the larger 
one so that the various parts of the model at 
each level of scale are articulated with each 
other. 

At each level of scale an entirely new set of 
structural properties emerges out of this 
network and the interactions among parts that 
are generated by it. Groups do not relate to 
each other structurally as if they were really 
no more than individual actors. Organizations 
are not merely large groups, or simple actors 
like individual actors in a social interaction. 
There are special structural arrangements 
which emerge at each level of scale, and new 
social phenomena are encountered. Further- 
more, the networks that bind units of various 
levels of scale to each other furnish a system 
of articulation through which the effects of 
the behavior of one unit may be transmitted 
over a linkage system to impact on other 
units. These impacts are transmitted upward 
and downward in levels of scale as well as 
laterally within a given level of scale. This is 
like saying organizational level phenomena 
have an impact on group-level phenomena 
and vice versa, and that one group affects 
another as do various interrelated organiza- 
tions. 

This characteristic of structural connected- 
ness makes it possible to conceptualize events 
in one part of a system affecting events in 
other parts. Therefore, on.the basis of an 
implicit or explicit model, we formulate 


CONCEPTUALIZING SOCIAL STRUCTURE 


causal theories from which we derive hypoth- 
eses for empirical testing (Willer 1967). 
Without conceiving articulated units as form- 
ing a network, there is no basis for formulat- 
ing a theory of “causal” relationships. One 
may argue that, as attribute "X^" of unit “1” 
increases, attribute "Y" of unit "2" de- 
creases, but in order to see "X" as causing 
“Y,” we must define a connection between 
unit “1” and unit “2” through which a causal 
mechanism operates. Without a causal connec- 
tion, the relationship between “X” and “Y” 
remains a prediction rather than an explana- 
tion. The objective of science is to explain 
and not only to predict. Explanation implies 
knowledge of how an effect is produced by 
causal mechanisms and not whether it is 
produced or not. 

The articulation among units in modeling is 
precisely what classification schemes lack, 
and what is lacking among units in a 
category. It is therefore apparent that catego- 
ries in a classification scheme are not 
available for incorporation into models which 
claim to explain. For example, even if it can 
be demonstrated that membership in a class 
called proletariat is associated with lower 
income than membership in a class called 
capitalist, we cannot say that class member- 
ship is causing the difference (cf. Wright 
1979). Classes or categories are pure mental 
constructs and have no real object as a total 
category to which they refer.> What is causing 
the difference between classes must be found 
in a model which depicts how the parts of the 
social system in which income is produced 
are organized and how it operates through 
time. In the case of income differences 
between capitalists and proletarians, it is only 
that we have chosen terms taken from a prior 
structural model of the capitalist enterprise to 
base our classification on, that makes it 
appear reasonable to think that differential 
income between classes is being “caused” by 
class membership (Wright 1980, 1976). 
Membership in these classes is not member- 
ship in a group or organization, but only a 
kind of figurative membership created in the 
mind of the analyst by the act of classifica- 
tion. It is the structure of the social system 
which places individuals in certain positions 


5 This statement holds even if one predicts that 
someday the “class” proletariat will exist as a 
revolutionary organization in an interactional 
sense. 
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with less power or control over the means of 
production that determines income differen- 
tials, not class membership. 


Classification 


This does not imply that classification is 
worthless as a tool in the study of social 
structure. Rather, classification must be used 
as a Separate and complementary procedure in 
conjunction with model building but not as a 
part of it. To understand this, we must first 
discuss the classification of structural models 
themselves. 

Like any target set of objects, structural 
models may be classified in terms of 
structural similarity and difference. For exam- 
ple, suppose a set of structural models has 
been constructed to represent individual 
restaurants. If we were comparing actual 
restaurants with respect to their structure, we 
might arrive at a classification that contains 
such forms as cafeterias, drive-ins, full- 
service restaurants, and so forth. These types 
would constitute classes or categories of 
restaurants having a similar structure. The 
class-defining criteria would be the posses- 
sion of certain structural attributes. For 
convenience, a category of social units with 
identical structures will be called a social 
form.® The point is that social forms consti- 
tute classes of units with identical structures. 
It would still be incorrect to say that society's 
Structure is constructed out of social forms, 
because they are, after all, merely classes in a 
classification scheme even though the classi- 
fication is based on structural similarity. But 
now we have the advantage of being able to 
generalize statements concerning how the 
structure of a few members of a class operates 
or produces certain results, to other members 
of the same class since they would all be 
identical with respect to their structure. 


©The term structural form is used at many 
levels of abstraction by various social theorists. 
For example, Simmel proposed a number of highly 
abstract forms that reduced structural patterns to 
almost geometric sets of relationships (Wolff 
1950). Similar procedures are utilized by modern 
network theorists when conceptualizing connec- 
tions between "nodes" in a social network. These 
uses are compatible with ours but at a more 
abstract level where virtually all behavioral content 
is removed and focus is on the formal or 
mathematical properties of the network. 
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This points to the central and vital function 
of classification in the grand strategy of 
Science. Its most important purpose is to 
allow us to generalize what we learn by the 
` study of a few members of a category to other 
members of the same category. The ability to 
generalize includes the ability to predict. If 
we are able to say that something is a 
"member" of category "Y," and all Y's 
behave in such a fashion, then we can predict 
that the new object Y will behave in a similar 
fashion. But to make this procedure work 
optimally it must be grounded on valid 
structural models. These models represent 
how objects in nature are joined and how their 
parts operate in relationship to each other and, 
as a result, provide the basis for explanation 


and theory (Willer 1967; Bates and Harvey . 


1975). The purpose of classification, on the 
other hand, is to raise explanation to the level 
of a generalization to a class of similar 
objects. This is accomplished by a reiterative 
procedure in which a generalization in the 
form of a hypothesis is derived from a model 
and tested on a few selected members of a 
category. The ultimate success of the proce- 
dure depends upon preserving a clear distinc- 
tion between concepts derived from classifi- 
cation, and those derived from modeling. If 
models are constructed using categories, the 
procedure becomes circular and self- 
confirming. 


SUMMARY AND DISCUSSION 


The above argument is illustrated in Figure 1 
which, from bottom to top, represents three 
levels of abstraction, each succeeding level 
moving further from the world of sense 
impressions. At the first level of abstraction, 
structural models of three separate groups are 
shown. Only three-person (position) groups 
are shown for the sake of simplicity. Let us 
assume they are all work groups of a similar 
type, for example, physicians’ offices. 

Each group is represented symbolically by 
a set of geometric figures in a definite 
relationship to each other. The shape of the 
geometric figures represents the different 
positions and roles contained within each 
group. For example, the triangle represents 
the position of physician, the square the 
position of nurse, and the circle the position 
of office manager/secretary. Each geometric 
figure represents a unique position and is 
labeled by a subscript indicating, first, group 
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location and, second, its identity in the group. 
The triangle or physician’s position in group 
one is given the subscript 1,1 to indicate it is 
located in group | and is identified as position 
1. In group 2, the physician’s position is 
given the subscript 2,1, indicating it is 
position 1 in group 2, and so on. Inside each 
geometric figure the letters X, Y, and Z 
represent the actors occupying each position, 
and the subscript indicates group location. 
They could be read as Physician X,, Nurse 
Yı, Office manager Z,. The letters selected 
are arbitrary and they may represent a 
distinctive feature of the positional occupant 
such as sex, race, or other attribute. The 
double-ended arrows represent the unique 
relationships between actor-occupied posi- 


tions and are designated by "R" with a triple 


subscript. The first number in the subscript 
indicates the group location, and the last two 
the positions in contact. For example, Ri; 
indicates the relationship in group 1 between 
positions 1 and 2. 

It is important to note that each group is 
represented by a separate model involving 
particular people occupying particular social 
positions in association with each other in the 
context of different physicians' offices. These 
models would utilize concepts referring to 
unique social units and the relationships or 
connections that relate these units to each 
other to form an observable bounded whole. 
Although they are all "physicians' offices," 
in the real world they exist as separately 
occurring behavioral networks. 

Again, the purpose of these structural 
models is to furnish an analyst with a 
symbolic representation of an entity, a social 
group in this case, which can be used to 
formulate theories about how entity elements 
function in relations to each other. To 
formulate theories for scientific testing, at- 
tributes of the parts of the model or of their 
relationships to each other are examined in 
terms of how changes in one are related to 
changes in the others. 

In the middle section of Figure 1 we show 
three classification. schemes of positions, 
relationships, and actors or position occu- 
pants. For example, all triangles are classified 
together as physician positions and assigned 
the symbol A, squares are classified together 
as nurse positions and assigned the symbol B, 
and so on. Relationships and occupants have 
also been classified and symbolically desig- 
nated. 
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Figure 1. 


While we have represented these categories 
symbolically and given the reader some idea 
of their constituent elements, we stress that 
these categories are only mental conceptions 
concerning similarities and differences among 
individual objects with respect to selected 
attributes chosen as the basis for classifica- 
tion. Within the categories, the individual 
units being cónceived of as alike or different 
are not bound together structurally by the act 
of classification. They remain in the real 


Relationships 


C=(013,023 O33 ON3} III-(R123; R223» R3237- RN23) 





573 


Occupants 
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world where and as they were, that is, as 
separate individual units, part of an organized © 


-system in relation to other units. The logic of 


classification does not require that the object 
in the same class be linked by any sort of 
connections which can justify thinking of the 
class as an object in and of itself. Whatever 
connections do exist between units in a class 
are coincidental as far as the logic of 
classification is concemed. 

Analysis undertaken by cross-classifying 
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these categories will inform us about the 
nature of these groups. For example, we may 
choose to cross-classify our positional with 
occupant categories, thereby learning about 
the allocated process and gaining actuarial 
knowledge useful in prediction. However, it 
would be a mistake to suggest that category 
membership causes one to be allocated to 
certain positions. Rather, it is the actual 
structure of the group (i.e., the norms, roles, 
attitudes, and power in specific structural 
locations) or perhaps the group’s location in a 
larger organizational network that determines 
allocation. Further, categories of positions 
formed on the basis of power attributes and 
individual occupants can be cross-classified 
or examined through time, as Blau (1977) and 
others have fruitfully undertaken, thereby 
informing us of heterogeneity and inequality 
within a social system. However, it would be 
an error to discuss or consider these catego- 
ries as part of social structure. 

The top of Figure 1 provides three 
examples of structural forms. The problem is 
that only one, form a, is correctly conceptu- 
alized. Both forms b and c are of questionable 
utility because they treat categories as if they 
were units. In form c, the categories of 
individuals Phi, Theta, and Omega, represent- 
ing people who occupy physician, nursing, 
and office manager positions respectively, are 
treated as if they are a collective unit in 
interaction. Indeed, the relationships are also 
represented as categories of relationships. The 
problem is that this form does not represent 
any particular part or unit that exists outside 
the mind of the analyst. At best, it represents 
a very abstract generalization about the 
similarity found in relationships between 
individuals who tend to occupy similar 
positions. Logically, given the analytical 
steps necessary for the creation of forms b 
and c, there is no possibility of correspon- 
dence between these forms and the real 
world. 

Only structural form a represents a struc- 
tural form, in this case a physician's office, 
that has some utility. This form does not use 
categories as units and it attempts to preserve 
the temporal and spatial relationships between 
positions and occupants. This structural form 
is derived by stripping away any unique 
attributes incorporated in the structural mod- 
els formed at the first level of abstraction. 
Here the symbols represent idealized separate 
positions, relationships, or occupants, which, 
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although part of a class, are thought of as 
separate exact members of the category. 
Thus, the form itself represents a class of 
similar unique structural models and can now 
be utilized to generalize. If a social unit can 
be said to fall into this category (i.e., be 
classified as a structural form a), then we can 
generalize and make predictions concerning 
its behavior. While structural forms of this 
nature are useful, because of their classifica- 
tory nature, they can not be employed as 
models of social structure. 

Idealized structural forms similar to form a 
have informal counterparts in the culture of 
society that serve as information utilized by 
social actors. These abstract forms or institu- 
tions represent designs for social action in that 
they supply general rules for organizing and 
forming common types of social units and for 
behaving in social situations (Fararo and 
Skvoretz 1984, 1986b). Examples are the 
idealized forms elicited in the minds of actors 
in American society by tbe concepts family, 
fast-food restaurant, or elementary school. 
These forms provide information on how to 
organize a family or school and how to act in 
customer-seller relationships. By furnishing 
collectively understood (institutionalized) plans 
for the organization of social relationships, 
these forms insure a degree of stability and 
continuity in the social network of society. 
However, these institutionalized structural 
forms are also classificatory in nature and can 
not be employed as parts of social structures 
in the sense of actual groups or organizations. 
Indeed, these institutionalized forms repre- 
sent, in part, the contents of social structure, 
but not the structure itself.? 


7 The structure of society at any given moment 
may be thought of as an expression or embodiment 
of these institutionalized abstract forms. However, 
in the implementation and utilization of these 
forms, deviations and distortions are inevitable. 
For example, factors such as the exact location in 
the larger network where the instantiation occurs 
will constrain its implementation, especially in 
complex societies where competing forms having 
the same function are likely to exist. Further, 
having knowledge of the repertoire of social forms 
contained within a cultural system does not insure 
knowledge of social structure any more than 
knowledge of the grammar of a language will 
allow you to know an exact sentence in the context 
of a particular paragraph. It is necessary to know 
how these forms are distributed in a social network 


CONCEPTUALIZING SOCIAL STRUCTURE 


None of the structural forms represented in 
the top of Figure 1 can be properly used as 
units in a model of social structure. Our 
argument is that the proper procedure in the 
search for explanations of social phenomena 
is to start by creating an explicit structural 
model of the phenomenon to be investigated. 
Classification should follow modeling and be 
used as a means of generalizing knowledge 
obtained from the examination of a small 
number of cases to other'cases to which a 
similar model applies. In fact, the criteria of 
classification should be carefully selected 
from a consciously defined prior model so 
that classification cari assist in generalizing 
the knowledge obtained from modeling to all 
examples of the same phenomenon. 


CONCLUSIONS AND IMPLICATIONS 
FOR SOCIOLOGY 


Current fashions in sociological research 
depend heavily on the use of classification as 
a-tool in defining and operationalizing 
research problems and in formulating theories 
and hypotheses. All brands of sociological 
analysis, be they conflict or functionalist, 
Marxist or non-Marxist, or micro or macro, 
confuse categories with social units. As often 
as not, these categories are based on an 
unknown or ill-defined model of social 
systems or are merely customary ways of 
sorting individuals according to characteris- 
tics believed to have social significance as 
defined by the culture and value system 
dominant in society (Wellman and Berkowitz 
1988, p. 15; Mayhew 1980, 1981). Thus, 
sociologists create categories of people, based 
on occupation, ethnicity, gender, race, power, 
or income, as if these categories are real and 
useful for causal explanations. Then sociolo- 
gists cross-classify these category schemes, 
and when statistical analysis shows that 
individuals fall into particular cells in a 
nonrandom fashion, they conclude that dispro- 
portionate membership in these categories, as 
the parts of social structure, “causes” the 
phenomenon of concern. 

. On the basis of the above arguments, we 
argue that the structure of society does not 
include social categories, however defined, as 
units. Instead, social structure consists of 








and how they are articulated to form a larger whole 
before we can define social structure completely. 
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functioning groups and organizations such as 
families, work groups, business enterprises, 
labor unions, political parties, gangs, schools, 
friendship cliques, and so forth, which are 
intertwined through relationship. networks to 
form ever more complex structures. The 
structure of society does not consist of 
categories of people any more than organisms 
consist of categories of organs, or machines 
consist of categories of parts, or molecules of 
categories of atoms. Organisms consist of 
particular individual organs and machines of 
particular parts. In the same way, groups 
consist of the behavior of particular people, 
and organizations of particular groups. Cate- 
gories can form nothing in and of themselves, 
and category membership, because of the 
ontological status of categories, can not cause 
anything. 

We recognize that people who occupy 
positions in the structure of society have 
personal characteristics such as gender and 
race and that these characteristics are impor- 
tant in determining their behavior and the 
behavior of others toward them. This does not 
mean that social structure consists of gender 
or racial categories as parts. To utilize 
personal characteristics as the basis for 
formulating models of social structure is to : 
confuse categories with social units. It is 
certainly important that gender and racial 
categories differ in other characteristics such 
as income, education, and wealth. We can 
point out these differences and evaluate them 
on moral grounds. We can even predict them 
with confidence, but it is another thing to 
explain them. The important question for 
sociology is what operates to produce these 
differences. It is the structure of society that 
operates, and without a model of that 
structure, we cannot explain these differ- 
ences. 

Sociologists misconceive the nature of 
social structure when they treat abstract 
mental categories used to summarize the 
similarities and differencés among social units 
or actors as building blocks to construct 
structural models of society. This is simply 
wrong, because it is inconsistent with the 
assumptions upon which classification as a 
logical procedure is based. Classes in a 
classification scheme result from particular 
mental operations, and because of this they 
cannot, by any definition that remains faithful 
to the meaning of classification as a proce- 
dure, be treated as objective phenomena 
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occurring outside the mind of the classifier. 
Their only real consequences are those they 
produce as subjective factors in the minds of 
society’s members. 

The misconception of social , structure 
through the inclusion and even the concentra- 
tion of research efforts upon categories as 
units obfuscates a critical task that we, as 
sociologist, must undertake: the construction 
of structural models. Until we begin to 
develop and operate on the basis of explicitly 
defined structural models, we will remain a 
soft science. Without such models our 
investigations and analyses of social phenom- 
ena will be haphazard, will lack theoretical 
underpinning allowing for explanation, and 
will be actuarial at best. With structural 
models the possibility of our scientific 
development, while not guaranteed, is at least 
probable. Perhaps the emerging “structural 
analytic” paradigm indicates movement in 
this necessary direction (cf. Wellman and 
Berkowitz 1988; Berkowitz 1982; Bates 
1987; Burt 1982; Fararo and Skvoretz 1986a, 
1986b). 
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THE IRISH CASE OF DEPENDENCY: 
AN EXCEPTION TO THE EXCEPTIONS?* 


Dents O’ HEARN 
The University of Wisconsin-Madison 


The dependency approach has recently been criticized by authors who quote the 
“exceptional” East Asian cases. The Irish case is used to refute these new 
modernizationist arguments on two counts. (1) Countries such as Ireland, not the 
East Asian countries, have regimes with characteristics that tend to set dependency 
relations in motion. These characteristics include radical free trade, free 
enterprise, and foreign industrial domination. (2) Because of these characteristics, 
Ireland has endured economic stagnation and tendencies toward higher inequality. 
Irish economic growth under foreign-dominated industrialization was slowed by 
decapitalization and by the absence of linkages between foreign and domestic 
industry. Time-series models show that foreign penetration and free trade are 
related to slower economic growth, because of slower investment-growth and other 
reasons. Inequality increased primarily because of rising unemployment, Although 
social welfare programs reduced the effects of direct-income inequality, their 


effects have been reduced by regressive taxation and austerity programs. 


For the past 20 years, the approach loosely 
termed as dependency enjoyed popularity in 
development studies, particularly among rad- 
ical scholars. Many studies concentrated on 
the connections between foreign penetration, 
economic growth, and income distribution. 
Baran (1957) and others emphasized the 
tendencies of foreign-penetration to limit 
economic growth in the LDCs. Later works 
(Frank 1969) implied “laws” of development, 
relating proximity to the “metropole” with 
underdevelopment in the periphery. A series 
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drafts. All remaining faults, of course, are my 
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of analysts then attempted to "test" the 
hypotheses that foreign penetration caused (1) 
low growth rates and (2) inequality in LDCs 
(Chase Dunn 1975; Bornschier 1980; Bier- 
steker 1978; Bornschier and Ballmer-Cao 
1979; Evans and Timberlake 1980). 

Within the past few years some scholars 
have strongly challenged the dependency 
approach. A number of empirical analyses 
concentrated on "exceptions" to dependency, 
particularly from East Asia (for a review, see 
Chakravarty 1987). In place of dependency, 
these analysts propose a return to orthodox 
principles of neoclassical development eco- 
nomics and modernization theory. To qucte 
one of their starkest claims: 


Radical critiques employing concepts like "pe- 
riphery," "dependency," "exploitation" and 
“delinking” have lost much of their previcus 
attraction. The examples of the Pacific countres 
have demonstrated their irrelevance as analytical 
tools. (Linder 1986, p. 54; see also Keesing 
1967; World Bank 1981) 


In place of dependency, a series of trade 
theorists give the following reasons for the 
“gang of four’s” success: 

(a) maintenance of an outward-looking orienta- 

tion throughout the rapid growth phase of these 

countries; 

(b) maintenance of a very hospitable climate for 

foreign investment; 

(c) and finally, “keeping the prices right," by 

which they imply a relatively low real price of 
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labour, a relatively high real rate of interest and 
“realistic” exchange rates. (Chakravarty 1987) 


Sociological and political accounts of the 
success in East Asia concentrate less on 
openness to foreign penetration than on 
political and class variables —the nature of the 
colonial and post-colonial relationship with 
Japan, the emphasis on labor-intensive enter- 
prises, and the absence of an entrenched 
bourgeoisie. In fairness, many of these 
“non-economic” analyses have taken great 
pains to point out that South Korea and 
Taiwan are special or deviant cases and, 
therefore, do not in themselves constitute a 
threat to the dependency approach. Barrett 
and Whyte, for example, argue that Taiwan is 
a “deviant case” of dependency theory 
because foreign penetration was linked nei- 
ther to stagnation nor inequality (Barrett and 
Whyte 1982; see also Cummings 1984). 

In the present article, I will challenge the 
new modernizationism in development stud- 
ies by presenting the case study of Ireland 
(i.e., the 26 southern counties). Ireland 
represents not only an “exception” to the 
“exceptions” in the so-called gang of four. It 
is in many ways a truer representation of the 
open, foreign-dominated, free-enterprise re- 
gime that the new modernizationists pre- 
Scribe. In development terms, it is also an 
abject failure. This is especially significant 
because Ireland has for 30 years been in the 
heart of European economic integration. For 
15 years, Ireland has "enjoyed" full member- 
ship in the European Economic Community, 
one of the dominant economic powers of the 
“modern industrial West." Indeed, this close 
relationship to the core, in classical “depen- 
dency" terms, directly contributed to Ire- 
land's development problems. 


SPECIFICITY OF IRISH DEPENDENCY 


Ireland is an island about the size of the state 
of Maine, lying 30-60 miles from the British 
mainland. Its population is about four and a 
half million, well below its mid-19th century 
population of eight million. Its modern 
bistory is dominated by British occupation, 
four major waves of emigration, and several 
famines, including the famous mid-19th 
century “potato famine.” In the 19th-century, 
Irish peasants won limited ownership of farms 
and limited rights against British landlords. 
Ireland’s traditional industrial area, around 
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Belfast, was based on shipbuilding and 
linen-making for the British empire. Industry 
in the rest of the island was impeded by 
British laws, which removed Irish tariffs on 
industrial goods and outlawed certain lines of 
industry and exports. 

Ireland was partitioned in 1921, at the end 
of a war of independence. Britain retained six 
of the nine counties of the province of Ulster, 
which included industrial Belfast and the 
largest adjoining area with a built-in settler 
majority. The southern state of today, there- 
fore, comprises an area of about four-fifths of 
the island, with a population of about 3 
million. 

The new postcolonial state in 1932 em- 
barked on an attempt to build native industry 
through a classical program of import- 
substituting industrialization (SD, with high 
levels of tariff protection. Despite an eco- 
nomic war waged by Britain against Ireland, 
this program was quite successful: between 
1931 and 1947 the number of manufacturing 
establishments employing more than 10 grew 
by 63 percent, and those employing over a 
hundred more than doubled. In the same 
period, industrial employment grew by about 
80 percent, from 110,588 to 197,605 (O’ Hearn 
1988, pp. 82, 89). But in the mid-1950s, 
because of rising dollar trade deficits, exter- 
nal political pressures tied to Marshall Aid 
and European integration, and economic 
recession, the Irish regime changed the 
industrialization program from ISI to export- 
led industrialization (ELI). The new ELI 
regime had three distinguishing characteristics: 
(1) radical free trade, (2) radical free 
enterprises, and (3) foreign industrial domina- 
tion. 


Radical Free Trade 


Unlike many developing countries, which 
followed a “stop-and-go” pattern of deprotec- 
tion and reprotection, Ireland was forced to 
free its trade rapidly and totally. The removal 
of protection began after Ireland joined the 
Organisation for European Economic Cooper- 
ation (OEEC) in the 1950s—a prerequisite for 
receiving Marshall Aid—and ended when 
Ireland joined the EEC in 1972. Table 1 
clearly shows the fall of tariff receipts during 
ELI, beginning in 1959 when OEEC free 
trade pressures became severe. In 1972, tariff 
revenues as a percentage of total Irish 
government revenues fell by almost 6 per- 
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Table 1. Revenues from Customs Tafiffs, and as Percentage of Total Net Government Receipts (million Irish punts*) 








() Q) 


tariff total (1) as 
Year revenue revenue % of (2) 
1954 37 83 44.6 
1955 37 85 43.5 
1956 39 89 43.8 
1957 45 94 47.9 
1958 47 97 48.5 
1959 48 98 49.0 
1960 45 103 43.7 
1961 41 107 38.3 
1962 45 119 37.8 
1963 47 129 36.4 
1964 50 145 34.5 
1965 56 177 31.6 
1966 58 194 29.9 
1967 68 222 30.6 
1968 70 248 28.2 
1969 76 282 27.0 


Q) Q 


tariff total (1) as 
Year revenue revenue % of (2) 
1970 88 338 26.0 
1971 92 398 27.2 
1972 101 469 21.5 
1973 117 540 21.7 
1974 136 665 20.8 
1975 176 901 19.5 
1976 25 1,222 2.0 
1977 29 1,445 2.0 
1978 35 1,709 2.0 
1979 39 1,991 2.0 
1980 46 2,584 1.8 
1981 58 3,274 1. 
1982 62 4,014 1.5 
1983 T! 4,503 1.7 
1984 94 5,115 1.8 


* The punt is the Irish currency. Its value was tied to the British pound until Ireland joined the European Monetary 


System in 1979. 
Source: Irish Revenue Commissioners (various years). 


cent. Four years later, as a result of Ireland's 
terms of accession to the EEC, tariff revenues 
fell to practically nil.! 

As a result, the Irish market was penetrated 
by competing imports. Between 1960 and 
1980, imports took over the Irish market in 
nearly every category of manufactured goods. 
In footwear and clothing, imports rose from 8 
percent of domestic consumption in 1960, to 
70-80 percent in 1980. In nonelectrical 
machinery, the share of imports during the 
same period rose from 55 to 98 percent. 

The results for Irish-owned industry were 
disastrous (Table 2). Between 1973 and 1986, 
85 to 90 percent of the jobs in pre-1955 
clothing and textiles firms were lost. Three- 
fourths of the jobs in domestic miscellaneous 
manufacturing and over half of the jobs in 
domestic chemicals and metals were lost. In 
the pre-ELI Irish manufacturing sector as a 


‘Care should be taken not to confuse tariff 
revenues with categories such as "taxes on 
international trade and transactions" (e.g., in 
World Development Report). For Ireland, the latter 
is primarily excise taxes on alcohol and tobacco, a 
consumption tax which has partly replaced tariffs 
as a source of government revenues. Excise taxes 
are not a form of protection because locally 
produced alcoholic drinks and tobacco products are 
taxed at the same rate as imports. In addition, 
because excise taxes are placed on goods that are 
highly price inelastic, they have a limited effect on 
consumption of taxed items. 


whole, half of the jobs held in 1973 were lost 
by 1986 (60 percent in the nonfood sector). 
Of course, this demolition of protected 
nationalist industry could be viewed as 
"restructuring" —that is, clearing out ineffi- 
cient and unprofitable sectors to make way for 
more profitable “modern” sectors. To vzli- 
date the dependency approach, therefore, we 
must show that the new industry which 
replaced domestic manufacturing was not 
conducive to economic growth. 


Radical Free Enterprise 


From its inception, the ISI regime was based 
strictly on principles of "private enterprise." 
Interventions by the state into the "business 
of business" were few, and state industry was 
limited (in 1945 there were three infrastruc- 
tural and two industrial state companies). 
These principles were intensified under ELI. 
The state’s role was to market Ireland as a 
profitable location for business—to provide 
incentives for industry to locate in Ireland and 
to find firms that would respond to the 
incentives. After that, new foreign firms 
could avoid any kind of scrutiny. The 
disposition of profits is left entirely in the 
hands of the firm. No profits taxes are paic on 
most manufactured exports, and profits may 
be freely repatriated. Means of production are 
freely imported, and output is freely ex- 
ported. The regime does not pressure TNCs to 
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Table. 2. Changes of Employment and Number of Firms in Domestic “Old” and “Adapted” Industry from 1973 to 


1986, Ireland 
Employment 

Sector 1973 1986 

Food 27,601 17,330 
Drink 3,804 3,197 
Textiles 8,561 894 
Clothing 8,084 1,231 
Wood 5,186 2,769 
Paper ' — 8,446 4,453 
Clay 9,861 7,519 
Chemical 2,245 1,047 
Metals 12,148 5,172 
Other mfg. 3,267 . 766 
Nonfood 58,825 23,851 
TOTAL 90,230 44,378 


No. of Firms 

$6 change 1973 1986 % change 
-372 598 448 . —25.1 
—16.0 7 56 ~27.3 
— 89.6 120 43 —64.2 
— 84.8 213 78 ~ 63.4 
— 46.6 361 284 -—-21.3 
—47.3 218 176 —19.3 
—23.8 211 161 ~23.7 
— 53.4 64 28 ~ 56,3: 
— 57.4 354 239 ~ 32.5 
—76.6 132 71 —46.2 
—59.5 1,673 1,080 ~ 35.4 
— 50.8 72,348 1,584 ~32.5 ` 


Source: Author's calculations from IDA Employment Surveys. . 


use Irish inputs or to create other linkages. In 
the words of an Irish Minister for Industry 
and Commerce at the beginning of ELI, “we 
aim to convince [U.S. industrialists] that 
Ireland is the best possible location because of 
its attitude to private enterprise. . . . the more 
profits they make the better we will like it” 
(Dail Eireann 1958). 


Foreign Domination 


The Irish regime perceives foreign industry as 
a substitute for—not a complement to— 
domestic industry. An early and influential 
proponent of ELI captured the Irish attitude, 
saying, “By far the most hopeful means of 
getting good management, technical knowl- 
edge, and capital all at once is from 
subsidiaries of large foreign companies... . a 
plant which is paid for by foreign capital is a 
great deal better than one which has to be paid 
for from the scanty saving of the Republic” 
(Carter 1957). Since the inception of ELI, the 
regime relies first and foremost on the 
attraction of new foreign capital for industrial 
expansion. 

Thus, the Industrial Development Author- 
ity (IDA), the administrator of the regime's 
industrial policy, goes out of its way to attract 
large foreign companies rather than nurturing 
new lrish enterprises that may or may not 
succeed. Capital grants for foreign firms are 
much higher on average than grants to 
domestic industry (O'Hearn 1988, chap. 10). 
This trend is intensified by internal IDA 
job-creation and asset-attraction targets, which 
determine a range of things from salaries and 


promotions, to the very level of IDA funding 
by the state. 

The bias in favor of TNCs is reflected in 
their share of new industry. In the late 1950s, 
over a quarter of new manufacturing jobs 
were in TNCs. During the first half of the 
1960s, TNCs created more than half of the 
new manufacturing jobs. After 1965, at least 
70 to 80 percent of the new manufacturing 
jobs were in TNCs (O'Hearn 1987). . 

Ireland’s adherence to free trade, free 
enterprise, and foreign industrial domination 
sets it apart from "exceptional" cases, such as 
South Korea and Taiwan. Regimes in these 
countries are characterized by strong state 
intervention in business, widespread use of 
selective protection and import-substitution, 
and a definite preference for domestic indus- 
try. The "free market" in Singapore, another 
high-growth East Asian economy, has been 
described as a "myth" (Lim 1983; see also 
Rodan 1985; Islam and Kirkpatrick 1986).2 

Examples of state intervention in Taiwan 
and South Korea abound. Both countries 
nurtured their domestic machine-tool sectors 
by a system of protection against imports and 
discriminatory credits and export subsidies. 
South Korea reintroduced import-substitution 
in heavy industry in the early 1970s, and 
Taiwan in the early 1980s, with notable 
success. In South Korea, export performance 
is monitored by the regime, down to the 
individual firm, with strict sanctions for poor 


? Apart from the level of state intervention in 
Singapore, its special characteristics as a city- 
state—like Hong Kong —reduce its usefulness as a 
“model of development" (Rodan 1985, pp. 9—10). 
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% FDI Country % FDI 
7.81 Guatemala 4.67 
7.63 Spain 4.11 
7.06 S. Korea 3.29 
6.82 Greece 3.35 
6.61 Portugal 2.91 
6.57 Morocco 2.60 
5.23 Thailand 2.03 
4.80 Turkey 0.90 
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Table 3. Level of FDI in 24 LDCs, End-1978 (FDI Assets as Percentage of GDP)* 
Country % FDI Country 
Ireland 35.92 Philippines 
Jamaica 34.01 Taiwan 
Singapore 21.78 Brazil 
Malaysia 18.51 Uruguay 
Hong Kong 15.35 Mexico: 
Chile 9.54 Argentina 
Costa Rica 8.55 El Salvador 
Dominican Rep. 8.41 Tunisia 

* OECD-based TNCs only. 


Source: OECD (1980, pp. 165, 184—85, 237-40). Irish estimate is based on extrapolation of US FDI, using 
McAleese's (1977) data on percentage of new FDI in Ireland that is US-based (38.74 percent of assets). U.S. 


Department of Commerce (1980). 


performance. Both regimes exert influence 
over private industry by their monopolies of 
key economic sectors—for example, banking 
and credit—and through their activist state 
planning boards. Regime control also extends 
to agriculture, where state pricing policies 
and monopolies in key inputs are used to 
influence performance and to extract large 
economic surpluses (Amsden 1985a; Cum- 
mings 1984; Foster-Carter 1985; Kuo, Ranis, 
and Fei 1984; Mason, M. Kim, Perkins, K. 
Kim, and Cole 1980). 

Nor is foreign-penetration as extreme in the 
"exceptional" cases as it is in Ireland (Table 
3). Taiwan and South Korea both have 
relatively moderate levels of TNC-penetration 
relative to GDP. Ireland, on the other hand, is 
among the most highly penetrated economies, 
both because of the high proportion of foreign 
firms within new industry and because of the 
decline of domestic industry under ELI. 
Singapore has a much higher rate of foreign 
penetration than either South Korea or 
Taiwan, but it has little nonindustrial output. 


RESULTS I: SLOW 
ECONOMIC GROWTH 


How do the specificities of the Irish case 
affect the relation of foreign penetration to 
economic growth and inequality? Are the 
relations of “dependency” at work in Ireland? 
To show the relationship between foreign 
penetration and slow economic growth, I will 
concentrate on two basic themes of “depen- 
dency”: exploitation and linkages. 


Exploitation 


One of the frequent claims of dependency is 
that TNCs take more resources out of a 


country than they put into it (Amin 1974; 
Frank 1969). TNCs are said to drain capital 
from a host country. Proponents of decapital- 
ization argue that TNCs’ negative transfers— 
profit repatriation, high transfer prices for 
subsidiaries’ imported inputs—are greater 
than their direct investments. Free enterprise 
gives the TNC control (vis à vis the host 
country) over the size of its profits because 
taxes are low and prices are set freely or in 
the market. Free trade allows unrestricted 
repatriation of profits and manipulation of 
transfer prices and royalties. If foreign 
enterprise predominates. more firms are in a 
position to repatriate. Thus, the higher the 
stock of foreign capital, the greater the likely 
outward flow of repatriated capital.? The most 
systematic empirical evidence for decapital- 
ization is by Bornschier (1980), who found 
that the stock of TNC-penetration is nega- 
tively associated with investment rates and 
economic growth rates in a sample of 34 
larger LDCs, including Ireland. 
Decapitalization in Ireland from 1970 tc 
1982 is estimated in Table 4. To the extent 
that the model is appropriate, the Irish case 
strongly supports dependency. The Irish state 
pays for about a third of new TNC investment 
through its capital grants. Another sixth is 
financed from Irish sources and is not capital 
transfer. Profit repatriations far exceeded 


? This outward flow of capital, which is caused. 
by the stock of foreign penetration, must not be 
confused with the inward flow of foreign invest- 
ment. In the Bornschier model, the inward flow of 
foreign investment has a positive effect on capital 
formation as a flow, while the stock of foreign 
penetration has a negative effect on capital 
formation in the form of an outward flow of capital 
(decapitalization). 
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Table 4. Direct Inflows and Outflows Due n TNC Activities in Ireland, 1970-1983 (‘000 Irish punts) 
(5) | ©) (d) (e (£) 
(a) nongranted of which: nongranted repatriated capital 
TNC TNC nonlocally TNC TNC transfer 
Year inv’t investment financed reinv't profits (c)-(e) 
1970 39,359 29,615 24,679 3,300 26,000 -1,321 
1971 61,740 46,016 38,347 4,317 28,300 10,047 
1972 63,496 50,446 42,038 4,753 24,200 17,838 
1973 43,269 34,989 29,158 5,931 36,400 — 7,242 
1974 32,123 24,512 20,427 5,296 44,900 ~ 24,473 
1975 53,961 37,058 30,882 5,206 47,300 — 16,418 
1976 97,982 66,361 55,301 9,747 88,600 — 33,299 
1977 107,753 78,978 65,815 11,933, 128,000 —62,185 
1978 114,351 , 78,396 65,330 11,785 206,400 — 141,070 
1979 154,864 103,006 85,838 16,511 219,000 ~ 133,162 
1980 186,437 129,309 107,758 26,697 258,000 — 150,242 
1981 214,901 150,519 125,433 28,162 361,700 ~ 236,267 
1982 212,234 142,888 119,073 ` 39,246 498,700 ~ 379,627 
1983 na na na na 658,600 na 
1984 na na na na 940,000 na 
1985 na na na na 1,321,000 na 
1970-1982 1,382,497 972,093 "810,079 172,884 1,967,500 — 1,157,421 


Source: O'Hearn (1988, p. 330). 4 


actual TNC capital inflows (column c) except 
in 1971—72. The ratio of investment to profit 
repatriations fell sharply, from 94.9 percent 
in 1970 to 23.9 percent in 1982. In terms of 
simple decapitalization; the situation for 
Ireland is bad. and getting worse. 

The activities of TNCs may have important 
"second-round" effects, however, if they 
increase the level of economic activity, 
savings, and wealth in the economy. There is 
little evidence of large second-round effects 
in Ireland. There were few linkages between 
TNCs and. domestic firms (see below), and 
the major “second-round” effect of free trade 
was the decimation of local industry. It is 
unlikely. that second-round effects offset the 
massive net outflow of capital shown i in Table 
4. 

Vernon (1971) strongly challenges decapi- 
talization. He argues that the positive .“bal- 
ance of payments effects" of TNC- 
penetration outweigh any losses of capital. 
TNCs bring foreign exchange, a key re- 
source, into a host country through export 
earnings and by producing things that were 
formerly imported. In the, case of export- 
platforms with free trade, however, export 
earnings may be largely offset by the cost of 
imported inputs. 

A “balance of payments” model, which 
incorporates the capital flows of the decapi- 


talization model, may be expressed as 


follows: 
FE = Ing — Pos KL 
+ ME gi max - eli,max 
= Cli, max (1) 


where, FE measures the transfer of foreign 
exchange, L,. measures non-granted capital 
investments by TNCs, P, measures profits 
that are repatriated by TNCs, Kle measures 
capital imports by TNCs, ME. max measures 
the increased exports attributable to ELI, 
MI; max measures the increase of materials 
imports attributable to ELI, and Clq; max 
measures the increase of consumer imports 
attributable to ELI. 

The phrase .“attributable to ELI” is tricky, 
because it involves comparing ELI with some 
“yardstick.” In the model (Equation 1), the 
previous (ISD development regime is the . 
yardstick by which ELI is measured. Materi- 
als imports “attributable to ELI,” for exam- 
ple, is the difference between the ISI and ELI 
ratios of "imports of materials for use in 
industry" to industrial output.* In a sense, the 


* Furthermore, the model is probably an under- 
estimation of actual decapitalization, because it 
does not include the infrastructural costs of foreign 
industry that are borne by the Irish state, nongrant 
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. Table 5. Balance of Payments Effects of ELI and Foreign Penetration, 1970-82 (million Irish punts) 








Year I -" P - KI + ME - MI - CI = FE 

1970 24.7 26.0 13.9 60.6 52.5 49.9 -57.0 
1971 38.3 28.3, 21.8 64.6 55.1 57.2 —59.£ 
1972 42.0 24.2 22.4 114.8 34.5 42.4 33.2 
1973 29.2 36.4 15.3 211.8 120.7 116.9 — 48.2 
1974 20.4 44.9 11.3 327.8 384.4 116.5 —208.¢ 
1975 30.9 47.3 19.0 349.5 234.1 151.8 -TLE 
1976 55.3 88.6 34.6 576.8 430.3 248.1 —169.5 
1977 65.8 128.0 38.0 885.1 522.4 367.9 — 105.2 
1978 65.3 206.4 40.3 1,034.3 578.7 504.9 —230.7 
1979 85.8 219.0 54.6 1,312.5 863.7 701.6 — 440.6 
1980 107.8 258.0 65.8 1,613.7 1,042.7 896.0 — 541.0 
1981 125.4 361.7 75.1 2,073.3 1,291.9 1,131.7 — 662.3 
1982 119.1 498.7 74.9 2,705.0 1,211.5 1,145.7 — 106.7 


Source: O'Hearn (1988, p. 338). 


resulting estimates are an "upper limit" since, 


some import-penetration might have hap- 
pened after 1960 even without TNC- 
penetration. Many of the resources necessary 
to set up and run new Jrish-owned industrial 
firms, for example, would have had to be 
imported, at least for a time. Surely, the 
continuation of traditional ISI was not a 
practical alternative. Thus, the null hypothe- 
sis that ELI had no effect on foreign exchange 
transfers is more likely to be rejected than 
under a stricter yardstick. Unfortunately, 
however, there is simply no practical alterna- 
tive yardstick to the one employed in 
Equation 1. 

Estimates of FE in Table 5 support the 
thesis that foreign-penetration hurts the bal- 
ance of payments. While the decapitalization 
model identifies profit repatriation as the 
decisive factor responsible for exploitation, 
_ the balance of payments model emphasizes 
the loss of foreign exchange due to import- 
penetration and the high import-content of 
TNC inputs. : ` 

The negative balances for most years are so 
large that one could be skeptical that other 
development strategies, or less radical forms 
of ELI, would have performed as badly. In 
other words, the major reason for Ireland’s 
poor balance of payments position between 


forms of local financing of foreign investments, 
and the investment /ost because of the collapse of 
domestic industry. Moreover, it estimates only the 
extra imports used by TNCs —rather than subtract- 

-ing total TNC imports from exports—and ignores 
service imports. For a further discussion of the use 
of ISI as a yardstick for ELI performance, see 
O'Hearn (1988, chap. 12). - 





1955 and 1982 was its particular form of 
ELI, which incorporated radical free trade 
and allowed domestic industry to atrophy. In 
completeness, it should be noted that there 
was a rise of exports during the mid-1980s, 
which probably drove the measure FE posi- 
tive in 1986 and 1987. Rather than a 
long-term development, this appears to be a 
temporary result of several factors, including 
a massive fall of the dollar-price of imports 
and utilization of excess capacity during the 
post-recession recovery. There is no evi- 
dence, in any case, that the upward move- 
ment of manufactured exports has had more 
than a temporary effect on economic growta 
rates, and according to IDA employmert 
surveys it has had no appreciable effect on jo» 
creation. 


Linkages 


Even if the resources for investment and 
expansion are available, productive outlets for 
investment must be found. For many years, 
orthodox development theorists stressed the 
desirability of balanced growth among eco- 
nomic sectors, to avoid bottlenecks and 
realization problems (see Nurkse 1953; Lew:s 
1955). Albert Hirschman (1958) challenged 
this orthodoxy in his seminal work The 
Strategy of Economic Development. Accord- 
ing to Hirschman, investment opportunities 
are created by the imbalanced development of 
the economy. Bottlenecks and gaps induce 
productive investment to correct or fill them. . 
Hirschman lists three kinds of linkages, or 
investment inducements. (1) If a firm buvs 
inputs locally, it induces someone to produce 
them (backward lirkages). (2) If a firm makes 
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a product which may be processed further, it 
may induce a local firm to make a new 
product (forward linkages). (3) If a firm’s 
activities contribute to the state's economic 
resources (through taxes and import duties), it 
creates fiscal linkages. Hirschman’s analysis 
of linkages is behind more recent concepts 
such as disarticulation between “modern” 
and “traditional” economic sectors (Amin 
1974) and economic dualism (Myint 1970; 
Singer 1970). 

' Fiscal linkages are probably negative in 
Ireland because (1) the typical TNC pays no 
taxes due to “free enterprise” tax provisions 
and pays no import duties due to free trade, 

: and (2) large incentives and infrastructural 
costs are incurred by the state to attract 
foreign investment. Since TNCs in Ireland 

` export nearly all of their product, forward 
linkages are few. The only possible signifi- 
cant linkages from TNC operations in Ireland 
are backward linkages. 

Irish data on backward linkages are sparse. 
There were surveys. of industrial firms in 
1966,. 1971, 1974, and 1983 (Survey Team 
1967; O hUiginn 1972; McAleese 1977). 
Unfortunately, the coverage and classification 
of firms differ among the surveys, so 
longitudinal comparisons are awkward. But it 
is still possible to construct a fairly good 
picture of TNCs’ backward linkages. Buckley 
(1974) found that TNCs in 1971 purchased 
only 31.5 percent of their material inputs 
from Irish sources, while Irish firms pur- 
chased 68.3 percent locally. McAleese and 
McDonald (1978) found that new (post-1955) 
nonfood TNCs purchased 11.2 percent of 


their inputs in Ireland, while new domestic ` 


firms purchased 22.2 percent locally. Thus, 
we can conclude that (a) the rate of 


TNC-related linkages is very low, (b) domes-. 


tic firms have twice as many linkages as 
TNCs, and (c) the rate of linkages created by 
new domestic industry is also low. It is 
: , important to note that linkages were reduced 
in Ireland not only by the predominance of 
: TNCS in new industry, but also because ‘free 
.trade encouraged (or forced) domestic firms 
to. turn to foreign sources of supply. 

. Finally, results of a 1983 survey carried out 
by the IDA and made available to the author 
provide the most complete picture of linkages 
to date. The IDA Components of Sales survey 
covers firms employing 68 percent of workers 
in manufacturing, making .it possible to 
construct a complete snapshot of linkages. 


tore 


585 


Table 6. Backward Linkages in Irish Manufacturing, by 
Industrial Sector and Nationality of Ownership 








(1983) 

Irish inputs as % of output 

Irish UK Other 

Sector . firms firms ^ — foreign 
Food 65.2 48.8 63.0 
Drink 36.2 36.9 18.0 

Textiles 20.1 12.0 15.7 : 
Clothing 28.6 18.1 24.8 
Wood 34.4 12.2 43.2 
Paper 49.5 23.0 30.4 
Clay 24.3 47.7 13.1 
Chemicals 33.2 28.0 13.1 
pharmaceuticals 27.5 39.2 13.3 
Metal 26.4 27.2 9.3 
electronics 31.8 —. 13.1 
Other mfg 27:8 10.9 15.2 
healthcare — — 13.7 
TOTAL 48.0 37.2 15.5 
nonfood 31.2 30.2 12.3 


Source: Author's calculations from survey data 
supplied by the IDA (Components of Sales survey). ` 


Backward linkages as a proportion of output 
in 1983 are given in Table. 6.5 

The IDA survey results give strong support 
to theses of low backward linkages in. the 
TNCs. TNCs have lower backward linkages 
than Irish firms in all sectors except wood (a 
special case where several TNCs were given 
state guarantees of cheap wood supplies). The 
differences between TNCs and Irish firms are 
especially large.in the "modern" sectors— 
chemicals, metals, other manufacturing— 
where domestic firms bought two to three 
times as many Irish inputs (as a proportion of 
output) as TNCs. But backward linkages 
created by TNCs are low by any standard, 
regardless of the level of linkages created by 
domestic firms. The ratios of locally. pur- - 
chased inputs to TNCs' output in electronics, 
pharmaceuticals, and healthcare products 
ranged from 13.1 to 13.7 percent. Locally 
purchased material inputs were 5.5 percent, 


5 I have weighted the IDA data of Components 
of Sales per employee according to the actual 1983 
distribution of employment by nationality of 
ownership and industrial sector (from IDA Employ- 
ment Surveys). Backward linkages are defined 
here as all locally purchased materials and service 
inputs, but not wages. If wages were included, the 
level of linkages would increase somewhat. The 
difference between linkages in TNCs and local 
firms would also increase, however, because / 
wages are a much smaller part of TNC output than 
of local output. ' 
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5.6 percent, and 7.1 percent of output, 
respectively. 

TNCs' backward linkages in Ireland are 
also extremely low relative to other dependent 
countries. Table 7 compares the proportions 
of material inputs that are locally purchased 
by manufacturing subsidiaries of U.S. TNCs 
in Ireland, Brazil, and Mexico. TNCs’ local 
purchases are uniformly lower in lreland, 
particularly in the "modern" sectors. In 
manufacturing as a whole, U.S. affiliates 
purchased 76.4 percent of their material 
inputs locally in Brazil and 68.5 percent in 
Mexico, but less than 20 percent in Ireland. 
While this confirms the extremely low level 
of TNC-induced linkages in Ireland by 
comparison with other semiperipheral coun- 
tries, it also indicates the importance of 
specificities of Irish development. In particu- 
lar, the most important causes of low linkages 
in Ireland—unlike Mexico and Brazil—are 
(1) the predominance of TNC production for 
export rather than for local markets, and (2) 
the degree of free enterprise (absence of 
local-content regulations). 

In an economy such as Ireland’s, where the 
regime emphasizes the attraction of new 
foreign capital, and where imports have 
swamped the domestic consumer market, 
linkages are an important source of invest- 
ment opportunities. The scarcity of linkages 


Table 7. Locally Purchased Materials as Percentage of 
Total Material Inputs in U.S. Affiliates Oper- 
ating in Brazil, Mexico, and Ireland 


Ireland 

: US All 
Sector Brazil Mexico TNCs TNCs 
Food 89.5 96.8 73.0 70.7 
Textiles * 94.1 13.8 18.1 
Paper 9].4 93.5 33.3 55.3 
Chemicals 44.7 54.8 19.3 18.2 
'Rubber 92.7 88.5 3.1 1.5 
Clay/cement 81.7 66.7 15.3 49.6 
Metals 83.8 90.8 7.0 15.4 
Nonelectric 

machinery 63.6 45.7 12.6 15.4 
Electric x 

machinery 60.0 61.6 10.9 11.2 
Transport 97.1 60.7 8.9 18.6 
Instruments . 11.2 44.7 21.9 18.9 
Other 74.0 97.6 19.2 19.2 
TOTAL 76.4 68.5 19.7 27.7 





Note: Irish data are for 1983; Brazilian and Mexican 
data are for 1972. l 

Source: IDA Components of Sales Survey data, 
Connor (1977). ; 
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between TNCs and the local Irish economy 
means that the contribution of new TNCs to 
growth is practically restricted to the activities 
of the TNCs themselves. Foreign investmeni 
creates few multipliers that lead to the growth 
of domestic investment. 


Foreign Penetration and Growth 


The logic of the Irish development regime is 
deceptively simple, as is the logic or 
modernizationist analysis in general: attract as 
much foreign investment as possible, the 
foreign investors will export a lot, exports 
will bring new resources into the country, and 
economic development will surely follow, 
along with jobs and prosperity. 

Irish ELI was spectacularly successful at 
attracting foreign investment and increasing 
exports. Real foreign investment grew at an 
annual rate of 25 percent between 1955 and 
1983. Real exports grew by 7.5 percent 
annually between 1955 and 1985 (8.3 percent 
after 1972), and manufactured exports by 
more than 10 percent (calculated from OECD 
1987). If foreign investments and exports are 
the key to growth, the Irish economy should 
have expanded especially rapidly during the 
1970s and. 1980s. 

The actual Irish growth experience was 
very different (Table 8). Throughout ELI, 
gross fixed capital formation (GFCF) grew 
annually by only 5 percent. Since 1972 (when 
Ireland joined the EEC) per capita GFCF 
grew by less than 2 percent. Annual rates of 
growth of per capita GNP were even slower. 
At its highest, per capita GNP grew at an 
annual rate of 3.4 percent (1965-1970). 
During ELI as a whole, the annual growth 
rate of per capita GNP was a mere 2.3 
percent. Most significantly, the annual per 
capita economic growth rate fell to 0.4 
percent during the post-EEC period, and was 
negative-(— 1.25 percent) in the 1980s. 

These rates of growth are strikingly low 
compared to rates of growth in other 
countries. Ireland's annual rate of economic 
growth over the 30 years of ELI was the 
lowest in Europe, well below rates of growth 
in the European periphery (4—6 percent) and 
the average rates of growth for upper-income 
LDCs (about 5 percent), and far below .the 
"exceptional" East Asian economies (6—8 
percent). Even in its so-called "miracle 
years" of the 1960s, Ireland’s annual growth 
rate of per capita GNP never reached 4 
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Table 8. Annual Percentage Rates of Growth of Gross Fixed Capital Formation (GFCF) and GNP during ELI, 


1955-85 (constant 1958 Irish punts) 





per cap. per cap. net output 
Years GFCF GFCF GFCF (mfg.) GNP GNP (mfg.) 
1955-60 —3.91 =g] 3.60 1.22 1.83 4.40 
1960—65 11.88 11.59 14.64 3.41 3.14 7.15 
1965-70 6.34 5.87 7.82 3.90 3.41 8.23 
1970-75 3.48 1.98 6.51 3.33 1.81 5.61 
1975-80 6.89 5.34 6.61 3.53 2.16 6.99 
1980-85 —0.39 —1.02 —6.12 —0.45 —1.25 6.88 
1955-72 6.73 6.53 11.3 3.40 3.19 7.44 
1972-85 3.24 1.97 1.61 1.65 0.38 5.90 
1955-85 5.83 5.02 74 3.15 2.34 6.92 


Note: Five-year rates of growth for each variable x are calculated according to the following formula: 


[log x(t-- 5) — 


log x(t)] / 5. 


Rates of growth for longer periods are calculated by least-square regression of the log of the variable on time in 


years. 


Source: Central Statistics Office (various years); OECD (1987). 


percent over any five-year period. Can this 
poor growth performance be associated with 
foreign penetration, free trade, and free 
enterprise ?® 

Growth of output and investment became 
slower as foreign penetration became higher, 
and growth rates became particularly low 
(and finally negative) after Ireland joined the 
EEC and was forced into full-fledged free 
trade. Every industrial sector except chemi- 
cals and metals experienced significantly 
lower rates of growth of net output after 
Ireland's accession to the EEC than in the 
pre-EEC period of ELI (O’Hearn 1988, chap. 
14). Textiles and clothing experienced nega- 
tive post-EEC growth rates. Food, drink, 


© Critics of the present approach may suggest 
that Ireland's poor per capita growth performance 
is caused by its high rate of population growth. 
This is not the case. It is true that Ireland has the 
highest rate of natural increase in Western Europe, 
and that population grew more rapidly in the 1960s 
and 1970s than in previous periods (when it 
actually declined, because of emigration). Thus, 
population growth did exacerbate the slow growth 
of the 1970s. In the 1980s, however, Irish 
population is again declining —emigration is reach- 
ing its highest levels ever (see Irish Census of 
Population 1986). Most experts agree that emigra- 
tion and population decline will continue at a rapid 
rate for some years to come, barring an unforeseen 
economic recovery. While there is some reciproc- 
ity, Irish population growth rates (which change 
according to emigration rates) clearly respond to 
economic growth rates more than they “cause” 
them. 


wood, paper, and clay were stagnant after 
1972. Only metals, chemicals, and other 
manufacturing experienced annual growth 
rates of more than 5 percent after Ireland 
entered the EEC and embarked on full free 
trade. 

It should be possible to observe statistically 
any connection between foreign penetration, 
investment-growth, output-growth, and free 
trade. Elsewhere (O’Hearn, forthcoming), I 
have developed an extended time-series 
version of the cross-sectional model em- 
ployed by Bornschier (1980a) to explore the 
relationships between foreign-penetration and 
economic growth. In the model, foreign 
penetration is measured by the ratio of new 
(post-1955) foreign assets to total fixed 
capital stock in manufacturing. The flow of 
TNC-investments is measured by the ratio of 
foreign capital investments to GNP. The rate 
of economic growth is measured by the annual 
rate of growth of per capita GNP (Central 
Statistics Office, various years). Investment 
growth is measured by the annual rate of 
growth of real GFCF (OECD 1987). Free 
trade is measured by the ratio of import- 
penetration to apparent consumption. 

In time-series variables of this kind we 
cannot assume that the regression errors are 
generated by an independent process. The 
size and direction of the error in time ¢ are 
often correlated to the size and direction of 
error in time t+1. Because of this, the 
Durbin-Watson statistic (d) was used to test 
for autocorrelation in models of the effects of 
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Table 9. Effects of Dependency-Related Variables on Economic Growth (autoregressive time-series models) 


I 
Dependent variable: income growth rates (OLS) 


GFCF-growth b 0.1685 
beta 0.5023 
t (3.039)** 
TNC-penetration b —0.0779 
beta —0.3500 
t (2.228)** 
Import-penetration b 
beta 
t 
TNC-investment b 
beta 
t 
Constant 2.6059 
R 0.5058 
df 21 
d 1.7420 
Dependent variable: GFCF-growth (OLS) 
TNC-penetration b —0.2473 
beta — 0.3725 
DC (1.883)**: 
Import-penetration b 
beta 
t 
TNC-investment b 
beta 
t 
Constant 10.926 
R 0.1388 
df 22 
d 1.9403 
Dependent variable: import-penetration (autoregression) 
TNC-penetration b 
beta 
t 
Constant 
R 
df 
rho 
d 


Hu I IV 
0.1474 0.1554 0.1421 
0.4395 0.4634 0.4238 
(2.856)** (2.76T1)** (2.588)** 

—0.1736 
—0.7797 
(3.198)** 
— 16.505 —26.113 
—0.3885 —0.6146 
(2.319)** (2.847)** 
1.3045 0.7887 
0.5179 0.3131 
(2.248) "* (1.583)* 
2.2196 7.8293 10.357 
0.6055 0.5225 0.5757 
20 21 20 
1.7967 1.7432 1.7283 
— 0.3886 
—0.5855 
(1.883)** 
—55.185 —70.813 
—0.4356 —0.5590 
(2.270)** (2.146)** 
2.0386 1.3891 
0.2714 0.1849 
(0.845) (0.710) 
9.9630 28.368 32.156 
0.1671 0.1898 0.2088 
21 22 21 
1.9558 1.9417 1.9228 
0.0044 
0.8465 
(5.122)** 
0.3253 
0.9396 
2 
0.7812 
0.5154 





* p<.10 (one-tailed). 
** p«..05 (one-tailed). 


TNC-investment and TNC-penetration on 
economic growth in Ireland for 1960—83. 
Most models exhibited no evidence of 
autocorrelation and were estimated by OLS. 
There was evidence of autocorrelation in one 
model, which was then estimated by an 
autoregression, which corrects for first-order 
autocorrelation by using maximum-likelihood 
estimation of the residuals and obtaining the 
autocorrelation coefficient (rho) by the Coch- 
rane-Orcutt iterative procedure.” 


7 The estimates were generated by the SHAZAM 


The most important results are summarized 
in Table 9. Models I and II are simple models 
of the relationship between foreign invest- 





econometrics package using the OLS command 
(autoregressions used the AUTO command with 
DN and ML options). Results of autoregressions 
and OLS in these models differ insignificantly in 
terms of direction, size, or significance of the 
coefficients. Autoregression was used only in the 
model with import penetration as dependent 
variable and TNC-penetration as independent 
variable (Table 7), because the Durbin-Watson 
statistic showed evidence of autocorrelation. 
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ment, foreign penetration, and economic 
growth. Models III and IV include import- 
penetration, which I contend is a strong 
intervening variable between foreign penetra- 
tion and economic growth. The most direct 
way to relate the effects of each independent 
variable is to compare the standardized 
coefficients (beta). 

Clearly, TNC-penetration bas a strong 
negative effect on economic growth (models I 
and II). Its direct effect on growth is 
associated with a beta coefficient of —0.780, 
while its indirect effect through GFCF-growth 
is —0.257 (model ID. TNC-investment, as 
expected, has a weaker positive effect on 
growth, both directly (0.518) and indirectly 
(0.141). The direct coefficients estimate that 
each percentage rise in TNC-penetration is 
associated with a direct fall of 0.174 percent 
in per capita GNP growth, and an indirect fall 
(through GFCF) of .057 percent. 

It is possible, however, that TNC- 
penetration does not directly affect economic 
growth, or even investment growth. Instead, 
there may be important intervening variables 
that link TNC-penetration with economic 
stagnation. In Ireland's case, as I have 
already argued, the prime candidates as 
intervening variables are free trade and free 
enterprise. Competing imports were responsi- 
ble for widespread decline of domestic 
industry, while the general rise of imports 
created foreign exchange problems and the 
free exit of profits caused decapitalization. 
The association between free trade and slow 
economic growth is tested in models III and 
IV. Free trade is indicated by import- 
penetration, which is measured by the ratio of 
manufactured imports to domestic consump- 
tion of manufactured goods. 

Models III and IV indicate that import- 
penetration is a strong intervening variable 
between foreign penetration and economic 
stagnation. Import penetration has a highly 
significant negative relationship with both 
investment- and income-growth and is strongly 
correlated with foreign penetration. In model 
IV, each rise of 1 percent in import 
penetration is directly associated with a fall of 
0.26 percent in the rate of growth of per 
capita GNP. Indirectly—through GFCF— 
each percentage rise in import penetration is 
associated with a fall of about 0.10 (—0.708 
times 0.142) percent in the rate of growth of 
per capita GNP. 

These results, along with our earlier 
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observations about growth rates in Ireland 
during the period of ELI, indicate that there is 
a dependency-type relationship at work be- 
tween foreign penetration and economic 
growth. The relationship is specific, however, 
and is related to Ireland’s model of liberal 
foreign-led industrialization. Unlike cases 
where TNCS invest primarily to gain access to 
local markets, TNCs have located in Ireland 
for access to regional markets. Therefore, 
they demand a regime where they can freely 
import factors of production and freely export 
their output and their profits. Under such a 
regime, the TNC is extremely disarticulated 
from the local economy and has little reason 
to create linkages within the host country. 
Furthermore, the host has little leverage over 
the TNC to enforce more favorable arrange- 
ments —for example, local content or reinvest- - 
ment requirements, joint ventures. At the 
same time, the existing industrial firms in the 
host country are driven out of business by 
competing imports both at home and in their 
most important export markets. The result is 
economic stagnation.? 


RESULTS II: INCOME DISTRIBUTION 


The second major claim of dependency is that 


2 An obvious question arises: why does the Irish 
regime continue to pursue a policy which clearly 
has been unsuccessful in terms of economic 
development? The answer is very complex, and far 
beyond the scope of the present article. One part of 
an answer lies in the strong dominant-class and 
regime interests in the present system, which is 
very profitable for foreign capital, some sections of 
local capital that survived ELI (especially finance 
capital), and the technocrats who control the new 
regime. A second part of an answer is the degree to 
which the lrish regime became "locked into" 
European integration and ELÍ by the strong 
international forces of the postwar period. From 
early on, U.S. and European sources made it clear 
that Ireland would be isolated economically and 
politically if it attempted to "go it alone." Third, 
the strongest potential opponents of ELI (protected 
capital and the trade unions) were either defeated 
or coopted by the emerging coalition that favored 
ELI. Finally, there is a strong ideological element 
to the present regime. An important factor here is 
the degree to which ‘any attempt to devise 
alternative development strategies was identified 
with either Republicanism (IRA "terrorism") or 
communism, or both. Some of these factors are 
analyzed in greater detail in O' Hearn (1988, chaps. 
4-9). 
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foreign penetration either increases inequality 
or prevents its decrease. Here again most of 
the existing evidence is from cross-sectional 
studies, which do not adequately explain how 
the effects of dependency may unfold over 
time (for a review of findings, see Bornschier 
1980b). Dependency posits three major types 
of arguments about foreign penetration and 
inequality: (1) arguments about structural 
unemployment and poverty, (2) arguments 
about class inequality, and (3) arguiments 
about wage inequality. 

Structural unemployment and poverty. For- 
eign penetration may cause unemployment for 
several reasons, two of which are particularly 
relevant in the Irish case. (a) Radical free 
trade and discrimination in favor of foreign 
industry may cause widescale displacement of 
domestic industry. To this may be added the 
migration of farmers to the cities and towns, 
which may further swell the ranks of the 
unemployed. (b) If new foreign industry is 
more capital-intensive than domestic indus- 
try, or is concentrated in capital- (or materials-) 
intensive sectors, the employment created per 
unit of foreign investment may be too low to 
Offset employment losses associated with 
foreign-penetration (Rubinson 1976; Born- 
schier and Ballmer-Cao 1979). 

Although Irish income distribution data are 
scarce,’ structural unemployment has cer- 
tainly been a major cause of inequality since 
Ireland joined the EEC. In Table 10 I show 
the distribution of direct incomes in 1973 and 
1980. Unemployment is the source of the 
major differences between the two distribu- 
tions. In 1973, the lowest 10 percent of Irish 
households received less than 1 percent of 
direct incomes, while the next decile received 
1.7 percent. By 1980, the lowest decile 
received no direct income, while the next 
decile received a mere 0.7 percent of total 
incomes. Today, with unemployment exceed- 
ing 20 percent, the lowest two deciles receive 
no direct income. To the extent the ELI 


? The Irish census contains no questions about 
income. There are a few tax-based surveys of 
income distribution, but these have gaps in 
coverage, because many people pay no income 
taxes. A few household budget surveys were 
conducted (in 1951/52, 1965/66, 1973, and 1981) 
and these seem to provide the best information on 
income distribution. For existing studies of Irish 


income distribution, see Stark (1977) and Roche: 


(1984). 
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Table 10. Distribution of Incomes and Tax Payments, 
by Decile, 1973 and 1980 





Direct Gross Indirect 
income income tax 
Decile 1973 1980 1973 1980 1980 
1 * NIL 29 26 
2 1.7 07 44 42 10.0 
3 44 3.5 53 54 6.7 
4 61 57 65 62 72 
5 76 7.5 T5 74 8.3 
6 90 93 8.8 88 9.6 
7 11.0 11.2 10.3 104 Wa 
8 13.5 13.9 14 12.7 12.5 
9 17.0 18.0 15.5 16.1 14.6 
10 29.6 30.1 26.3 26.5 20.1 
99.9 99.9 99.9 100.0 100.1 


Source: Roche (1984); Revenue Commissioners (1985). 


regime caused unemployment in Ireland, it 
unquestionably caused higher inequality of 
direct incomes. 

Apart from industrial closures, rising capital- 
intensity is the major cause of unemployment. 
Assets per hour of labor expended in Irish 
industry grew during ELI (1955-82) at an 
annual rate of 5.3 percent (O'Hearn 1988). 
Not only did highly capital-intensive foreign 
operations penetrate lrish industry, but the 
regime's generous capital grants encouraged 
local firms to replace labor with machines. 
These factors added considerably to unemploy- 
ment in the 1970s and 1980s. 

An important difference between the Irish 
case and other situations of dependency, 
however, is the degree to which popula: 
demands for social welfare have been met. 
Ireland's proximity to England, its claim to 
full European membership and “modernity,” 
and its national aspirations to regain its 
northern territory all contributed to pressures 
on the regime to keep its social welfare 
benefits roughly in line with Britain's. At the 
same time, Britain's claim over the North o? 
Ireland necessitated the introduction of social 
welfare benefits there at the same level as 
existed in England. Thus, when postwar 
welfare reforms were introduced in Britain, 
Ireland followed with a similar, if slightly less 
*grandiose" scheme. As a result, the distribu- 
tion of gross incomes in Ireland is more 
equitable than direct incomes (Table 10). 
although the share of the bottom 20 percent o? 
families is still falling. Unfortunately, data 
more recent than 1980 are not available 
Social welfare cutbacks have increased dra- 
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matically in the 1980s because of severe 
public and foreign debt crisis in Ireland. 
When the effects of these cutbacks on gross 
income distribution are accounted, it will 
surely be shown that inequality rose signifi- 
cantly. f 

Finally,- the introduction of free trade in 
Ireland led to a massive loss of tariff revenues 
for the state. As a result, highly regressive 
value-added taxes were introduced in 1964, 
and the rates were rapidly increased during 
the 1970s and 1980s. Indirect (value-added 
and excise) taxes now account for more than 
half of government revenues. As shown in 
Table 10, the proportion of indirect taxes paid 
by low-income receivers is much greater than 
the proportion of income they receive. Since 
indirect taxes have risen considerably during 
ELI—they made up 19.8 percent of govern- 
ment revenues in 1955 and 50.5 percent of 
revenues in 1985—they have added consider- 
ably to inequality over time. Thus, free: trade 
is responsible for yet another dimension of 
income inequality in terms of spending 
power. 

Class inequality. Explanations of inequal- 
ity that concentrate on income differences 
among classes emphasize the aspects of 
foreign penetration that increase elite power 
and decrease working-class power. Some 
analysts concentrate on the tendency of local 
elites to use their alliances with foreign 
capital in order to resist popular demands for 
income redistribution (Rubinson 1976; Born- 
schier and Ballmer-Cao 1979). Others concen- 
trate on the power of TNCSs to resist labor 
demands and weaken labor organization. A 
structural argument emphasizes that the high 
proportion of unskilled workers in TNC 
subsidiaries increases the number of the 
lowest industrial wage earners, while the 
failure to locate skilled work in subsidiaries 
leaves a gap in the middle ranges of the 
income scale. At the same. time, foreign- 
penetration may encourage the expansion of 
low-paying service jobs (Evans and Timber- 
lake 1980). 

Certain specific characteristics of the Irish 
case tend to make some of these effects 
inoperable. The most obvious is the differ- 
ence between the Irish political system— 
where overt authoritarianism is largely 
avoided—and regimes in other developing 
countries. A related difference is the degree to 
which the very rich elites in pre-independence 
Ireland were absentee elites, since England 
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was so close. Very rich landlords and very . 
rich capitalists in Ireland tended to remove 
their incomes to England, where they do not 
figure in Irish income distribution statistics. 
Therefore, 20th-century Ireland never devel- 
oped the kind of measurable elite-mass 
inequality that is observed throughout the 
developing world. 

These special Irish characteristics are 
reflected in the movement of upper income 
shares between 1973 and 1980 (Table: 10). 
The increases of higher-income shares that 
are associated with falling low-income shares 
are spread throughout the top 50 percent of 
income earners. The share of the top 20 
percent of income receivers grew only a little, 
from 46.6 percent to 48.1 percent of total 
incomes. No identifiable small elite in Ireland 
appeared to benefit greatly from the worsen- 
ing position of the lowest-income receivers. 
This is: because the lost income of the lowest 
20 percent was due less to lower wages than 
to the complete loss of wages. Wage earners 
who remained employed may not be better off 
absolutely but still moved up to higher 
income deciles as the unemployed filled WP 
lower deciles. 

The lack of significant “bunching” of 
income shares in the higher deciles is also 
consistent with the structural argument that 
there are few new high-salaried professionals 
and managers in TNC subsidiaries. Evidence 
in support of this conclusion comes from 
studies of the skill structure of TNCs in 
Ireland. In the electronics industry, for 
example, Wickham (1986) finds a large 
concentration of employees in assembly 
operations, and a particular preponderance of 
women in these occupations. 

The most significant change of class 
inequality to come out of dependent Irish 
industrialization is absent from Irish income 
distribution statistics. This change “involves 
the large-scale movement of incomes from 
wages to profits, and within profits from 
domestic industry to the TNCs. The high 
levels of profit repatriations simply reflect the 
rapid rise of TNC profits, and the duality of 
profit rates between foreign-and domestic 
industry. During TNC-penetration (1955-85), 
the surplus-share of incomes rose by 2 percent 
a year in manufacturing as a whole, and by 
3.5 percent annually in electronics and 6.4 
percent in pharmaceuticals (O'Hearn 1988,. 

hap. 16). 
The level of international inequality that is 
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engendered by dependency is clearly shown 
in Table 11, where I report profit rates in 
1981 by economic sector and nationality of 
ownership. The duality of profit rates is 
particularly large in the “modern” sectors— 
electronics, pharmaceuticals, and other man- 
ufacturing—and a few “traditional” sectors 
such as drink, wood, and clay. Profits in 
certain foreign sectors—soft drinks, pharma- 
ceuticals, healthcare, computers, instrument 
engineering, and others—were well in excess 
of 25 percent of sales and in some cases as 
high as 60 percent. Overall, non-British 
TNCs’ profit rates in Ireland averaged 22 
percent, while domestic firms’ profit rates 
averaged less than 2 percent. It is clearly 
possible, therefore, that the effects of depen- 
dency on inequality among countries are 
much greater than its effects on local 
inequality. 

Finally, the relationship between elites, 
TNCs, and trade union power may differ 
significantly in Ireland from many other cases 
of dependency. The Irish working class 
developed in close proximity to the highly- 
unionized British working class. Trade union- 
ists were involved in the struggle for national 
liberation, and their support for early nation- 
alist governments was very important. By 
1958, 92 percent of Irish industrial workers 
were unionized (Registrar of Friendly Socie- 
ties 1959). At the same time, the uneasy 
alliance between governments and trade 
unions, which is an important factor that 


Table 11. Profit Rates by Sector and Country of 
Ownership, Irish Manufacturing, 1983 


(c) 


(a) (b) Other c 
Sector Irish British foreign a 
Food 2.26 3.01 523 231 
Drink 9.29 18.60 26.95 2.90 
Textiles 0.90 7.23 2.09 2.32 
Clothing —1.88. 6.71 1.33 — 
Wood 0.98 1.55 27.68 2824 
Paper 0.95 1.63 8.12 8.55 
Clay 4.13 —8.90 27.32 6.62 
Chemicals — 12.04 3.00 5.34 — 
Pharmaceutical 10.15 22.02 48.66 4.79 
Metals 1.09 5.52 10.25 9.40 
Electric —1.47 —196.00 22.65 — 
Other 3.74 5.62 25.09 6.70 
TOTAL 1.82 7.00 22.09 12.14 


. Note: Profits rate refers to pretax profits as percentage 
of total sales. 
Source: Author's calculations from IDA Components 
of Sales Survey. 
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neutralizes working class opposition to for- 
eign penetration, led most unions to become 
nonmilitant. For many years, the IDA advised 
TNCs who located in Ireland to recognize 
trade unions. An incoming TNC commonly 
made a “sweetheart” agreement with an Irish 
trade union, giving it sole organizing rights in 
return for guarantees of labor peace. Re- 
cently, however, as the Irish regime became 
desperate for new TNC investments, thev 
acceded to the wishes of the latest wave of 
entrants—the “yuppie” U.S. electronic firms— 
and dropped their pressure for unionization of 
TNC subsidiaries. The first major TNC to 
“go nonunion” was the computer firm 
Digital, which is now one of the largest 
employers in the country. The consequences 
on interclass income distribution of these 
successes of TNC power over Irish trade 
unions have yet to be worked out. 

Wage inequality. The effects of foreign 
penetration on wage differentials have proba- 
bly been the most widely discussed aspect of 
dependency and inequality. Much has been 
written about dualism, where a thriving 
foreign-dominated “modern” sector exists 
alongside a stagnant “traditional” sector 
(Myint: 1970; Singer 1970). According to this 
approach, wages in the modern" sectors are 
expected to be higher than in the "traditional" 
sectors, and this encourages wage inequality. 

On the other hand, the expectation that 
subsidiaries’ employment structures are 
weighted toward unskilled jobs, with the 
more skilled and professional positions kept 
in the parent country, contradicts the dualism 
argument. If TNCs concentrate their employ- 
ment in unskilled assembly trades, they 
decrease wage inequality, even if their 
unskilled workers are paid more than un- 
skilled workers in domestic firms. Finally, in 
a labor-rich economy, with high unemploy- 
ment—and particularly with a highly edu- 
cated working class as in Ireland — there may 
be little reason to expect the TNCs to pay 
higher wages than local firms. This is 
particularly true where centralized wage- 
setting or wage-bargaining structures exist, as 
they do not only in Ireland, but also in some 
of the East Asian economies (e.g., Singa- 
pore). 

There is little evidence that foreign penetra- 
tion caused significant wage inequality in 
Ireland. Average wage rates within industrial 
sectors in 1983 were quite similar among 
Irish, British, and other foreign firms (Table 
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12). Neither are wage rates particularly high 
in the “modern” sectors. Wages in electron- 
ics, metals, and “other manufacturing” are at 
or below the average for industry. Among 
foreign sectors, only chemicals and pharma- 
ceuticals have relatively high average wages. 
Clearly, the sectoral differentials in wage 
rates are tied to the structure of the labor force 
within those sectors. Electronics, healthcare 
products, textiles, and clothing have a high 
proportion of female assembly-type workers. 
The drink and chemicals industries are highly 
capital-intensive and have a higher proportion 
of technical and other skilled workers. 

Specific local factors are more important 
‘than foreign penetration or “duality” in 
determining the Irish wage distribution. The 
Irish wage bargaining structure — where wages 
are negotiated in a series of centralized “wage 
rounds" with tripartite participation of capi- 
tal, trade unions, and the state—is the most 
important determinant of wage distribution. It 
is in the area of wage determination that the 
specificities of dependent cases, rather than 
any general "laws" of dependency, are likely 
to be most important. 

Still, there are important dependency- 
related trends—-economic decline, capital 
intensity, and unemployment—which have a 
strong influence in favor of income (as 
opposed to wage) inequality. These “pure 
dependency” effects of foreign penetration on 
income distribution differ significantly from 
other developing regimes, where localized 
political factors — often connected with author- 
itarian regimes—play a more important role 
in determining inequality. 


Table 12. Wages by Sector and Country of Ownership, 
Irish Manufacturing, 1983 (annual wage per 
_capita in Irish punts) . 








——— 








Other 
Sector Irish British foreign 
Food 9,445 10,822 9,600 
Drink 12,694 15,840 12,652 
Textiles 6,545 7,025 9,208 
Clothing 5,619 5,567 4,838 
Wood 7,114 10,383 4,039 
Paper 12,787 9,276 9,781 
Clay ` 12,424 13,821 15,561 
Chemicals 15,455 10,191 12,925 
Pharmaceutical 9,720 10,675 13,830 
Metals 8,967 10,520 9,139 
Electronics 7,861 10,202 9,867 
Other 7,664 9,251 8,837 
TOTAL 9,502 : 9,235 9,307 





Source: See note to Table 11. 
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The "dependency" approach to development 
studies has come under fire in recent years 
because of the seemingly "deviant" cases of 
East Asia. Proponents of modernization cite 
Taiwan and South Korea as instances where 
foreign penetration was not accompanied by 
either slow growth or rising inequality. They 
imply that other developing countries may . 
also achieve rapid growth with equality by 
following a radical regime of free trade, free 

enterprise, and openness to foreign enterprise. 

Unfortunately for the new modernization- 
ists, neither South Korea nor, Taiwan is.really 
characterized by-these traits. Both regimes 
intervene strongly in business, use selective - 
protection and import-substitution, and have a 
definite preference for domestic industry. If 
South Korea, Taiwan, and even Singapore are . 
“getting the prices right," it is because of 
their regimes’ extensive intervention, not 
their reliance on "the market." 

Ireland, on the other hand, has closely ` 
followed the prescriptions of the moderniza- 
tionists. Since the 1950s, it has been highly 
penetrated by foreign industry, in preference 
to domestic industry; it removed its protective 
barriers and allows free movement of goods. 
and profits in and out of the country; it 
disdained any form of control of business, 
including the* use of indirect economic 
instruments such as credit and foreign ex- 
change manipulations. 

Ireland has also.grown at a snail’s pace 
and, recently, experienced negative economic 
growth. While the level of inequality has been 
reduced by social welfare policies—a remnant 
of the country’s proximity to Europe—there 
is an underlying trend toward rising inequal- 
ity, resulting from slow growth and tax 
policies that are forced by free trade and free 
enterprise. Since Ireland is among the top five 
debtor countries in the world (foreign debt as 
a percentage of GNP), the regime has already 
begun social welfare cutbacks that will drive 
it toward greater inequality. It is a classic case 
of “dependent” relations: slow growth and 
inequality caused by foreign penetration. !° 


10 A similar analysis could be made regarding 
the North of Ireland. It has also experienced severe 
economic decline and inequality during the post- 
war period. The reasons for stagnation and 
inequality in the North, however, are quite 
different from the South. Stagnation is mainly a 
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There are other countries that are like 
Ireland in many respects, although the 
specific nature of dependent relations always 
differs significantly from country to country. 
The present analysis of Ireland does not 
resolve the problems of using dependency as 
a general approach to the study of develop- 
ment. Rather, it demonstrates how dependency- 
type mechanisms may thrive in a liberal, open 
atmosphere. Liberalness and openness, how- 
ever, do not predominate in the periphery 
today —deviations from liberal regimes rein- 
force the need to rigorously examine the 
specificity of dependent situations. Thus, the 
Peruvian case of export-led “bonanza devel- 
opment” —where the state tries to maximize 

' its share of TNC profits—is quite different 
from Irish ELI, where the state tries to 
maximize TNC investment and depend on 
trickle-down. The Brazilian case, where the 
state attempts to maximize linkages of 
TNCs with local capital, differs from the 
Irish regime, where linkages are simply 
hoped for. 

Still, there are strong pressures afoot 
for liberalization, as proposed by the new 
modernizationists. These pressures are seen 
less in East Asia, however, than in places like 
Mexico and in the conditional lending of the 
IMF. The experience of foreign penetration, 
‘limited growth, and inequality—despite the 
exceptional cases of East Asia—is still 
much more common among developing 
countries than so-called "success." Students 
of development may have a lot to leam 
from the "exceptions" — particularly about 
ways to bring about economic development 
while avoiding the extreme authoritarianism 


result of the decline in importance to England of 
shipbuilding and natural fibers. The North has 
been allowed to decline as part of a British regional 
policy which favors Southern England at the 
expense of the so-called "Celtic fringe" and the 
old industrial regions of Northern England. In 
other words, Britain has largely withdrawn econom- 
ically from its Northern Irish colony, although it 
refuses to withdraw militarily and politically. 
Inequality in the North of Ireland, unlike the 
South, is affected not only by class relations and 
economic decline, but also by sectarianism. The 
unemployment rate among the settler (‘‘Protes- 
tant") population, for example, remains below the 
British average, at the expense of the native 
(“Catholic”) population, which suffers unemploy- 
ment averaging over 35 percent (above 80 percent 
in Some Catholic areas). 
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of the East Asian cases. They have still 
more to leam from careful comparative 
historical analyses of specific cases of de- 
pendent development. The route of free 
trade, free enterprise, foreign penetration, and 
the new modernizationism is not the way to 
go. 


DENIS O'HEARN, Assistant Professor of 
Sociology at the University of Wisconsin- 
Madison, has been working on a project on 
Irish export-led industrialization. His main 
area of interest is the interaction of class ard 
state in the implementation of industrializa- 
tion regimes, and transitions from one regime 
to another, in post-colonial societies. He will 
soon begin work on the introduction of the 
Jamaican industrialization regime during the 
transition from colonialism, and subsequeat 
pressures for change. 
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ECONOMIC DISCRIMINATION AND SOCIETAL 
HOMICIDE RATES: 
FURTHER EVIDENCE ON THE COST OF INEQUALITY* 


STEVEN F. MESSNER 
State University of New York at Albany 


This research examines the effects of economic discrimination against social 
groups on national rates of homicide. Drawing on Peter Blau’s macrostructural 
theory, I hypothesize that nations with intense and pervasive discrimination will 
exhibit comparatively high levels of homicide, and that the effects of discrimination 
will exceed those of income inequality. Regression analyses using both INTERPOL 
and World Health Organization homicide data support both hypotheses. Indicators 
of economic discrimination against social groups are significantly and positively 
related to homicide rates despite fairly extensive controls for other theoretically 
relevant national characteristics. Moreover, the effects for discrimination are 
consistently stronger than those for income concentration. These results suggest 
that the structuring of economic inequality on the basis of ascribed characteristics 


is a particularly important source of lethal violence in contemporary societies. 


A fairly large body of quantitative research on 
the determinants of national homicide rates 
has accumulated in recent years (see the 
bibliography in Krahn, Hartnagel, and Gar- 
trell 1986). This research has yielded one 
remarkably consistent finding: income inequal- 
ity tends to be positively related to homicide 
levels. The regularity of this relationship has 
promoted “consensus that the incidence of 
homicide is higher in countries with greater 
income inequality” (Krahn et al. 1986, p. 
269). 

A common feature of these studies is the 
operationalization of income inequality by 
measures of income dispersion, most com- 
monly Gini coefficients. Useful as these 
measures may be, they do not reveal the 
processes associated with the generation of 
income distributions. That is, simple mea- 
sures of income dispersion do not convey 
information about the social structuring of 
economic inequality. 

This paper goes beyond previous cross- 
national studies by exploring a relatively 
neglected aspect of economic inequality, 
namely, economic discrimination against so- 
cial groups. Drawing on the recent theoretical 
and empirical work of Peter Blau and 
associates (Blau 1977; Blau and Blau 1982; 
Blau and Schwartz 1984), I argue that 





* J would like to thank Scott South, Allen Liska, 
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economic discrimination should have an 
appreciable effect on national homicide rates. 
Furthermore, the effect of discrimination 
should be greater than the effects of income 
inequality. The basic question is thus whether 
discriminatory social processes involved in 
income inequality are as important as, or 
more important than, the level of income 
inequality in determining national homicide 
levels. 


MACROSTRUCTURAL THEORY AND 
INTERPERSONAL VIOLENCE 


Blau (1977, p. 12) has recently formulated a 
macrostructural theory in which he derives 
numerous predictions about patterns of social 
associations from a set of "primitive" assump- 
tions and definitions. The theory has been 
used to explain a wide array of social 
behavior, including patterns of intermarriage 
between social groups (e.g., Blau, Blum, and 
Schwartz 1982; Blau, Beeker, and Fitzpatrick 
1984; Blum 1984; South and Messner 1986), 
patterns of intergender support in the work- 
place (South, Bonjean, Corder, and Markham 
1982, 1983), and patterns of interracial 
violent crime (e.g., Blau and Blau 1982; 
Messner and South 1986; Sampson 1984). 
Most relevant to the present analysis is Blau's 
theory of interracial violent crime. To expli- 
cate this argument, I will first briefly 
summarize some major concepts in Blau's 


theory. 
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Blau conceives of social structure in terms 
of the distribution of, populations among 
social positions (1977, pp. 1-12). These 
social positions are structured along two basic 
types of axes or "parameters": nominal and 
graduated. Nominal parameters divide the 
population into groups with discrete bound- 
aries but with no inherent rank ordering (e.g., 
race, sex, religion). Graduated parameters 
entail ranked status distinctions that are in 
principle continuous (e.g., income, wealth, 
prestige). These two types of parameters 
underlie the two fundamental forms of social 
differentiation: heterogeneity and inequality. 
Heterogeneity refers to the distribution of the 
population in terms of nominal parameters; 
inequality refers to the population distribution 
in terms of graduated parameters. 

An important property of parameters is 
their "salience," or their social significance 
as reflected in the strength of in-group 
preferences (Blau and Schwartz 1984, p. 29). 
Blau points out (1977, p. 24) that race is a 
highly salient parameter in American society, 
whereas eye color is not. Nominal and 
graduated parameters may also be related or 
*consolidated" (Blau 1977, pp. 86-87). The 
degree of consolidation of these parameters is 
determined by the extent to which group 
membership is correlated with a status 
dimension. 

The most extensive application of these 
concepts in the explanation of interpersonal 
violence can be found in Blau and Schwartz's 
(1984) Cross-Cutting Social Circles. Blau 
and Schwartz identify two major sources of 
societal interpersonal conflict: “excessive 
ingroup salience typically rooted in a group's 
distinctive value system" (1984, p. 174) and 
inequality associated with graduated parame- 
ters. Excessive salience creates a situation 
wherein in-group preferences, defined with 
respect to a nominal parameter, lead to the 
exclusion of out-groups. This exclusion 
entails certain costs to out-groups and conse- 
quently provokes antagonism and counteran- 
tagonism (p. 175). Thus, the excessive 
salience of a nominal parameter tends to 
produce intergroup hostility and the potential 
for conflict and violence. 

Inequality is also a major source of 
conflict. Blau and Schwartz postulate that “it 
is in the interest of everybody to maximize his 
[or her] resources" (p. 176). This ultimately 
creates a "potential split" between those 
whose self-interest dictates a reduction in 
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levels of inequality and those whose sel- 
interest dictates the maintenance of inequal- 
ity. The incompatibility of objective interests 
generated by inequality, however, does not 
necessarily lead to overt conflict. A variety cf 
“counteracting social forces" may suppress 
overt conflict by promoting an isolated, 
individualistic experience of deprivation (p. 
177). 

Overt conflict becomes a likely concomi- 
tant of status inequalities only when saliert 
nominal parameters are consolidated with 
these inequalities. Under such circumstances 
the underprivileged are likely to becom2 
"aware of their collective interest in redistri- 
bution" (p. 179). In other words, consoli- 
dated inequalities facilitate the recognition of 
competing self-interests in the redistribution 
of a status resource and in this way transform 
the potential for conflict associated with 
inequality into overt forms of conflict. 

An additional argument explains the partic- 
ular form of conflict that is likely to emerge. 
While conflict produced by consolidated 
inequalities might be expressed by efforts to - 
redistribute social resources, such efforts are 
typically unrealistic given the political and 
social disadvantages of the dispossessed. 
Consolidated inequalities thus tend to produce 
“pent-up aggression which manifests itself in 
diffuse hostility and violence” (p. 180). The 
widely reported finding that criminal vio- 
lence, including homicide, is often directed at 
those in similar social positions in no way 
vitiates the logic of the mmacrostructural 
argument. The victims of criminal violence 
are likely to be convenient targets, even 
though the root cause of the violence might be 
the "diffuse hostilitv" generated by consoli- 
dated inequalities. 

On the basis of these arguments, Blau and 
colleagues hypothesize that levels of violent 
crime in metropolitan communities should 
vary positively with levels of racial inequality 
in income, an instance of the consolidation of 
parameters. Racial income inequality should 
generate the diffuse hostility that is mani- 
fested, at least in part, in criminal violence. 
Blau and colleagues (Blau and Blau 1982; 
Blau and Schwartz 1984) have analyzed 
violent crime rates for the 125 largest 
Standard Metropolitan Statistical Areas in the 
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United States and have reported results that 
largely support this hypothesis.! 

Blau’s macrostructural theory can be readily 
extended to derive hypotheses concérning the 
relationship between economic discrimination 
and criminal violence. For purposes of this 
analysis, I conceive of economic discrimina- 
tion as the denial of access to economic 
resources on the basis of largely ascribed 
social characteristics such as race, religion, 
and ethnicity. Such discrimination is likely to 
be related to both of the two major sources of 
conflict identified by the theory: excessive 
salience and consolidated inequality. Discrim- 
inatory practices based upon social character- 
istics can be expected to enhance the salience 
of these characteristics. Indeed, one may 
speculate that salience and discrimination are 
mutually causal. Salient social distinctions 
provide the basis for discrimination, and this 
discrimination increases the significance of 
these distinctions for all members of society. 
Moreover, economic discrimination is essen- 
tially a form of consolidated inequality— 
access to economic resources is correlated 
with ascribed social characteristics. 

_ Accordingly, two hypotheses are derived 

from macrostructural theory concerning the 
relationship between economic discrimination 
and national homicide rates: (1) the level of 
economic discrimination should be positively 
related to national homicide rates, (2) the 
level of economic discrimination should be a 
stronger predictor of homicide rates than is 
income inequality. This latter hypothesis 
follows from the theoretical claim that 
consolidated inequality is especially likely to 
produce the diffuse hostility responsible for 
high levels of violent crime. 


RESEARCH PROCEDURES 


The hypotheses will be tested with data for a 
maximum sample of 52 contemporary nation- 
states circa 1980. A major advantage of a 
cross-national design is that it provides for 
substantial variation in the variables. of 
interest. For example, the size of populations 
subject to discrimination ranges from zero in 
some nations to large majorities in others (see 


! See Golden and Messner (1987) for a more 
cautious assessment of the evidence on racial 
‘income inequality and violent crime rates for 
SMSAs. 
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Appendix A). ? Similarly, whereas differences 
in levels of income inequality across subna- 
tional units tend to be very modest, variation 
between nations is quite substantial (see Jain 
1975). A cross-national design thus increases 
the likelihood that there will be sufficient 
differences among the units of observation in 
the basic "parameters of social structure" to 
coriduct meaningful analyses.? 

. The primary dependent variable for the 
study is the rate of homicides known to the 
police per 100,000 inhabitants. The source 
for this measure is International Crime 
Statistics," published by the International 
Criminal Police Organization. (INTERPOL).* 
To minimize the effect of random yearly 
fluctuations, the conventional practice of 
computing averages for a multiyear period 
has been followed (cf. Kick and LaFree 1985; 
Krahn 'et al. 1986). Averages have been 
computed for the 1977-82 period or, in cases 
where there are missing data, for the subset of 
years for which data are available. Also, to 
alleviate problems of heteroskedasticity, hom- 
icide rates have, been log (natural) trans- 
formed. 

"The .adequacy of the INTERPOL data 
needs to be assessed cautiously. The general 
limitations of international crime statistics are 
well known and have been fully discussed 


? Zimbabwe’s high score (95) on “percent 
discriminated against" in Appendix A is striking. 
Because of the profound social and political 
changes that transpired in Zimbabwe during the 
general time period under consideration, measure- 
ment of discrimination for this case is particularly 
problematic. Results with Zimbabwe excluded are 
nevertheless highly similar to those reported for the 
full sample. 

? Restricted variance im levels of inequality 
across subnational units may be partially responsi- 
ble for thé inconsistent results obtained in previous 
studies of inequality and violent crime that have 
used .statés, cities, or SMSAs as units of analysis. 
Compare, for example, Bailey (1984), Blau and 
Blau (1982), Braithwaite (1979), Golden and 
Messner (1987), Loftin and Parker (1985), Mess- 
ner (1982), and Williams (1984). 

* INTERPOL employs the term murder in the 
national tables for the years under investigation. 
However, the explanatory note states that this 
category encompasses "'any act performed with the 
purpose of taking human’ life;, in whatever 
circumstances" (INTERPOL 1977-82). The cate- 
gory thus coriesponds more closely to homicide 
than to murder in the strict sense of these terms as 
reflected i in ‘U.S. laws. 
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elsewhere (e.g., Wilkins 1980). In particular, 
critics have pointed to problems of compara- 
bility in the definition of crime and to the 
possibility of cross-national variation in crime 
reporting. Proponents of comparative re- 
search, however, have suggested that the 
deficiencies of international crime statistics 
may be less severe than is often thought. For 
example, Kick and LaFree (1985, p. 42) 
address the issue of definitional comparabil- 
ity. They conclude that offenses such as 
homicide are "traditional crimes with ancient 
origins" and that these kinds of offenses tend 
to exhibit a fair degree of definitional 
consistency. 

Krohn and Wellford confront the other 
common criticism of international crime 
statistics, namely, cross-national variation in 
reporting practices. They cite evidence indi- 
- cating that crime-reporting practices do not 
appear to vary directly with the level of 
economic or urban development (Krohn and 
Wellford 1977, pp. 4-5). This finding 
suggests that cross-national variation in crime- 
reporting may be largely idiosyncratic and not 
related systematically to independent vari- 
ables of theoretical interest. 

There is, however, an important deficiency 
associated specifically with INTERPOL data. 
These data are ultimately collected by an 
informal, nongovernmental agency — the inter- 
national organization of chiefs of police. 
Hence, as Groves, McCleary, and Newman 
(1985) caution, there is no clear way “to 
establish just how ‘official’ the data are or 
how extensive the reporting base is" (p. 74). 

In view of these concerns about the 
INTERPOL statistics, data have also been 
collected from an additional source: .the 
annual vital statistics reports of the World 
Health Organization (WHO). The WHO 
compiles reports based on mortality statistics 
provided by governmental health agencies in 
participating nations. These reports contain 
information on the "cause of death," includ- 
ing deaths due to "homicide and injury 
purposely inflicted by other persons" (exclud- 
ing war). These data permit the construction 
of two additional indicators of homicide for 
the subsample of nations represented in both 
INTERPOL and WHO publications: an. aver- 
age homicide rate (log transformed) based on 
the WHO vital statistics reports for 1976— 
1982, and a combined INTERPOL-WHO rate 
(the simple average of the rates from the 
respective sources). The availability of IN- 
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TERPOL data serves as the primary criterion 
in sample selection because the vast bulk of 
past research on economic inequality and 
crime has relied on this source. 

Data for the primary independent variable — 
economic discrimination—are taken from the 
World Handbook of Political and Social 
Indicators (Taylor and Jodice 1983). These 
data are refined and updated analogues to 
measures of group discrimination originally. 
published by Gurr (1966). Construction of the 
measures involved identifying those groups 
subjected to discrimination in a given nation 
and rating the intensity of this discrimination. 
Economic discrimination refers specifically to 
“the deliberate, invidious exclusion of social 
groups from some desired economic goods or 
conditions (values) because of the groups’ 
ascribed characteristics” (Taylor and Jodice 
1983, p. 52). These “ascribed character-s- 
tics” include ethnic, linguistic, religious, aad 
regional identity (p. 71). The identification of 
groups discriminated against and the rating of 
the degree of discrimination were based on 
scholarly judgments of numerous national and 
regional studies for each polity included in the 
sample (see Gurr 1966, pp. 67-86 for 
details). i 

The World Handbook provides two indica- 
tors of economic discrimination against social -- 
groups to represent the scope and the intensity 
of the discrimination. Scope is operational- 
ized by the relative size of the population 
discriminated against; specifically, it refers to 
the percent of the total population that is 
subject to economic discrimination. Intensity 
is measured by an ordinal scale with scores 
ranging from 0 tc 4. The scale values reflect 
the kinds of economic position or classes of 
economic activity from which the specified 
groups are excluded (e.g., only higher-level 
positions, higher-level and medium-level po- 
sitions) (Taylor and Jodice 1983, p. 71). 
Scores for both indicators have been recorded 
for the most recent available date (circa 1975) 
to create an approximate temporal match with 
other variables. 

I anticipate that these two dimensions of 
economic discrimination—scope and inten- 
sity —might have mutually contingent effects 
on homicide. In other words, the impact of 
the scope of discrimination on homicide 
might be conditioned by the intensity of that 
discrimination. One way to capture these 
contingent effects is to construct a composite, 


< 
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‘ “weighted” measure by multiplying the scope 


: and the intensity scores. 


Two limitations associated with the 
weighted measure of economic discriminátion 
should be noted. First, the measure is based 


' on a multiplicative. operation with an ordinal 


measure —the intensity scale. This technically 
constitutes a violation of measurement assump- 
tions. Secondly, the multiplicative operation 


: implicitly assumes a specific functional form 


for the interaction between scope and inten- 
sity, even though there is no fitim evidence at 
present to indicate the nature of such an 
interaction. In view of these concerns, a 
second measure is also considered in the 
analysis, that is, a dummy variable with 
nations having nonzero values on the discrim- 


ination indicators assigned the score of “1” 
"and all others assigned "0." The sample 


under investigation is reasonably heteroge- 


: neous with respect to this simple dichotomous 


classification. “Nondiscriminatory” nations 
constitute about 54 percent of the sample, 
"discriminatory" nations 46 percent. 

`A second independent variable is income 
inequality. Consistent with recent prior re- 
search (Avison and Loring 1986; Kick and 
LaFree 1985; Krahn et al. 1986), income 
inequality is operationalized by the Gini 
coefficient of household or individual income 
concentration. The principal source for the 
income data is Jain (1975), supplemented by 
Paukert (1973) and Krahn et al. (1986). The 
specific dates for the income measures vary 
by nation but generally refer to the late 1960s 
and early 1970s. Although ‘there is some 
temporal discrepancy between the measure- 
ment of this variable and other variables, the 
lack of a precise temporal match should not 
be particularly troublesome given the consid- 


. erable stability in national patterns of income 


distribution.5 

Finally, data have been collected for a 
fairly large number of control: variables. 
Studies of income inequality and homicide 
rates typically distinguish between the distri- 
bution of economic resources and the overall 
level of resources to permit a separate 
assessment of the effects of "relative depriva- 


5 Muller (1988) notes that most cross-national 
Studies of income distribution assume that measure- 


‘ments are reasonably comparable for intervals of 


20 years, although he adopts a more "conservative 
assumption" .(p. 53) that measurements are 
comparable for at least an interval of 11 years. 


tion" versus “absolute deprivation" (se 
Krahn et al. 1986, for a comprehensiv 
review). Consistent with standard practice: 
Gross National Product (GNP) per capit 
serves as one indicator of "absolute depriv: 
tion.” Two additional indicators—the infa 
mortality rate and life expectancy—are cor 
sidered as well in view of concerns about th 
adequacy of GNP/capita (Conklin and Sim 
son 1985). The source for. these thre 
indicators is the Population Reference Burea 
(1983). Data refer to 1980, or to the close: 
year with available data. 

Previous researchers have also stressed th 
importance of demographic controls in tb 
analysis of homicide rates (see especiall 
Conklin and Simpson 1985). Specific me: 
sures have varied across studies (Krahn et a 
1986, pp. 272-73), but the following dimer 
sions of demographic structure are mo: 
commonly represented: population size, poj 
ulation density, population growth, and agt 
sex structure. The indicators for these aspec! 
of demographic structure are reported below 
along with the respective sources: 


total population (Population References: Burea 
1983); population per square kilometer (U.D 
Demographic Yearbook 1980); percent annu: 
rate increase in population 1975-80 (U.h 
Demographic Yearbook various years; U.M 
Statistical Yearbook 1979, 1980); 

percent of the population less than 15 years ol 
(Population Reference Bureau 1980); 

‘percent of the population male, ages 15-2 
(U.N. Demographic Yearbook various years). 


Once again, unless otherwise noted, the targ 
year for the measurement of these contr 
variables is 1980.5 


$ The target year for the measurement of tk 
control variables is 1980 because of the read 
availability of data for a large number of natior 
for this particular year. There will be a sligl 
temporal discrepancy between the controls and tk 
primary independent variable (economic discrim 
nation), which is measured circa 1975. Th 
discrepancy should not have any appreciab: 
influence on the analyses. The basic structuri 
characteristics of nations which serve as contro 
(e.g., population size, national wealth) do m 
fluctuate dramatically over relatively short perio< 
of time. To illustrate, the correlation. betwee 
Gross Domestic Product per capita in 1975 an 
1980, and between population size in those sar 
years, is .9625 and .9998 respectively for tho: 
nations with observations at both points in time 


The sources for these computations are the U.D 
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Previous research indicates the potential 
importance of three additional control vari- 
ables. One is the level of political democracy, 
which exhibits interactive effects with income 
inequality in Krahn et al.’s (1986) study. The 
measure of democracy is Bollen’s political 
democracy index (Bollen 1980). Also, the 
results of Conklin and Simpson’s (1985) 
study indicate that both urbanism and ethno- 
linguistic heterogeneity are important predic- 
tors of homicide rates. Accordingly, measures 
of the percent of the population living in 
urban areas (U.N. Demographic Yearbook 
various years; Population Reference Bureau 
1980) and the ethnolinguistic heterogeneity 
index reported in Taylor and Hudson (1972) 
will be included in the analysis.” 

Examination of the univariate distributions 
reveals high skewness for GNP/capita, popu- 
lation size, population density, and the 
weighted measure of economic discrimina- 
tion. Natural log transformations have been 
performed on these measures to facilitate 
linear modeling (Tufte 1974, pp. 108-15). 
The value of “1” has been added to all scores 
on the weighted economic discrimination 
measure to insure nonzero values for the log 
transformations. 

Data for the INTERPOL measure of 
homicide and economico discrimination are 
available for a sample of 52 nations (see 
Appendix A). The sample size is reduced to 
32 nations in analyses based on WHO 
homicide measures because of missing data. 
As noted by previous researchers (cf. Krahn 
et al. 1986, p. 277), lesser-developed nations 
tend to be underrepresented:in WHO reports. 
A number of cases contain scattered missing 
data for the other variables. Rather than 


Ww. rs 


National, Accounts Statistics (U.N. 1988) and the 
U.N. Demographic Yearbook (the 1983 volüme). 

7 Some conceptual overlap exists between the 
control variable "ethno-linguistic heterogeneity" 
and the economic discrimination variable. Obvi- 
ously, social diversity is a prerequisite for group 
discrimination. Nevertheless, as the editors of the 
Worid Handbook explain, the two measures are not 
identical and in a number of nations there are 
distinctive groups that are not "systematically 
favored or disfavored” in: economic activities 
(Taylor and Jodice 1983, p. 51). The effects of 
economic discrimination against social groups 
should be examined independent of the effects of 
the simple level of social diversity. 


AMERICAN SOCIOLOGICAL REVIEW 


discard these cases, I have performed mean 
substitution where required.? 

The sample is clearly not a random one. It 
excludes entirely communist nations because 
of data limitations (see Muller 1988, p. 52). 
Nevertheless, nations from all regions of the 
world are included, and at least in the 
maximum sample, nations at quite different 
levels of development are represented. Most 
importantly, the sample includes nations 
characterized by widely varying practices cf 
economic discrimination and is thus particu- 
larly well suited to address the substantive 
concerns of the analysis. 


RESULTS 


The first step in the analysis is to specify 
models that are suitable for statistical estima- 
tion. À persistent problem in the macrolevel 
research on crime is multicollinearity. Many 
basic structural features of population aggre- 
gates are strongly intercorrelated, which can 
create large standard errors for regression 
coefficients and unstable parameter estimates 
(Wonnacott and Wonnacott 1970, pp. 59— 
63). 

Examination of the correlation matrix for 
the maximum sample (see Appendix B) 
reveals, not surprisingly, that some pairs of 
control variables exhibit extremely high 
zero-order relationships (e.g., the correlation 
between the infant mortality rate (X4) and life 
expectancy (X15) is —.94). A useful tech- 
nique for reducing problems created by 
intercorrelated predictors is to construct 
composite indexes (Weisberg 1980, p. 185). 
Accordingly, a principal axis factor analysis 
of the full list of control variables has been 
performed. The results reveal that five control 
variables exhibit very strong intercorrelations: 
percent less than 15 years of age; the annual 
rate of population increase; life expectancy; 


8 The variables and nations for which mean 
values have been substituted for missing values are 
as: follows: 


ethnolinguistic heterogeneity — Zimbabwe; 
GNP/capita — Lebanon; 

income inequality — Belgium, Ireland, Singa- 
pore, Syrian Arab Republic, Portugal, Jordan, 
Zimbabwe; 

percent males 15—29 —Jamaica, Lebanon, Nige- 
ria; 

political democracy —Hong Kong, Egypt, Re- 
public of Korea, Zimbabwe. 
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GNP/capita (im); and the infant mortality 
rate. Each of these variables receives a rotated 
(varimax) loading of .85 or greater on the 
same factor. This underlying factor might be 
considered a general "development" dimen- 
sion, with wealthy nations (as indicated by 
bigh GNP/capita) exhibiting low infant mor- 
tality rates, high life expectancy, modest 
population growth, and comparatively small 
“young” populations. A composite "develop- 
ment index" has been constructed by combin- 
ing these five variables after z-score transfor- 
mations and the reversing of polarities where 
required to insure that positive values repre- 
sent high levels of development.? 

Another procedure recommended for deal- 
ing with multicollinearity is to experiment 
with slightly different specifications and to 
observe any fluctuations in the corresponding 
coefficients (Hanushek and Jackson 1977, p. 
90). If important changes in the effects of 
strategic independent variables are brought 
about by minor alterations in the list of 
controls, little confidence can be placed in the 
parameter estimates. A second basic model is 
thus considered in which a single variable 
from the set of five highly intercorrelated 
controls is used to represent that set of 
variables (cf. Cohen and Land 1984). The 
"representative" variable selected is “the 
percent of the population less than 15 years 
old." This variable exhibits the largest rotated 
factor loading (.97), and it also yields the 
strongest zero-order relationship with homi- 
cide rates (r — .30) of variables in the set of 
highly intercorrelated controls. i 

The adequacy of these procedures for 
dealing with problems of multicollinearity can 
be assessed with reference to “variance 
inflation factors.” The variance inflation 
factor (cjj) for a piven predictor is defined as: 
cj = (1—R7)~* where Rj is the coefficient 
of determination resulting from the regression 
of the jth predictor on the remaining predic- 
tors (Fisher and Mason 1981, pp. 105-6). 





? Unfortunately, even though there might be 
conceptual reasons to treat some of the indicators 
in the index separately (e.g., indicators of 
“wealth” versus indicators of “demographic struc- 
ture"), the strong intercorrelations preclude the 
meaningful analysis of separate effects. Because 
these variables are intended to serve primarily as 
controls, ambiguities surrounding the interpretation 
of the development index should not be problem- 
atic in the present analysis. 
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The resulting score indicates the extent to 
which the variance of the estimator is inflated 
by multicollinearity. Fisher and Mason sug- 
gest that “a good rule of thumb regarding 
variance inflation factors is that values above 
4.0 (Le., R? > .75) indicate that the 
corresponding predictor variable is involved 
in collinearity that may have serious effects 
on the OLS regression coefficient estimators 
and their standard errors. Values below 4.0 
are not considered severe” (1981, p. 109). 
In Table 1 I report the variance inflation 
factors for three different matrices. Severe 
multicollinearity problems are quite apparent 
for the original list of predictors (Matrix I). 
However, in the other two matrices, no 
variance inflation factor exceeds the sug- 
gested critical value of 4.0. Regression results 
based on models with these two “reduced” 
lists of predictors should not therefore be 
seriously impaired by multicollinearity. 
Turning to the main hypotheses, the 
macrostructural theory predicts that economic 
discrimination will be positively related to 
homicide rates and that this relationship will 
be stronger than that observed for the overall 
level of income inequality. The zero-order 
correlations are consistent with both hypothe- 
ses (see Appendix B). Both the weighted 
measure of economic discrimination and the 
dummy variable exhibit significantly positive 


Table 1. Variance Inflation Factors 


Matrix" 

Independent Variable I Hu m 
Life expectancy 15.3 — — 
Infant mortality 13.0 — — 
Pop. less than 15 12.1 — 3.5 
GNP/capita (In) | 8.3 — 
Pop. growth 6.3 — — 
Pct. urban 3.1 2.4 2.1 
Ethnolingistic 

heterogeneity 2 1.9 1.8 
Income inequality . 2.1 1.7 1.9 
Population (In) 1.8 1.3 1.5 
Pop. density (in) 1.8 1.3 1.3 
Political democracy 1.7 1.5 1.5 
Economic discrimination 

(“Weighted”) 1.6 1.3 1.3 

Pct. males 15-29 1.4 1.3 1.3 
Development index — 3.8 — 


* Matrix I includes the original list of independent 
variables. Matrix II substitutes a composite development 
index for strongly interrelated predictors. Matrix III 
includes "population less than 15" as a representative 
variable for the strongly interrelated predictors. See text 


. for explanation. 
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associations with the INTERPOL homicide 
rates (r = .31 and .34 respectively).'!° 
Moreover, although the correlation for the 
Gini coefficient of income inequality and 
homicide rates is also positive as expected (r 
= .19), it is somewhat weaker than the 
correlations observed for the discrimination 
measures. 

Additional evidence concerning the impor- 
tance of economic discrimination is provided 
by the regression analyses. In Table 2, the 
results of four regression equations are 
presented. Equation I is based on a “full” list 
of controls with the development index 
included. In this model, the weighted mea- 
sure of economic discrimination emerges as 
the only significant predictor of homicide 
rates—nations with pervasive and intense 
economic discrimination tend to exhibit high 
levels of homicide. Equation I also indicates 
that development has modest effects on 
homicide rates, with more developed nations 
characterized by comparatively low homicide 
rates. This effect is not quite statistically 
significant at the stipulated levels. 

Equation II is a “trimmed” model created 
by means of a backward elimination proce- 
dure.!! The results indicate that all variables 
other than the measure of economic discrimi- 
nation and the development index can be 
eliminated from the original model with no 
appreciable effect on predictive accuracy. 
Equations III and IV are analogous to 
Equations I and II with the exception that the 
measure of the "population less than 15" is 
included in place of the development index. 
Once again, economic discrimination clearly 


10 Given the nonrandom nature of the sample, 
tests of statistical significance are not formally 
appropriate. Nevertheless, such tests are used in 
cross-national studies of crime as a criterion for 
identifying nontrivial relationships. A liberal sig- 
nificance level (.10) is applied in the regression 
analyses, along with the more conventional (.05) 
level, because of the relatively small sample size. 
When the sign is predicted, one-tailed tests are 
performed. 

11 In this procedure, each variable in the “full” 
equation is considered for removal on the basis of 
the probability of the corresponding F-value. The 
variable with the highest probability is removed 
first, and then coefficients are reestimated. The 
procedure continues until all variables with a 
probability-of-F value greater than the criterion 
have been removed. The criterion for the analysis 
is p =.10. 
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Table 2. Regressions for Average INTERPOL Homi- 
* cide Rates (In) with "Weighted" Measure of 





- Economic Discrimination 
Equation: 

Independent Variable I 0 n IV 
Weighted economic 

discrimination .20** — .29** — p» 26** 
Income inequality .04 —  -.05 — 
Percent urban —.09 — ~—.02 — 
Ethnolinguistic 

heterogeneity —.03 — -~.02 ~ 
Population (/n) 01 — 07 — 
Population density (In) —.03 — .03 — 
Democracy index .03 — .09 — 
Percent males 15-29 — —.07 — ~.ll — 
Development index —-.16  -.19* — — 

.35* 24** 


Population less than 15 — — 
R? .15 .14 37 15 
N = 52 


Notes: Standardized regression coefficients are re- 


* ps 0l. 
** p= 05. 
“p= 051. 


emerges as an important predictor of homi- 
cide rates. The results in Equations III and IV 
also indicate that the measure of the relative 
size of the young population serves as a better 
predictor of homicide rates than does the 
development index. Moreover, this demo- 
graphic variable has effects roughly compara- 
ble to, or slightly greater than, those observed 
for the measure of economic discrimination. 
Table 3 reports regression results for 


Table 3. Regressions for Average INTERPOL Homi- 
cide Rates (In) with Economic Discrimination 
“Dummy” Variable 


Equation: 

Independent Variable I II Il IV 
Economic discrimination 

dummy .33**  ,33**  .3]** .30** 
Income inequality .07 — -—.02 — 
Percent urban —.10 —  -—,03 — 
Ethnolinguistic 

heterogeneity -.02 — -—.02 — 
Population (In) —.04 — .03 ~ 
Population density (in) 04 — 04 _ 
Democracy index 04 — .09 — 
Percent males 15-29 — .09 —  -.13 — 
Development index -.16 -1* — — 
Population less than 15 — — .34* 26* 

Rt 7 16 .20 .18 

N = 52 

Notes: Standardized regression coefficients are re- 
ported. 

* ps.10 
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equations which include the dummy variable 
for economic discrimination in place of the 
weighted measure. As in Table 2, four 
equations are provided to represent both 
“full” and “trimmed” models with either the 
development index or the “representative” 
variable. 

The pattern is essentially the same as that 
observed previously. Economic discrimina- 
tion is positively and significantly related to 
national homicide rates across all specifica- 
tions. In fact, the coefficients for the dummy 
variable are consistently somewhat larger than 
those for the weighted measure of economic 
discrimination. Also consistent with findings 
in Table 2, the development index has modest 
effects in an inverse direction, and the relative 
size of the young population exerts fairly 
substantive positive effects on homicide rates. 

To assess further the stability of these 
results, measures of Cook’s D have been 
computed for the fully specified models to 
identity influential cases (Weisberg 1980). A 
large value of D indicates that regression 
estimates depend heavily on that particular 
case. Weisberg (1980, p. 108) suggests that a 
useful criterion for determining a “highly 
influential” case is a value of 1.0 or greater 
on the D statistic. 

The largest observed value of D in the 
analysis is .21 for Ecuador. This value is well 
below the suggested criterion for a “highly 
influential” case. Nevertheless, the regres- 
sions have been re-estimated with Ecuador 
excluded (see Tables 4 and 5). The most 
noteworthy consequence of this exclusion is 
to increase the observed effect of the 
measures of economic discrimination on 
homicide rates. The reestimated standardized 
regression coefficients range from .35 to .40 
across the various equations. 

The exclusion of Ecuador also substantially 
increases the predictive power of the measure 
of income inequality. Consistent with past 
research, the Gini coefficient exhibits appre- 
ciable positive effects on homicide rates 
under certain specifications (see in particular 
Equation II, Tables 4 and 5). Even in these 
models, however, the coefficients for the 
measures of economic discrimination exceed 
those for income inequality, and the effects 
for discrimination exhibit greater stability 
across models. 

The possibility of interactions between 
economic discrimination and other national 
characteristics has also been explored (cf. 
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Table 4. Regressions for Average INTERPOL Homi- 
cide Rates (in) with “Weighted” Measure of 
Economic Discrimination (Excluding Ecuador) 








Equation: 
Independent Variable I I m IV 


Weighted economic 








discrimination .40** ,36*^ — 3g** 35» 
Income inequality .8 .23%* 2 — 
Percent urban —.12 — -—.07 — 
Ethnolinguistic 

heterogeneity ~ 01 — —.02 _ 
Population (In) -.01 — .03 — 
Population density (In) — .02 — .02 — 
Democracy index — 07 —  —.03 — 
Percent males 15-29 —.09 — -4Àl — 
Development index —.05 — — -— 

25 22 26  .24 


Population less than 15. — 
R : 
N-51 
Notes: Standardized regression coefficients are re- 








* ps.10. 
** p.05. 


Krahn et al. 1986). The procedure involves 
the computation of product terms for each 
independent variable and the dummy variable 
for economic discrimination (the more power- 
ful predictor of the two discrimination 
indicators). Additive baseline models which 
include economic discrimination, the popula- 
tion less than 15, and a given independent 
variable are then estimated, and the additional 
contribution to explained variance associated 
with the relevant product terms is assessed. 


Table 5. Regressions for Average INTERPOL Homi- 
cide Rates (/n) with Economic Discrimination 
“Dummy” Variable (Excluding Ecuador) 





Equation: 

Independent Variable I II m IV 
Economic 

discrimination dummy — .39** .36** = .38** 35** 
Income inequality 22 260" 5  — 
Percent urban x — ~.07 -— 
Ethnolinguistic 

heterogeneity —.01 — -—-01 — 
Population (Jn) ~ O07 — —-.02 _ 
Population density (In) .03 — .03 — 
Democracy index — .04 — .00 — 
Percent males 15-29 —.10 —  —.13 — 
Development index ~ .07 — = _ 
Population less than 15 — — .21 — .30** 

R 26 .22 .27 24 


N=51 
Notes: Standardized regression coefficients are re- 





- ps.10. 
** px;.05. 
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The results reveal one significant interac- 
tion, that between population density and 
economic discrimination. The sign of the 
interaction term is positive, which indicates 
that the tendency for economic discrimination 
to promote high homicide rates is greatest in 
the more densely populated nations. These 
findings are consistent with Krahn et al.’s 
(1986, p. 287) argument that high density 
might enhance the visibility of invidious 
social distinctions and in so doing increase the 
likelihood of interpersonal conflict. !? 

The equations have also been estimated 
using the WHO homicide rate and the 
combined WHO-INTERPOL measure, as de- 
pendent variables for the subsample of 32 
nations with available data (see Table 6). 
Once again, a full model is reported for each 
homicide indicator, along with a trimmed 
model created by the backward elimination 
procedure. The “dummy” variable has been 
retained as the measure of economic discrim- 
ination and the “population less than 15” has 
been selected as the representative develop- 
ment indicator, given the comparatively 
strong effects of these variables in the earlier 
analyses. 

The results in Table 6 are very similar to 
those obtained previously. Economic discrim- 
ination clearly emerges as a comparatively 
strong predictor of homicide rates. Table 6 
also reveals that the "population less than 15” 
is no longer a significant predictor of 
homicide rates, while urban development 
appears to be inversely related to homicide. 

One final set of results warrants discussion. 
As noted earlier, mean substitution proce- 
dures have been employed to maximize 
sample size. A potential problem with this 
procedure is that it might bias the comparative 
assessment of the effects of the inequality 
measures. This is because all cases in the 
sample have “genuine” values on economic 
discrimination, whereas mean values on 
income inequality have been assigned to 
seven nations (see footnote 7). To address this 
issue, the basic set of regression equations 
reported in Tables 2 and 3 have been 


12 Similar procedures have been employed to 
test for interactions between the Gini coefficient 
and all other independent variables. Using as 
baseline models the full equations with either the 
development index or the population less than 15, 
no significant interaction terms involving the Gini 
coefficient are observed. 
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Table 6. Regressions for Average Homicide Rates (Ir) 
Using W.H.O. Data and a Combined W.H.O.- 
INTERPOL Measure 








Homicide Indicator: 





W.H.O. W.H.O.- 
Homicide INTERPOL 
Rate Average 
Independent Variable I u m IV 
Economic discrimination 
dummy .42** 46** — 40** ,50u* 
Income inequality 23. — .06 ~ 
Percent urban —26 -— —.36 -.307* 
Ethnolinguistic 
beterogeneity 40 — .10 — 
Population (in) -.14 — .07 — 
Population density (in) —.03 . — .00 — 
Democracy index —.,13 — .00 — 
Percent males 15—29 02 — 17 _ 
Population less than 15 —.16 — . —.03 = 
Rm 34 22 .42 38 
N = 32 
Notes: Standardized regression coefficients are re- 
ported. 
* p=.10. 
** 05.05. 


reestimated for the subsample of 45 nations 
with nonmissing values on the Gini coeffi- 
cient. The results reveal that the standardized 
regression coefficients for economic discrim- 
ination are in all instances appreciably larger 
than those. for income inequality. The 
“dummy” variable for economic discrimina- 
tion once again yields consistently significant 
estimates across specifications, although the 
“weighted” measure of discrimination attains 
a level of significance only in the trimmed 
models. In general, the regression coefficients 
for the subsample with nonmissing data for 
income inequality are similar to those for the 
full sample, and conclusions concerning the 
comparative effect of the two forms of 
inequality remain unchanged. 

Analyses have also been performed on 
subsamples with nonmissing data on the 
control variables. When cases are deleted on 
the basis of missing data on any single control 
variable, there is little change in sample 
composition, and the results conform rather 
closely to those for the full sample. However, 
because different nations exhibit missing data 
for different variables, a strict listwise 
deletion of cases &cross all variables produces 
a substantial reduction in sample size from 52 
to 39 nations. 

Analyses based on this reduced sample 
diverge in some important respects from those 
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reported for other samples. The effects of the 
discrimination dummy variable continue to be 
positive but are modest and statistically 
significant only under certain specifications. 
The coefficients for income inequality are 
consistently nonsignificant, but so are the 
coefficients for the weighted measure of 
- discrimination. In short, the highly consistent 
findings for the main sample must be 
interpreted with the qualification that an 
appreciable change in sample size and 
composition can alter the pattern of relation- 
ships. 


SUMMARY AND DISCUSSION 


The primary purpose of this analysis has been 
to test the hypothesis derived from macrostruc- 
tural theory that economic discrimination 
against social groups is an important determi- 
nant of national levels of homicide. The 
results are reasonably consistent and clear. 
Measures of economic discrimination against 
social groups exhibit significant, positive 
effects on national homicide rates. These 
effects withstand controls for a wide array of 
national characteristics, including indicators 
of the overall. standard of living; population 
size, growth, and density; age-sex structure; 
and. political structure. The estimates for 
economic discrimination prove to be highly 
stable across varying specifications with 
different controls. Moreover, similar patterns 
are observed for homicide indicators based on 
two different data sources. 

The results also support the theoretically 
derived claim that the way in which economic 
inequality is generated, or the type of 
inequality, has an even greater bearing on 
levels of homicide than does the simple 
magnitude of this inequality. This conclusion 
is suggested by the finding that measures of 
economic discrimination are consistently stron- 
ger predictors of homicide rates than is the 
standard measure of income inequality, the 
Gini coefficient of household or individual 
income concentration. Evidently, the structur- 
ing of inequality in terms of ascribed social 
characteristics serves as a particularly impor- 
tant source of lethal violence. 

Although these findings are consistent with 
the predictions of the macrostructural theory, 
conclusions must be drawn cautiously for 
several reasons. ‘First, the measures of 
economic discrimination are rather crude, and 
their adequacy for cross-national comparisons 
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needs to be more fully established with future 
research. Second, the aggregate nature of the 
data precludes identification of the microlevel 
relationships between ‘social characteristics 
and homicide. There is no way of determin- 
ing, for example, whether those who commit 
homicide are’ drawn disproportionately from 
groups subject to discrimination throughout 
the sample of nations. Finally, the sample of 
nations is rather small and is based on the 
availability of data. Representativeness is 
accordingly problematic, and appreciable 
changes in sample composition can alter some 
of the effects. Firm conclusions must await. 
replication of these results with additional, 
preferably larger, samples of nations and with 
more refined measures of economic discrimi- 
nation. i ' 

With these qualifications in mind, it is 
interesting to note a striking affinity between. 
the implications of Blau’s recent macrostruc- 
tural theory and those of Durkheim’s classic 
remarks on social. inequality in The Division 
of Labor. Durkheim speculates that strong 
solidarity can be achieved in the “advanced” 
societies only to the extent that “social 
inequalities exactly express natural inequali- 
ties” ([1893] 1964, p. 377). He cautions that 
“external inequalities,” that is, inequalities 
rooted in ascribed characteristics, preclude 
the development of strong social bonds. . 
Durkheim does not address the topic of 
homicide directly in The Division of Labor, 
but insofar as high homicide rates are a 
reflection of weak social bonds, his argu- 
ments lead to the very same prediction 
implied by the macrostructural theory— 
discriminatory social practices that generate 
*consolidated inequality" will promote high 
levels of lethal violence. In short, the results 
of this analysis are compatible. with both 
recent macrostructural theory and classical 
Durkheimian social thought, and they under- 
score the potentially high costs of ascriptive 
inequalities in modern societies. 
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Table A-1. Sample of Nations and Economic Discrimi- 
nation Scores 


Percent Intensity 
Discriminated `of 
. Nation Against Discrimination 
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An early debate in the empirical study of political democracy concerned the 
measurement of democracy. Initial work employed dichotomous indicators and 
incorporated stability into political democracy measures. Evidence accumulated 
showing that this approach could adversely affect analyses, particularly in the 
study of income inequality. Despite this, some recent studies have renewed the 
flawed practices. We draw upon recent work to highlight the confusion that can 
result when democracy and stability are confounded and the problems with 
dichotomous indicators. We propose that stability is analytically distinct from 
political democracy and should be treated as such empirically, and we suggest 
ways to estimate incremental effects of political democracy. We further argue that 
political democracy is continuous and that measures of it should reflect this. It is 


important that the measurement history of this construct not repeat itself. 


Three decades ago, Lerner (1958), Lipset 
(1960), and Cutright (1963) pioneered the 
cross-national empirical study of political 
democracy. Initial measurement efforts were 
limited. For example, Lipset treated political 
democracy as a binary rather than a continu- 
ous variable. He and Cutright (1963) mea- 
sured democracy so that it included stability 
along with measures of political rights and 
liberties. With improvements in both the 
quality and availability of data, the practices 
of treating democracy as a dichotomy and of 
mixing the concepts of stability and democ- 
racy became less common (Jackman 1973; 
Johnson 1977; Bollen 1980). Recently, how- 
ever, the previously abandoned conventions 
have enjoyed a resurgence. 

At an intuitive level, it is appealing to 
divide the world into democracies and 
nondemocracies. And the idea that the 
persistence of democratic institutions should 
be included in any measure appears attractive. 
But the procedures have shortcomings. 

Dichotomizing democracy lumps together 
countries with very different degrees of 
democracy and blurs distinctions between 
borderline cases. For example, are democratic 
practices entirely absent from Mexican poli- 
tics? Does the return of elections to Argentina 
mean that it is a full-fledged democracy? 
Does the suppression of the Tamil separatist 
movement in Sri Lanka assign that country to 
the nondemocratic rank? The difficulty in 


*We would like to thank Mary Jackman, 
Gerhard Lenski, Fred Pampel, and Brian Silver for 
their constructive criticism and suggestions. 


612 


answering these questions reflects the inher- 
ently continuous nature of the concept of 
political democracy. Further, labeling some 
countries as democratic invites insensitivity 10 
persisting political inequalities in even these 
states, because it implies that they are fully 
democratic: democracy is specified in ad- 
vance as an all-or-nothing matter. 

The other pivotal measurement issue cen- 
ters on the fusion of stability and democracy. 
This fusion in empirical measures makes it 
impossible to interpret observed associations 
of “democratic stability” with other vari- 
ables, because it is never clear whether degree 
of stability or degree of democracy is the 
operative factor at work. For example, Moore 
(1966) and de Schweinitz (1964) hypothe- 
sized that countries that developed earlier are 
more democratic than late developers, net of 
their level of industrialization. This hypothe- 
sis is supported if one uses Cutright’s (1963) 
Stable democracy index as the measure of 
democracy, but not if one uses a measure that 
excludes the stability component (Bollen 
1980). Likewise, the confounding of political 
democracy and political stability seems par- 
tially responsible for Cutright’s (1967) judg- 
ment that democracy reduces income inequal- 
ity (see Jackman 1974). However, once one 
uses continuous measures, free of political 
stability, one rarely finds significant relation- 
ships between political democracy and in- 
come inequality (Bollen and Jackman 1985). 
Further, stratification theories like Lenski’s 
(1966, pp. 59-62, 439) distinguish the effects 
on inequality of (a) the nature of the political 
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system and (b) the duration or stability of the 
regime. To test these ideas requires distinct 
measures of each concept. 

Despite the above considerations, some 
authors have renewed the custom of dichoto- 
mizing democracy or blending stability into 
their democracy measures (e.g., Coulter 
1975; Hewitt 1977; Vanhanen 1979, 1984; 
Muller 1988). Our purpose is to demonstrate 
that these practices resurrect past problems. 
First, we show that confounding stability with 
democracy leads to uninterpretable associa- 
tions. Second, we argue that since democracy 
is conceptually continuous, it is best mea- 
sured in continuous terms. We address these 
issues in the context of the study of 
democracy and inequality. 


STABILITY AND DEMOCRACY 


Political sociologists have long debated the as- 
sociation between democracy and income in- 
equality (e.g., Lipset 1960; Lenski 1966; Cu- 
tright 1967; Jackman 1974; Rubinson and 
Quinlan 1977). Two of the most recent em- 
pirical analyses reached opposite conclusions. 
The first (Bollen and Jackman 1985) found no 
significant association, while the second 
(Muller 1988) concluded that democracy re- 
duced inequality. One key difference between 
these two studies lies in the measurement of 
democracy. Bollen and Jackman (1985) em- 
ployed a continuous measure of democracy for 
a specific time point. This measure includes 
six indicators of political rights and political 
liberties for 1965, and ranges from 0 to 100 
(for full details on the measure, its properties, 
and country values, see Bollen 1980; Bollen 
and Grandjean 1981). 

In contrast, Muller (1988) dated the“inau- 
guration” of democracy as the year of the first 
election that satisfies the following four 
criteria: *(1) the executive must be elected or 
be responsible to an elected assembly in (2) at 
least two consecutive and free and fair 
competitive elections in which (3) at least 
approximately a majority of the population 
has the right to vote, and during which (4) the 
rights of freedom of speech and assembly are 
respected" (Muller 1988, p. 54). These 
criteria form the basis of the "democratic 
experience" country scores that Muller dis- 
played in his Table 2. 

Years of democratic experience were mea- 
sured by the number of years since the 
inaugural date, after adjustments are made for 
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either “interruptions” or “breakdowns” in 
democracy. "Interruptions include the tempo- 
rary suspension of democratic liberties, as 
well as periods when the government is 
controlled by a foreign power. Breakdowns 
occur when democratic regimes are replaced 
internally by authoritarian rule or lose author- 
ity over a large proportion of the population 
due to civil war" (1988, p. 56). Where the 
criteria for democracy were never met, 
Muller assumed that democracy was never 
inaugurated, and the country is scored zero. 
Countries were also scored zero if the criteria 
were once met, but the interruption or 
breakdown lasted more than 20 years (Table 
2). Muller employed his index in two ways. 
The first is the natural logarithm of the raw 


^ democratic experience scores (plus one). The 


second is a fivefold qualitative classification: 
nondemocracies (no years of democratic 
experience); new democracies (1—20 years); 
relatively new democracies (20—39 years); 
relatively old democracies (40—59 years); and 
very old democracies (60 or more years). 

Muller's index appears to incorporate a 
stability factor (since it is a count of the 
number of years certain institutions have been 
in place) whereas Bollen and Jackman's does 
not. However, Muller asserted that "the years 
of democracy measure does not confound 
stability with democracy because stability is 
not part of the operational definition," even 
though he acknowledged that "countries with 
many years of democratic experience tend to 
be politically stable" (1988, p. 57. Emphasis 
in original). Does Muller's measure confound 
the two concepts? 

Bollen (1980, pp. 382-84) employed an 
empirical test to demonstrate the presence of 
stability in Cutright's (1963) democracy 
measure. We apply the same test to Muller's 
(1988) measure. To label the variables, we 
use the following acronyms: DEMEXP is 
years of democracy, as defined by Muller; 
POLDEM is degree of democracy, 1965, as 
measured by Bollen (1980); POLSTAB is 
political stability, as measured by Muller 
(1988, p. 57) If DEMEXP does not 
confound stability with democracy, then 
regressing DEMEXP on POLDEM and POL- 


! POLSTAB is the natural logarithm of the 
difference between the most recent date for the 
inequality data employed by Muller and the year in 
which the current constitution was inaugurated 
(from Table 2.1 of Taylor and Hudson 1972). 
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STAB will generate an insignificant parame- 
ter estimate for POLSTAB, and a significant 
and positive coefficient estimate for POL- 
DEM. If DEMEXP does confound stability 
with democracy, then the estimate for POL- 
STAB from the same regression will also be 
significant and positive. 

Regressing DEMEXP on POLDEM and 
POLSTAB produces the following estimates: 


—2.625 + .051 POLDEM 
(.431) (.006) 

--.288 POLSTAB + ej. 

(.115) 


(1). DEMEXP = 


Standard errors are in parentheses, the R? is 

.713, and N = 54. Even with the control for 
POLDEM, POLSTAB is associated with 
DEMEXP (the t-ratio for POLSTAB is 2. 5). 
In contrast, when POLDEM is regressed on 
DEMEXP and POLSTAB, the following 
estimates are obtained: 


Q POLDEM = 49.480 + 12.200 DEMEXP 


(5.402) — (1.343) 
` —.396 POLSTAB -e;. 
| (1.886) 


Standard errors are again in parentheses, the 
R? is .678, and N — 54. 

These estimates have two implications. 
First, from both equations we see that there is 
a strong association between DEMEXP and 
POLDEM, which Muller described as "the 
most reliable and valid indicator of a 
country's level of democracy" (1988, p. 54). 
Second, equation 1 shows that Muller's 
measure (DEMEXP) is a function of both 
degree of democracy and political stability. 
Hence, the conclusion is inescapable that 
DEMEXP confounds the two. In contrast, we 
see from equation 2 that POLDEM is not 
similarly confounded.? 

With this established, we can examine the 
link between democracy and inequality. Do 
we obtain comparable results with DEMEXP, 
which reflects both degree and stability of 
democracy, and with POLDEM, which indi- 
cates only the degree of democracy? To 
address this question, we need a baseline set 





? These estimates can be compared directly with 
those reported by Bollen (1980, p. 383). Muller 
(1988, footnote 22) acknowledged that Cutright's 
measure is confounded with stability. 
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of regression estimates against which the 
effects of democracv can be judged. In view 
of prior research (e.g., Ahluwalia 1976; 
Bollen and Jackman 1985; Muller 1988), an 
appropriate baseline involves regressing in- 
equality on the level of economic develop- 
ment and population age structure.? 

Since we want to compare the performance 
of DEMEXP and POLDEM, and given that 
DEMEXP? scores are available only for the 55 
countries analyzed by Muller, we confine our 
attention to those 55 countries (POLDEM 
scores are available for 54 of these). Accord- 
ingly, Table 1 reports four regressions 
employing as the measure of inequality the 
Gini coefficients reported by Muller (1988, 
Table 1). The first column shows the 
estimated baseline model; the second and 
third columns present the estimates obtained 
when the logarithm of DEMEXP and the 
qualitative measures of DEMEXP, respec- 
tively, are added to the baseline; the fourth 
column displays the estimates obtained when 
POLDEM is added to the baseline. 

. The figures for the baseline model in the 
first column of Table 1 show that the 
relationship between economic development 
and income inequality conforms to Kuznets's 
(1963) inverted-U shaped hypothesis. Fur- 
ther, consistent with Bollen and Jackman 
(1985) and Muller (1988), inequality in- 
creases with the proportion of the population 
that is young (0 to 14 years of age). These 
patterns persist when either DEMEXP or 
POLDEM is added to the model. 





? We exclude warld-system position since the 
analyses in both Bollen and Jackman (1985) and 
Muller (1988) provide no evidence for such 
positional effects. 

* Following Ahluwalia (1976) and Bollen and 
Jackman (1985), we employ GNP per capita, 
1965, as the measure of economic development 
(figures for the 55 countries are from the World 
Bank 1983). Data on the proportion of the 
population aged 0-14, 1965, are also from the 
World Bank (1983), except in the case of Taiwan, 
where we employ the mean of the 1960 and 1970 
figures reported by the World Bank (1980). Data 
on years of democratic stability are from Muller 
(1988, Table 2). Similar results are obtained when 
levels of energy consumption per capita, 1965 
(Taylor and Hudson 1972), are substituted for the 
GNP data, although employing the latter improves 
the fit a little. We exclude a dummy for the type of 
inequality data because Muller's data set includes 
very few cases with inequality data based on 
nonhousehold information. 
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Table 1: Regressions of Income Inequality (Gini) on 
Level of Economic Development, Population 
- Age Structure, and Democratic Experience 








—.528 


Constant |o—.782* —.712* 
(275) (272) (309) (270) 
In GNP/pop, 1965 — .323*  .293*  238*  .308* 
$ (.093)  (.093). (.106) | (.092) 
In GNP/pop, af 
1965, squared —.026* —.023*- —.018* —.024* 
(.008) (.008) (.009) . (.008) 
% population .006* .006*  .005* .006* 
"0-14, 1965 (002) (001) (002) (001) 
ln DEMEXP —.012 : i 
(.007) 
Relatively new ‘= ,037 
democracy (.027) 
Old democracy — .057 
- (.036) 
POLDEM — .0005 
(.0004) 
Adjusted R? .498 519 511 .497 
N 55 55 55 54 





Starred coefficients are more than twice their standard 
errors, which appear in parentheses below them. 


Turning to the second and third panels, the 
estimates for years of democratic experience 
have the sign postulated by Muller, but they 
are of borderline statistical significance. None 
of the three estimated coefficients satisfies the 
common guideline of being at least twice its 
standard error. With a t-ratio of 1.77, the 
coefficient estimate for In DEMEXP in the 
second column is statistically significant .at 
the .10 level (two-tailed test); a simultaneous 
F-test for the two coefficient estimates in the 
third column is not significant at the .10 
level. Pampel and Williamson (forthcoming) 
estimate a model that includes our baseline 
variables and Muller's measure for a different 
set of countries and with different inequality 
data. Their parameter estimate for the log- 
arithm of democratic experience is not 
statistically significant. In all, there is at best 
weak evidence for the proposition that 
DEMEXP influences income inequality.5 

The fourth panel of Table 1 displays the 
estimated impact of POLDEM within the 
context of the baseline model. That effect is 
even weaker than the DEMEXP effects. The 
estimated coefficient for POLDEM is barely 
larger than its standard error, and the t-ratio 





5 Discrepancies between these estimates and 
those reported by Muller (1988, Table 4) stem 
from the fact that he did not evaluate the effects of 
DEMEXP and POLDEM within the context of the 
simple baseline model that we have adopted. 


—.725* 
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of 1.28 is not significant at the .10 level with 
either a one- or two-tailed test. 

One argument for measures of “years of 
democracy," such as Hewitt's (1977) or Mul- 
ler's (1988), is that these measures capture 
variance in an additional trait that affects in- 
equality (see, e.g., Muller 1988). We can eval- 
uate this idea by examining possible effects of 
DEMEXP on inequality, with POLDEM held . 
constant. Regressing inequality on DEMEXP, 
POLDEM, and the baseline variables pro- 
duces the following estimates: 


(3) GINI = —.691 + .290 InGNP 
(272) (.093) 
— .022 InGNPSQ 
(.008) 
+ 006 AGEO-14 
(.002) 


— .009 InDEMEXP 
(009) — 

— .0001 POLDEM +e3. 
(0005) 


Standard errors are in parentheses, the 
adjusted R? is .497, and N = 54. The 
argument implies that the coefficient for 
POLDEM is insignificant, while that for 
DEMEXP is significant and negative., How- 
ever, the estimates in equation 3 belie this 
expectation, since neither of the coefficients 
in questioh approaches statistical significance 
(the ¢-ratio for DEMEXP is 1.01).6 As would 
be expected, we reach essentially the same 
conclusion when we residualize DEMEXP on 
POLDEM, and enter this “purged” version of 
Muller’s DEMEXP as an explanatory vari- 
able.” From the foregoing, it is obvious that 





$ Evidently, Muller (1988, p. 57, footnote 15) 
performed a similar analysis, but he reported only 
the nonsignificance of the POLDEM coefficient, 
omitting any discussion of the nonsignificance of 
DEMEXP. 

7 The estimates ‘are as follows: 


(4) GINI = —.726 + .306 InGNP 
C277) (.094) 
— .025 InGNPSQ 
(.008) 
+ .006 AGE0-14 
(.002) 


— .005 ResDEMEXP + e4, 
(.009) 


where standard errors are in parentheses, the 
adjusted R? is .483, and N = 54. ResDEMEXP is 
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Muller’s measure does confound democracy ` 


and stability and that the variance unique to 
Muller’s measure (and not shared with 
POLDEM) is unrelated to inequality.? 

Our complaint about measures that combine 
stability and democracy is not a denial that 
. Stability is potentially important. Plausible (test- 
able) propositions involving incremental ef- 
fects of both stability and political democracy 
on inequality can be constructed. For exam- 
ple, one could argue that the effect of democ- 
racy on inequality is highest where political 
institutions are most stable. Empirically, this 
might involve specifying inequality as a mul- 
tiplicative function of the degree of both de- 
mocracy and stability. If true, inequality would 
decrease as institutions become more demo- 
cratic and stable. In this formulation, democ- 
racy is still cast in continuous terms. 

We reported a specification along these lines 
in Bollen and Jackman (1985, p. 448). Using 
age of the current constitutional form (logged) 
as an indicator of political stability; we found 
rio evidence that the interaction between de- 
mocracy and stability had a significant influ- 
ence on inequality. In other words, there was 
no evidence that the impact of the degree of 
democracy on inequality hinges on years of 
experience with political institutions. 

An alternative way to model a gradual, 
incremental influence of political democracy 
on income inequality explicitly includes a 
series of lagged values of political democ- 
racy. Consider the following: 


y = a + bix, + box. 


(5) 
' +b; X2... 4 by X; -k—1 


+C + &, 


where y, is the income inequality in a given 








formed from the residuals of a regression of 
InDEMEXP on POLDEM. 

* Muller (1988, p. 50) claimed that level 
measures such as POLDEM assume instantaneous 
effects. This, however, is not always true. Indeed, 
cross-sectional data sometimes better represent 
long-run impacts than do time-series data (see, 
e.g., Intrilligator 1978, pp. 62-64; Cassidy 1981, 
pp. 77—85). The assertion that researchers using 
level measures assume immediate effects is thus 
erroneous. Further, Muller did not follow his own 
advice when he used level of economic develop- 
ment rather than “years of economic development" 
and level of income inequality instead of "years of 
income inequality" as explanatory variables in his 
regression equations. 
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year, x,..; is the value of political democracy 
in the t—i year, and z, is a vector of other 
explanatory variables (e.g., InGNP, ln- 
GNPSQ, and AGEO-14) and c their coeffi- 
cients. In this formulation, political democ- 
racy is allowed to have a gradual, incremental 
effect over a period of years. If we expect 
negligible effects until after a decade or more, 
we could include political democracy mea- 
sures beginning 10 years prior to the 
inequality measure. Or we could use any of 
the distributed lag specifications developed by 
econometricians (see, e.g., Johnston 1984). 

Contrast equation 5 with the implicit 
models in Cutright (1967), Hewitt (1977), 
and Muller (1988): 


(6) y= a + by & t xa 
+ X2 +... 
ToxX-poc mede, 


where the variables are defined as above. The 
expression inside parentheses shows that 
these authors treat democracy as a simple sum 
of yearly scores. For Cutright, each annual 
Score is a continuous measure; for Hewitt and 
Muller, democracy is scored as present (1) or 
absent (0) in each year. The influence on 
inequality of democracy in year t is implicitly 
constrained in these formulations to be 
identical for all lagged years. This follows 
because the coefficients for democracy in the 
current year (x, as well as, say, 50 years 
earlier (x, 49) are both b,. Thus, Muller’s 
statistical formulation in fact contradicts his 
argument that *a relatively immediate causal 
effect of democracy on inequality is implau- 
sible" (Muller 1988, p. 51). Muller's ap- 
proach, like that of his predecessors, actually 
treats the influence of democracy on inequal- 
ity as uniform, regardless of the length of the 
lag.? 


IS DEMOCRACY AN ALL-OR- 
NOTHING MATTER? 


We may tolerate a crude, binary classification 
for political democracies as a shorthand 
reference to clusters of countries that have 
large differences in degrees of democracy. Or 


? An analogous argument can be made for 
Muller's logged version of “years of democracy.” 
The untransformed version is given here to 
simplify the presentation. 
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we may label nations as: democratic to isolate 
countries with the highest levels of political 
democracy for more. intensive study (e.g., 
Powell 1982; Lijphart 1984). But in quantita- 
tive empirical analyses including societies 
with diverse levels of political inequality, a 
simple dichotomous distinction between dem- 
ocratic and nondemocratic is hard to justify. 
In early work such as Lipset's (1960), binary 
classifications reflected the difficulty of 
obtaining more refined information. But, 
even then, some analysts (e.g., Lerner 1958) 
found continuous proxies for democracy.!? 
Cutright (1963) was an early critic of 
dichotomous measures of political democ- 
racy. We share his premise that political 
democracy is a continuous concept and that 
our measures should reflect this. 

The analyses by Lipset (1960), Rustow 
(1967), Hewitt (1977), and Muller (1988) fail 
within this binary classification tradition. 
Muller's analysis illustrates the limitations of 
any attempt to categorize democratic regimes. 
Details on Muller's four measurement criteria 
were provided in the last section. What do we 
learn from an inspection of the country scores 
themselves? 

Of the 55 countries in Muller's analysis, 
the United States is scored as having the most 
democratic experience because democracy 
was established with the introduction. of 
universal manhood suffrage in 1870. Accord- 
ing to Muller, prior to that date the United 
States was not democratic, but as of 1870 it 
was. By contrast, Mexico had the same lack 
of democracy as South Africa and Taiwan. 
West Germany and Costa Rica became 
democracies in 1949. Note that Muller's 
approach implies that, as of 1969, the latter 
two countries had enjoyed the same amount 
of democratic experience as had the United 
States by 1890. In other words, the demo- 
cratic experience of West Germany or Costa 
Rica as of 1969 was equivalent to that of the 
"United States as of 1890, because each 
country had experienced a generation of 
democracy.!! ; 

An implicit assumption here is that the 


nature of political democracy has been 


invariant over time. But this assumption is 
implausible and misleading. Consider some 





10 Voter turnout is Lerner’s sole proxy measure 
for democracy. The validity of this indicator is 
subject to debate. . 

!! Muller defined a "generation" as 20 years. 
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concrete cases. Ás of 1949, West Germany 
had universal adult suffrage, a secret ballot, 
and votes were equally weighted. In contrast, 
although the 15th Amendment to the United. 
States Constitution (effective in 1870) did 
seek to outlaw racial discrimination in voting, 


' evasion of this Amendment was widespread. 


Poll taxes and literacy tests were still common 
in the 1960s.!2 The litany is familiar. Less 
well known, perhaps, the secret ballot was 
first introduced in the United States in 
Massachusetts in 1888, and had not been 
adopted by all states a decade later.!? In light 
of this, the claim that democracy in the 
United States was effectively inaugurated in 
1870 cannot be sustained. Similarly, Muller's 
dating of the creation of democracy in the 
United Kingdom at 1918 coincides with the 
date on which universal adult male suffrage 
was introduced. However, university gradu- 
ates and occupiers of business premises could 
vote more than once after that year, and plural 
voting was not abolished until 1948 (Butler 
1953). If one takes seriously the proposition 
that all votes should be equally counted, then 
democracy would not appear much older in 
the United Kingdom than it is in West 
Germany. : i 
An additional complication is Muller's 
reliance on universal manhood suffrage in 
dating the beginning of democracy. If we 
consider the extension of the franchise to 
women, we obtain very different “years of 
democracy” for some countries. For instance, 
Belgium’s date for the inauguration of 
democracy would be 1948 rather than 1919; 
Argentina’s would be 1947 rather than 1916; 
and France’s would be 1945 rather than 1875! 
In other cases, Muller’s dating may under- 
state democratic experience. For example, his 
date for Japan is 1952, the end of the 
American occupation. But the 1947 constitu- 
tion restored the prewar electoral system, 
among other things. Under the Meiji constitu- ' 
tion there had been considerable electoral 
competition, and universal .adult male suf- 
frage was introduced in 1925 (Ishida 1968; 


12 Oddly, Muller (1988, p. 56) used the 
presence of literacy tests in dating democracy in 
Chile, but not in the United States. 

35 Because it reduced the amount of intimidation 
at the polls, the introduction of the secret ballot in 
the United States was followed by a decline in 
turnout. For an interesting analysis, see Rusk 
(1970). . 
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Scalapino 1968). True, political parties were 
banned from 1940 to 1946, but using Muller’s 
own criteria, 1952 would seem too late a 
date. Similarly, as Muller himself notes, 
suffrage was universal in Weimar Germany, 
but that experience is not counted because 
West Germany was a new state in 1949.14 
We think it would be difficult for an 
independent researcher to reproduce Muller’s 
inaugural dates. First, his ‘criteria are not 
clearly defined. For example, what consti- 
tutes “free and fair competitive” elections? 
Similarly, Muller does not tell us when 
“freedom of speech and assembly are re- 
spected.” Even were these more clearly 
defined, there appear to be many exceptions 
to his criteria. Muller reported using Hewitt’s 
(1977) data to determine the inaugural date 
“in most countries where democracy was 
established before World War II” (1988, p. 
56). However, nowhere did Hewitt provide 
data on freedom of speech and assembly, one 
of Muller's criteria. Further, Muller's use of 
Hewitt's data is erratic. For example, Muller 
dated American democracy from 1870, when 
universal male suffrage was introduced. 
Hewitt dated it from 1904, when use of the 
secret ballot became widespread. Muller 
dated democracy in France from 1875, with 
the introduction of an elected executive/ 
legislature, but Hewitt dated it from 1913, 
with the introduction of the secret ballot.!5 
We are not suggesting that Muller has 
merely identified the wrong dates in which 
democracy was “inaugurated’ in certain 
countries. Our argument is more fundamen- 
tal. It is meaningless to claim that democracy 
was inaugurated in.a given country: on a 
single date. The very idea of an inaugural 
date before which democracy does not exist 
and after which it does is hollow. Instead, 
democratic forms have evolved considerably 
over the past century. The emphasis on 


14 There were, of course, major boundary 
changes after World War II. But Germany is 
hardly the only European country to have experi- 
enced significant boundary changes since the turn 
of the century. And if postwar Germany is to. be 
treated as & special case, how should the various 
French Republics be counted? On the evolution of 
French electoral laws since 1789, see Campbell 
(1965). 

35 Inspection of footnote 9, p. 56 reveals that 
Muller had many more exceptions to his coding 
rules. 
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inclusiveness (universal suffrage) is of recent 
vintage (Dahl 1971). That Muller used 
universal male adult suffrage as a criterion for 
democracy simply underscores the arbitrary 
and artificial nature of his coding scheme. 

Democracy is always a matter of degree. It 
was precisely in recognition of this that Dahl 
proposed the term polyarchies, defined as 
“relatively (but incompletely) democratized 
regimes" (1971, p. 8). In the United States, 
for example, there were considerable political 
rights and liberties prior to the 1960s, but 
factors like discriminatory voter registration 
requirements remained notable blemishes. 
Thus, while we might characterize the United 
States as more democratic than Nicaragua in 
the period, we would stop short of labeling 
the United States as fully democratic. Simi- 
larly, we do not find useful Muller's 
classification of Mexico, Portugal, and Spain 
as (equally) undemocratic in the 1960s. While 
we recognize that with the PRI there has been 
less political competition in Mexico than in, 
say, Holland, Mexico demonstrated more 
competition, more inclusiveness, and fewer 
restrictions on information than did either 
Portugal or Spain at the time.!6 The dichoto- 
mous approach dismisses these differences as 
inconsequential. 

Along with their intrinsic significance, 
variations in degree of democracy are at the 
heart of common arguments linking the 
expansion of political democracy to a reduc- 
tion in economic inequality (e.g., Lipset 
1960; Lenski 1966). Those arguments empha- 
size that as politics becomes increasingly 
competitive and inclusive, political inequali- 
ties are reduced. With these changes, groups 
that had previously been excluded acquire 
more political power. Where they could 
previously be ignored, they now increasingly 
have to be accommodated. Again, these 
changes did not suddenly spring forth in full 
bloom, but rather were typically slow and 
uneven: property and other economic restric- 
tions on the right to vote (i.e., political 
inequalities) were gradually removed as 
Broups of increasingly lower income were 
successively enfranchised. Indeed, standard 
arguments connecting political democracy 


16 By the middle 1980s, we would expect the 
data on degree of democracy to show that these 
contrasts have changed dramatically with both 
Portugal and Spain having become much mcre 
democratic. 
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with reductions in economic inequality as- 
sume that voters seek to maximize their 
incomes. As politics becomes increasingly 
inclusive, more such income maximizers 
enter the political arena, demanding a greater 
share of the pie. This is the critical element 
said to link democratic reforms to reductions 
in economic inequality. 

In the second empirical section of his 
paper, Muller examined the other side of the 
coin: does inequality affect democracy? He 
concluded that while inequality is unrelated to 
the inauguration of democracy, it does affect 
the survival odds or stability of democratic 
regimes. However, these empirical claims 
also suffer from the dichotomous approach to 
measurement. They falsely assume that an 
inaugural date can be identified. They further 
assume that once in place, countries are either 
democracies or they are not, and that the two 
categories are internally homogeneous. For 
1945—61, countries as diverse as Brazil, 
Guatemala, and Peru are classified as democ- 
racies equivalent to India, Italy, and West 
Germany (Muller 1988, Table 5). In a parallel 
manner, the analysis of democratic stability 
from 1961 to 1980 centers on a set of 33 
countries said to be democratic as of 1961: 
this is the population at risk of experiencing 
instability in ensuing years. Again, the 
implicit assumption here is tbat the 33 cases 
are equally democratic. For example, Muller 
acknowledged the 1960 crisis in Turkey, but 
then went on to suggest that “democracy was 
restored in 1961" (1988, p. 62). Countries 
such as Turkey, Brazil, and Panama are thus 
said to be as democratic as Canada, Holland, 
and Sweden as of 1961. These categorical 
taxonomies cannot be sustained whether 
democracy appears on the right or the Jeft side 
of the equation. !? 


17 A parallel problem is apparent in Muller's 
analysis of instability, which was also treated in 
binary terms: either democracies experienced 
instability (and were "replaced by authoritarian 
rule") or they did not. His expectations about 
instability were couched in terms of military coups 
d'etat (1988, p. 62), and a majority of events 
included are indeed coups. But others are not. For 
example, the idea that the outbreak of civil war in 
Lebanon in 1975 is an instance of a democracy 
being replaced by authoritarian rule squares 
poorly with events in that country. Counted as an 
equally severe event is the 1969 suspension of 
Parliament in Malaysia; not counted is the 1975 
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Our purpose has been to show that recent 
measures of political democracy which incor- 
porate the age of democratic institutions and 
conceive of the phenomenon in dichotomous 
terms resurrect problems that were inherent in 
older measures. First, we demonstrated that 
confounding stability with democracy leads to 
uninterpretable associations. Second, we ar- 
gued that since democracy is conceptually 
continuous, it is best measured in continuous 
terms. 

On the first point, measures of democratic 
experience like those proposed by Cutright 
(1963), Hewitt (1977), and Muller (1988) 
confound stability with democracy and are 
therefore fundamentally invalid. Moreover, 
this conceptual issue has immediate implica- 
tions for the empirical analysis of linkages 
between democracy and inequality. Although 
we do not minimize the potential importance 
of stability, if we are to discriminate effects 
with any confidence, we should eschew 
contaminated variables and concentrate on 
concepts and measures that reflect as closely 
as possible the theoretical constructs at hand. 

On the second score, treating democracy as 
an all-or-nothing dichotomy introduces insu- 
perable difficulties. Dating the inauguration 
of democracy conceived in binary terms is an 
inherently ambiguous task, as we have 
shown. In fact, it is an impossible task. The 
empirical problems we identified with the 
dichotomous approach stem from the rudimen- 
tary point that democracy is a continuous 
rather than a binary concept. And the 
dichotomous approach neglects the fact that 
the nature of political democracy (especially 
inclusiveness) has changed considerably over 
the past decades. 

In view of this, confounding stability with 
democracy ‘and treating the latter in binary 
rather than continuous terms are regressive 
steps that simply reintroduce the mistakes 
associated with research traditions of over 


“emergency” in India. We are not, however, 
proposing an alternative taxonomy. In the present 
context, instability refers to a drop in political 
democracy and it is a matter of degree; some 
declines. are much more severe than others. 
Muller's procedure disregards these variations and 
the heterogeneity they represent in favor of an 
arbitrary dichotomy that treats cases like Malaysia 
and Lebanon as equally unstable, and India and 
Switzerland a as equally stable. 
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two decades ago. This is surely a cycle that is. 


best avoided. 
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We investigate change in occupational earnings during the 1970s by comparing 
similar models at either end of the decade. Since the 1970s were years of relative 
economic stagnation, we anticipated that they would not be favorable to women. 
To suggest hypotheses we related ideas regarding workers’ status characteristics, 
human capital, market demand, and job content, the last compatible with the 
comparable worth tradition. Expectedly, economic stagnation affected women 
more unfavorably than men. Although earnings attainment generally appeared less 
dependent on workers’ status characteristics in 1980 than in 1970, incomes 
increased less for occupations where concentrations of women were high. 
Earnings returns to human capital increased during the interval, largely owing to 
Stronger returns among males. In addition, female earnings were more sensitive 
than male earnings to workers’ status characteristics. We note directions for future 


research using these indicators in contextual models. 


A relatively neglected stream of literature is 
that concerning the structure and functioning 
of occupational labor markets. Those with 
interests in occupations often study particular 
professions and white-collar occupations, 
thus producing detailed knowledge of a few 
cases at the expense of more general 
occupational dynamics. In this analysis we 
help to correct this deficit by estimating a 
broadly conceptualized model of occupational 
earnings attainment and comparing the model 
at two time points, 1970 and 1980. We draw 
on arguments regarding job content from 
traditions of comparable worth, as well as 
from literature arguing for structural determi- 
nants of earnings differences, and study the 


* Portions of this paper were reported at the 
1986 Annual Meetings of the American Sociolog- 
ical Association, New York, at the 1987 Social 
Stratification Research Committee’s Meetings, 
Berkeley, California, and at two Authors’ Work- 
shops sponsored by the National Academy of 
Sciences. We thank Chin-fen Chang and Moham- 
med Siahpush for help in data production, and 
Richard I. Haller for essential contributions to data 
configuration. The 1977 DOT data matched to 
1980 Census titles were provided by Paula 
England, and the unionization data by Randall 
Filer. Helpful comments on earlier drafts were 
provided by Linda Waite, Barbara Reskin, James 
Smith, Heidi Hartmann, James Baron, Paula 
England, Patricia Roos, and Joan Huber. Any 
remaining errors are our own. Researchers may 


request the data derived from U.S. Census sources .. 


from the first author. 
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extent to which processes of earnings determi- 
nation changed over the decade. We then use 
these models to evaluate ideas regarding 
gender inequality during the 1970s. 

We first consider the economic context 
within which these processes occurred. The 
1970s was a decade both of transformation 
and of relative stagnation for the U.S. 
economy. Cyert and Mowery (1987) argue 
that while the labor supply grew, labor 
demand increases were more modest, thus 
producing higher rates of unemployment than 
during the 1950s and 1960s. Employment 
opportunities grew substantially in the latter 
part of the decade, with both high rates of 
growth in professional and technical jobs and 
high volume of growth in lesser-skilled 
positions. While it is well known that rates of 
female labor force participation increased 
during the decade, especially for mothers of 
young children, debate continued regarding 
whether occupational sex segregation was 
decreasing (Beller 1984) or whether aggregate 
analyses were masking higher levels of 
segregation within workplaces (Bielby and 
Baron 1984). Bayes (1988) suggests that 
while women increased their relative represen- 
tation in professional and managerial occupa- 
tions, these changes did not result in major 
improvements in female/male earnings ratios, 
thus prompting concern regarding whether 
female job content was rewarded equivalently 
to male job content, that‘is, concern with 
comparable worth. Based on these facts, we 
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. anticipate that the decade was unfavorable for , 


reducing gender inequality. In this analysis 
we study how occupational variation in job 
content, social organization, and supply/ 
demand affect earnings across occupations. 
Occupational labor market theory suggests 
several hypotheses. 


CONCEPTUAL BACKGROUND 
AND HYPOTHESES 


Stolzenberg (1975) argues that labor markets 
become fragmented along occupational lines 
as a function of differing skills, training 
investments, and processes of social organiza- 
tion. Skill differentiation leads to differences 
in labor supply by occupation; he infers that 
there are differing levels of labor demand by 
occupation because returns to skills are 
reduced if occupational requirements are 
inconsistent with worker skills. Stolzenberg 
also distinguishes forces. of supply and 
demand from dimensions of social organiza- 
tion, such as unionization and race and sex 
composition.: Social organization includes 
nonproductivity characteristics of occupations 
or of their incumbents that may influence 
wages, although they have nothing to do with 
supply and demand directly. Clearly, status 
characteristics do not necessarily reflect 
productivity. 

Many studies of comparable worth docu- 
ment the effect that one dimension of social 
organization, percent female, has on earn- 
ings, independent of job content, and often 
attribute this impact to some form of 
discrimination (e.g., England, Chassie, and 
. McCormick 1982; England and Norris 1985). 
England et al. (1982) control for a variety of 
skill factors in the analysis of occupational 
earnings and find that female concentration 
strongly predicts differences. In a similar 
analysis, England and Norris (1985) interpret 
such a residual as “comparable worth discrim- 
ination," which explains almost 30 percent of 
annual earnings differences between fully 
employed men and women. Treiman, Hart- 
mann, and Roos (1984) attribute up to 66 
percent of occupational pay inequality in 
1970 to various forms of discrimination in 
their analysis of the effects of DOT-based 
factors and percent female. In this analysis, 
we interpret racial composition, marital sta- 
tuses of occupational incumbents, unioniza- 
tion, and female concentration as organiza- 
tional dimensions of occupational markets 
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that influence earnings independent of supply/ 
demand and the productivity of occupational 
incumbents. 

Studies of comparable worth have also 
suggested dimensions of job content along 
which occupations vary. These include sub- 
stantive complexity and manual dexterity that 
underlie substantively distinct bundles of 
talent, training, and job activities. By using | 
these dimensions it is possible to compare 
jobs as diverse as nurse and truck driver. 
Mariy studies use DOT data, as we do here, to 
measure job content since this is the best 
available source to capture relevant skill 
dimensions. While the data do not capture all 
relevant skill dimensions, they do provide 
interpretable and reliable measures to control 
for job content across occupations. i 

In this conceptualization of occupational 
earnings, three major categories affect earnings: 
(1) supply/demand, including measures of 
workers’ productivities or the quality of labor 
supply, (2) dimensions of social organization, 
and (3) job content. “Supply” and “demand” 
do not refer to actual amounts of labor, but to 
factors that might be expected to affect levels 
of labor demand and supply. With these 
limitations in mind, following Waldaur (1984), 
we first argue that the wider the range of labor 
market opportunities of an occupation, the 
higher its earnings level. Occupations with a 
narrow range of potential job settings are 
likely to have reduced bargaining power and 
lower earnings. Second, the higher the 
unemployment rate in an occupation, the 
lower its wages. Third, the higher the 
"reserve labor pool" for an occupation, the 
lower the occupational earnings. A reserve 
labor pool exists when eligible workers with 
Skills and recent occupational experience are 
not in the labor force. When workers in an 
occupation can be easily replaced, and when a 
large reserve pool exists, current occupational 
incumbents have reduced bargaining power 
with employers. Fourth, substantial govern- 
ment employment should increase occupa- 
tional earnings since these jobs generally pay 
better than private sector positions, particu- 
larly at entry level, although this relationship 
may not hold at higher levels of authority and 
responsibility. Our fifth and sixth hypotheses 
concern the impact of productivity factors on 
earnings. Education and tbe job experience of 
workers indicate aggregate human capital . 
investment, that is, supply factors that should 
positively affect earnings. 


-© with their husbands. 
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We extend analysis of social organization 
effects in three directions. First, we expect 
high concentrations of black and Hispanic 
incumbents to be associated with lower 
occupational earnings, while high Asian 
concentration should be positively associated 
with earnings. Educational differentials among 
these subgroups account for these differing 
. predictions and may partially explain such 
race effects. 

Second, we expect that the higher: the 
proportion of married men in an occupation, 
the higher the earnings, while the higher the 
proportion of married women in an occupa- 
tion, the lower the earnings. These opposing 
hypotheses are derived from the traditional 
model of sex role differentiation within the 
family. Employers may embrace the stereo- 
type that married men are more responsible 
than the unmarried because the former may be 
‘supporting a wife and children. Marriage may 
interrupt the labor force participation of 
women if they rear children and/or relocate 
Under this model, 
women may have reduced bargaining power 
if employers view them as less likely to 
accumulate firm-specific experience than 
males. Moreover, married women may be 
willing to work more cheaply in exchange for 
regular hours of work, a job located conve- 
niently to home, and other activities that 
interfere less with family responsibilities. 
This analysis does not consider these possibil- 
' ities. We also recognize that these stereotypes 
do not operate with consistency, and that they 
were likely changing during the decade under 
study. 

Third, following Stolzenberg (1975), Freed- 
man and Medoff (1984), and others, we 
hypothesize that unionization should posi- 
tively influence wages. We also control for 
urbanization because employees in urban 
areas will earn more than those in nonurban 
areas. Finally, given that returns to occupa- 
tional earnings should be affected by the 
content of work activities, we control for 
several dimensions of job content activities as 
specified below. 

These hypotheses have been tested for 
1980 by Parcel (1989) using U.S. Census 
data; she found that most variables were 
correlated with occupational earnings as 
predicted, that measures were interrelated as 
predicted, and that an important subset 
retained statistical significance in multivariate 
analyses. She also found that predictors of 
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occupational earnings differed for the sexes. 
with social organization characteristics being 
particularly important for women. This anal- 
ysis represented a significant improvement 
over studies such as that by Waldaur (1984). 
He argued that wages vary across occupations 
because of different supply-demand ratios. In 
a study of Ph.D.s in a variety of academic 
disciplines, he proposed that the greater the 
nonacademic opportunities, the greater the 
labor demand. However, he did not use 
multivariate analysis and restricted his study 
to a narow range of occupations. The 
usefulness of Parcel's findings suggests the 
model may also be a valid one for studying 
change. 


METHOD 


We assemble two data sets, one for each point 
in time, conduct identical analyses, and 
compare the findings. Occupation is the unit 
of analysis. There are 503 units available for 
analysis in 1980 and 424 in 1970. This 
difference occurs because the scheme for 
detailed occupational classification changed 
substantially between the 1970 and the 1980 
Censuses, with greater differentiation in 1980 
(Vines and Priebe 1988). . i 

Table 1 summarizes measures in the 
analysis and indicates how they correspond to 
the above `theoretical ideas. While most 
measures are straightforward, a few deserve 
note. First, following Treiman et al. (1984) 
and Parcel (1989), we annualized earnings as 
the dependent variable. This measure wás 
calculated by taking a weighted average of 
mean female and male earnings, where each 
mean was adjusted to incorporate occupation- 
specific average hours worked per year. To 
create each sex-specific component, mean 
annual earnings was multiplied by 2080 and 
divided by the product of mean weeks worked 
per year and mean hours worked during the 
Census survey week. The constant 2080 
reflects the assumption that full-time workers 
work 40 hours per week and 52 weeks per 
year. Annualized earnings does not confound 
time worked by gender since it estimates the 
level of earnings as if all incumbents worked 
full time. We know, of course, that for many 
occupations women work fewer weeks per 
year and/or fewer hours per week than men. 
This variable is measured in the same way at 
both time periods, as are the vast majority of 
measures of occupational labor market condi- 
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Table 1. Constructs and Measures Used in Analysis 


Occupational Earnings: A weighted average of mean male earnings and female earnings, where each sex-specific 
component was multiplied by 2080 and divided by the product of mean hours worked per week and mean weeks 
worked per year for each sex.* f 


Job Content 


Substantive Complexity: For 1970, a factor-based scale composed fo 16 Z scores including GED, SVP, 
intelligence, complexity of working with data and people, numerical and verbal aptitudes, direction, control and 
planning; form and clerical perceptions, talking, preferences for abstract and creative vs. routine, concrete activities, 
ability to deal with measurable or verifiable criteria, variety and change, communication of data vs. activities with 
things, and repetition or continous processes: a = .91. For 1980, measurable or verifiable criteria is omitted, and 
influencing people, reaching, handling, fingering, feeling, and aptitude for sensory judgmental criteria are included: 

.a = .95. 

Physical Dexterity/Perceptual Abilities: For 1970, a factor-based scale composed of 10 Z scores including finger 
dexterity; motor coordination; manual dexterity; complexity of dealing with things; form and spatial perceptions; 
seeing, reaching, handling, fingering, and feeling; setting limits, tolerances, and standards; preference for activities 
involving processes/machines vs. social welfare: « = .91. For 1980, preference for activities involving machines are 
omitted, and color discrimination, preference for tangible activities vs. those that bring prestige, and aptitude for work 
demanding measurable or verifiable criteria are included: a = .91. 

Physical Activities/Working Conditions: For 1970, a factor-based scale including 5 Z scores representing outside 
working conditions; stooping, kneeling, crouching, crawling; eye-hand-foot coordination; climbing and balancing; 
lifting, carrying, pulling, pushing: a = .82. For 1980, all of these items plus hazards; noise and vibration; clerical 
perception and fumes, odors, and dust are included: a = .89. 

Unpleasantness of Work: For 1970, a factor-based scale including 3 Z scores representing hazards; fumes, odors, 
dust, and poor ventilation; and extreme heat: a = .57. For 1980, a factor-based scale including extreme heat, extreme 
cold, and wet or humid working conditions: a = .54. 


. Quality of Labor Supply 

Years of Schooling: Mean years of schooling for experienced civilian labor force (ECLF). 5 

Labor Force Experience: A weighted average of estimated male and female mean years of experience. For males, 
experience was estimated as Age—(mean years of schooling + 6) (see Mincer 1974). For females, this figure was * 
deflated depending on combinations of the proportions of white and nonwhite occupational incumbents and on the 
proportions of current married and currently nonmarried occupational incumbents (see Beck, Horan, and Tolbert 
1980).°° * 


Quantity of Labor Supplied/Demanded 

% Self-Employed: ®ECLF self-employed. 

Reserve Labor Pool: Ratio of number not in the labor force who last worked 1975-79 or 1960-69 to the number in 
the occupation’s ECLF. 

Unemployment rate: Ratio of number experienced unemployed to ECLF. 

% Government: Ratio of number employed by state, federal, and local governments ro ECLF. 


Market Social Organization 
% Female: Ratio of number females to ECLF. 
% Black: Ratio of number black to ECLF. 
% Hispanic: Ratio of number Hispanic to ECLF. 
% Asian: Ratio of number Asian to ECLF. 
% Males Married: Ratio of number male employed persons married, spouse present to number male employed 
persons. 
% Females Married: Ratio of number of female employed persons, spouse present to number female employed 
persons. 
% Urban: Ratio of number urban employed persons to number of employed persons. 
% Unionized: Three-year moving average centered on 1980 (taken from Kokkelenberg and Sockel! 1985).4 





* Median annual earnings used in 1970, 

> Median years of schooling used in 1970. 
* Median age used in 1970. 

€ Available for 1980 only. 


tions. Earnings from 1969 are inflated to 1979 years. Measures of job content were con- 
levels using Cost of Living Indices so as.to  structed by matching Fourth Edition DOT 
allow the comparisons of unstandardized data to 1970 and 1980 detailed U. S. Census 
regression coefficients (Bureau of the Census occupational titles and by using factor analy- 
1985b). ses of those data for data reduction and to 

In a few cases measures varied in the two — construct reliable measures of job content. To 
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measure job content for 1970, we used four 
dimensions suggested by Cain and Treiman 


(1981) in their factor analysis of DOT-based - 


occupations: substantive complexity, physical 
dexterity/perceptual abilities (their name was 
Motor Skills), physical activities/working 
conditions (their name was Physical De- 
mands), and undesirable working conditions. 
We used their measures in order to maximize 
comparability with -their findings. Parcel 
(1989) describes in greater detail the principal 
components solution for the 1980 occupa- 
tional codes. Table 1 also reports items 
included in each scale and scale reliabilities. 
While job content is measured somewhat 
differently in the two time periods, the 
measures for substantive complexity and 
working conditions are nearly identical. The 
measure of physical activities/working condi- 
tions includes a greater number.of indicators 
in 1980 than in 1970, and the measures of 
unpleasant working conditions share only one 
out of three indicators, although the interpre- 
tation remains the same. Our intent, of 
course, is to provide the best possible control 
for job content in each time period, taking 
into account that basic measurement of 
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occupation had changed during the interval 
Based on Parcel (1989), we expect complex- 
ity to be positively and work unpleasantness 
to be negatively associated with earnings. The 
behavior of the remaining measures may varv 
across specifications. 

Data on annualized earnings, quantity oz 
labor supplied/demanded, quality of labor 
supply, and occupational market organization 
were derived from the 1970 U.S. Census of 
Population Subject Report, Occupational Char- 
acteristics (PC70-2-7A) and from comparable 
unpublished data irom the 1980 Census 
provided on tape from the U.S. Bureau of the 
Census. We included a measure of occupa- 
tional unionization in 1980 that was unavail- 
able at a sufficient level of detail in 1970 tc 
provide meaningful variation. 


FINDINGS 


Data in Table 2 reveal that earnings across the 
occupations rose by close to 10 percent during 
the decade, while aggregate levels of school- 
ing, experience and the reserve labor pool 
declined. Government employment increased 
while occupational urbanization declined; 


Table 2. Means and Standard Deviations of Variables Used in Analysis; Zero-Order Correlations with Annualized 


Earnings" 
1970 1980 
X S D X s Srr 
Occupational earnings 14522. 6076. 15935. 5369. 
Substantive complexity .00 10.00 .699*** — .00 12.82. 612*** 
Physical dexterity/perceptual abilities .00 6.89 .030 — .00 8.50 074A 
Physical activities/working conditions .00 3.27. —.254*** .00 6.58  —.129** 
Unpleasantness of work .00 2.19 —.166*** .00 2.17 --.086** 
Years of schooling 12.89 2.50 .7518** 12.70 2.09 626*** 
Experience 20.20 7.11 .007 17.10 4.48 22]*** 
Reserve labor pool 29.00 28.54 —.57]3*** 20.40 14.06 —.537*** 
% Self-employed 6.08 14.25 .111* 6.44 12.50 129** 
% Government 18.50 25.94 .299**» 20.18 25.87 201*** 
96 Urban 78.01 10.25 .320**9 75.42 11.61 .175*** 
% Males married 72.29 15.43 .S]9**» 65.35 13.14 .582*** 
% Females married 53.19 8.97 —.119* 52.83 6.25 —.042 
% Female 28.4 30.37 —.467*** 32.30 28.51 -—.500*** 
% Black 8.52 8.98 —.494*** 9.21 6.69 —.480*** 
% Spanish speaking 3.65 2.62 —.479*** 5.51 3.63  — 497*** 
% Asian .81 77 .188** 1.78 1.55 .096* 
% Union 26.83 19.74 .079A 
* p3.05. 
** p.01. 
*** ps 001. 
A ps.06. 


* Maximum Ns are 503 occupations for 1980 and 424 occupations for 1970. Significance tests are based on lowest 


minimum pairwise deletion Ns. 


> Inspection of covariances in the two samples suggests that their magnitudes are similar across the decade or show 
` changes consistent with changes in the correlations shown here. 


TEMPORAL CHANGE IN EARNINGS ATTAINMENT 


self-employment levels remained constant. 
On average, occupations increased in percent 
female, and gained in representation of racial 
minorities, particularly Hispanic and Asian 
concentration. Proportions of married females 
remained constant, while the average percent 
of married males decreased substantially. The 
decline in the reserve labor pool likely reflects 
measurement changes in the two time periods 
(see Table 1). The decline in experience may 
be affected by (1) changes in the aggregation 
of jobs into occupations by the U.S. Census 
between 1970 and 1980; (2) changes in the 
cohort composition of the work force to 
include higher proportions of younger work- 
ers with less experience and fewer older, 
more experienced workers. Since educational 
level of the work force rose between 1970 and 
1980, the slight decline in schooling in our 
samples may reflect changes in occupational 
aggregation and/or use of means in 1980 
versus medians in 1970. 

The zero-order correlations of each mea- 
sure with earnings suggest that they are 
consistent with our expectations. Education, 
complexity, and percent male married all 
correlate highly with earnings, although the 
Schooling correlation declines from .728 to 
.626. Earnings is strongly negatively corre- 
lated with reserve labor pool, percent female, 
and percents black and Hispanic in both time 
periods. Earnings also correlates: positively 
with percents in government and. self- 
employed. Other relationships changed over 
the decade. The relationship between-earnings 
and experience increased over the decade 
while the relationships with urbanization, 
percent married female, and percent’ Asian 
declined. The negative correlations with both 
physical demands/working conditions and un- 
pleasantness of work seemed to decline; al- 
though measurement differences for these two 
constructs call for a cautious interpretation. | 

Tales 3 and 4 summarize-regression results 
for 1970 and 1980. Both tables’ include 
variables to predict annualized-earnings/as a 
function of (1) job content, (2) occüpational 
labor market conditions, (3) and (4) job 
content and market variables. Model (5) for 
1980 is a specification that includes unionizà- 
tion. When a coefficient is significant in: both 
time periods, an: additional test for: this 
difference is performed and reported. Mini- 
mum pairwise deletion Ns are used through- 
out the analysis. 

Findings from the ` "Equations 1 suggest 


D 
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that, although substantive complexity domi- 
nates each equation, returns to complexity 
decline over the decade. While physical 
dexterity/perceptual abilities positively affects 
occupational earnings in 1970, physical 
dexterity and especially physical activities in 
1980 also increase earnings. In neither period 
does undesirable working conditions affect 
earnings. 

Considerable changes occurred in the 
magnitudes of occupational labor market 
effects during the decade. Returns to both 
human capital variables, education and expe- 
rience, increased during the interval. While 
the size of occupational labor pool reserve 
became less of a handicap over the decade, 
percentages of self- and government- 
employed became more of a handicap. 
Moreover, the advantage that urbanized labor 
markets gave to occupations in 1970 disap- 
peared by 1980. 

Changes also occurred in the bases for 
earnings inequality traceable to ascribed and 
marital statuses. The effects of percent 
married for both sexes declined over the 
decade, that for women to the point of 
nonsignificance; while that for males married 
remained positive and significant, the effect 
was weaker in 1980 than in 1970. The 
considerable advantage to working in an 
occupation with a high percent of Asians 
disappeared,over the decade, while in neither 
interval did concentrations of black and 
Hispahic minorities affect earnings net of 
other predictors. On the other hand, the 
negative effect of percent female was stronger 
in 1980:than in 1970, 

. The third' Specification combines measures 
of job content and a limited number of market 
and supply characteristics. Comparable to an 
equation: reported by Treiman et al. (1984), it 
provides an, “upper bound” estimate to the 
effects of percent: female. on occupational 
earnings,. controlling for human capital and 
job content variables. Because multicollinear- 
ity. between substantive complexity and years 
of schooling prevents . their simultaneous 
inclusion, complexity : is dropped from this 
and all remaining . Specifications. Returns to 
physical. dexterity in . 1970. exceed returns in 
1980, as do returns to physical activities in 
1970: compared to 1980. Returns to experi- 
énce increased during the decade while levels 
of returns to percent female remained similar. 
For every, 1- percent increase in female 
concentration, occupational earnings declined 
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roughly 73 dollars in both 1970 and 1980, 
while each additional year of schooling 
increased occupational earnings by close to 
two thousand dollars. We compare these 
changes with those described for specification 
2. In 1970, inclusion of additional occupa- 
tional labor market variables resulted in 
modest reductions in returns for both school- 
ing and percent female. In 1980, however, 
these same controls do not reduce payoffs to 
schooling or to percent female. Moreover, 
returns to experience were stronger in 1980 
than in 1970 in this specification as well as in 
model 2. 

The fourth model provides a more compre- 
hensive combination of job content, human 
capital, and labor market condition variables 
as they affect occupational earnings. The only 
Statistically significant job content variable is 
physical activities, and in contrast to the third 
model, physical activities (1980) evidenced 
stronger returns than physical dexterity. In the 
presence of these job content controls, most 
of the labor market variables behave as they 
did in the second specification, with the 
exception of percent female and percent 
females married. For the latter, the variable is 
not significant at either time point, while the 
earlier models suggested that for 1970 the 
greater the percent of married females, the 
lower the earnings. Regarding female concen- 
tration, similar negative effects on earnings 
were apparent at the two time points, while, 
where job content was uncontrolled, an 
increase in the negative effect was apparent at 
the end of the decade. 

The fifth model is for 1980 only and differs 
from (4) only in that it includes percent union- 
ized. Introduction of unionization noticeably 
affects just three variables: (1) it strengthens 
the effect of schooling, possibly acting as a 
suppressor effect; (2) it reduces returns to phys- 
ical activities, possibly because unionization 
is most frequent in occupations high in phys- 
ical activities; and (3) it reduces to nonsignif- 
icance the effect of males married. The zero- 
order correlation matrix (not shown) shows 
that these measures of market social organiza- 
tion are correlated (r = .47) so that those 
occupations high in unionization are also high 
in proportion of males married. While we might 
be tempted to think that employers prefer to 
promote and/or increase wages in occupations 
that contain a high percentage of married males, 
this association might actually reflect collec- 
tive bargaining arrangements. 
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In Table 5, Equation 4 is reestimated far 
both male and female annualized earnings far 
both time periods. All measures remain the 
same, except that gender-specific measures af 
schooling and experience are used. This 
model is useful for two reasons. First, it 
suggests how occupational labor market 
conditions might differentially impact male 
and female occupational earnings. Second, it 
helps to trace the changes in occupational 
earnings attainment across the decade that 
may have differently affected occupational 
earnings attainment by sex. 

Addressing the static gender difference 
issue first, the directions of the significant 
effects are consistent with findings discussed 
above. For male and female occupational 
earnings in 1970, returns vary by gender in 
magnitude as well as in statistical signifi- 
cance. While male earnings were sensitive to 
physical dexterity, female earnings were 
sensitive to physical activities of work. 
Female earnings were negatively affected by 
percent of self-employment and size of the 
reserve pool, and positively affected by the 
percent of Asian incumbents in the occupa- 
tion. Male earnings were more negatively 
affected by percent female than were female 
earnings. Male earnings showed stronger 
positive returns to schooling, experience, and 
urban location while female earnings derived 
greater benefit from high proportions of 
married male incumbents in the occupation. 

In 1980 gender differences are more preva- 
lent in magnitude than by their statistical sig- 
nificance. Female earnings suffer from size of 
reserve labor pool and the percentage Hispanic, 
and benefit from a high proportion of Asian 
incumbents and occupational urbanization, while 
these factors do not predict male earnings. The 
latter do evidence greater returns to physical 
demands, and much greater returns to human 
capital measures of schooling and experience. 
While female earnings are more hindered than 
male earnings by percent self-employed, male 
earnings are more hindered by high concentra- 
tion of government employees and by percen: 
female. Male earnings is show stronger returns 
to concentration of married men than do female 
earnings. 

These findings lay the foundation for study 
of these gender differences over time. For 
example, there is a 50 percent increase in the 
gender gap favoring males in returns to 
schooling from 1970 to 1980. Male earnings 
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are also more strongly influenced by experi- ` 


ence relative to female earnings in 1980 than 
1970. In 1980, female earnings were disad- 
vantaged by female concentration, although 
not to the extent that male earnings were; 
1980 female earnings were also disadvan- 
taged by Hispanic concentration, a finding 
not present in 1970. Earnings for both sexes 
increase less owing to Asian concentration in 
1980 than in 1970, although the effect 
remains significant for women in the latter 
period. Working in an occupation with high 
government concentration appeared as a 
disadvantage for earnings of both sexes in 
1980, while it had shown no effect in 1970. 
Male earnings appear more handicapped by 
self-employment concentration in 1980 than 
in 1970, although female earnings show an 
absolutely greater negative effect in 1980. 
Women retained a payoff to occupational 
urbanization that men lost over the decade, 
but men have gained an advantage relative to 
women in returns to concentration of married 
males and have lost the handicap implied by 
the reserve labor pool, a handicap women 
have retained. 


CONCLUSIONS 


We acknowledge several limitations to the 
analysis. First, differences in the detailed 
occupational categories used in 1970 and 
1980 Censuses may have influenced our 
findings. Treiman, Bielby, and Cheng (1988) 
have developed appropriate transfer devices 
for industry codes between the 1970 and 1980 
Censuses and are working on comparable 
devices for detailed occupations. Their tech- 
nique uses a weighted set of imputations of 
1980 industry codes from 1970 codes and 
additional respondent data on a sample of 
127,125 labor force participants in 1970. 
When comparing the results of these imputa- 
tions with actual 1980 codes assigned to the 
respondents, they find that the imputations 
are preferable. Still, at this writing, compara- 
ble technology for adequate equating of 
occupations across the two Census years is 
still being developed. The somewhat finer 
level of differentiation in 1980 may have 
increased reported levels of some occupa- 
tional characteristics: a parallel example may 
be that greater occupational differentiation 
has been associated with higher levels of sex 
segregation (Bielby and Baron 1984). How- 
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ever, no evidence is available to infer 
systematic changes in multivariate analyses. 

Second, DOT data are not ideally suited to 
cross-occupational comparisons. Ratings an 
DOT variables are created to maximize 
within-establishment rather than cross- 
establishment variation. Thus inferences across 
occupations are only approximations. Third, 
the data lack good measures of authority 
relationships within the workplace, and we 
were able to measure unionization at one 
point in time. We suspect that our findings 
probably applied to 1970 because there is no 
reason to believe that the relationships among 
unionization, marital statuses of male incum- 
bents, and earnings would have changed over 
the decade. This inference should be tested 
empirically. 

Despite these limitations, the data we have 
assembled are the best available to date and 
allow inferences that are useful and impor- 
tant. First, we have demonstrated the value of 
a conceptualization for understanding occupa- 
tional earnings attainment that encompasses 
three components: job content, market supply/ 
demand, and social organization. Indicators 
of these constructs are associated with 
occupational earnings at two time points in 
predictable ways. Occupational labor market 
conditions must be viewed more broadly than 
they have in many prior conceptualizations. 
Although female concentration is a major 
determinant of occupational earnings, its 
effects are overestimated when other mea- 
sures of market conditions are not simulta- 
neously controlled. In addition, the determi- 
nants of occupational earnings vary depending 
on whether we are concerned with female, 
male, or aggregate occupational earnings. 
There may be additional measures of occupa- 
tional market conditions not considered here 
that will be useful for additional analyses, and 
we hope the success of this investigation 
motivates researchers to construct and evalu- 
ate them. : 

We have àlso provided evidence regarding 
change in the role of market factors during tbe 
1970s. When we look at annualized earnings, 
some findings suggest an increase in the 
returns to achieved statuses, and a decrease in 
returns to ascribed statuses and social organi- 
zation. There were greater returns to educa- 
tional attainment and accumulated experience 
in 1980 than in 1970. There were reduced. 
returns to Asian concentration. The percent 
female married was no longer significant in 
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1980, and when unionization is controlled, 
the already-reduced effect of percent male 
married was further reduced to nonsignifi- 
cance. j 

However, other findings temper this con- 
clusion. First, the magnitude of the female 
concentration effect remains high and con- 
stant, even in the presence of numerous 
controls. Second, analyses of male and 
female annualized earnings suggest that the 
increase in returns to achieved characteristics 
‘has been greater for male than for female 
. annualized earnings. In 1980, female annual- 


ized earnings is sensitive to a broader range of — 


market conditions than is male annualized 
earnings. Thus, the relatively fewer signifi- 
cant market determinants of annualized earn- 
ings are a function of male annualized 
earnings dominating that dependent variable. 
Therefore, increases in returns to achieved 
characteristics and decline in the importance 
of several dimensions of ascription and social 
organization benefited men more than women. 
. Considering these findings within the 
economic context of the decade suggests two 
conclusions. Women were becoming a higher 
proportion of the labor force and facing an 
economy alternately plagued by recession and 
slow growth. These are not auspicious 
circumstances for the advancement of any 
economic minority. Indeed, our findings 
suggest that the gender gap in returns to 
human capital characteristics increased, sug- 
gesting the operation of mechanisms interfer- 
ing with the translation of human capital into 
earnings; they also suggest that female 
occupational earnings wefe more hindered 
than those for males by labor market 
conditions, again prompting concern regard- 
ing the differential effect of market conditions 
on male and female earnings. Second, the 
‘persistent strength of the female concentration 
effect in the presence of numerous controls 
suggests that despite increased progress in 
female educational attainment and possible 
reduction in occupational sex segregation, the 
classic female concentration effect endured. 
Future studies should evaluate whether this 
effect remained constant by the end of the 
1980s, a decade of greater and more consis- 
tent economic expansion, and thus ostensibly 
one in which we might expect to see a decline 
in the impact of this form of occupational 
social organization. 

Certainly the most prominent direction for 
future research using these data is in contex- 
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tual models of individual-level outcomes. 
Extensions of Stolzenberg’s arguments and 
those presented here suggest hypotheses in 
which dependent variables such as worker’s 
earnings or promotion prospects can be 
understood within the context of occupational 
market conditions. In addition, these data 
could aid in testing hypotheses regarding the 
organizational mechanisms through which 
these aggregate-level measures are operating. 
Such analyses would demand data on both 
organizational and individual characteristics 
for a broad cross section of workers. We need 
to know more regarding how employer 
decisions and organizational structure mediate 
between the occupational structures and the 
individual-level outcomes we ‘seek to under- 
stand. We look forward to research guided by 
these and related ideas as an avenue toward 
further understanding structure and change of . 
socioeconomic inequality. 
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Theory and research on the consequences of multiple roles have focused on 
psychological well-being or physical health, with studies typically employing 
cross-sectional data to examine relationships at single points in time. This paper 
investigates the implications of multiple roles for the longevity of women. Drawing 
on a two-wave 30-year panel study of 427 wives and mothers in upstate New York, 
we use event history techniques to test the effects of both the number and the nature 
of roles on the duration of women's lives. We find that social integration, defined 
by the number of roles occupied, promotes longevity, but that one form of 
integration —membership in voluntary organizations—is especially salutary. 


Durkheim ([1897] 1951) observed that women 
were less involved than men in collective 
'existence and, consequently, less likely to 
reap either the benefits or the costs of that 
involvement. But as women increasingly take 
on roles beyond home and family, the 
question is raised about the significance of 
these changes for their psychological and 
physical well-being. Studies have found that 
integration into society, in terms of multiple- 
role occupancy, reduces the likelihood of 
psychological distress among both men and 
women (Gore and Mangione 1983; Kandel, 
Davies, and Raveis 1985; Thoits 1983, 1986), 





* Direct all correspondence to Phyllis Moen, 
Director, Sociology Program, National Science 
Foundation, Washington, DC 20550. 

The research reported in this paper was 
supported by Research Grant No. R01-AG05450 
from the National Institute on Aging, U.S. 
Department of Health and Human Services. This 
study would have been impossible without the 
ingenuity and perseverance of Melody Miller and 
Maxine Schoggen in tracking down respondents. A 
Hatch grant through the College of Human 
Ecology at Cornell University using funds from the 
U.S. Department of Agriculture, State Agricultural 
Experiment Station, provided financial support for 
initially locating respondents. We appreciate the 
advice and comments of Michael Hannan, Eugene 
Litwak, John O. McClain, Stanley Presser, 
Richard P. Shore, Elaine Wethington, and the 
anonymous ASR reviewers. 


American Sociological Review, 1989, Vol. 54 (August:635-647) 


and similar findings also link multiple roles to 
physical health (Nathanson 1980, 1984; 
Verbrugge 1983, 1985). 

A logical extension of this model would 
incorporate duration of life, since a long life 
is often depicted as a primary or even the 
ultimate measure of health. If social integra- 
tion promotes mental and physical health, it 
seems plausible that it should also foster 
longevity. To test this hypothesis, this paper 
draws on data from a two-wave 30-year panel 
study of women to examine the links between 
their multiple roles and the duration of their 
lives. 


CONCEPTUAL ISSUES 


Social integration is a concept historically 
applied to collectives (Williams 1970), but 
which also can describe the multiple roles of 
individuals. Durkheim never defined the 
term, leading to considerable conceptual 
confusion and ambiguity (House and Kahn 
1985). However, a distinction can be made 
between social-psychological and structural 
integration, with structural integration denot- 
ing the concrete involvement of individuals 
with various aspects of a collectivity. Psycho- 
logical integration, on the other hand, is the 
subjective experience of that connectedness. 
Both concepts are related; both may be 
important in predicting health outcomes. But 
in this paper we examine only the structural 
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or outward manifestations of integration, that 
is, the number and types of roles individuals 
possess. This conforms to House and Kahn's 
(1985, p. 85) use of the term social 
integration or, conversely, isolation, to repre- 
sent "the existence or quantity of relation- 
ships." 

Multiple roles is a concept isomorphic with 
Merton's (1968, p. 434) status set, referring 
to the "complex of distinct positions assigned 
to individuals both within and among social 
systems." The behavior enacted in fulfilling a 
position with defined rights and responsibili- 
ties constitutes a role. Thus, one may occupy 
the position of worker, wife, churchgoer, 
friend, club member, neighbor—each with its 
own attendant behavior (role) expectations. 
Some roles, such as that of worker, are highly 
structured while others, such as friend or 
neighbor, are diffuse, with more vaguely 
defined obligations and privileges rather than 
highly specified rights and duties (Williams 
1970). Presumably, the greater number of 
roles, the greater the level of social connect- 
edness or integration. Thoits (1983, p. 178) 
employs the term role identities rather than 
multiple roles for this same notion, noting 
that “an actor who possesses many identities 
will be termed integrated," while one who 
possesses few or no such role identities is 
isolated. But identities connote a more 
subjective evaluative component rather than 
the structural aspect of role occupancy, which 
is our focus. For example, a woman may be 
employed but not identify herself by that 
social position within society. Still, following 
Thoits (1983, 1986) and Stryker (1980) we 
see multiple roles as providing to the 
occupant wider access to various components 
of society than is the case for someone with 
few such ties, thereby promoting greater 
social connectedness or integration. 


MULTIPLE ROLES AND HEALTH 


The notion of multiple roles is especially 
pertinent to women's lives, as historically 
women have had less opportunity than men to 
engage in roles beyond the domestic sphere. 
In fact, some scholars point to the deleterious 
effects of the isolation of the full-time 
homemaker (Bernard 1972; Gove and Geerken 
1977; Oakley 1974). Accordingly, the increas- 
ing involvement of women in the labor force 
has spawned a great deal of scholarly interest 
in the benefits and costs to women of the 
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employment role (e.g., Gove and Geerken 
1977; Kessler and McCrae 1982; Verbrugge 
1983). This interest has broadened into a 
wider conceptual focus on women's involve- 
ment in multiple roles rather than employ- 
ment per se (Barnett and Baruch 1985). It 
cannot be assumed that the effects of multiple 
roles or any particular role are similar for both 
sexes. Certainly the marriage role is distinz- 
tive for men and women (Bernard 1972), and 
combining parenting with employment can be 
quite a different experience for women than 
for men (Verbrugge 1983). The primacy of 
marriage and motherhood in women's lives 
makes the impact of additional roles espe- 
cially important. Do women filling these 
domestic roles benefit from additional role 
occupancy? Or do the duties of family life in 
tandem with other role obligations produce 
deleterious strains and conflicts in the lives of 
women? 

Three types of theoretical explanations 
have coupled multiple roles with both physi- 
cal and psychological well-being. The theoret- 
ical orientation most supported by empirical 
evidence (cited above) is the role enhance- 
ment perspective. This view points to the way 
in which multiple roles augment an individu- 
al's power, prestige, resources, and emotional 
gratifications, including social recognition 
and a heightened sense of identity (Marks 
1977; Sieber 1974). Support for this mcdel 
can be found in the panel data analyzed by 
Thoits (1983, 1986), who demonstrated that 
the acquisition or loss of roles over a two-year 
time period affects the psychological distress 
of both women and men. 

An alternative argument emphasizes the 
costs rather than the benefits of sccial 
integration. According to this role strain 
perspective, multiple role involvements entail 
obligations that may produce overload and 
strain which, in turn, lead to poor physical as 
well as mental health (Goode 1960; Coser and 
Rokoff 1971). This viewpoint has guided 
many of the studies examining the impac:s of 
employment on women’s lives, especially 
those women who are also wives and mothers 
(e.g., Pearlin 1975; Radloff 1975). 

Still a third viewpoint, which we describe 
as the role context approach, casts doubt on 
the essential simplicity of both the role strain 
and the role enhancement theoretical perspec- 
tives by considering not only the number of 
role involvements but their nature and cirzum- 
stance as well. Particular contingencies and 
role combinations can either promote or 
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hamper health. For example, both Gove and 
Hughes (1979) and Veroff, Douvan, and 
Kulka (1981) suggest that women’s responsi- 
bility for the welfare of children, husbands, 
and ailing relatives may have deleterious 
consequences for their own well-being. Since 
these nurturant role obligations are of an 
ongoing nature, this care giving to others may 
have a negative effect on self-care and one’s 
own health. Indeed, Nathanson (1980) found 
that the presence of children adversely affects 
women’s subjective appraisal of their own 
health. And Gove and Geerken (1977) found 
that symptoms of psychological distress 
among employed women increase as a 
function of the number of children present in 
the home. Further support for this character- 
ization comes from a prospective study 
analyzing data over an eight-year period (the 
Framingham study), which found that women 
with three or more children who were 
working outside the home were more likely to 
experience coronary heart disease than either 
homemakers with three children or working 
women with no children (Haynes and Feinleib 
1980). The significance of various permuta- 
tions in roles can be complex. For example, 
the Haynes and Feinleib (1980) study found 
that women in clerical jobs married to 
blue-collar husbands were especially vulnera- 
ble to coronary heart disease. 

But certain role contexts can prove advan- 
tageous. For instance, more informal roles 
involving primary groups, such as the roles of 
relative, friend, and neighbor, may offer 
emotional support that is beneficial to health 
(Litwak, Messeri, Wolfe, Gorman, Silver- 
stein, and Guilarte 1989). Participation in 
these roles also may serve as a source of 
information and advice, as well as more 
concrete forms of assistance (House and Kahn 
1985). Such social bonds have been found to 
both promote mental health (Brown and 
Harris 1978) and reduce mortality (Berkman 
and Breslow 1983; House, Robbins, and 
Metzner 1982). These findings suggest that 
those roles providing social support, rather 
than the total number of roles, are important 
facilitators of health. 

A similar approach posits that it is 
particular roles, such as employment, that 
matter, rather than their aggregate number. 
Employment has been shown to be positively 
linked to women’s mental and physical health 
(e.g., Gove and Geerkin 1977; Nathanson 
1980; Verbrugge 1983, 1985) and may weil 
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have greater impact on health than other 
roles. 

Still another possibility is that level of 
participation in a particular role might be 
conducive to health (House and Kahn 1985). 
In other words, women with many friends or 
those who attend church frequently may have 
a high degree of social connectedness, 
regardless of what other roles they may 
occupy. 


MULTIPLE ROLES AND LONGEVITY 


Two important panel studies have examined 
the significance of social relationships for 
mortality. The first, a study of a random 
sample of residents of Alameda County, 
California (Berkman and Breslow 1983), 
found that the risk of mortality over the 
period from 1965 to 1973 was related to (1) 
marriage, (2) contacts with close friends and 
relatives, (3) church membership, and (4) 
membership in other (nonchurch) organiza- 
tions. Among those studied, the most isolated 
were the most prone to death, even after 
controlling for measures of physical health in 
1965. The second panel study followed a 
group of men and women in the Tecumseh, 
Michigan community from 1967-69 to 1978— 
79 (House et al. 1982). It found that 
frequency of church attendance was inversely 
related to the mortality of female respondents, 
even after controlling for a range of risk 
factors. Frequency of attendance in other 
organizations also proved significant for men, 
although not for women. 

Both of these investigations analyzed the 
links between social integration and mortality 
over a relatively short span of time (8-12 
years) and studied a heterogeneous population 
(male and female, married and unmarried, 
parents and nonparents). Yet marriage and 
parenthood are such consequential social 
ties—especially for women—that those im- 
mersed in them may react differently to other, 
nondomestic roles. Women who are wives 
and mothers may be particularly vulnerable to 
role strains and conflicts accompanying mul- 
tiple roles. Finding that the longevity of 
women who are wives and mothers responds 
to multiple roles would provide support 
beyond the evidence presented in the Alameda 
County study of the significance of multiple 
roles for women in conjunction with their 
family role obligations and privileges. 
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THE FUNCTIONS OF MULTIPLE ROLES 
Proponents of a role enhancement approach 
do not always specify the mechanisms ‘by 
which multiple roles should lead to. the 
enhancement of health. But being integrated 
within a web of social networks.can promote 
information, support, choice, and guidance 
about health-related matters, in terms’ of 
individual behaviors (diet, exercise, smoking, 
drinking) and the propensity to’ seek profes- 
sional health care. Thoits. (1986, p. 259) 
claims that “roles and role. requirements 
should give purpose, meaning, and. guidance 
to one’s life" and thereby reduce "despair? as 
well as “disordered conduct." ;. 

We conclude that social integration “can 
operate both directly, by reducing isolation 
and, hence, anomie, (Durkheim [1897] 1951), 
and indirectly, by. (1) elevating self-esteem, 
which in turn fosters health (Pearlin, ‘Lieber- 
man, Menaghan, and Mullan ' 1981); (2) 
affecting preventative health behavior (Ver- 
brugge 1985); (3) enhancing health-related 
resources and choices (Verbrugge , 1985); 
and/or (4) providing social support Nathanson 
1980). 

Women who are exclusively wives ‘and 
mothers are especially vulnerable to social 
isolation. Gove and Geerken (1977, p. 73) 
describe the lives of homemakers as “immer- 
sion in a world of children with its incessant 
demands and lack of time for oneself, lack of 
adult interaction, and lack of opportunity to 
use instrumental skills.” 

The role enhancement approach sees multi- 
ple roles as the obverse of such isolation. This 
view has garnered the greatest empirical 
support and, we believe, makes the greatest 
intuitive sense in considering a long-term 
outcome like longevity. Consequently, we 
hold that number of roles—as a measure of 
social integration—should promote health, 
even for women already engaged in the roles 
of wife and mother. Whether it does so to the 
extent of affecting their ultimate longevity is 
the empirical question that we address. 


RESEARCH DESIGN 


This analysis is based on data from the 
Women's Roles Survey conducted in 1956 
and 1986 in an upstate New York community. 
A random sample of 427 women approxi- 
mately 25 to 50 years of age who were both 
wives and mothers were interviewed in 1956 
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(Dean and Williams 1956). This same grocp 
was located and reinterviewed 30 years later, 
in 1986. We were able to find 76 perceat 
(328) who were still alive in. 1986 and <o 
ascertain that 19 percent. (80) had died during 
the 30-year. period. Only 5 percent (19) of the 
original sample could not be located. This 
"catch-up" design (Kessler and Greenberg 
1981) of.the Women's Roles and Well-Beirg 
Project offers the rare opportunity to examine 
the long-term effects of multiple roles. 
Because we chose to examine duration of life, 
the morality transitions of these women — 
constitute the focus of the analysis. 
' Event history techniques offer the optimal 
method of modeling such transitions (Tuma 
and Hannan 1984). Event histories consist of a 
continuous record on.a single variable, in this 
case duration of life, with the length of that 
duration varying for each individual. Event 
history analysis requires the identification of 
three types of information: (1) transitions (in 
this case.mortality), (2) the time at which these 
take place; and (3) the censored cases, that s, 
those who could not be located and those still 
alive at the time.of the.1986 survey. 
Mortality information was garnered from a 
number of sources. Women still alive in 1986 
were coded as censored at that time. In other 
words, the duration of their lives extends 
beyond the 30-year span of the two surveys. 
For the respondents who had died we 
ascertained the date and, where possible, the 
cause of death. We do not know whether the 
small number of women (19) who could not 
be located are still alive. Consequently, we 
used the last date they were known to be alive 
to code them as censored data as well. For 
example, if we were able to locate a woman 
through 1976 and knew she was alive at that 
time, we include her as living until 1976 and 
as censored thereafter.! 


Variables Used in the Analysis 


Note that the cutcome variable is mot. 
mortality but the duration of life. This is 
computed using the date of death for the 19 
percent (80) who had died by 1986, the date 





! We determined the last date known alive 5y 
tracking women, using information from local 
telephone directories, former neighbors, acquaint- 
ances, and churches. At the point when we could 
find no further evidence, we coded a woman as 
censored. 
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. of the interview for the 76 percent (328) still 
alive in 1986, and the date last known alive 
for the 5 percent ( 19) who could not be found 
in 1986. Each equation we estimate contains 
some subset of the independent - variables 
described in the Appendix. 

The most theoretically important variable is 
ROLES, consisting of a sum of the roles of 
each individual woman at the time of the 
1956 interview. Six roles are incorporated in 
this measure: worker, church member, friend, 
neighbor, relative, and club or organization 
member. We operationalize ROLES as the 
number of such positions actively occupied 
by women. Thus it is the enactment of 
religious affiliation, by attending services on 
a regular basis, rather than mere church 
membership, that matters. However, we use 
membership in clubs or organizations rather 


than attendance since our analysis shows that _ 


such membership, unlike church member- 
ship, is virtually synonymous with atten- 
dance. This ROLES measure is similar to that 
employed by Thoits (1986), except that in this 
sample there is no variance in marriage and 
motherhood since all the respondents are 
married and have children present in the home 
in 1956. 


Analytic Strategy 


The goal is to advance our theoretical 
understanding of the link between multiple 
roles and the duration of life. The dependent 
variable is the rate at which individuals in the 
1956 survey experience death from that time 
until 1986, using maximum likelihood tech- 
niques to estimate their longevity. Such 
procedures are good with censored data, 
producing estimates that are asymptotically 
unbiased and normally distributed (Kalb- 
fleisch and Prentice 1980). We examine the 
effects of number of roles in 1956 as well as 
the independent effects of particular roles, 
such as employment, on the improvement in 
the estimate of longevity. Since longevity is 
obviously time dependent (its likelihood 
decreasing with age), we include AGE as a 


crucial covariate. We also include two ' 


measures of location in the socioeconomic 
structure, educational level, EDUC (six 
levels) and husband’s occupational status, 
HOCC (two levels: professional/nonprofes- 
sional).. Educational achievement and occupa- 
tional prestige have been shown to be highly 
correlated with subjective health appraisals 
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(Nathanson 1980) and mortality (Nathanson 
and Lopez 1987). Moreover, lifestyles incor- 
porating risk factors such as smoking have 
been linked to social class, with working- 
class women slightly more likely to smoke 
than higher status women (Nathanson and 
Lopez 1987). 

The covariates are all fixed, drawn from 
the 1956 survey. We use a monotonic, 
transformation (log) of both AGE and ROLES, 
since we expect neither to have a linear 
relationship with survival rates. We also 
estimate a Weibull model of age dependence, 
presupposing that the longer time alive the 
more likely the experience of mortality. 

Specifically, the model considered is: 


Y = log(AGE AT DEATH). 
= a + bILOGAGES6 
+ b2HOCC + b3EDUC 
+ b4LOGROLES +W, 


where w is a disturbance term with an 
extreme value distribution (Kalbfleisch and 


` Prentice 1980).? 


Using these dynamic event history tech- 
niques tells us more than we could get from 
cross-sectional data examining multiple roles 
and health since the least healthy — those who 
died— would be missing in a cross-sectional 
sample. Moreover, we are able to estimate 
not merely whether women live or die as a 
function of their multiple roles but how these 
roles alter the duration of their lives., 


RESULTS 

The first model tested is the null model that 
longevity is independent of all structural: and 
cultural role considerations save age and 
social class. As expected, the longevity of 
women in the 1956 sample is a function of 
their age (Estimate = —2.44, p = .000; 
Loglikelihood = — 224.12). Socioeconomic 
status, as measured both by the women’s 
education and her husband’s occupational 
level, does not affect longevity.? This finding 


? We use PROC LIFEREG in SAS (1985) to 
obtain maximum likelihood estimates from the 
censored data. 

? We used a dichotomous measure of husband's 
occupation because we felt that middle- 
class/working-class distinctions were more salient 
to longevity than incremental gradiations between 
occupations. However, a nine-level measure, 
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is similar to that reported in the Alameda 
County study (Wingard 1982) and may 
suggest that women’s position in the stratifi- 
cation system is less consequential for 
mortality than is men’s. But it may also 
reflect the relatively homogeneous nature of 
this sample of middle- and: working-class 
women in upstate New York in the mid- 
. 1950s. = 

Next we test a model including ROLES as 
a covariate, finding that the number of roles 
. in 1956 does alter the expected duration of 
life and in the hypothesized direction. Women 
occupying more roles in 1956 are likely to 
live longer than those with fewer roles (Model 
1 in Table 1). This supports the role 
enhancement perspective, documenting that 
multiple roles in the prime: of adulthood 
.promote health to such a degree as to extend 
* the lifetime of individuals. 

Some might argue that the relationship 
between number of roles and longevity is 
spurious, confounded by factors affecting 
both multiple roles themselves and life 
expectancy. Accordingly, in separate analy- 
ses (not shown) we used multiple regression 
procedures to estimate the determinants of 
number of roles held in 1956. Three factors 
proved significant in estimating ROLES as an 
outcome variable: HOCC, husband's occupa- 
tional status (negative relationship); EDUC, 
educational level (positive relationship); SELF, 
a self-esteem measure (positive relationship). 
None of these factors was statistically signif- 
icant in estimating longevity; moreover, the 
ROLES variable continued to be a significant 
predictor of longevity even after controlling 
for these background variables. Other poten- 
tially confounding variables, such as general 
life satisfaction and a traditional view of 
women's roles, also proved insignificant in 

‘estimating the duration of women's lives 
(GSAT and TRADFEM, described in Appen- 
dix). These results offer support for the role 

= enhancement perspective—linking multiple 
roles to health and, hence, to longevity. 
Still, the fact that we have no health 
measure in 1956 introduces the question of 
causal ordering. How can we conclude that 

ROLES is a contributor to, rather than a 

consequence of, health? Three pieces of 
information lend credence to our causality 


ranging from professional to laborer, also had no 
significant effects on longevity. 
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interpretation. First, we do have 1956 mea- 
sures of mental well-being (SELF, GSAT), 
and there is a well-documented relationship 
between physical and mental health. Includ- 
ing these measures in our estimations in no 
way reduces the power of the ROLES 
variable. Second, our sample consists of 
women who were wives and mothers in the 
prime of adulthood. Women in poor health 
are less likely to marry or to bear children, so 
our 1956 sample of young adult women is apt 
to include predominately healthy individuals. 
Using life history data collected on tke 
survivors in 1986, we were able to construct 
health measures for. both 1956 and 1986 for 
those survivors who agreed to be interviewed 
in 1986 (N — 313). Only nine women reported 
any health problems in the 1956 period. 
Third, two longitudinal studies have found 
that social relationships reduce the likelihood 
of mortality, even after controlling for initial 
health status (Berkman and Breslow 1983; 
House et al. 1982). While ROLES as a proxy 
for health status in 1956 cannot be categori- 
cally ruled out, we feel that (1) the age, 
marital and parental status of the women in 
1956, (2) the nonsignificance of the mental 
health measures, and (3) the corroboration of 
other studies provide supporting evidence for 
our causal interpretation. 


Contextual Issues 


Thus far, we have documented the impor- 
tance of multiple roles in estimating longev- 
ity. But the role context orientation argues fer 
a more complex view, that attributes of roles, 
as well as their number, may affect surviva:. 
We examine the four contingencies discussed 
earlier: (1) that care-giving roles reduce 
longevity, (2) that emotionally supportive 
roles promote longevity, (3) that particular 
roles, such as employment, have more impact 
on longevity than do others, and (4) that it is 
the level of involvement in certain roles that 
matters. 

To evaluate the potentially deleterious 
effects of women's care-giving role obliga- 
tions within the home a NURTURE variable 
was constructed, including the présence cf 
preschoolers, having three or more childrer, 
and other persons living in the household.4 





* Since the mean number of children present 
was 2.5, we used the presence of three or more 


- These are Weibull estimates. N = 427. Censored 
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: Table 1. Maximom-likelihood Estimates of the Effécts of Women’ 8 Role Involvements in 1956 on Their Survival 


Probability 1956-1986" 




















Models ` ; 

I i nt (IN V VI ^. VI 
Log Roles” = ,505* 554444 .585** .443* .638** 376* | Be : 
Nurture .144. p Ud 
Others . ` ` .339 
Preschool —.135 g x 

, Kids .331* 101+ .10* | 094 ` .108* 
Social Support ..366 : 
Employment .290 . 
Club/Org .256* — 1.215* 
Club/Org-Log Role Interaction f .—0.831* 

-0 .588 .587 .585 ' .593 .592 .594 585: 
Loglikelihood —220.97 —219.73  —217.76 —217.82 -217.414  —218.37 -216.16 





* Controlling for age (LOG AGE 56), husband's occupational status (HOSO: and woman’s education (EDUC). 


= 347. 


> Includes all roles except those considered separately i in Models IV, V, VI, ad VII. For example, in Model V TOG 
ROLES does not include employment, which is entered separately. 
* Interval variable NUMKIDS used in Models IV, V, VI, and VI. 


* p=.05. 
** px 01. 
* * ps.10. 


This draws on Gove and Hughes's (1979, p. 
133) notion of nurturant role obligations, 
which they describe as “role obligations that 
require constant ongoing activities vis-à-vis 
spouse, children and others (such as parents) 
living in the home." They argue that these 
obligations prevent women from taking proper 
care of themselves and, consequently, impair 
their health. However, NURTURE had no 
significant effect on longevity (Model 2, 
Table 1). Neither did subjective dissatisfac- 
tion with the wife/homemaking role (WIFE- 
ROLE, not shown). But some facets of care 
giving can be beneficial: For example, 
children were seen by Durkheim as a 
protective, integrative factor. Accordingly, 
we disaggregate the NURTURE variable into 
its component measures. We find that number 
of children is positively related to longevity 
(Model 3, Table 1). Thus, at least with 
respect to longevity, the nurturant role 
hypothesis of Gove and Hughes (1979) is not 
supported. Indeed, children appear to pro- 
mote rather than detract from longevity.5 
Another consideration borne of a contex- 
tual approach concerns the salutary effects of 
roles that provide individuals with an infor- 





children as a measure of heavy cit dnte obliga- 
tions. 

5 Both the number of children and the dichoto- 
mous measure of three or more versus fewer were 
positively related to longevity. 


mal network of emotional support. Social , 


support is frequently operationalized in terms 
of ties to primary groups (kin, friends, 
neighbors), what Litwak and his colleagues 
(1989) categorize as "informal social sup- 
ports." To test the significance of these ties, 
three roles— relative (RELAT), friend 
(FRIENDS), and neighbor (NEIGH) — were 
combined into a measure. of SOCIAL, SUP- 
PORT. But this variable too had no signifi- 
cant effect on longevity (Model 4, Table 1). 
To examine the potential significance of 
employment, club membership, and. other | 
particular roles, we disaggregated the ROLES 
variable into its components and entered each 
separately into the equation. We also included 


‘in each equation a modified ROLES variable 


with the particular role of interest omitted. 
Despite the theoretical and empirical attention 
given to the significance of women's ‘labor 
force participation for health, the employment ` 
role in 1956 (EMP) had no effect on 
subsequent longevity (Model 5, Table 1). In ~ 
fact, the only single role that mattered was 
CLUB/ORG, a measure of membership in 
voluntary organizations or activities, which? 
was positively related to longevity. This: 
finding held true even. after controlling for the 
sum of other roles, education, husband's 

occupational status, and number of childreri 
(Model 6, Table 1). Moreover, we found an 
interaction between CLUB/ORG and ROLES, 

such that the effects of the number of roles 
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were contingent on participation in clubs and 
organizations (Model 7, Table 1). 

' We also tested for the level of participation 
in each role, controlling for occupancy of 
other roles. We found that level of participa- 
tion, as measured by frequency of attendance 
in churches, clubs/organizations, frequency 
of seeing neighbors or relatives, number of 
friends, and part-time versus full-time employ- 
ment, had no significant effect on longevity 
(not shown). What did matter was number of 
roles and, in particular, membership in clubs 
or organizations.® 

Figure 1 clarifies the apparent effects of 
CLUB/ORG; membership in clubs or organi- 
zations had the same longevity-enhancing 
effect as possessing four other roles. The 
salubrious effects of the CLUB/ORG vari- 
able—regardless of the number of other 
roles—are both surprising and puzzling. This 
finding could reflect the voluntary choice 
aspect of membership in clubs and organiza- 
tions. Religious membership as well as 
involvement with neighbors and kin may be 
more socially constrained. Similarly, employ- 
ment of wives and mothers in the 1950s—a 
time of traditional gender role divisions — may 
reflect family financial need more than 
personal preference. The significance of 
CLUB/ORG could also mean that while roles 
generally may contribute both stress and 
support to women's lives, participation in 
voluntary associations may produce more 
benefits than costs. : 

Membership in voluntary organizations 
was common among wives and mothers in the 
1950s; indeed fully 70 percent of our sample 
reported such participation in the 1956 
survey. Not surprisingly, middle-class women 
(those: with husbands in professional or 
managerial occupations) were more likely 
than those in the working class to be 


$ We tested as well the possibility of a 
curvilinear relationship between roles and longev- 
ity (using both ROLES and ROLES squared) but 
found no such relationship. The log transformation 
of ROLES corresponds to the theoretical notion of 
.the integrative function of roles with the protective 
effects rising steeply with the first few roles and 
then beginning to level off. The negative coeffi- 
cient for the CLUB/ROLES interaction suggests 
that the combination of ROLES and CLUB/ORG is 


less beneficial than the sum of their separate. 


effects. However, the combined effect is still 
positive (see Figure 1). 
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“joiners” (84 percent to 60 percent). But 
there was no difference in CLUB/ORG 
membership by employment status, suggest- 
ing that unpaid voluntary activities were not a 
substitute for paid labor force participation. 
Moreover, the effect of CLUB/ORG is not 
confounded by social class, since two con- 
trols for class (HOCC and EDUC) are 
included in the models estimated. 

The association of early adult CLUB/ORG 
membership with subsequent longevity re- 
mains intriguing. Obviously it could reflect 
self-selection, with healthier women more 
likely to be active in organizations. But healtk 
also can be expected to be related to 
employment, and employment does not man- 
ifest the same protective function’ Another 
possibility is that participation in clubs or 
organizations may influence mortality by 
encouraging preventative health behavior and 
enhancing health-related resources and choices. 
Or it may signify the exercise of choice and 
control at a time in women’s lives when 
marriage and motherhood, in the family- 
oriented milieu of the 1950s, severely circum- 
scribed their options. 

Subsequent analysis of the roles of the 
survivors who were interviewed in 1986 gives 
credence to the view of membership in clubs 
and organizations as being a voluntary 
choice. The life history data reveal that 
women in 1956 were involved in clubs and. 
organizations such as the PTA, hospital 
auxiliary, community chest and other fund 
drives, community theater, church groups, 
the Junior League and other women’s groups, 
and Brownies and other Scout groups. While 
there may have been some community or 
family pressure to participate, the qualitative 
life history material paints a picture of choice 
and of satisfaction. For example, one woman 
who recounted her heavy involvement in 
volunteer activities said she “could do this 
because [she and her husband] only went out 
once a year—on [their] anniversary.” An- 
other who was involved in a book drive for a 
local health center notes, “it was satisfying 

. everybody was ‘book crazy’ for four 
years.” Although the kinds of group member- 
ship change over time, CLUB/ORG in 1986 
is positively correlated with CLUB/ORG in 
1956 (r = .17, p = .003), suggesting a 
degree of continuity in this role. 

Some may challenge the stability of role 
occupancy and, therefore, question the import 
of multiple roles measured at only one point 
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MULTIPLIER 
OF 
SURVIVAL 


FUNCTION 


*Excluding Club and Organizational memberships. 
**Using the natural logarithm of roles (LOGROLES). 
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ROLES* 


Controlling for age (LOGAGES6), husband's occupational status (HOCC), woman's education (EDUC), and 
(NUMKIDS). 


number of children 


Figure 1. Effects of CLUB/ORG Memberships and Other Roles on the Longevity of Women** 


in time for subsequent longevity. We know, 
for example, that women have typically 
moved in and out of employment over the 
course of their lives (Moen 1985). However, 
our findings affirm that those occupying 
multiple roles at one point in time are more 
likely to do so at other points in time. Among 
the survivors, ROLES as measured in 1986 
was positively correlated to ROLES in 1956 
(r = .20, p = .002). (This group of survivors 
averaged 4.2 roles in 1956 and 3.6 roles in 
1986.) In fact, the only role not positively 
correlated with its counterpart 30 years later 
was employment, with women more likely to 
be employed in 1986 than in 1956. 


CONCLUSIONS 


The major aim of this paper is to contribute to 
the literature on social integration and well- 
being by testing and evaluating evidence on 
one critical facet of well-being: longevity. 
The legacy of a long life is first and foremost 
a result of choosing the right parents; 
biological predispositions are unquestionably 
indispensable. But mortality is more than a 
consequence of biology and of aging. Using 
event history techniques and estimation pro- 
cedures, we have documented the relationship 


between women's social integration — in terms 
of multiple roles— during "early" adulthood 
and their longevity over a 30-year period. 
Moreover, we have established that this 
relationship is not an artifact of social status 
or of personalitv dispositions (self-esteem, 
gender role traditionalism, general life satis- 
faction). Finally, we have identified participa- 
tion in clubs or other voluntary organizations 
as especially important for subsequent longev- 
ity. 

By documenting the link between multiple 
roles and longevity, this analysis provides 
support for the Durkheimian notion of the 
protective function of social integration and 
its corollary, the role enhancement perspec- 
tive.” Social integration appears to operate, at 
least in part directly, by reducing isolation. It 
apparently does not operate solely through 
self-esteem or social support, since neither of 
these factors promoted longevity. But since 
level of involvement in any single role is not a 
significant factor, there is more to the 
multiple roles concept than the prevention of 


? However, as acknowledged earlier, the lack of 
a 1956 measure of health makes it impossible to 
rule out a possibly spurious relationship between 
roles and longevity. 
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isolation. This calls forth Coser’s (1975) 
notion of roles as a “seedbed of complexity,” 
wherein varied social ties promote autonomy 
and, conceivably, health. The multiple roles 
effect could also reflect the value of a diverse 
social network in promoting health-related 
behavior. Wives and mothers with few roles 
beyond the family may be either more 
vulnerable to risk factors or more exposed to 
hazardous life-styles than those more socially 
integrated. 

We have also presented evidence as to the 
import of some roles above others, suggesting 
the utility of the role context approach. Why 
participation in clubs or organizations partic- 
ularly enhances longevity is far from obvious. 
But this finding points to the need for 
research specifically examining the relevance 
of choice of role attachments for health, and 
again invokes Coser’s (1975) emphasis on 
autonomy. Participation in formal or informal 
voluntary activities is typically discretionary. 
That very discretion may well lead to the 
salutary results we have observed. 

Theory and research must also address the 
dynamics of social integration. Do multiple 
roles that persist over time promote greater 
longevity than do more sporadic role involve- 
ments? What is the significance of the timing 
of role acquisition and loss over the life 
course? Clearly the connections between 
social integration and health constitute a 
stimulating and fruitful area of sociological 
inquiry. 
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Variables Frequency/Mean 
CONTROL VARIABLES 
AGE Age (year to nearest decimal) in 1956, Log of Age 
(LOGAGES6) used in analysis 36.8 
HOCC  Husband's occupational status in 1956 coded as: 
0 = nonprofessional, 0=58% 
1 = professional/managerial 1=42% 
EDUC Educational level in 1956 
1 = eighth grade or less 1=5% 3.28 
2 = some high school 2= 18% 
3 = graduated high school 3=43% 
4 = some college, 4=18% 
5 = graduated college 5=13% 
6 = advanced study 6=3% 
ROLE INVOLVEMENTS 
ROLES! Summed number of role attachments in 1956 $ 
includes the following six items: 4.11 
CLUB/ORG— memberships in clubs or organizations 70% 
EMP —employment status 22% 
RELIG — attending religious services 7196 
RELAT — seeing nearby relatives 84% 
NEIGH — visiting with neighbors 79% 
FRIENDS —having one or more close friends 95% 
NURTURE Sum of the following three items: 1.11 
OTHERS Others, besides immediate family 0=90.9% 
living in household (0=no, 1 = yes) 1=9.1% 
PRESCH Preschool children in the home 0=41.2% 
(0 = no, 1 = yes) 1=58.8% 
KIDS Number of children in household 0=57.1% 
(0 = one or two, 1 = three or more) 1242.99 
NUMKIDS Number of children in household 
(1 = one to 6 = six or more) 2.52 


WIFEROLE? Mean of three standardized questions on 
satisfaction with wife role, housework, and 


housecleaning ability — ,0005 
SOCIAL SUPPORT Sum of three variables included 

in Roles: RELAT, NEIGH, FRIENDS 2.49 

ATTITUDINAL VARIABLES 

SELF? Self-esteem measure, mean eight items (range 1-3) 2.32 
GSAT* General life satisfaction, mean 

three items (range 1-3) 2.68 
TRADFEM?  Sex-role beliefs, mean of 

nine items (range 1-3) 1.68 


Standard 
Deviation 


7.52 


1.14 


1.05 


.82 


1.27 


36 


ATT 





! Questions asked in 1956 survey — ROLES: 


CLUB/ORG Did you belong to any clubs or organizations during the past year? (0 — no, 1 — yes) 
EMP Do you work for pay now? (0 — no, 1 — yes) 


RELIG How often do you go to religious services? (0 = once a month or less, 1 = more than once a month) 
RELAT How often do you see Elmira relatives? (0 = infrequently or never, 1 = frequently — more than once a 


month) 


NEIGH At this time of year, about how often do you visit with any of your neighbors? (0 — less than once a 


week, | = once a week or more) 


FRIENDS When you think of all the friends you have, at this moment, about how many of them are people you 


consider really close friends of your own? (0 = none listed, 1 = listed one or more friends) 
? Questions asked in 1956 survey— WIFEROLE — Cronbach's alpha = .60: 


How well would you say that your house is kept, compared to other homes that you know about? 
How satisfied are you with the way you are meeting your own standards as a wife? 


When you are keeping house, how do you usually feel at the end of the day about what you have gotten done? 


continued 
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3 Questions such as the following were asked in 1956 survey —SELF — Cronbach's alpha = .61: 
(Scale based on count of agreements with these statements. Complete listing available from authors.) 


Below are statements women we have talked with have made about themselves. For each statement, please tell 
. whether you think what it says is more like you or more unlike you yourself. Now tell for each statement whether 


you would want to be like this. 


I am pleased with myself. ` 
I have faith in myself. 
I am attractive. 


* Questions asked in 1956 survey—GSAT —Cronbach's alpha = .62: 
How often do you find yourself feeling disappointed about the way things turned out for you? 
All in all, how much happiness would you say you find in life today? 
On the whole, how satisfied would you say you are with your way of life today? 


5 Questions such as these were asked in 1956— TRADFEM — Cronbach's alpha = .77: (Complete listing available 


from authors.) 


Here are some things you may think are all right for a woman to do, and others you may think are not proper for 
a woman to do. Is it all right or not all right for a woman to 


supervise the work of men, 
take an active part in politics, 
become an electrical engineer? 
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. We report results from a large-scale survey of the effects of unemploymént on 


emotional functioning. The study design explicitly considers the possibility that 
prior emotional functioning may influence the risk of unemployment and chances 


_ for reemployment. We document substantial effects of unemployment on distress as 


well as positive emotional effects of reemployment. Contrary to initial 
expectations, the probability of reemployment is unrelated to baseline emotional 
functioning. The elimination of elevated distress after reemployment suggests that 
the worst psychological effects of job loss can be minimized if opportunities exist 


for (REDI RIEN 


Qualitative accounts of the personal and 
family suffering, caused by unemployment 
have been available for nearly half a century 
(e.g., Angell 1936; Bakke 1934; Komarovsky 


^* 1940). Large-scale surveys of unemployment 


have been available for the past 20 years (see 
Horwitz 1984, for a review), as have 
. aggregate time-series analyses of the relation- 
'. Ship between economic downturns and changes 
in rates of mental hospitalization, suicide, 
homicide, and crime (e.g., Brenner 1973; 
Brenner and Mooney 1983). Yet, despite all 
this research on the relationship between 
unemployment and health, we lack a body of 
evidence that provides an unbiased quantita- 
tive assessment of unemployment and reem- 
ployment effects on the individual. 

The absence of such data is due to the fact 
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that previous reseàrch has neglected misspeci- 
fication problems that confound attempts to 
evaluate the true effects of joblessness on the 
individual. The most rigorous studies have’ 
documented two main results. First, cross- 
sectional analyses have shown that people 
who have lost a job recently are in signifi- 
cantly worse health than the stably employed 
(Kessler, House, and Turner 1987). Second, 
longitudinal analyses have shown that the 
health of the recently reemployed improves 
significantly after they find new jobs (Warr 
and Jackson 1985). These results could be 
interpreted to show that unemployment causes 
poor health and that reemployment reverses 
this effect. But this is not the only plausible 
interpretation. Cross-sectional and longitudi- 
nal analyses, when used apart, are subject to 


selection bias that can lead to overestimates of : 


both unemployment and reemployment ef- 
fects. The only way to avoid this bias is to 
combine cross-sectional and longitudinal de- 
signs in a single study, a.strategy that has not 
been used up to now. Results of such a study 
are reported in this paper, based on a survey 
of job loss and reemployment in a high 
unemployment area of southeastern Michi- 
gan. : 


BACKGROUND 


Our work on unemployment began in 1984, 
the latter part of the early 1980s recession. 
The unemployment rate in the largely blue- 
collar work force of the Detroit ee 
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area was over 13 percent. We wanted to 
estimate the effects òf unemployment, on 
emotional functioning but realized that a 
before-after design was impossible because 
most of the people we wanted to evaluate had 
become unemployed before the research 
began. 

Our solution was to use a compromise 
design in which subsamples of stably em- 
ployed, currently unemployed, and recently 
reemployed respondents were obtained in a 
probability survey of high unemployment 
census tracts. Respondents who involuntarily 
lost a job between 1980 and 1984 were 
questioned about the circumstances surround- 
ing their job loss and classified as either not at 
fault (e.g., plant-closing) or possibly at fault 
(e.g., fired). This classification allowed us to 
estimate job loss effects on psychological 
distress in a cross-sectional comparison be- 
tween stably employed people and people 
who lost.jobs (currently unemployed and 
recently. reemployed combined into a single 
comparison group) while correcting for selec- 
tion into unemployment. This analysis docu- 


mented that job loss was associated with: 


significant elevation "of depressed mood, 
anxious mood, somatization, and self- 
reported physical illness (Kessler, House, and 
Turner 1987). 

A cross-sectional analysis of this. type can 
document an effect of recent job loss, but not 
an effect of current unemployment, because 
the cross-sectional -comparison of stably 
employed people with people having recent 
unemployment experience combines currently 
unemployed and previously unemployed re- 
spondents into a single group. This yields an 
unbiased estimate of job loss effects (i.e., the 
effects of having lost a job, ignoring whether 
the respondent was still unemployed, at the 
time of interview) but. underestimates current 
unemployment effects.. - >- 

This bias cannot be removed with cross- 
sectional data because we cannot know low 
much of the difference in symptom levels 
between the currently unemployed and the 
recently reemployed is due to post- 
reemployment recovery and how much is due 
to selection out of unemployment on the basis 
of prior distress. However, this information 
can be obtained in a longitudinal study. A 
number of longitudinal studies have consis- 
tently found that both psychological well- 
being (Jackson, Stafford, Banks, and Warr 
1983; Payne and Jones 1987) and physical 
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health (Warr and Jackson 1985) improve ` 
substantially after reemployment. Unfortu- 
nately, since none of these investigations 
included a comparison group of employed 
workers, they were unable to estimate either 
the impact of unemployment on health or the 
completeness of recovery following reemploy- . 
ment. 

The research reported here corrects this 
limitation by combining a' cross-sectional 
analysis of employed and unemployed respon- 
dents with a longitudinal analysis of reemploy- 
ment. This strategy allows us to study the. 
effect of prior distress on reémployment, to 
adjust cross-sectional data for: this effect in 
estimating the impact of current unemploy- 
ment on distress, and to determine the 
completeness of recovery associated with 
reemployment. 


DATA AND METHODS 


Sample 


The baseline sample was selected eon: a 
multistage probability sample of 14 contigu- 
ous census tracts in southeastern Michigan. ` 
An attempt was made to choose the census 
tracts with the highest rates of unemployment 
within a target area known to have experi- . 
enced high unemployment, while maintaining 
geographic continuity among the tracts. The 
unemployment rates in the 14 tracts ranged 
from 7.5 to 31.5 percent. 
unemployment rate for the entire selected area 
was 13.3 percent in the fall of 1984 when the 
baseline survey was administered. 

The sample was stratified with dispropor- 
tionate weights to select roughly equal 
numbers of respondents in three strata: - 
currently unemployed people who had lost 


: their jobs during the recession; previously 


unemployed people who lost jobs during the 
same period but were reemployed at the time 
of interview; and stably employed people 
who had not lost a job during the recession. 
The final sample consisted of 492 respon- . 
dents— 146 currently unemployed, 162 previ- 
ously unemployed, and 184 stably employed— : 
who participated in face-to-face interviews in 
their homes. The response rate was 78 percent 
overall. The combined baseline sample was 
60 percent male, 20 percent black, 50 percent 
unmarried, and 82 percent blue/pink collar, 
and had a mean age of 35 years and a mean 
education of 12 years. More detailed informa- 


The overall - 
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Table 1. Employment Status Changes between Time 1 and 2 














Time 1 Employment Status 





Currently 








Time 2 Previously Stably 

Employment Status Unemployed | Unemployed Employed Total 

Unemployed 64 5 a7 76 

Employed 51 123 141 315 

Excluded* 5 5 i 13 23 

TOTAL 120 133° 161 414 
(8295) (82%) ' (8896) (8495) 





* These cases were excluded from subsequent analyses either because they were voluntarily working part-time (less 
than 30 hours per week) or because their unemployment was voluntary. - 
> Percentages refer to the percent of the PEN sample or subsample that was as successfully followed up at the time 


2 data collection. 


tion regarding sampling procedures has been 


reported previously (Kessler, Turner, and 
House 1987). 

One year after this initial data collection, 
an attempt was made to contact all original 
respondents and to administer an abbreviated 
follow-up questionnaire over the telephone. 
We sent an interviewer to the homes of 
respondents who could not be reached by 
telephone. We successfully interviewed 414, 
or 84 percent of the original 492 respondents 
(see Table 1). 

In Table 1 we also show the changes in 
employment status that occurred over the 
follow-up period. More than 40 percent of 
respondents unemployed at Time 1 were 
reemployed (working 30 hours or more a 
week) a year later. This reflects the dramatic 
improvement in the local economy that 
occurred starting at the end of 1984. Of the 
294 baseline employed respondents who were 
reinterviewed, only 12 were involuntarily 
unemployed a year later. 

To compare unemployed workers who 
regained full-time employment and those who 
remained unemployed with the full-time 
stably employed, we excluded from analysis 
those who were working part-time at time 2. 
This resulted in the loss of 23 cases and a 
final panel sample of 391. 


Data Analysis and Measures 


We report two analyses in this paper. In the 
first, we assessed the extent to which the 
probability of becoming reemployed was 
affected by baseline distress. This was done 
by estimating regression equations, in which 
time 1 measures of distress were used to 


predict reemployment, with age, sex, educa- 
tion, race, and marital status controlled.: 

In the second analysis, we assessed the 
impact of reemployment on distress. This was 
done by regressing time 2 distress measures 
on reemployment, controlling for distress at 
time 1 and the sociodemographic control 
variables listed above. To determine whether 
the emotional effects of unemployment can be 
completely reversed, we estimated regression 
models comparing the time 2 emotional 
functioning of reemployed people to that of 
the stably employed.? 

We included a range of distress dimensions 
in the interview schedule. Based on the 
outcomes typically found to be influenced by 


‘Unemployed people in this sample were 
younger, less educated, and more likely to be 
female, black, and never married than the stably 
employed (see Kessler, Turner, and House 1987). 
These variables usually have significant relation- 
ships with health and mental health in general 
population surveys (Kessler, Price, and Wortman . 
1985). The bias in the estimated effects’ of 
unemployment due to this nonrandom distribution 
in job loss was taken into consideration by using 
these variables as controls. Nonrandom effects of 
age and education were taken into consideration by 
controlling for both linear and squared terms. 

. ? The use of time 1 outcome measures as 
controls effectively converts the outcomes into 
change scores. This is appropriate in the analyses 
comparing the reemployed with the stably unem- . 
ployed because we want to determine whether the 
improved functioning associated with reemploy- 
ment was significantly greater than any improve- 
ment found among respondents still unemployed at 
follow-up. It is not appropriate, however, in the 
analyses that compare the reemployed with the 
stably employed because we want to determine 
whether time 2 distress levels of the reemployed 
were equivalent to those of the stably employed. 
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Table 2. Cross-Sectional Associations between Employment Status and Distress, Time 1" 





A. Continuous Measures of Distress’ 





Standardized Means 
CU? ' PU SE 
Depression 311 —.064  —.188 
Anxiety .293  —.042 -—.197 
Somatization .115 105 —.179- 
Physical illness .178 .000 —.131 
B. Dichotomous Measures of Extreme Distress 
Standard Risks 
CU PU SE 
Depression .149 .072 .074 
Anxiety .152 .065 .045 
Somatization .252 225 .130 
Physical illness .168 .101 .066 





651. 














_ Mean Differences 
CU vs. PU CU vs. SE PU vs. SE (CU + PU) vs. SE? 
.375* ..449* `  .124 .273* 
.335* .490* .155 .289* 
.010 .294* .284* .288* 
.178 .309* .131 .203* 
Relative Risks . 
CU vs. PU CU vs. SE PU vs. SE (CU 4- PU) vs. SE 
2.07* 2.01* .973 1.53 
2.33* ` 3.37* 1.44 2.57* 
1.12 1.94* 1.73* 1.82* 
1.66* 2.55* 1.53 2.00* 


* Samples vary in size from 481 to 489. Listwise deletion of missing values was used to compute results. All results 
are standardized by age, sex, race, education, and marital status. 

b CU = curently unemployed. PU = previously unemployed. SE = stably employed. 

* [n the models combining CU and PU, these groups are differentially weighted to reflect the different sampling 


fractions used to obtain them. ; 
* Significant at the .05 level, two-tailed test. 


stressful life events (see Kessler, Price, and 
Wortman 1985 for a review), we used the 
depression, anxiety, and somatization sub- 
scales of the SCL-90 (Derogatis 1977) as 
baseline indicators of distress. 

In an effort to evaluate physiological 
consequences of unemployment, we also 
included a baseline four-item index of self- 
evaluated physical ill health (Kessler, House, 
and Turner 1987). We acknowledge that this 
measure taps broad symptoms of physical 
illness rather than particular clinical syn- 
:dromes. The index is not independent of the 
psychological outcomes and probably re- 
flects, at least in part, psychophysiological 
symptoms associated with emotional distress. 

The four outcomes were all positively 
correlated in the baseline data. The correla- 
tions between anxiety and depression (.81) 
and between physical illness and somatization 
(.59) were particularly strong. The first scale 
in each of these two pairs was subsequently 
omitted from the time 2 interview schedule. 
As a result, depression and somatization were 
the only outcomes considered in the analyses 
assessing the impact of reemployment on time 
2 distress.? 


3 Time constraints involved in the follow-up 

_ telephone interview forced us to drop many 
‘baseline survey questions. We would have liked to 

retain the anxiety and physical illness measures. 

Obviously, the correlations of anxiety with depres- 


All four outcome measures im the baseline 
survey were scaled to a mean of zero and 
variance of one. The time 1 means and 
variances were also used to define the metric’ 
of the time 2 outcomes so that change could 
be meaningful assessed. 


RESULTS 


Cross-Sectional Relationships Between 
Unemployment and Distress 


Results in Part A of Table 2 are based on 
analysis of covariance models in which a` 
three-category measure of employment status 
was the independent variable, distress was the 
outcome, and age, sex, education, race, and 
marital status were covariates. Results in Part 
B are based on logistic regression equations. 
The coefficients in Part A are standardized 
means, while those in Part B are standardized 
probabilities of extreme distress, obtained by : 
reparameterizing the logistic regression coef- 


ficients. For depression, anxiety, and somati- ` 


zation, extreme distress was defined using 
cutpoints established for psychiatric outpa- 
tients by Derogatis (1977). For physical 





Sion and somatization with physical illness at 
baseline were not so high that we could exclude the 
possibility that different influences on them might 
have .been found in a more comprehensive 
follow-up study. 
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Table 3. Impact of Time 1 Distress on Probability of Reemployment* 














Time 1 Distress Measures 


Depression Anxiety 
b se b se 
040 .067 .035 


(n? (115) (115) 








Somatization Physical Health 
b se b se 
.054 .041 .016 ` .047 
(114) (115) 





* Coefficients represent incremental change in the probability of reemployment for each standard deviation increase 
in distress score, standardized on age, sex, race, education, and time 1 marital status. 
> The equations were estimated on those respondents in the panel sample who were unemployed at time 1, for whom 


complete data were available. 


illness it was defined as the top 10th 
percentile of the distribution for the total 
sample. The dichotomous outcomes were 
considered along with the more conventional 
continuous ones to evaluate the impact of 
unemployment on clinically significant ex- 
treme scores as well as on mean levels. The 
coefficients show that all distress measures 
were significantly elevated among the cur- 
rently unemployed compared to the stably 
employed at baseline. We found evidence of 
.residual elevated distress among the previ- 
ously unemployed only for somatization. 

As noted, past analyses documented that 
self-selection into unemployment cannot ex- 
plain these patterns (Kessler, House, and 
Turner 1987). Respondents who lost their 
jobs through no fault of their own (e.g., plant 
closings) were just as distressed as those 
whose own actions may have led to their job 
loss (e.g., fired because of inadequate work 
performance). Nonetheless, we should not 
interpret the high distress of the currently 
unemployed as evidence that current unem- 
ployment causes emotional reactions which 
are totally resolved with reemployment. It is 
equally plausible that poor emotional function- 
ing in the face of unemployment interfered 
with job search activities and that this 
selection process created a situation in which 
high distress was common among currently 
unemployed and rare among previously unem- 
ployed respondents. The only way to deter- 
mine whether selection of this sort was at 
work is to analyze panel data. 


Baseline Distress and Reemployment 


In Table 3 we present the results of panel 
analyses estimating the effects of Time 1 
distress on the probability of reemployment. 
These effects were estimated from four 
Separate OLS regression equations computed 
"on" the subsample of respondents who were 


0m 


unemployed at time 1. These equations are af 
the form 


P, = bg  bi1T, + Controls, (1) 


where P, is a dummy variable scored one far 
reemployed respondents, zero otherwise, and 
T; is the time 1 distress measure.* 

If the type of selection effect discussed 
above was at work, we would expect that 
time | distress would be negatively associated 
with reemployment. The results show, how- 
ever, that this was not the case. Indeed, 
distress was modestly but consistently associ- 
ated with an increased probability of reem- 
ployment by time 2. This relationship was 
strongest for depression, where it bordered on 
being statistically significant at the .05 level. 
We also considered nonlinear effects (results 
not presented) and found that time 1 extreme 
depression significantly increased the proba- 
bility of reemployment at the .05 level. 
Insignificant trends in the same direction were 
found for the other distress measures. 

A pattern such as this could occur if people 
who were highly distressed by job loss were 
willing to accept the next job that came along, 
even if that job was not a very good one. We 
considered this possibility by examining two 
measures of job quality—earnings and job 
security. No evidence could be found tha: 
time 1 distress was associated with accepting 
a bad job as defined by either of these criteria. 

The only other plausible explanation is that 
extreme distress in the face of unemployment 
was associated with more intense job searck 


* The dichotomous outcome variable has a 
distribution of 44 percent reemployed and 5€ 
percent still unemployed. This lack of extreme 
skew makes it possible to use OLS regression. The 
same significant predictors emerged in logistic 
regression, but OLS results are reported here 
because they are easier to interpret. 
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efforts. The survey lacked sensitive measures 
of job search intensity. We suspect, however, 
that accurate measures of job search intensity 
would have shown more persistent efforts 
among people who were extremely distressed 
by job loss. 

The reason for the weak positive associa- 
tion between baseline distress and reemploy- 
ment is of less importance for the present 
analysis than the fact that the hypothesized 
negative association was not found. This 
means that the selection bias which we feared 
in the cross-sectional comparison was not 
present. This, in turn, means that the 
cross-sectional documentation of very high 
levels of distress among the currently unem- 
ployed is, if anything, an underestimate of the 
emotional damage created by job loss rather 
than the overestimate we suspected. 


Reemployment and Improvement in 
Emotional Functioning 


Having assessed the effect of distress on 
reemployment, we next turned to the impact 
of reemployment on subsequent change in 
emotional functioning. To assess this impact 
we estimated models of the form 


T2 = bo + biT, 
+b2RE + Controls, (2) 


where 7T, and 75 were the time 1 and time 2 
measures of the distress scales (either depres- 
sion or somatization) and RE was a dummy 
variable scored one for respondents who 
were reemployed at time 2 and zero for those 
still unemployed. The b, coefficient repre- 
sents the relative change in the outcome 
variable, between time 1 and time 2, of 
reemployed compared to still unemployed 
respondents, controlling for the selection 
effects of baseline distress on reemployment. 
Both OLS models for continuous measures 
and logistic regression models for dichoto- 
mous measures were used. The latter models 
used the same cutpoints as the time 1 analyses 
in Table 2. 

In the first row of Table 4, reemployment 
is associated with improvements in both 
depression and somatization, although these 
improvements were significant only for de- 
pression.5 The second row presents estimates 


5 The small and nonsignificant improvement in 
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Table 4. Impact of Reemployment on Change in Psycho- 














logical Distress 
Distress Measures 
Somatization Depression 
b se b se 
Continuous scores* — —.245 .180, —.548* .190 
Relative risk of 
extreme distress .576 .559 .190* .080 


(n) (114) (114) 


* Mean differences between the reemployed and the 
still unemployed in time 2 continuous distress scores, 
controlling time 1 distress scores as well as age, sex, 
race, education, and time 1 marital status. 

> Probability of extreme distress (reemployed) Proba- 
bility of extreme distress (still unemployed) at time 2 
with the same variables controlled. 

* Significant at the .05 level, two-tailed test. 





of relative risk—the probability of extreme 
distress among the reemployed divided by the 
probability among the still unemployed, 
standardized on baseline differences in the 
control variables.6 The reemployed were less 
than 20 percent as likely to experience 
extreme depression and 60 percent as likely to 
experience extreme somatization as people 
who were still unemployed at time 2. Thus, 
reemployment reduced not only the average 
symptom level, but also the risk of experienc- 
ing symptoms potentially severe enough to 
warrant professional intervention. 

While it is perhaps not surprising that the 
functioning of those who found work im- 
proved more than that of those who remained 
unemployed, it still may be that the unemploy- 
ment experience had residual effects. In other 
words, while improving considerably, the 





somatic symptoms is consistent with the cross- 
sectional result that this was the only outcome for 
which previously unemployed individuals exhib- 
ited substantial residual effects (Table 2.) 

© Relative risk estimates were obtained by 
reparameterizing predicted values based on logistic 
regression equations to calculate predicted proba- 
bilities of extreme distress. The standard errors for 
the estimates of relative risk are equal to the 
estimates themselves divided by the absolute value 
of the z-score 


z= (P. — PVP, (17 P) [(/P,) + Q/P2"^, 


where P, is the probability of extreme distress 
among the reemployed, P, is the probability of 
extreme distress among the time 2 unemployed, 
and P, is the weighted average of Uiese two 


two 
probabilities (Kleinbaum, Küpfer, p: EN 
stern 1982). [or SW 
"oCCENTR 


AL}! 


b P 
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reemployed may not have returned com- 
pletely to their emotional state prior to job 
loss. We were unable to address this 
possibility directly because we did not 
measure the health of the time 1 unemployed 
respondents prior to when they lost their jobs. 
However, in previous analyses (Kessler, 
. House, and Turner 1987) we found no 
evidence that people had been selected into 
unemployment on the basis of prior levels of 
` distress. This suggests that we can indirectly 
determine the extent of recovery experienced 
by the reemployed by comparing them at time 
2 to the stably employed. 

To do this we estimated regression models 
similar to Equation 2, the difference being 
that the unemployed were excluded from 
analysis and the reemployed were compared 
to those who never lost their jobs.” In 
addition, unlike in Equation 2, time 1 distress 
was not controlled. The focus of this analysis 
was, therefore, on whether the functioning of 
the reemployed was distinguishable from that 
of the stably employed at time 2 rather than 

^on whether the two subsamples differed in 
change between time 1 and time 2 distress 
levels. 

The results, presented in Table 5, indicate 
that the reemployed experienced complete 
emotional recovery within a year of returning 
to work. Neither depression nor somatization 
was: significantly different at time 2 for the 
reemployed compared to stably employed 
workers. Indeed, the recently reemployed 
actually had somewhat lower risk of extreme 
distress at time 2 than the stably employed. 
This result suggests that the psychological 
impact of job loss in this sample was entirely 
an effect of being unemployed. 

Just as job loss is not the same event for all 
those who experience it, so is reemployment 
different depending on the new job and the 
circumstances surrounding it. People who 
take jobs that are substantially less desirable 
than their previous work are unlikely to be 
affected in the same way as people who are 
able to find secure employment, comparable 
in quality to their previous job. With this in 
mind, we carried out analyses to investigate 


7 Time 1 employed respondents who had been 
unemployed sometime in the recent past were 
excluded in order to make the comparison a purer 
contrast between the stably employed and the 
recently reemployed. 
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Table 5. a on ae M He 
ployed and the Stably Employed 








Time 2 Distress Measures 





Somatization Depression 
b se b se 
Continuous scores* — .034 .115 011  .144 
Relative risk of 
extreme distress? .850 — 3.17 .696 1.65 
(n) (188) (189) 


* Mean differences between the reemployed and the 
stably employed in time 2 continuous distress scores. 
Standardized on age, sex, race, education, and marital 


status. 

> Probability of extreme distress (reemployed) Proba- 
bility of extreme distress (stably employed) at time 2, 
standardized on the same five control variable. 


the specifying influences of job conditions on 
the healing effects of reemployment. 

Over a dozen different job conditions were 
considered in this analysis, including subjec- 
tive feelings about intangible job . qualities 
(e.g., feeling a sense of accomplishment), 
objective rewards (e.g., earnings), and com- 
parisons of job conditions at the old and new 
jobs. None of these measures was signifi- 
cantly associated with variation in the emo- 
tional recovery found among reemployed 
people. It is likely that this null finding 
reflects the fact that recently reemployed 
people are happy to have any job. Negative 
job conditions would almost certainly be 
more important correlates of emotional func- 
tioning in a follow-up over a longer time 
interval. 

One job condition did make a statistically 
significant difference to reemployment ef- 
fects. Fourteen of the reemployed respondents 
reported that it was at least somewhat likely 
that they would involuntarily lose their new 
job within a year. This kind of job insecurity 
was positively associated with time 2 depres- 
sion and somatization among the reemployed. 
Furthermore, reemployed people who held 
these insecure jobs were significantly more 
depressed at time 2 than stably employed 


respondents, suggesting that reemployment - 


does not fully relieve unemployment-related 
distress if future unemployment is antici- 
pated. 

DISCUSSION . 


We began the analysis of reemployment with 


.the suspicion that unemployed people experi- 


encing high levels of distress would find it 
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more difficult to obtain a new job. A pattern 
of this sort would be important for policy 
purposes because it would direct us to a 
subgroup of unemployed people who are 
doubly disadvantaged—experiencing more dis- 
tress because of unemployment and finding it 
more difficult to change their situation. The 
documentation of such a pattern would also 
be important methodologically because it 
would suggest the .existence of bias in 
cross-sectional analyses of unemployment 
effects. 

The longitudinal data disconfirmed our 
initial suspicion. Far from impeding job 
search efforts, high distress was actually 
associated with a slightly increased likelihood 
. of finding a new job over the one-year 
follow-up period. 

Although we found no evidence that 
distress was associated with a greater willing- 
ness to accept an inferior job, it is important 
to remember that the local economy improved 
greatly over this one-year period. In-such a 


Situation high distress might have long-term. 


effects in motivating active job search. A 
much more destructive cycle could occur, 
however, in a place or time in which the 
economy was not improving. 

We also found that undesirable job condi- 
tions were not associated with distress among 
the recently reemployed. It is well known that 
undesirable job conditions are important 
correlates of distress (Caplan, Cobb, French, 
Harrison, and Pinneau 1975; McLean 1979). 
The fact that this was not true of the recently 
reemployed in our sample suggests that they 
were, at least temporarily, satisfied to have 
any job. Two comments are important to 
make about this finding. First, because of the 
improving economy over the follow-up pe- 
riod, it was rare to find a respondent with a 
new job that was substantially worse than the 
job held at the beginning of the recession. 
- Thus, our failure to detect a negative effect of 
reemployment at an inferior job might hold 
only for small decrements. 

Second, even when otherwise stressful job 
conditions are not experienced as stressful by 
the recently reemployed, this is likely to be 
temporary. The fact that only one year had 
passed at the time of the follow-up prevents 
us from assessing whether the improvements 
resulting from reemployment dissipate for 
those in low-paying or otherwise unrewarding 
jobs. We currently know very little about the 
long-time effects of downward mobility. It 
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has been suggested that the resulting eco- 
nomic deprivation and commensurate anxiety 
about adequately meeting financial demands 
may have long-term effects on the worker’s 
self-concept (Ferman and Gardner 1979). 

‘Further research on unemployment and 
reemployment should consider the occupa- 
tional structure as a whole. If a consistent 
pattern of deskilling continues over the 
remainder of this century, as some have 
suggested (Braverman 1974), then a new kind 
of job “loss” will become increasingly . 
common, with workers being retrained to take 
on jobs that are less rewarding than their 
previous jobs. The evidence for a pervasive 
deskiling is far from conclusive (Form 
1987), but job loss needs to be studied in the 
context of the changes over time occurring 
within specific occupations. Longitudinal 
research in contracting economies can provide 
a good preliminary Model for future research 
and theorizing. 

A major finding of our research in some. 
ways contradicts . earlier work. We docu- 
mented that the health-damaging effects of 
job loss were reversible. Respondents who 
found new jobs showed dramatic decline in 
distress compared to those who remained | 
unemployed and were, in fact, functioning as 
well as the stably employed at the follow-up 
interview. This finding contrasts to those of a 
number of earlier qualitative studies that 
followed a-number of unemployed workers 
over time. These studies found residual 
effects of job loss after reemployment. 
Fineman (1983), in one of the most careful of 
these investigations, found psychological “leg- 
acy” effects in approximately half of the 
reemployed people in his study. Feelings of 
personal failure and doubts about abilities to 
perform adequately on their new jobs were 
particularly common among the recently 
reemployed. Kaufman (1982) reported that 
feelings of low self-esteem were particularly . 
persistent after reemployment and even sug- 
gested that some of these emotional residuals 


` may be permanent. Fagin (1979) suggested 


that personality changes as a result of 
prolonged unemployment could be permanent 
as well. 

Other researchers have found that the 
extent of emotional recovery following reem- 
ployment varied depending on tlie nature of 
the new job. Shamir (1985), for, example, 
found that emotional recovery after reemploy- 
ment required that the new job be seen as at 


656 - 
least as favorable as the oid one. Fineman 
(1983) reported that reemployed people who 
-felt inadequate to the tasks of the new job 
were more distressed than they had been 
when they were unemployed. Having identi- 
' fied a number of features of unemployed life 
that can result in poor mental health, Warr 
(1987) noted -that reemployment alleviates 
these different features to differing degrees, 
- depending on the nature of the new job. 

.- What. sets our study apart from these, other 
than the methodology, is the nature of our 
sample. The predominate industry in the area 

. We studied is automobile manufacturing and 
most of our respondents were blue-collar 
workers. "The studies reporting residual ef- 
fects all examined white-collar, middle-class 
individuals who lost their jobs. . 

In an earlier paper (Kessler, Turner, and 
House 1987), we showed that financial strain 
is the main mediating variable in the 
relationship of unemployment to mental 
health for our sample. Effects frequently 

. discussed in the context of job loss among 
middle-class individuals—reduced  self- 
esteem, -a sense of purposelessness, distress 

' associated with the absence of the day-to-day 
: challenges previously experienced on the job, 

` and the sense of fulfillment associated with a 
job.well done— were not important themes. 
Thus, it appears that blue-collar workers are 
particularly amenable to emotional recovery 
thróugh reemployment because a regular 
„paycheck removes the main stress associated 
with unemployment. In contrast, since middle- 
class professionals and other white-collar 
‘workers might be expected to see their jobs as 
more than a source of income (Kahn 1981), 
the effects of unemployment are not as likely 
to be. ameliorated simply by earnings from a 
new job. If, as Fineman (1983) suggests, a 
jeb loss experience alters the ‘entire way in 
whith middle-class workers approach their 
work, then emotional effects might persist 
well after reemployment. 

. The state of the economy at the time of our 

: study also calls into question the degree to 

which the finding of reversibility is generaliz- 
able to different times and places. Job 


' security was a major concern of reemployed: 


workers in the studies that documented 
residual psychological effects. In a bad 
économy, the reemployed are more likely to 
face future unemployment or underemploy- 
‘ment. This type of situation undoubtedly 
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increases their psychological: distress relative 
to the stably employed. 

Our evidence suggests that this is a valid 
concern. The reemployed in our survey dic 
not fully recover if they reported that it was 
likely they would have to leave their new 
jobs. They were significantly more depressed 
than the reemployed who perceived their jobs 
to be secure. If emotional recovery from 
reemployment is contingent upon job secu- 
rity, then it is reasonable to assume that. it is 
contingent upon the character of the job 
market as a whole. 
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COMMENTS 


CULTURAL MATERIALISM IN THE 
FUNCTIONALIST MODE 


(Comment on Namboodiri, ASR, August 1988) 


MICHAEL A. FAIA 
College of William and Mary 


. Namboodiri's (1988) defense of ecological demog- 
raphy.is highly compelling. However, I believe 
that he misses several opportunities to convince us 
even more strongly of the power and utility of this 
neglected paradigm. In this paper I wish to explore 
a few of these opportunities. 

Although merely implied by Namboodiri (e.g., 
p. 620), an impressive aspect of demography/ 
human ecology is that at least since the days of 
Hauser and Duncan's The Study of Population 
(1959) and the earliest promulgation of the POET 
paradigm, we have had the benefit of conventional 
definitions of: both fields. Most social science 
disciplines do not have this level of consensus as to 
the nature of their subject matter. Nor does 
Namboodiri mention explicitly the fact that 
demography provides us the opportunity to use 
powerful deterministic models, such as the deter- 
mination of size, distribution, and composition of 
population by fertility, mortality, migration, and 
social mobility, or the determination of fertility by 
the Davis-Blake (1956; Bongaarts and Potter 1983) 
“proximate determinants.” Again, most social 
science fields are constrained to work with 
' stochastic models most of the time. When we 
conteniplate the definitions of demography and 
human ecology, we discover, further, that funda- 
mental social science fields, such as social 
stratification or the study of norms and deviance, 
may be subsumed as aspects of .population 
composition and ecological organization. We 
should consistently point out that demography is 
the first leg of POET, and that POET is basically 
another name for the social sciences. 

-© I'm not sure that I understand Namboodiri's 
distinction (p. 621) between corporate and cate- 
goric units: He mentions households or villages as 
corporate units, but these entities strike me as 
unique categories, comparable to other categories. 

Why can’t they simply be categoric units? 
Recently I've gotten involved in a project designed 
to see whether the thesaurus that now accompanies 
Sociological Abstracts has any utility in providing 
a taxonomy for the discipline, and my answers so 
far, based on boolean searching through the online 
version of SA, are generally in the negative. 
However, I do believe that there are subdisciplines 
of the social sciences that are now ready to get 
themselves organized taxonomically, and I believe 
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that demography and human ecology are among 
the most likely candidates. We should probably 
begin building a general taxonomy by exploiting 
the notion that demography, with its strict 
definition and its determinisms which are them- 
selves by definition made.up of exhaustive and 
mutually exclusive factors,. is classified under 
ecology, and so forth. We would then have a start 
on the sorts of conceptual hierarchies, advocated 
by Lazarsfeld (1972, pp. 227-30), that Sociologi- 
cal Abstracts, Inc. claim (1986, p. vii; 1988, p. 2), 
but on which they cannot really deliver. Inciden- 
tally, taxonomic efforts seem to be expanding 
rapidly among specialists in organized response to 
disasters (Kreps 1989, forthcoming), and these 
efforts promise to make a significant contribution 
to the “O” leg of POET. 

I find it encouraging that questions of taxonomy ` 
are of sufficient importance to Namboodiri that he 
returns to them again on p. 631, in reference to 
Ronald Corwin’s work. On the same page, 
Namboodiri suggests that the abstract concept of a 
population clarifies the affinities between the study 
of human populations and the study of animal and 
plant populations, of populations of organizations, 
of cultural artifacts, statuses, and so on—a lovely 
taxonomic leap. In a recent work of my own (Faia 
1986) I argue, along with many others (e.g., Tuma 
and Hannan 1984), that the analysis of abstract 
populations should be a central concern of the 
social sciences and, further, that the functionalist 
or systemic perspective should emerge as the major 
theoretical context for this sort of work. 

Apropos of this last remark, I cannot agree with 
Namboodiri's claim (p. 623) that for demography 
"no theoretical orientation with a macrolevei 
emphasis can be of any help." It is not inevitable 
that we drown in Ryder's morass of subcranial 
exploration, and in avoiding this outcorhe it seems 
to me essential that we take advantage of the 
systemic or functionalist perspective, which histor- 
ically has been our most successful theoretical 
paradigm. Namboodiri's discussion of “power as a 
system property" relies heavily on the macrosocial 
traditions of functionalist/systemic theory. I be- 
lieve that most of the essential elements of the 
macrosociology of power and conflict are impticit 
in the following illustration (Schmidt-Nielsen 
1979, p. 481), which happens to be a descrip-ion 
of a bedroom microsocial process drawn from a 
book on animal ecology: 


Assume that a husband and wife have separate electric 
blankets, each thermostatically controlled through 
negative feedback. Say that the husband prefers a 
rather cool blanket, and the wife a higher temperamure. 
Let us now assume that the thermostats inadvertently 
get interchanged. The husband sets “his” thermos:at at 
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his preferred low temperature, and his wife, who now ' 
finds her blanket cooler than she likes, turns up “her” 
thermostat. The husband soon finds his blanket much 
too warm and turns “his” thermostat further down, 


whereupon the wife tums “hers” up even more. This > 


is positive feedback, in which a deviation leads to an 
ever-increasing augmentation of the deviation. 


Positive feedback explosions—what Marxists call 
contradictions —must eventually run up against 
limits defined either by the operation of negative 
feedback or by a qualitative transformation of the 
system. 

- In brief, the "atheoretical" label must be 
rejected. Demography and human ecology should 
stop worrying about their alleged theoretical 
shortcomings; Namboodiri exemplifies a self- 
flagellating tradition that cannot be justified. Along 
with transition theory, decision theory, and 
"institutional" theory—all of which Namboodiri 
enumerates and discusses— we have the grand 
Malthusian traditions, which still flourish from 
time to time (as in the Club of Rome world model), 
we have the countervailing Marxist traditions, we 
have the Marxist functionalism that I try to capture 
in the work mentioned above and that Harris (e.g., 
Harris and Ross 1987) and other ecological 
anthropologists capture elegantly in the concept of 
` “cultural materialism,” and finally we have the 
powerful stable population model of Alfred Lotka 
et al., which provided the mathematical basis for 
the abstract concept of a population. I don’t know 
of any other social science field that has this degree 
: of theoretical vitality. I believe that the stable 
population model (as Namboodiri alludes to it on 
p. 628), placed within a functionalist/systemic 
framework, is a uniquely powerful theoretical and 
methodological armamentarium. In fact, I would 
be willing to argue that the stable population model 
and the. general linear model (GLM) of statistics 
are the theoretical bedrock of the contemporary 
Social sciences; the rest is mere commentary. I 
' would also argue that it was Malthus, along with 
Adam Smith, who provided the first highly 
developed functionalist/systemic theory, a theory 
that was to serve as a major inspiration for both 
' Darwin and Wallace, the founders of functionalist/ 
systemic theory in biology, and for J.B.S. 
Haldane, the founder of Marxian functionalism in 
biology. For the record it should be noted that 
Haldane's work, emphasizing catastrophism, has 
been extended by studies of the interaction 
between the earth's biosphere and its atmosphere 
(United States Senate 1988). 

It may seem unusual, even offensive, to claim 
that a pair of elaborate deductive models, sans 
"content," are the theoretical bedrock of the 
discipline. I make the claim, however, for several 
interrelated reasons. First, the best logic textbooks 
to which I've been exposed since my undergradu- 
ate days consistently make the claim that scientists 
are people with expertise in semantics, syntactics, 
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and empirics, and the theoretical foundations of 
our science — such as they be— must therefore have 
semantical, syntactical, and empirical components. 
Second, stable population analysis and the GLM 
are obviously among the most highly developed 
deductive (syntactical) models available to social 
scientists, and yet one cannot use these models at 
all unless one is willing to define clearly the’ 
concepts and variables that provide the essential 
content. Third, it is always taken for granted that | 
the major reason why we run hundreds of ` 
experiments with these models, finally publishing 
only a handful oi equations,! is that. we have 
attempted to identify those formulations that have a 
reasonable "fit" to the empirical world. Fourth, it 
is taken for granted by advocates of POET—for ` 
example, Duncan—that reciprocal causal interac- : 
tion occurs among the four sets of variables that 
compose this paradigm, and when one conducts 
multivariate causal analysis with reciprocal interac- 
tion and feedback, and with an eye tóward social 
selection (i.e., stable population processes) as a 
fundamental variable, one is by definition conduct- 
ing functionalist/systemic analysis. — - 

A few years ago I had the delightful experience 
of arguing for a few days with Bruce Mayhew about 
what I saw as his overkill attitude toward social 
psychology. I think, however, that his argument 
was essentially correct, and I also think that Nam- ` 
boodiri is right when he says that we should not be , 
required to "pay attention to intraindividual psy- 
chological processes.”. On the other hand, when 
Harris and Ross (1987) argue that far more fertility 
control has existed historically than we ever envi- 
sioned within transition theory or Malthusian the- 
ory, and that much of this effort involved people 
who were practicing infanticide, who were bringing 
about excess female mortality, and so on, without 
knowing what they were doing, I should like very 
much to verify the existence of this pervasive ig- 
norance and the distorted perceptions that allegedly 
underlie it. In short, I have no quarrel against'the 
idea of subsuming social psychology. As Westoff 
and Rindfuss (1974) once pointed out, when a new 
technology such as.sex preselection of offspring | 
becomes available, one of the most important ques- 
tions is whether or not people want to use it; such 
questions, to Mayhew, were more or less oüt of 
bounds. 


MICHAEL A. FAIA is Professor. of Sociol- 
ogy at the College of William and Mary. His . 


! A generation ago Blau and Duncan (1967, p. 
18) were placed in the difficult situation, avoidable 
in these times of powerful computers, of having to 
guess a priori "just which tables were wanted, out 
of the virtually unlimited number that conceivably 
might have been produced”. for a major work on 
occupational stratification. 
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teaching, research, and writing delve into and 
span the following questions: (1) Where do 
babies come from? (2) How do machines, 
such as computers, form our minds and our 
behavior, and vice versa? (3) How can 
science and art conspire to transcend science 
and art? Faia believes that he is making 
inexorable progress. So do his babies. 
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REPLY TO FAIA 
(ASR, this issue) 


KRISHNAN NAMBOODIRI 
The Ohio State University 


As I see it, the main thrust of Professor Faia's 
comment on my article is that the case I outlined in 
behalf of ecological demography would have been 
stronger if I had emphasized certain features such 
as (a) the prevalence of a high degree of consensus 
among scholars regarding the scope and coverage 
of demography and ecology, (b) the presence of a 
number of deterministic models in demography, 
and (c) the stable population model, a partnership 
between which and the general linear model of 
statistics would provide, in Professor Faia's 
opinion, a theoretical bedrock of contemporarv 
social sciences. Professor Faia hints also to his 
conviction that what he calls the functionalist/ 
systemic tradition provides a powerful theoretical 
framework to tie many things together. 

I do not expect al! scholars to emphasize the 
same features when they make their case for 
ecological demography. If I were fo develop the 
case, I would emphasize point (a) mentioned 
above but not put too much emphasis on (5) or (c). 

To me, the presence of deterministic models in 
any social science discipline does not make the 
discipline superior to others with no deterministic 
models. As I see them, deterministic models are 
convenient abstractions, often extremely oversim- 
plified to permit mathematical manipulations (as 
illustrated by the population interaction model 
referred to in my article). In my view, determinis- 
tic models are imaginary experiments, high on 
internal validity but very low on external validity 
This does not mean that deterministic models are 
useless. Mathematical economics, for example. 
has contributed substantially to the development o£ 
economic theory. 

To avoid misunderstanding, here is the distinc- 
tion I emphasize between deterministic and 
stochastic models. Suppose the members of £ 
homogeneous population of size N are exposed tc 
risk, 4, of occurrence of a given event. A 
stochastic model for the number of events actually 
occurring in the population treats the number of 
events as a random variable with properties such as 
that its expected value is Na. A stochastic model 
for the outcome of an experiment in which a fair 
coin is tossed 10 times recognizes that the number 
of heads may be 0, 1,..., or 10, each with a 
specified probability. A deterministic model for the 
outcome of the experiment would focus on the 
expected number of heads, namely, 5. For this 
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reason, deterministic models are often referred to 
as expected-value models. In demography, most of 
the population projection models are of this latter 
kind. I consider it a healthy sign that recently 
demographers have begun to pay increasing 
attention to the stochastic counterparts of many of 
the deterministic models such as the component 
model for population projection. 

Professor Faia calls an exhaustive listing of the 
components of a change a deterministic model. I 
do not find any particular gain in calling the 
components of a change the determinants of the 
change. In the same vein, while the size of 
tomorrow’s world population will be exactly equal 
to the size of today’s world population plus the 
number of births minus the number of deaths in the 
interim, any attempt to model this relationship, 
giving due attention to questions such as why the 
numbers of births and deaths are what they are, 
would require dealing with measurement errors, 
equation errors, and errors due to excluded 
variables. It is in connection with the handling of 
such errors that I find models that have random 
elements in them more attractive than those 
containing no random elements, Similarly, while it 
is true that the number of births in a population is 
the combined results of sexual unions, gestation, 
and successful parturition, when developing a 
model that purports to explain the number of births 
during a period one has to pay attention to the 
determinants of sexual unions, gestation, and 
successful parturition, and I find stochastic models 
much more attractive than deterministic models in 
this connection. 

Professor Faia would forge a partnership be- 
tween the demographic stable population model 
and the statistical general linear model with a view 
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to building a theoretical foundation to contempo- 
rary social sciences. In his view, given this 
theoretical bedrock, “the rest is mere commen- 
tary.” Since Professor Faia does not develop this 
contention, it is not possible for me to give an - 
informed reaction. The following observations 
seem pertinent, however. 

(1) There is no denying that, although, strictly 

speaking, few real populations meet the assump- 
tions of stable populations, the stable population 
model, which is treated as a deterministic model in 
most of the literature, has shed light on many 
relationships (e.g., the relative impacts of fertility 
and mortality on age structure). Interestingly, a 
more general population model that has come to be 
known as the "variable r model" seems to have 
replaced the stable population theory in many of its 
uses. . 
(2) In social science disciplines, when develop- 
ing any stochastic model, it is a common practice 
to devise methods for fitting the model and for 
testing its adequacy. From this point of view, it is 
clear that statistical estimation theory plàys an 
important part in model building in social sciences. 
It is not clear to me, however, why professor Faia 
has chosen to emphasize only a part of statistical 
estimation theory, namely, the general linear 
model. i 

Turning to Professor Faia's preference for the 
functionalist/systemic theory, there is little I wish 
to add to what I said in my article. If Professor 
Faia's functionalist/systemic framework is differ- 
ent from ecological demography, I would like to 
see a comparison of the two frameworks on how 
they handle the core of sociology, assuming that 
some consensus exists regarding what constitutes 
that core. 
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EDITOR’S COMMENT 


On Late Reviews: A Desperate Appeal for 
Prompt Reviews 


The editors of ASA journals complain that 
manuscript reviews seem to be returned later 
and later. ASA policy asks reviewers to return 
manuscripts immediately or review them 
within three weeks of their receipt. Despite 
our obvious irritation with late reviewers, 
editors have produced little data on the 
problem. This editorial provides some prelim- 
inary data. I ran two analyses of the number 
of days reviewers took to return a manuscript 
unreviewed or reviewed. The first analysis of 
98 cases consisted of all articles accepted or 
conditionally accepted during 1988. The 
second analysis was a systematic sample of 
90 cases of refusals during the same time 
period. 

A late review was defined as one that took 
more than 28 days from the time the paper 
was sent to the date that ASR received a 
communication from the author. This period 
provides a week for mailing and three weeks 
for reviewing. For most items we examined, 
the data for accepted and rejected manuscripts 
are remarkably similar. Thus, almost half of 
the reviews were returned on time for both 
types of decisions. Put the other way around, 
slightly more than half of the reviews were 
late. How late? Slightly more than a tenth 
took more than twice the allotted time; i.e., 
more than 56 days. The record time for 
‘accepted manuscripts was 112 days and 119 
days for rejected ones. Combining both 
analyses, six reviewers never responded even 
after six reminders and appeals. Women were 
slightly more prompt than men in reviewing 
manuscripts. 

For accepted manuscripts, almost 8 percent 
of the reviewers returned them without 
review; the figure was twice as high for 
manuscripts that were rejected. Apparently, 
some reviewers take a quick look at manu- 
scripts and decide whether they are worth 


reviewing. Understandably, more reminders 
were required for manuscripts that were 
acceptable than were rejected; about 2:1.5. 

Almost one-quarter of the late reviewers 
had to be telephoned at least once to produce 
a response. HG kept a paper 119 days and 
then decided he did not have time to review 
it. AR did not bother to respond despite three 
reminders and three telephone messages. 
Ditto for MA. JG returned a manuscript 
without a review or explanation soon after 
ASR had published his paper! 

How much does late reviewing cost ASA? I 
estimated that, on the basis of 400 papers a 
year and three reviewer contacts per paper, 
ASR sent out about 800 reminders a year, 
exclusive of telephone calls. If each reminder 
cost a dollar in time, stamps, and paper, this 
sum, plus the cost of reproducing manuscripts 
and telephoning, ASR spends well ove: 
$1,000 a year for late reviews. This does not 
include the costs to authors who phone and 
write about the status of their reviews. Nor 
does it include the cost in time to the editor. 

Late reviews are the most irritating aspects 
of running a journal. Editors are very grateful 
to the reviewers who responsibly carry out 
their professional obligations. However, the 
erosion of professional norms has gone 
beyond reasonable bounds when over half of 
the reviewers need to be reminded to get their 
work done on time. It is time for self 
appraisal: we have identified the virus and it 
is us. Most people have a pretty good idea of 
their work habits. I suggest that when you 
receive a paper from ASR, confront yourself. 
Ask yourself, "Do I want to review this 
paper?" If no, retum it at once. If yes, place 
the date on your calender when you should sit 
down to do the review. Two weeks at the 
most. Then do it. Damn the excuses. Put 
yourself in the shoes of poor author whc 
expects a reasonablv prompt review, and act 
as a responsible colleague. You'll feel better 
for it. And so will we. 


AHISTORICISM IN TIME-SERIES ANALYSES OF HISTORICAL 


PROCESS: CRITIQUE, REDIRECTION, AND ILLUSTRATIONS 
FROM U.S. LABOR HISTORY* l 


LARRY J. GRIFFIN 
Indiana University 


Larry W. ISAAC 
Florida State University 


Most time-ordered analyses of historical processes are rendered “ahistorical” 

because the premises that direct quantitative inquiry obscure changing historical 

relations through time. These conventional practices stem from three problematic 

presuppositions: (1) the separation of theory and history, (2) ahistorical 

conceptions of time, and (3) methodological autonomy and the primacy of the 

theory of statistical technique. We illustrate how these premises shape 

sociological-historical investigation by examining several quantitative historical 

analyses of labor organization and conflict. Reanalysis of the same historical. 
series with “temporally moving covariance” analysis demonstrates theoretically 

important historical contingency as well as the error of presupposing temporal 

homogeneity in historical relationships. Our critique of conventional practice and 

our reanalysis enable us to pose alternative premises to ground quantitative , 
historical research in the interdependence of history and theory, in historical 

conceptions of time, and in the historicization of quantitative methodology. : Our . 
redirection aims at moving us closer to historical “actuality,” that is, to historical " 
relations and processes, when we use time-series data and analytic procedures. 


INTRODUCTION 


Time-ordered quantitative examinations of 
macrosociological processes and historical 
change have important explanatory advan- 
tages over the use of cross-sectional data and 
procedures. But many quantitative analyses of 
historical series fail to realize their potential 
because they are stamped by a series -of 
interrelated pre-theoretic and meta-methodo- 
logical presuppositions which are themselves 


* We share fully and equally in the production 
and authorization of this paper. Many colleagues 
commented on various versions of this paper, and 
for their labors we express our appreciation to G. 
Alter, R. Boylan, S. Carlson, J. Devine, C. 
Eavey, M.'Hardy, L. Hazelrigg, D. Heise, R. 
Hodson, G. Hooks, K. Leicht, P. O'Connell, J. 
Orcutt, J. Quadagno, L. Raffalovich, R. Rubin- 
son, K. Schuessler, S. Stryker, P. Thoits, D. 
Zaret, and all of the participants of the Political 
Economy Workshops at both Florida State and 
Indiana Universities, especially D. James, H. 
McCammon, and P. Walters. Finally, the paper 
benefited considerably from three reviewers and 
the editor of ASR. Griffin acknowledges support 
from the National Science Foundation (SES 
8609459) and the Indiana University Division of 
Research and Graduate Development. None of the 
above is in any way responsible for the opinions 
expressed by the authors. 
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“ahistorical.” These premises (1) separate 
theory from history, (2) use ahistorical 
conceptions of time, and (3) presuppose 
methodological autonomy and privilege statis- 
tical theory over historical process. Conse- 
quently, they severely denude most quantita- 
tive time-ordered inquiries of their historical 
grounding. No sociological account, however 


theoretically or technically sophisticated, is 


valid if it denies or masks important features 
of historical reality. 

Our claim that much conventional quantita- 
tive time-series research is “ahistorical” rests 
largely on the fact that critical contingencies 
of social change, understood as the sudden or 
gradual temporal conditioning of historical- 


structural relationships (Duncan 1975, p. 57), _ 


are generally ignored in quantitative explora- 
tions of historical processes (Hernes 1976, p. 
521).| This is true at least for -every 


! Our argument pertains to any research design 
where the data are primarily organized by time and 
where time is integral to the analysis, such as 
time-series regression and event-history analysis. It 
is directed not at technique per se but rather with 
its use, and here the communalities among all . 
time-ordered statistical techniques far outweigh 
their statistical and philosophical differences. We 
deliberately exclude quantitative historical analyses 
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time-series study we have examined in 
sociology, economics, and political science, 
in a wide variety of substantive areas (e.g., 
. fiscal policy, macroeconomic performance, 
educational output), and using several differ- 
ent statistical procedures, from regression to 
event-history analysis and spectral analysis. 
Considering the dozens of analyses of labor 
organization and conflict (the substantive area 
we use for illustrative purposes), we found 
only one that systematically assessed the 
temporal stability of time-series parameter 
estimates (Rubin, Griffin, and Wallace 1983), 
only a handful more that attempted serious 


historical periodization (e.g., Snyder 1975; 


Edwards 1981; Sheflin, Troy, ànd Koeller 
1981), and very few that explicitly offered a 
theorized rationale for choosing the starting 
and ending dates in the analysis (e.g.; 
Edwards 1981; Griffin, Wallace, and Rubin 
1986). We found no study which did all three. 
These and other seemingly innocuous lapses 
belie a much more profound problem: the 
temporal dimensions of concrete historical 
processes—their timing, periodicity, tempo, 
duration, and trajectory (Aminzade 1989)— 
are seriously undertheorized and underana- 
lyzed in conventional time-series research. ' 

The ahistoricism of many quantitative 
analyses of historical data is consistent with 
Tilly’s (1981, p. 7) conclusion that "in 
important ways sociological practice comes 
down to failed history" and is at the root of 
Skocpol's (1987, p. 20) observation that 
"most historical sociologists have been non- 


quantitative or even 'anti-quantitative' in their 


orientation." These judgments do not appear 
to be based simply on the omission of 
historical detail in order to increase theoreti- 
cal generality. While quantitative practice too 
often excludes the historical, it also necessar- 
ily produces a “history-as-analyzed,” an 
analytic mirror of the historical materials 
examined. Although distortions are inevita- 
ble, some reflections extend our understand- 
ing of historical process itself. But distortions 
induced by conventional quantitative (and 
qualitative) approaches to historical data can 
be so gross as to make history virtually 
unrecognizable and the resulting theoretical 
interpretations of it seriously misleading. That 
surely is both "failed history" and failed 
` sociology. 





and designs using cross-sectional results from two 
or more time points from this discussion. 
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Serious discussion of these (or similar) 
practices is both infrequent and generally 
limited to scholars who do not use quantita- 
tive procedures extensively (Thompson 1978; 
Zaret 1978) or to historiographers and philos- 
ophers of history (Fischer 1970; Knapp, 
1984; Mandelbaum 1977; Murphey 1973). 
Conventional practices and assumptions are 
thus seldom examined by quantitative ana- 
lysts in the context of actual research practice 
or in the development of time-series method- 
ology.? Thus, these strategies have become 
institutionalized, portraying and projecting 
"correct" quantitative research practice. Crit- 
ical assessment and revision of some of the 
basic presuppositions that shape time-ordered 
analysis are mandatory if quantitative inquiry 
is to generate better historical sociology and 
social theory. 


CONVENTIONAL PREMISES AND 
HISTORICAL TIME-SERIES ANALYSIS 


Separation of theory and history 


This stance argues that theory and its 
derivative hypotheses are intellectually sepa- 


.rate from and independent of history and 


historical data (cf. Carr 1961; Jones 1976; 
Zaret 1978). Lipset (1968) represents this 
view perfectly. After first stating that, “from 
an ideal-typical point of view, the task of the 
sociologist is to formulate general hypotheses 
. . . and to test them" (p. 22), he asserts that 
these issues "are clearly not the problems of 
the historian. . . . [who] must remain close to 
the actual happenings and avoid statements 
which . . . lead to a distortion in the description 
of what occurred" (p. 23; our insert and 
emphasis). Thus, analysts can adjudicate 
between or among conflicting theories, ac- 


? A partial exception to this claim is the 
econometric literature on models incorporating 
*time-varying parameter" structures. Briefly, these 
techniques are used both to ascertain the validity of 
the temporal homogeneity assumption required by 
classical regression and in the identification and 
modeling of "shift" and "switching" points 
inducing or reflecting different structural "re- 
gimes" governing causal processes (Quandt 1958; 
Brown, Durbin, and Evans 1975; Judge et al. 
1985). Even here, though, the emphasis is mostly 
on econometric theory, and the discussion gener- 
ally is far removed from historical process or 


inquiry. 


s 
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: TIME-SERIES ANALYSES OF HISTORICAL PROCESS 


PE and „explanations of the historical 
record, and/or use history to test or elaborate 
theory, because of two implicit “indepen- 
dence” assumptions: theory is uncontami- 
nated by history, and history is itself not 
“distorted” or mediated by theory (cf. McCul- 
lagh 1984). History therefore becomes some- 
thing "out there," needing no explicit theori- 
zation, ‘and, to the quantitatively inclined, is 

` often represented by time-series observations 
às the data necessary to test theory. This prac- 
tice begins with the abstract (the “theoreti- 
cal") and then confronts time-ordered data on 
the past (the “historical”) in order to validate 
or falsify theoretical statements. 

This premise is essential to those who, like 
Hannan and Freeman (1987, ` 1988) and 
Freemah and Brittain (1977) in quantitative 

: labor history, use historical sequences to test 

a deductive general theory of wide scope. The 

prime purpose of the engagement between 


theory and history in Hannan and Freeman’s 


event-history analysis of union foundings and 
mortality from 1836 to 1985, for example, 
explicitly is not to explain the substantive 
historical issue under investigation. Because 
unions share certain causal "regularities" 
with "other kinds of organizations," their 
analysis therefore is thought to "illuminate 
basic processes of organizational ecology" 
(Hannan and Freeman 1987, p. 914; see also 
Freeman and Brittain 1977, pp. 173, 184). 
` Hence, Hannan and. Freeman's analytic ques- 
tion and motivation are general-theoretic: 
“Can populatiori- -ecology arguments be ap- 
plied fruitfully to such organizations?" (1987, 
pp. 911-12), and “Analysis of unions is one 
way to explore the limits on the scope of 
ecological theories of organizations" (p. 
912), as is their answer: "analysis of founding 
rates is a useful way to explore the population 
ecology of organizations" (p. 940). Thus, 
this use of population-ecology theory is 


thought to need no historicization because its 


explanatory ‘framework is internal to the 
sath ! Not. surprisingly, therefore, their 





5 Indeed, historical explanation is riot possible 
with the exclusive use of deductive theories: 
“theories may be deductive, but explanations are 
not, for explanations are concerned with particular 
causes and particular effects” (Maclntyre 1976, p. 
: 152). Explanation requires “concrete description 
of [a] particular occurrence" before whatever 
theoretical generalizations that may: exist can be 
applied (Manes 1977, p. 103). 


` 
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research enterprise— from tnd operationaliza- 
tions of concepts through equation specifica- 


- tion and estimation and substantive inference— 


* 


` is primarily grounded in the general, ahistorical 


postulates of organizational demography. r: rather 
than in the concrete history of trade unions. 

Finally, while some union history i is provided 
by Hannan and Freeman, “the history of the 
whole population” (1988, p. 34) is largely 
presented as a quantitative count of the 
number of union foundings or disbandings 
over the last 150 or so years. As the most . 
important dimension of Lipset's "description 
of what occurred," these time series, rather 
than less quantitative historical information, 
are assumed the appropriate terrain of theory 
testing or elaboration. 

There are two crucial implications of the : 
first premise. First, history is reduced to 
“data”, for the testing or elaboration of a. 
theory ‘not at all constructed to explain the 
particular historical case under study (Tilly 
1981). ‘Theory therefore is not historicized, 
and history, particularly its “time-conditioned- 
ness": and temporal contingencies, is not. 
theorized (Spiethoff 1952). Second, and B 
relatedly, this premise permits an ahistorical- 
justification condition that is very widely used’ 
in all theoretical research, theoretical consis- 
tency, to validate what may be dubious or 
even false historical peeecalizations anc 
inferences.* 


44 
‘ 


Ahistorical time 


This practice constructs “time” to be a linear 
organizing device for a sequence of events or 
an incremental counter “marking time” in 
equal units as in clock or calendar time. 

^ Because history and theory are, a priori, 
conceived to be independent ünderstandings, then 
that part of history not pertinent to a particular 
theory, or tested statistically nd discarded, has no 
relevance for the theoretical account that is 


- offered. Some historical processes therefore are 


never analyzed, or if analyzed'are not theorized in 
a manner consistent with the relevant historical 
record, because those facts are not pertinent to the 
general ahistorical theory being tested. Thus, even 
if inferences about those ignored or dismissed 
aspects:of history are incorrect, and because of this 
much of the entire accouht is questionable 
(McCullagh 1984, p. 12), this ‘historical falsity has 
no bearing on justification by theoretical consis- 
tency as long as the theoretical explanation that is 
generated remains consistent. 


876 : uu 
Ahistorical time typically is qualitatively 
undifferentiated time, a smooth homogeneous 


magnitude viewed as. external to the events . 


and relationships .of history, except as an 
index of chronology (Bloch 1953). One of the 
: most consequential ways in which the premise 
. of ahistorical time is manifest is in the 
emphasis on the continuity of history or the 
history of continuity. (cf. Fischer 1970; Jones 
1972). Here history is conceived to be the 
continual unfolding of the same underlying 


historical communality. Such a preoccupation, 


with the continuity between past and present 

.tends to homogenize or average away the 
‘difference between “then and there” and 
“here and now.” History therefore is often 
premised in time-series designs to be a 
seamless sequencing and, as we will show, is 
quantitatively reproduced as homogeneous 
relations in homogeneous time. When com- 
bined with the premise of the independence of 
theory and history, the presupposition of 
historical continuity results in general theoret- 
ical accounts which stabilize or “domesti- 
_cate” the temporal contingencies of history by 
“constructing arguments with presumed appli- 
cability across different historical contexts 
contained within a single period of time 
(Moore 1958; Zaret 1978). The construction 
and estimation of such quantitative models 
thus understood lead to parsimonious causal 
generalizations implying that the coefficients 
attached to the exogenous or predetermined 
variables capture, for the time period sub- 
sumed by the estimation, ‘time-invariant 
relations which reflect some “historically 
general” process. 

Ahistorical time and historical generality 
characterize not only generál-theoretic analy- 
ses, but also some studies addressing a 
concrete: historical issue theoretically. Thus, 
for example, Ashenfelter and Pencavel's 
(1969) path-breaking time-series regression 
ahalysis of historical fluctuations in union 
membership explicitly is motivated by a 


historical particularity, “the determinants of .. 


the growth of American trade union member- 
ship in the period 1900-1960” (p. 434). But 
the authors also consciously reject histori- 
cized theoretical arguments for the waxing 
and waning of union membership, labeling 
them "ad hoc" (p. 434), and develop instead 
an implicitly transhistorical rational actor 
'theory of unionization. Thus, to the extent that 
historical periodization or disjuncture is at all 
recognized in these and similar studies, it is 
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typically modeled only as an additive influ- 
ence, as in Hannan and Freeman’s (1987, 
1988) “period effects” reflecting different 
organizational and legal arrangements, or it is 
used to test, as in Ashenfelter and Pencavel 
(1969, pp. 445-47) and Freeman and Brittain 
(1977, pp. 180-83), the retrodictive utility of . 
the “historically general” estimates assumed 
stable through time. : 
The parameter estimates produced ty 
Hannan and Freeman and Ashenfelter ard 
Pencavel, again among many others, are thus 
constructed to apply. to the entire sets of 100 
to 150 years and 60 years, respectively, 
through, that is, war and peace, depression 
and prosperity, the passage of the Wagner Act 
and its partial dismemberment by the Taft- 
Hartley Amendment, legal and employ repres- 
sion of unions and legislative and employer 
tolerance or even encouragement, waves of 
immigration and alien quotas, and times of 
labor insurgency and labor quiescence. Are 
there theoretically meaningful historical dis- 
continuities in the processes governing union 
founding and disbandings since 1836? Have 
workers attached identical weights to the 
same indictors of the utility of union 
membership and have they translated these 
calculations into action at identical rates 
throughout the differing historical contexts 
identified above? These questions remain 
unposed, untheorized and unanalyzed, as- 
sumed away by the seamless generality of 
continuous chronological time. The uncritical 
acceptance of this premise thus lends weight 
to the practice of treating time-ordered daca, 
and the estimated empirical relations from 
them, as temporally stable. and nondifferenti- 
ated. As a consequence, the forced homogene- 
ity of quantitative relations thus also sup- 
presses temporal “signals” in changes in 
historical processes. D 


Methodological autonomy and the primacy 
of the theory of technique 


The final premise, is another intellectual 
division of inquiry and also provides the basis 
for ahistorical justification. Just as theory and 
history are practiced as separate, and brought 
together only with clear understandings of 
their specialized function and the limits of 
their reciprocal engagement, so, too, with 
quantitative methodology and history. Thus, ` 
the use of time-series techniques typically is 
governed by statistical-theoretic criteria inter- 
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nal to. the technical procedures themselves 


rather than by the logic of the substantive 
theoretical and historical question under 
analysis. There are several indications of this. 


Statistical significance is, ceteris paribus, a: 


positive function of the number of observa- 
tions analyzed, and the general inclination in 
the social sciences to present only “signifi- 
cant" findings pushes the analyst to increase 
the number of time points analyzed. Some- 
times this occurs even at the expense of the 
motivation for the analysis. Edwards (1981), 
who generally displays admirable sensitivity 
to historical context, for example, delineates 
‘two periods of strike activity in the pre-World 
‘War II years, 1881-1910 and 1900-39, and 
then estimates parallel regression models in 
each of these sets of years in order to capture 
period-specific causal processes. In justifying 


the confounding decade-long overlap (1900— 


10) inducing “contaminated” period-specific 
coefficients of dubious utility, he states (fn. 
3, p. 54) that “the overlap . . 
obtain a period long enough for statistical 
analysis to give significant results." 

Long series are also thought advantageous 
for two additional reasons rooted in the 
statistical properties of time-series estimators. 
First, such coefficients are often thought to 
bave statistically desirable properties only if 
they are *asymptotic." Asymptotic estimators 
are "consistent" and "efficient" as the 
number of temporal observations in the series 
approaches infinity. Second, long series, 

` perhaps even the entire known series, may be 
thought necessary to avoid selectivity bias 
induced by the analytic omission of units of 
the relevant population. Hannan and Freeman 
(1988,. p. 34), for example, explicitly’ invoke 
this statistical rule as partial justification of 
their decision to estimate equations covering 
the entire 100- and .150-year periods at their 
disposal. And in McCleary and Hay's (1980, 
p. 20) exposition of ARIMA models, they 
caution readers that "No time series that we 
analyze in this volume is shorter than 50 
observations long" (emphasis in the original). 
Formalistic adherence to such methodological 


strictures can lead to the belief that one must: 


have either a certain (but often arbitrarily 
defined) minimum number. of observations 
(beyond the limiting case) or all of them in 
order to estimate an equation or obtain 


statistically significant estimators. Thus, the , 


theory of technique may limit or even 
preclude the examination and explanation of 


.'is needed to` 
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* historical conditioning requiring aralyeis over 


much: shorter periods of time. 
. A second implication of this premise is that _ 
the criteria of statistical theory, including - 
statistical significance, are very often used as a 
justification condition in historical inquiry.: 
We saw this in Edwards (1981)-and Hannan 


. and Freeman (1988), who validated particular 


historical findings and aspects of their designs 
by statistical criteria. One of the most 
Sophisticated statistical justifications of both 
historical design and generalization is found 
in Ashenfelter and Pencavel (1969, p. 445). 
They acknowledge that "traditional analyses" 
identify "special" periods in union history— 


the union membership growth years of ' l 


1917-20, 1933-37, and 1941-45, and the 
decline during 1920-24—but then go on to 
justify their decision to analyze in a single 
equation all of the observations in the entire 
60-year period by the accuracy, of the 
statistical fit between the predicted series 
produced by the historically general estimates 
and the actual patterns of union growth in 
these historically "special periods." Thus, we 
see again how a conventional premise— here, 
the privileged status of statistical tbeory— 
spawns the ahistorical. justification of what . 
may be dubious historical statements. 


THE COMBINED REINFORCEMENT 
AND CONSEQUENCES OF . 
CONVENTIONAL PRACTICE 


These premises tend to reinforce each other so 
that in practice their combined power to shape 
an analysis is great indeed. The a priori of the 
continuity of history, for example, is well 
served by the a priori that history and theory 
are independent (thus historical time is not” 
theorized) and by the practice of placing a 
premium on the: statistical significance and 
properties of the estimators obtained with 
very long series. Why assess the possibility of 
historical discontinuities if either gradual 
changes in time-series estimates or clearly. 
demarcated historical periods have not been ' 
theorized? Why, moreover, “carve up? his- 
torical series only to reduce the significance 
of already statistically “dubious” coefficients 
estimated from relatively short series? The a 
priori that history does not require theorizing 
is more readily preserved given the premises 
of temporal homogeneity and the privileged ` 
status of statistical theory. Why, for example, 


attempt to theorize the appropriate starting or 


! 
; à 
878. | 
ending dates of a time-series analysis, or why 
theorize a historical periodization, if all or a 
minimum number of observations are thought 
necessary for analysis or if history is indeed 
seamless. RE 
Virtually every’ ‘particular of conventional 
time-series research is shaped by these 
historically limiting premises: (1) the implicit 
assumption of equi-temporality of the elemen- 
tal time unit used to organize the observations 
in the historical series according to some 
homogeneous counter (months, quarters, years, 
etc.), thus possibly fragmenting historical 
processes spanning two or more observations 
and/or operating in two or more temporal 


units simultaneously; (2) the practice ‘of . 


estimating equations from a historical time- 
series with missing observations (as in 
Freeman and Brittain 1977), thus denying the 
(possibly disjunctive) connectedness of the 
historical process;* (3) the use as a statistical 
control of an atheoretical linear time trend on 
the right-hand side of the equation to be 
estimated (as ‘in Snyder 1975), thereby 
circumventing explicit theorization of the 
historical movement of the series; (4) the 
assumption that time-series: coefficients are 
constant for the entire (and deliberately long) 
time-period over. which the equation ! is 
estimated, thus a priori precluding the search 
for temporal conditioning of parameter esti- 
mates; (5) the practice, typically determined 
by data availability, of “slicing into” history 
8t virtually any convenient point and ending 
at another convenient point (as in Edwards 
. 1981, p. 54) even if, historically, the 
temporality of the process under examination 
differs from the happenstance of convenience; 
. (6) the measurement and scoring of indepen- 
dent variables so that both their increases and 


5 An observation in a time series is fundamen- 
tally unlike an observation in most cross-sectional 
designs. Omitting, say, France from a cross- 
sectional analysis of countries would limit the 
generalizability of one's results, but it would not 
necessarily destroy the integrity of the more 
limited results that were generated. The other 
nations in the study are not ordered in the design so 
E France's absence matters to analysis of the 

remaining nations. (The exception here is those 
. very rare cross-sectional designs exploring spatial 
: effects or spatial correlations.) But to omit, say, 
. the year 1922 and analyze a series containing 
1900-21, 1923-39 simply because data for 1922 
are missing is somehow to argue that 1923 could 
have existed without 1922, a historical absurdity. 
: l i i 
; 
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decréases are included in a single continuous 
variable, thereby forcing the relationship 
between the dependent variable and increases 
and decreases in the independent variable to 
be identical even if „this contradicts tne 
historical movement and behavior of series; 
(7) the. use of the coefficients from a 
time-series model estimated over one stretch 
of historical data to "forecast" or “predic:” 
values of the dependent variable ‘for subs2- 
quent points in time without first checking for 
structural change; (8) the use of theoretically 
general concepts and "timeless" operational- 
izations stripped of adequate grounding in the 
particular:historical process under examina- 
tion;® and (9) the exclusion. from further: 
consideration of variables deemed ;"insignifi- 
cant" by some statistical criterion even if the 
relevant historical record is (perhaps infre- 
quently) clear as to their substantive import. 

Through all of these practices, “historical 
time" and its contingencies. are rendered 
increasingly atheoretical or ahistorical in 
whatever analysis is performed, évidence 
adduced, or inference reached. 


DOCUMENTING HISTORICAL, |. , 
CONTINGENCY IN THE -— 
STRIKE-UNIONIZATION RELATION 


Our charge that tvpical presuppositions of 
time-series research reduce the “historical” to 
the "ahistorical" cannot rest on critique 
alone. We must also show empirically that 


, historical processes and series similar or 


identical to the ones analyzed: by those we 
cited as adhering to conventional practice do, 
in fact, contain important temporal disconti- 
nuities. We explore this possibility by using 
much of. the same data and measures as 
Ashenfelter and Pencavel (1969), Edwards 
(1981), and Hannan and Freeman (1987, 
1988) to examine the simple bivariate relation- 
ship between changes in strike frequency 
(CH.FREQ) and three indicators of uniom 
viability, GROWTH in membership and the 
numbers of union FOUNDINGS. and DIS- 


© “Beyond a certain point abstraction robs the- 
facts of all historical reality. What gives facts of 
history . . . their worth as evidence, and their value 
for causal analysis, is their existence, their tangible 
and verifiable reality. Only tangible and concrete 
phenomena can be fitted in a social setting and 
demonstrated as a link in a chain of causation” . 
(Postan 1971, p. 31). 
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BANDINGS, for almost 100 years, from 
1882 to 1980. Union GROWTH is operation- 
alized as the annual percent change in the 
number of union members. Strike frequency 
is defined as the number of strikes per 10,000 
employed nonagricultural labor force partici- 
pants.’ 

Our analysis is a correlational variant of 
“moving regressions” (Brown, Durbin, and 
Evans 1975), first-used (to our knowledge) by 
Quandt (1958). In “moving regressions” a 
time-series regression model is fitted to a 
portion of the entire series; the estimation is 
then “moved” along the series, so that the 
next regression is based on another segment 
of data differing from the preceding one only 
by a few data points, often those immediately 
preceding and/or following the original data 
segment. We follow a similar strategy in this 
example, using in this simple exposition 
correlations and covariations in lieu of 
estimates derived from formal causal models 
(the latter are examined in Griffin and Isaac 
1989). 

There are three basic ways to perform the 


“moving covariance” analysis. Each poses’ 


different historical and theoretical questions 
and presupposes distinct views of historical 
process. Though we performed each of the 


7 The number of union foundings are available 
to 1976 and are from Hannan.and Freeman (1987; 
Figure 2, p. 925); the number union disbandings 
are available for the entire 1882-1980 period and 
are from Hannan and Freeman (1988; Figure 3, p. 
35). Following Ashénfelter and Pencavel (1969), 
we use the Wolman-Troy series of union member- 
ship. There are no Wolman-Troy figures for the 
years 1963-69, so we computed an OLS regression 
estimate for these years from the U.S. Bureau of 
Labor Statistics (BLS) series of union members 
(U.S. Historical Statistics 1975 and various 
updates). We experimented with a variety of 
estimation procedures and found all generated very 
similar series and little prediction error. Systematic 
strike data are available for the entire period. 
Despite incompleteness and noncomparability in 
the raw data before 1880, the U.S. Commissioner 
of Labor concluded that strikes were of no essential 
aggregate economic importance until 1880 (1901, 
p. 721). We use change scores, which entail the 
loss of the first year in the series, where necessary 
because they remove some of the secular trends in 
union membership and strike activity and thus are 
less likely to result in spurious correlations induced 
by not controlling for omitted (and unknown) 
determinants of these trends (McCleary and Hay 
1980). 
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procedures, we present here only the results 
of the so-called "backward" strategy because 
(at least in this example) it illustrates the 
likelihood of sudden and dramatic turning 
points in historical relationships. The results 
of this particular "moving covariation" anal- 
ysis are presented in Figures 1-3. There we 
graphically present the bivariate correlations 
(left-hand axes) and covariances (right-hand 
axes) for 81 distinct time periods. The first 
point on the graph is that of the correlation or . 
covariance as estimated over the years 
1882-1980 (1882-76 for FOUNDINGS). We 
then sequentially drop the first year of the 
series, so that the’ second association (and 
point in the graph) pertains to 1883-1980, the 
third to 1884-1980, and so on until the last 
relationship (and observation graphed) is 
estimated over the years 1962-80. All of the 
relationships we present are thus “anchored” 
by: the year 1980 and, with every estimation, 
become exclusively more recent as we 


""move" toward the present, deleting the most 


distant year.? . ; 

This procedure, used to assess the temporal 
stability of time-series estimates and to 
establish "shift" or "break" points in series, 
persuasively shows the historically contingent 
nature of these relationships. Consider. first 
the correlations graphed in the left-hand axis 
in each figure and summarized in Table 1. 
The correlation between union GROWTH and 
changes in strike frequency (CH. FREQ.) 
ranges (see the appropriate "Max" and 
*Min" rows in panel A of Table 1) from .5 to 
— .15. The range of the union FOUNDINGS- 
CH.FREQ correlation, too, is large, varying 
across. historical time periods separated by 
only three years from .39 (1941-76) to — .14 


? The second approach, the "forward" proce- 
dure, “anchors” the series to be correlated at some 
“distant” point in history (here, the earliest 
possible date, 1882) and a more recent date is 
chosen to initiate the correlational analysis (say 
1896). The series are then incremented forward by ` 
a year (1882-97), and correlations (or other 
Statistics) calculated. This is repeated (1882-98, 
1882-99, . . .) until the entire temporal range of 
the series (here, 1882-1980) is analyzed. The third 
method, the "diagonal" procedure, holds constant 
the number of years analyzed (at 16 time points 
here, beginning with 1882-97) and simultaneously 
increments both the beginning and ending dates of 
the series analyzed (e.g., 1883-98, 1884-99, . . .) 
until the last year of the series is included in the 
16-year analysis (1965—1980). l 
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Fig. 1. Correlations (Left-Hand) Axis and Covariances 

(Right-Hand) Axis Between Union GROWTH 

, and Change in Strike Frequency for Periods from 
1882—1980 to 1962-1980 


Note to Figure 1: Entries on the X-axis refer to the year 
beginning the historical period encapsulating the relation- 
ship graphed. Thus, “90” refers to the period 1890— 
1980; “50” to the period 1950—1980, etc. These data are 
available in tabular form on request. 


(1938-76). The union DISBANDINGS- 
CH.FREQ correlation exhibits even greater, 
if less sudden, extremes, ranging from .39 to 
—.54. ` 

Occasionally, the alteration in the size and 
sign of the relationship is quite sudden and 
dramatic. For most of the time periods from 
1882-1980 to 1936-80 (e.g., 1900-80, 
1935-80) the correlation between GROWTH 
and CH. FREQ. (Figure 1), for example, is 
substantial, always positive, and relatively 
stable. All of these time periods obviously 
contain the year 1937. But when 1937 is 
removed from the analysis, the correlation 
between GROWTH and CH.FREQ suddenly 
drops from .41 for the years 1937-80 to .017 
for 1938-80, suggesting extreme historical 








90 00 10 a 3 40 5 e 


Fig. 2. Correlations (Left-Hand) Axis and Covariances 
(Right-Hand) Axis Between Union DISBAND- 
INGS and Change in Strike Frequency for 
Periods from 1882-1980 to 1962-1980 


See Note to Figure 1. 
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Fig. 3. Correlations (Left-Hand) Axis and Covariances 
(Right-Hand) Axis Between Union FOUND- 
INGS and Change in Strike Frequency “or 
Periods from 1882-1976 to 1962-1976 


See Note to Figure 1. 


contingency and discontinuity. The union 
DISBANDINGS-CH. FREQ. correlation is 
also heavily affected by the presence or 
absence of 1937 (Figure 2). The series 
indexing the relationship between union 
FOUNDINGS and changes in strike fre- 
quency (Figure 3) is equally sensitive to the 
inclusion or exclusion of 1939 and 1940. The 
mid- to late 1940s to the early 1950s and the 
mid-1950s appear to be other sudden “turning 
points” for at least two of the three series of 
correlations, as does the World War I years 
for the DISBANDINGS-CH.FREQ correla- 
tion. 

That these discontinuities, contingencies, 
and turning points are not limited to the 
correlations, which, as standardized measures 
of associations, are not ideal for comparison 
across time periods, is amply demonstrated 
by the graphs of the covariances, read from 
the right-hand axes in Figures 1—3. Not being 
bounded by an absolute value of 1.0, the 
covariations exhibit much greater absolute 
temporal variation, ranging (rows 4—5 in 
Panel B of Table 1) from 147.7 to —9.2 for 
GROWTH-CH.FREQ, 9.1 to -—3.8 for 
DISBANDINGS-CH.FREQ, and 10.7 to 
—6.1 for FOUNDINGS-CH.FREQ. One 
covariation changes 3800 percent across time 
periods differing only in a single year (e.g., 
in Figure 1, the GROWTH-CH.FREQ covari- 
ation for the periods 1937-80 and 1938-80). 
While not typical of the three series of 
covariations (or correlations) graphed, tempo- 
ral shifts and historical discontinuities of the 
magnitude observed in the late 1930s cannot 
simply be ignored. What happened to and. 
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Table 1. Summary Statistics Describing the "Moving Covariation” Analysis Graphed in Figures 1-3 
Series Related to CH.FREQ 
GROWTH . DISBANDINGS FOUNDINGS 
A. Series of correlations : : 
Mean . .289 .055 .063 
Standard deviation .142 -249 114 
Coefficient of variation .48 4.53 1.81 
Maximum .505 .385 .389 
Minimum —.148 —.536 —.137 
B. Series of covariances 
Mean 45.50 4.04 2.54 
Standard deviation 35.83 4.45 3.82 
Coefficient of variation 19 1.10 1.50 
Maximum 147.65 9.13 10.68 
Minimum —9.19 —3.75 —6.14 
C. Historically general relations 
Correlations i .352 .161 .038 
Covarations 141.84 7.22 3.90 
Years ^ 1882-1980 — 1882-1976 





through the labor movement in the late 1930s 
is a "historical fact" and its implications, for 
both time-series methodology and historically 
general theory, must be acknowledged. The 
covariations also unmask several other less 
dramatic shifts or break points in the series of 
relationships which are not observable in the 
three series of correlations. The early 1900s 
and 1922-23, in particular, stand out as 
additional potentially influential turning points 
in these data. 

Compare these historically contingent pat- 
terns, which are evident for all three relation- 
ships and for both correlations and covaria- 
tions, with the period-singular estimates that 
would typically be obtained from the analysis 
of the entire 1882-1980 (or 1882-1976 for 
FOUNDINGS) period (Panel C of Table 1). 
Whatever the utility of these "historically 
general” relationships for a “general” theory 
linking organization and collective action, 
they clearly have little utility for a historically- 
grounded theory of trade unionism and class, 
conflict in this country. There simply is no 
temporally invariant bivariate relationship in 
any of the series of correlations or covaria- 
tions under examination; the magnitude and 
even sign of the three relationships differ 
substantially from one historical period to the 
next. Moreover, we have also shown that the 
precise year at which one chooses to begin or 
end a historical analysis can have profound 
implications for the quantitative findings 
generated and thus necessarily for the theoret- 
ical inferences drawn from tbose findings. 
History and historical process cannot be 





adequately represented or analyzed if pre- 
mised on ahistorical time; they are not so 
“general” that they can always be arbitrarily 
“sliced into" and thus analyzed merely 
because data are available for a certain year 
and not before or after.? 


? The inferences we draw from this analysis do 
not unduly depend on (1) the precise measures 
used in the above analysis, (2) the use of bivariate 
relationships rather than more elaborated time- 
series regression models, or (3) the "backward" 
approach to the moving covariance analysis. On 
the fist point, we continue to find as much 
temporal heterogeneity and identify the very same 
critical turning points when (a) change scores 
rather than the level representations of union 
FOUNDINGS and DISBANDINGS are used; (b) 
directionalized measures of change are substituted 
for the single nondirectional change score used in 
the text; (c) an additional indicator of unionization, 
union density (i.e., the percentage of the labor 
force organized into unions), is correlated with 
change in strike frequency; (d) the logarithm of the 
series is used to reduce tbe consequences of 
“outliers”; and (e) the BLS union membership 
data are substituted for the Wolman-Troy series. 
On the second point, neither the severity of the 
historical contingency nor even the specific turning 
points (e.g., 1923, 1937) detected above are an 
artifactual function of bivariate analysis; we come 
to the very same inferences when a formal “moving 
regression” analysis is performed (Griffin and 
Isaac 1989). On the last point, both the “forward” 
and the "diagonal" approaches also indicated great 
temporal instability of the relationships. In the 
former analysis, the parameter estimates generally 
declined gradually but substantially throughout the 
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Are we to infer, then, that historically 
grounded sociology is impossible if infer- 
ences rest upon any form of time-ordered 
quantitative analysis? We believe not. If 
history is taken more seriously, "requiring 
systematic investigation in its own right" 
(Tilly 1984, p. 79), quantitative modeling can 
enable the researcher to detect, model, and 
explain social change and thus contribute to 
the goals of historical sociology more gener- 
ally: 

[To] ask questions about social structures or 

processes understood to be concretely situated in 

time and space . . . address processes over time, 
and take temporal sequences seriously in 
accounting for outcomes . . . attend to the 
interplay of meaningful actions and structural 
contexts . . . highlight the particular and 
varying features of specific kinds of social 
structures and patterns of change. (Skocpol 
1984a, p. 1; emphasis in the original) 
Understanding and quantitatively grasping 
historical time as context, contingency, and 
change, such as that documented by the 
analyses presented above, is precisely what is 
lacking in time-series analyses rooted in 
conventional presuppositions. A historically 
fruitful use of time-series techniques, there- 
fore, must be guided by premises "contrary" 
to those shaping typical practice. Below, we 
briefly discuss these alternatives, showing 
also how the results of the "moving covari- 
ance" analyses were both grounded in and 
interpreted or explained by these contrary 
presuppositions. 


PREMISES OF QUANTITATIVE 
HISTORICAL ANALYSIS 


Theoretical history 
Social scientists and historians have long 





analysis; in the latter analysis, the relationships 
changed quite abruptly, with those years identified 
above (especially 1937) again acting as critical 
shift or transition points in the relationships. The 
results of the "diagonal" analysis also demonstrate 
that the temporal discontinuity in the more 
"recent" periods in the "backward" analysis 
discussed above (e.g., the mid-1950s) is not 
simply a function of analyzing a small number of 
cases. The "diagonal" procedure uses only 16-year 
periods but also produces long stretches of relative 
temporal stability in historical relations. The 
greatest discontinuity in Figures 1-3, in fact, tends 
to occur when the number of time points analyzed 
is rather more than less. 
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recognized that theory and history are interde- 
pendent representations of the social world. 
Because the notion of a "theoretical history" 
(Abrams 1982; Zaret 1978) is hardly new, we 
discuss only two of its dimensions of 
particular importance for quantitative histori- 
cal analysis. 
* Historical facts" and "theoretical con- 
cepts." Historical facts and "evidence" are 
neither obvious nor predetermined. As con- 
ceptual and analytic constructions, they “say” 
only as much as analysts ask of them, and 
what analysts ask is a matter of theoretical 
purpose (Carr 1961; Murphey 1973; Thomp- 
son 1978). Quantitative historical data under- 
stood as “facts” and “evidence” thus are 
produced, sometimes quite literally, by the 
application of formal or informal theoretical 
models and assumptions to historical informe- 
tion (Marx [1873] 1967, p. 28; Weber [1905] 
1949, p. 173). Concepts and theories, in turn, 
are historically grounded: “concepts emerge 
from the analytic problem of history” (Zaret 
1978, p. 118), and general and abstract 
understanding of a phenomena is the result of 
prior empirical analysis (Postan 1971, p. 31; 
Stinchcombe 1978, p. 115).19' . 
* Historical explanation" and "theory test- 
ing." The very process allowing a "test" or 
"validation" of a theory, empirical research, 
also forces a theorized reconstruction of the 
historical materials. This  ''history-as- 
analyzed" (as we previously called it) mav 
take the form of a description, a generaliza- 
tion, or an explanation of or about history. In 
quantitative analysis, aspects of this theorized 
understanding are represented by the reconsti- 
tuted dependent variable, the structural pro- 
cess posited and not disconfirmed by the 


1° Snyder's (1975) research demonstrates most 
of these points. Starting with a particular concep- 
tualization (i.e., rational collective action) of the 
"facts" of the matter (i.e., historical strike 
patterns), he then constructed, from a time-series . 
analysis of the theorized determinants of strikes, 
empirical "evidence" (i.e., historically contingent 
regression coefficients) suggesting the explanatory 
significance of a temporally variable social struc- 
ture, "institutionalization," for the very historical 
materials analyzed. While the potential for “discov- 
ery” and use as evidence of these differential 
patterns was contained within the information he 
collected and analyzed, these coefficients obvi- 
ously did not directly exist in the raw historical 
data and were "sensible" only in light of his 
theorized history. 


i 
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statistical analysis, and the estimated parame- 
ter structure, which, with the process struc- 
‘ture, generates the predicted series. .:While 
analysts may be using history simply to test or 
"elaborate theory, they are also, intentionally 
or not, “explaining” the particular historical 
'events and processes used in theory testing. 
(All explanations. are necessarily particular 
[Macintyre 1976, p. 152].)'So history, or at 

, least what we observe and make of it, is both 
the vehicle for theory testing and the arbiter 
:of the empirical adequacy of theory (Bloch 
1953; Postan 1971; Stinchcombe 1978). 

t. Theory, therefore, cannot be rigidly sepa- 
tated from history. Indeed, in very important 
“ways the “historical” is “theoretical” and the 
' “theoretical” is “historical.” Some: direct 

implications of this premise are that (a) 

‘history cannot be reduced merely to “data” 
serving strictly to test a general theory 
independent of history; (b) thus the estimation 
of time-series parameters is not and can not 
-be a process above or outside history that then 
“naturally” confronts differing, and histori- 

,cally independent, theoretical arguments; and 

(c) the movement and temporal contingencies 
of an historical series should be theorized and 

‘analyzed with historically "determinate ab- 

-stractions” (Zaret 1978, p. 118), not timeless 
and idealized concepts. 


Historical time . 
; We reject the “fallacy of presumptive-conti- 
,nuity" (Fischer 1970, p. 154), the unexam- 
-ined “myth that all events are conjoined by 
mere fact of continuous succession in..time” 
' (Jones 1972, p. 115). A fully theorized 
“historical time may be qualitatively differen- 
' tiated time, potentially discontinuous, nonlin- 
ear, and heterogeneous in magnitude and 
consequence. Time is internal to the historical 
making and development of social structure, 
and it may have different tempos at different 
levels of social organization (Brandel, 1980; 
Zaret 1978). 
_ One of the more far-reaching impliéstioné 
‘of this redirection in understandings of time is 
‘that the possibility of both historical continu- 
ity and discontinuity in structural relations 
should be an analytic presupposition. Histori- 
cal processes, as we saw in Figures 1-3, 
therefore may contain distinct "turning" or 
"shift" points at specific historical moments 
or. periods creating or reflecting sudden 
structural discontinuity and change: -(e.g., 
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Edwards 1981; Rubin et al. 1983; Sheflin et ` 
al. 1981). Structural transformation may also . 
be seen through gradual changes in parameter 
estimates. Whatever, the specific theoretical 
and statistical mechanisms inducing histori- ' 
cally variable parameter estimates, temporal 
discontinuity in causal relationships points to 
the existence and efficacy of different struc- 
tural “regimes” or "periods" temporally 
contextualizing historical relations contained 
within them (Griffin and Isaac 1989; Hernes 
1976; Isaac and Leicht, 1989). 


Historicization of quantitative methodology 


Research technique and its underlying theo- 
ries “should be made both internal and 
subservient to the analytic logic of ‘the 
substantive theoretical and historical problem 
under analysis, not the reverse. This premise 
is especially necessary in the study of 
historical contingency and structural change. 
Social structures aré often theorized to be 
unobservable, periodization is messy and 
sometimes seemingly arbitrary, and the exis- ` 
tence, theoretical importance, and timing of 
social change contentious. These are not 
matters to be solved by statistical fiat. Thus, 
analysis and explanation of, and inference 
about, temporal context and conditioning 
must not be limited to processes that can be 
quantified as variables. Questions about 
structural change, moreover, should not be 
necessarily posed, nor the historical inquiry 
unduly molded, because the quantitative 
answers to them (the resulting time-series 
coefficients) have attractive statistical prop- 
erties (significance; efficiency, etc.). To 
proceed otherwise represents “voluntary” 
ignorance of anything but the numbers 
(Stinchcombe 1978, p. 5) and the statistical 
theories and procedures used to penile the 
numbers. 


1 


TWO GENERAL IMPLICATIONS 
OFHISTORICZED ` 
QUANTITATIVE ANALYSIS 


Historical Justification 
Historical descriptions, generalizations, and 


- inferences must always be justified as true or 


useful for the substantive historical problem 
under analysis (Bloch 1953; McCullagh 
1984; Murphey 1973). Our “contrary” pre- 
mises imply that the justification of the results 
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of quantitative historical sdciology must 
reside primarily with the “factual” historical 
record itself. This is neither as simple nor. as 
“empiricist” as it may initially sound because 
_, the adequacy of “historical facts” as the basis 
of the truth content of historical explanations 
. is itself problematic for a variety of reasons. 
' As our discussion of "facts" and evidence 
above suggested, the "historical record" is 
never completely independent of abstraction, 
* conceptualization, and theorization. Gaps and 
ambiguities, moreover, may exist in historical 
` knowledge. Partly for these reasons, histori- 
ans themselves often present contentious 
understandings of the historical record (Mur- 
phy 1973; Thompson 1978). Whose or which 
standard, therefore, should quantitative ana- 
lysts believe? 

Such questions go to the heart of our 
argument and are not to be lightly dismissed. 
But they are not generally intractable. Con- 
ceptual selection, classification, and analytic 
construction do not render 

"evidence" entirely capricious: "the determi- 
nate properties which [facts] present to 'the 
' practitioner" (Thompson 1978, p. 27; our 
insert) means historical data are not to be 
ignored or manipulated without limit (see also 
Mandelbaum 1977, pp. 145-95). Historical 
facts which are ignored or dismissed by a 
particular theory, moreover, remain as data to 
be used in alternative or supplementary 
theoretical and quantitative accounts. "Pre- 
factual” observations or as yet unknown 
empirical pattems, moreover, are subject 
always to historical/theoretical “discovery,” 
identification and interpretation as “fact” and 
“evidence,” often through quantitative anal- 
ysis (see note 10). Additionally, “the domain 
of evidence relevant to a theory is not closed 
once the theory has been elaborated,” and 


given “the searching function of theory": 


(Murphey 1973, p. 127), this characteristic of 
the fact-evidence-theory linkage reduces pure 
factual or constructionist subjectivity. in his- 
torical inquiry and mandates attempts at 
reconciliation or adjudication of the historical 
record. Quantitative analysts may not always 


know “whose history” to believe, but their 
- analytic tools can clarify the historical record’ 


ever as the use of those tools is grounded by 
and in the historical process studied. 

The .“factual” if ever-mutating historical 
record, even with its epistemological and 
practical limitations, is the sole truth condi- 


' tion both deliberately created to account for: 


"facts" and 
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historical reality and directly but inadvertently 
displaying the tracks of history. The same 
simply cannot be said of -either of the 
justification conditions conventionally used in 
quantitative historical studies, statistical crite- 
ria and theoretical “consistency. In very 
different ways, both justify questionable 


* theorized or statistical accounts about external 


historical processes by ahistorical criteria 
internal to themselves. For example, some 
“justified” aspects of Hannan and Freeman’s 
(1988) “histories-as-analyzed” — especially 
their generalizations and inferences about 
strikes, employer offensives, and the Taft- 
Hartley Act having little or no effect on the 
labor movement—are completely inconsistent 
with trade union history. So, tod, is Ashen- 
felter and Pencavel’s (1969) a priori neglect 
of worker insurgency and employer and legal 
repression as stimulants and retardants, respec- 
tively, of growth in union membership. Had 
these (and most other) analysts relied less on - 
theoretical consistency and statistical infer- 
ence in their attempts to verify their historical 
generalizations and more. on what is widely 
known in U.S. labor history, these, flawed 
"histories-as-analyzed" might not have mate- 
rialized.!! The created labor history we see 
from these particular studies is a labor history 
largely reduced to ahistorical population- 
ecology processes.and the timeless rationality 
of workers shrewdly calculating the costs and 
benefits of social action without regard to 
historical context. We-do not know from 
these histories-as-analyzed that “what actu- 
ally happened" (Hobsbawm 1974, p. 9) to 
trade unionists and their unions was the 
making and remaking, consciously and other- 
wise, of "labor" as a historical subject. 
Recourse to historical justification does nct - 


! On the temporal conditioning of the fre- 
quency and efficacy of state repression, see 
Bernstein (1960) and Goldstein (1978). On the 
historicity of employer repression, see, among 
many others, Bernstein (1960) and Montgomery 
(1979). On the historicity of workers' struggles to 
self-organize, see Edwards (1981) and Montgom- 
ery (1979). Several time-series studies, moreover, 
have documented one or more of these very points 
with period-specific and historically: contingent 
time-series regression coefficients (Griffin and 
Isaac 1989; Griffin et al. 1986; Rubin et al. 1983). 
Finally, these studies. and that by Sheflin et al. 
(1981), show that Ashenfelter and Pencavel's 
(1969) assumption of temporal invariance in union 
growth processes is incorrect. ; 
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imply that statistical or theoretical, justifica- 
tion has no role in quantitative historical 


inquiry. (In a subsequent paper on historical ' 


. contingency we use both extensively [Griffin 
: ad Isaac 1989].) It is to say, to quote Bloch 
(1953, p. 120), that historical inquiry. "de- 
mands" nothing less than rigorous attention to 
evidence independent of any particular theory 
or statistical procedure to insure that the 
proferred explanation of “what/why in his- 
tory" is as valid as possible. All quantitative 
historical statements, including those pro- 
duced through theory testing and elaboration, 
must be justified, and the primary "criteria of 
significance should' not be methodological, 
: but substantive in nature" (eer 1970, p. 
í 91). 12 


Periodization 


Historically contingent analysis and explana- 
tion generate several logically related proce- 
dural implications. First, the temporal stabil- 
` ity of time-series coefficients must be 
ascertained. Analytic techniques, such as 
“moving regression" analysis, are available 
"that allow assessment of the stability assump- 
tion without a priori positing the structural 
shift point in the series (Brown et al. 1975; 
Quandt 1958), which is necessary for the 
more familiar analysis of covariance method 
of evaluating the presence of statistical 
interaction. If the relations are found to be 
systematically variable when estimated across 
all of the temporal observations, then period- 
specific analysis is mandated. Second, sound 
historical periodization is essential and cari 
‘not be adequately determined by data avail- 
ability, impressionistic accounts, or the "great 
events" of history. Periodization should be 


defined and justified on the basis of both 


change in the structural relationships under 
examination (Duncan 1975, p. 57; Judge, 


‘1? «Historians certainly seek a coherent account 
of the: subject they. are studying. But they also 
require that its elements are strongly supported by 
the evidence, and more strongly supported than 
any other, incompatible, account of the same 
subject” (McCullagh 1984, p. 12). The pronounce- 
ments of "historical authorities” are not the only or 
even final criterion for assessing the historical 
accuracy of statements, but, as Bloch (1953, p. 
122) notes, "individual originality has its limits," 
and "authorities" must be confronted Sere 
before they can be dismissed. 
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Griffiths, Hill, Luthepohl, and Lee 1985; 
Quandt 1958) and the theorized explanation 
of that change.!? Finally, the specification 
and estimation of time-series models should 


be limited.to those historical contexts or 


periods having more or less the: same 
historical-structural regularity. Thus, theoretic- 


“ historical generalization must of necessity be 


restricted initially to those particular historical 
contexts with near temporal invariance. The 
specific content of sociological explanation 
therefore will likely vary from one such 
context to another (Griffin and Isaac 1989; 
Skocpol 1984b). 

The use of quantitative procedures to help 
detect periodization and in periodized analysis 
generally raises unavoidable, and important 
statistical issues: There is little doubt that 
analysts’ performing periodized analyses will 
have to develop a tolerance for a relatively. 
small number of degrees of freedom, large. 
sampling variances of the estimators, reduced 


t 


: P Using “moving regression” analysis of tem- 
poral changes in the ‘determination of union 
growth, Sheflin et al. (1981) found that the critical 


-turning point was not in 1935, when the Wagner 


Act (a “great event” in labor history) was passed, 
but rather in 1937, when the Act was adjudicated 


- constitutional by the Supreme Court. Hage, 


Gargan, and Hanneman (1980) also stress the 
importance of shifts in: structural’ relationships 
rather than “great events” for correct periodiza- 
tion. The particular procedure they recommend, 


‘inspection of residual plots’ from a time-series 


regression equation estimated over all of the time 
points available, however, is unsound if autocorre- 
lation is present. Autocorrelation itself produces 
patterned residuals which can easily be mistaken 
for historically distinct periods. The behavior. of 
residuals is a function of the underlying statistical 
process driving the disturbance. Hage et al. are 


therefore periodizing on the basis not of histori- 


cally "correct" estimated relations, but rather of 
autocorrelation of one form or another, or, in what 


' | is the same thing but expressed substantively, of 


specification bias induced by both the omission of 

*period-specific" causal factors having additive 
effects on the dependent variable and the initial 
use, to residualize, of the biased "historically. 
general" estimates of the equation. Some of Brown 
et al.'s (1975) procedures for detecting causal 
discontinuity, although more technically, sophisti- 
cated, also depend on “global” residualization and 


. therefore suffer from the same limitation. They: 
: acknowledge this and, 


in fact, assume correct 
specification and the absence of autocorrelation in 
developing their techniques. e 
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statistical significance of either the time-series 
estimates themselves or the differences be- 


‘tween coefficients across historical periods, 


unknown small-sample properties of estima- 


: tors, and possibly “censored” (Tuma and 


Hannan 1984) series.!^ Statistical inference, 
perhaps limited by these unresolvable difficul- 


. ties and certainly not of the textbook variety, 
` will nonetheless remain useful in assessments 
. of temporal stability and periodization. But 
' historical contingency and, periodization are 


not predominately statistical or even purely 


"empirical" matters. They are 'issues of. 


theoretical history and must ultimately be 


' justified by their utility in advancing theoretic- 


historical knowledge. . 


THEORIZING HISTORICAL 
CONTINGENCY IN THE 


_ STRIKE-UNIONIZATION RELATION 


. gation "was, ‘grounded not in 


Important aspects of these alternative pre- 
mises, and their implications for historical 
justification and periodization, can be illus- 
trated by reference to’ the empirical analyses 
summarized in the figures and table above. 
We uncovered critical shift points and tempo- 
ral conditioning in the correlations and 
covariances between unionization and strike 
activity only because our quantitative investi- 
"timeless" 
general theories of collective . action, but 
rather in a theorized understanding of the 
historical particülarities and contingencies of 


the series and relationships under analysis.' 


Our reading of the historical record pointed to 
the likely: possibility that the linkage between 
labor's collective action and its organizational 
strength varied through the years of history 
encapsulated ‘by’ our analysis and even 
suggested approximately when (and through 
what mechanisms) we could expect to see 
structural change. Also empowered generally 
by our second and third alternative premises, 
we then cast about for procedures to histori- 
cize more fully our time-series methodology 
and analysis by quantitatively ' reflecting 
historical time as both context and turning 
point in these series. “Moving covariance" 
analysis, while not unique among possible 


- ^ Both Quandt (1958, p. 878) and Brown et al. 


(1975, p. 150), in developing their procedures for 
detecting “switching” points, employed bivariate 
and multivariate regression analysis on periods 
having as few as one degree of freedom. 


: " 
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methodological advances, allowed assessmen 
of the temporal invariance assumption an 
could be used as a tool to model the extent 
intensity, and contour of historical discontinu 
ity. The statistical significance of the differ 
ences in coefficients was ignored -here (fo 
purposes of exposition only) in favor o 
historical justification emphasizing theoretica 
interpretation and consistency with importan 
aspects of U.S. labor history. ; 
. This last observation forces us to confron 
directly the question of the interpretability 0: 
our quantitative results. Are the ‘particula 
historical contingencies we generated simpl 
background noise, random deviations from 
some "true" invariant covariation induced by 
"sampling variability," or are they substan 
tively interpretable? It is not possible . Kk 
answer definitively that question on the basi: 
of any set of quantitative estimates, including 
those we presented above. When couplec 
with the use of theories pointing towarc 
historical discontinuities and periodizaticr 
(Hernes 1976), however, evidence from labo: 
history strongly suggests that profound alter. 
ations in “regimes” or “structures” o 
relations both within the ranks of the working 
class and among labor, capital, and the state 
occurred during the years we have identifiec 
as potentially crucial turning points. 
Capital’s first highly organized, national, 
and effective assault on the trade unio: 
movement, for example, began in 190: 
(Griffin et al. 1986). What about the earl 
1920s? Montgomery (1979) argues that : 
strong syndicalist tendency in U.S. labor, 
evident since the turn of the century, wa: 
finally smothered in 1922-23 by legal repres- 
sion (Bernstein 1960) and the “Americar 
Plan," thus altering the contours of clas: 
formation and conflict. The mid- to late 
1930s? Both unprecedented labor militancy 
then and the passage of the Wagner Act ir 
1935, its adjudicated constitutionality ir 
1937, and its resulting organizational anc 
ideological consequences in the late 1930: 
and early 1940s quickly ushered in a new ere 
of industrial relations and political struggle 
(Davis 1986; Rubin et al. 1983) The 
mid-1940s? Davis (1986) argues that by 194€ 
labor's "second upsurge" of militancy hac 
ended and “bureaucratic hegemony” in the 
unions reestablished, creating realignment: 
between the rank-and-file and union leaders 
Jaccoby, (1985) concludes that in the year 
immediately following World War II interna; 


* 
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labor markets for union employees had: produce “histories-as-analyzed” which are 


become widely diffused and institutionalized, 
thus changing the basis of job security and 
rewards. The Taft-Hartley Act, amending the 


Wagner Act, was passed during this same - 


time period, again altering modes of labor 
control and’ regulation. The mid-1950s? 
Zieger (1986) contends that the merger of the 
AFL and CIO in 1955 largely ended the 
internecine warfare between the two and 
further entrenched the unions into the Demo- 
` cratic party's “New Deal coalition.” Orga- 
nized labor’s struggle, and struggles of labor 
more broadly, thereafter began to take-a new 
course. Thus, the historically specific theoret- 
ical explanation for the strike-union correla- 
tion differs as the temporal context of the 
historical relation also changes through time. 
More analysis, both qualitative and quanti- 
tative, is needed before this particular period- 
ization could be employed in substantive 
research. Other quantitative procedures, such 
as event-history and spectral analysis, also 
could be employed with these or similar data 
'to obtain alternative or supplementary infor- 
mation about shift points, periodization, and 
structural change. Acknowledging, then, that 
our analyses are at best only suggestive, .we 
also believe they are nonetheless broadly 
consistent with theorized understandings of 
the qualitative historical interpretations dis- 
cussed above. While the historical accounts 
of Montgomery, Davis, and others do not 
necessarily explain the patterns observed in 
‘the graphs we presented, and in some cases 
` are inconsistent among themselves, they (and 
those of others) theorize not statistical noise 
but the real activity, terms, and conditions of 
the labor movement, the contingencies of 
historical context, structure, and agency. And 
in so doing they, and historicized quantitative 
analyses such as those presented here, do 
offer the possibility for the historical- 
theoretical identification and explanation of 
contingent yet "patterned" social change.: 


` CONCLUSIONS AND CAVEATS 


Plausible theories of large-scale social pro- 
„cesses are intrinsically processual, and quan- 
titative accounts of change and constancy 
should be historically grounded. All too often 


they are not. The merger of time-series © 


procedures of great technical sophistication 
but excessive historical selectivity and ahistor- 
ical theories of wide scope can and do 
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unrecognizable as "histories-as-lived." But 
these limitations and errors should not be 
attributed solely either to theoretical general- 
ity or to quantitative analysis per se (as do, 
among many others, Moore 1958 and Thomp- 
son 1978): : 
On the first point, theory of any generality 
or scope will unavoidably omit nonessential - 
details of the historical record. But incomplete- 
ness is understandable, and even highly 
"general" theories having quite diverse sub- 
stantive content can be used in a historically 
rich and grounded fashion (as shown in 
Hernes 1976 and Thompson 1978). Part of 
the problem, then, is with the premises, 
guiding the use of social theory. On the 
second point, the quantitative analyst often 
must use historically unknown or limited 
information. The “moving covariation” anal- 
ysis we performed, for example, clearly is 
historically compromised. Historical data are 
most often organized as annual observations 
(as in our case), almost always immutable, 
and never devised or chosen by those 
collecting or reporting the series for reasons 
of historical explanation, Assumptions about 
measurement, categorization, and aggrega- 
tion, such as those we made in our own 
analysis, moreover, are essential if the study 
is to proceed quantitatively. But qualitative 
analysis also simplifies and distorts and for 
the very same reasons. Even historically 
limited quantitative examinations may be 
better than none at all. Eschewing ideal 
analytic conditions, we therefore assert that 
the real questions to be confronted are ones of 
degree and consequence: How much distor- 
tion is acceptable? and to what effect? 
Without invoking an impossible standard of 
"historical purity," we nonetheless believe 
there is "too much" that is avoidably false in 
typical time-series analyses. There is nothing , 
in the logic of quantitative procedures per se 
necessarily negating solid historical research. 
Many of the general flaws or practices we 
identified, in fact, boil down to no more than 


improper use of time-series technique; basic- 


assumptions of conventional statistica] theory 
(e.g., adequate equation specification, tempo- 
ral homogeneity) were violated. But these 
violations were first permitted, even encour- 
aged, and then justified by conventional 
premises. Part of the problem here as well, 
then, is with the premises guiding the use of 
quantitative methodology. 
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Historical contingency and change in tem- 
: poral relations are not limited to the examples 
from U.S. labor history presented in this 
paper. Interpretable contingency in the causal 
processes driving welfare rolls and military 
spending in the postwar years, for example, 
have also been found (see Isaac and Leicht 
1989, who also criticize the time-series 
studies conducted by the present authors). We 
suspect that so few studies have adduced 
similar evidence (Hernes 1976) because 
structural change and temporally contingent 
relations have not been adequately theorized 
and quantitatively analyzed. Historically 
grounded applications of time-series method- 
ology pose core challenges to how we 
conventionally integrate history, theory, and 
quantitative analysis. These will not be 
resolved by greater factual accuracy or 
historical detail, though the former obviously 
is essential and the latter is often necessary. 
Rather, quantitative analysts should consider 
approaching history anew. Thirty years ago, 
Moore (1958, p. 150-1) wrote that “a 
particular historical event can do more than 
confirm or disconfirm a hypothesis. It can 
change the situation so that one hypothesis or 
another becomes correct. There would seem 
to be an irreducible element of indeterminacy 
here” (emphasis in the original). He is 
correct: historical contingency and structural 
change are facts of historical process and thus 
must become facts in historical social science, 
including that which is quantitative. 

Our attempts to formulate and use “histor- 
ical" premises and techniques do not solve (or 
even address) some of the most serious 
problems plaguing quantitative analysis of 
historical series. “Moving covariance" analy- 
sis, moreover, is not so mechanical that it will 
somehow  magically spit out historically 
"correct" and unambiguous periodization. 
Indeed, as we noted, the historical record 
itself may be internally inconsistent about the 
temporal dimensions and dynamics of the 
process under analysis. But presupposing the 
interdependence of history and theory, taking 
historical time more seriously, and generally 
historicizing quantitative methodology will at 
least reduce the avoidable dehistoricization 
associated with conventional premises. A 
more fully "historical" quantitative approach 
to temporal processes, such as the one 
presented here, or others more fruitful still, 
therefore should be attractive to those now 
suspicious of or even hostile to quantification 
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(cf. Skocpol 1987, p. 20; Moore 1958). It 
simultaneously compels and authorizes us to 
look for structural change in ways that go 
beyond both the movement through time of 
univariate series and the ahistorical inferences 
from or about temporally invariant time-series 
coefficients. Because this strategy facilitates 
the detection, dating, analysis, and explana- 
tion of historical contingency, and thereby 
illuminates the contours of social change, it 
helps produce both better historical explana- 
tion and better social theory. And given our 
understanding of theoretical history, each will 
be better because the other is better. That is 
our purpose: to encourage and empower a 
quantitative historical sociology which self- 
consciously seeks to identify and explain 
qualitative change in and around quantitative 
relations, thus furthering the interpenetration 
of sociology and history. 


LARRY W. ISAAC is Associate Professor of 
Sociology at Florida State University. He is 
continuing work on alternative strategies for 
analyzing historical processes of social change 
as well as theoretical-historical analyses of 
labor, value, and the wage relation. 


LARRY J. GRIFFIN is Professor of Sociol- 
ogy and Adjunct Professor of Political 
Science at Indiana University-Bloomington. 
His research interests center around the 
relationship between capitalism and democ- 
racy and the uses of quantitative methodology 
in comparative and historical inquiry. He will 
spend the 1990—91 academic year as a Fellow 
at the Center for Advanced Study in the 
Behavioral Sciences. 


REFERENCES 


Abrams, Philip. 1982. Historical Sociology. Ith- 
aca, NY: Cornell University Press. 

Aminzade, Ron. 1989. “What is Historical About 
Historical Sociology?" Paper presented at the 
American Sociological Association. 

Ashenfelter, Orley and John Pencavel. 1969. 
“American Trade Union Growth, 1900-1960." 
Quarterly Journal cf Economics 83:434—48. 

Bernstein, Irving. 1960. The Lean Years: A 
History of the American Worker, 1920-1933. 
Boston: Houghton Mifflin. 

Bloch, Marc. 1953. The Historian's Craft. New 
York: Alfred Knopf. 

Braudel, Fernand. 1980. On History. Chicago: The 
University of Chicago Press. 

Brown, R. J. Durbin, and J. Evans.’ 1975. 


TIME-SERIES ANALYSES OF HISTORICAL PROCESS 


“Techniques for Testing the Constancy of 
Regression Relations Over Time.” Journal of 
the Royal Statistical Society 37:149-92. 

Carr, Edward. 1961. What is History? New York: 
Vintage. 

Davis, Mike. 1986. Prisoners of the American 
Dream. London: Verso. 

Duncan, Otis. 1975. An Introduction to Structural 
Equation Models. New York: Academic Press. 

Edwards, Paul. 1981. Strikes in the United States, 
1881-1974. New York: St. Martin's. 

Fischer, David. 1970. Historians' Fallacies: To- 
ward a Logic of Historical Thought. New York: 

: Harper Torchbooks. 

Freeman, John and Jack Brittain. 1977. “Union 
Merger Process and Industrial Environment." 
Industrial Relations 16:173—85. 

Goldstein, Robert. 1978. Political Repression in 
Modern America. Boston: Hall. 

Griffin, Larry and Larry Isaac. 1989. “Analyzing 
Historical Contingency and Structural Change.” 
Unpublished, Indiana University. 

Griffin, Larry, Michael Wallace, and Beth Rubin. 
1986. “Capitalist Resistance to the Organization 
of Labor Before the New Deal: Why? How? 
Success?” American Sociological Review 51: 
147-67. : 

Hage, Jerald, Edward Gargan, and Robert Hanne- 
man. 1980. "Procedures for Periodizing History: 
Determining Distinct Eras in the Histories of 
Britain, France, Germany and Italy." Pp. 
267-83 in Historical Social Research, edited by 
Jerome Clubb and Erwin Scheuch. Stuttgart: 
Klett-Cotta. 

Hannan, Michael and John Freeman. 1987. "The 
Ecology of Organizational Founding: American 
Labor Unions, 1836—1985." American Journal 
of Sociology 92:910—43. 

Hanann, Michael and John Freeman. 1988. "The 
Ecology of Organizational Mortality: American 
Labor Unions, 1836-1985." American Journal 
of Sociology 94:25-52. 

Hernes, Gudmund. 1976. “Structural Change. in 
Social Process.” American Journal of Sociology 
82:513-47. : 

Hobsbawm, Eric. 1974. "From Social History to 
the History of Society." Pp. 1—22 in Essays in 
Social History, edited by M. Flinn and T.C. 
Smout. Oxford: Clarendon Press. 

Isaac, Larry and Kevin Leicht. 1989. "Historical 
Contingency, Türning Points and Regimes of 
Struggle in State Policy History: Examples from 
the U.S. Welfare-Warfare State.” Unpublished, 
Florida State University. 

Jaccoby, Sanford. 1985. Employing Bureaucracy. 
New York: Columbia University Press. 

Jones, Gareth. 1972. “History: the Poverty of 
Empiricism.” Pp. 96-115 in Ideology in Social 
Science, edited by Robin Blackburn. London: 
Fontana. 

Jones, Gareth. 1976. “From Historical Sociology 


889 


to Theoretical History.” British Journal of 
Sociology 27:295-305. 

Judge, George, W.E. Griffiths, R. Carter Hill, 
Helmut Luthepohl, and Tsoung-Chao Lee. 
1985. The Theory and Practice of Econometrics. 
(2d ed.). New York: Wiley. 

Knapp, Peter. 1984. “Can Social Theory Escape 
from History? Views of History in Social 
Science.” History and Theory 23:34—52. 


' Lipset, Seymour. 1968. “History and Sociology: 


Some Methodological Considerations." Pp. 20- 
58 in Sociology and History: Methods, edited by 
Seymour Lipset and Richard Hofstader. New 
York: Basic. 

MacIntyre, Alasdair. 1976. “Causality and .His- 
tory.” Pp. 137-58 in Essays in Explanation and 
Understanding, edited by Juha Manninen and 
Raimo Tuomela. Boston: D. Reidel, Dordrecht- 
Holland. 

Mandelbaum, Maurice. 1977. The Anatomy of 
History Knowledge. Baltimore: The Johns Hop- 
kins University Press. 


Marx, Karl. 1873/1967. Capital, vol. 1. New 
York: International Publishers. i 
McCleary, Richard and Richard Hay. 1980. 


Applied Time Series for the Social Sciences. 
Beverly Hills: Sage. ; 

McCullagh, C. Behan. 1984. Justifying Historical 
Description. Cambridge: Cambridge University 
Press. 

Montgomery, David. 1979. Worker’s Control in 
America. Cambridge: Cambridge University 
Press. 

Moore, Barrington. 1958. Political Power and 
Social Theory. Cambridge: Harvard University 
Press. 

Murphey, Murray. 1973. Our Knowledge of the 
Historical Past: Indianapolis: Bobbs-Merrill. 
Postan, Michael. 1971. Fact and Relevance: 
Essays on Historical Method. Cambridge: Cam- 

bridge University Press. 

Quandt, Richard. 1958. “The Estimation of the 
Parameters of a Linear Regression System 
Obeying Two Separate Regimes.” Journal of 
the American Statistical Association 53:873—80. 

Rubin, Beth, Larry Griffin, and Michael Wallace. 
1983. “ ‘Provided Only That Their Voice was 
Strong': Insurgency and Organization of Ameri- 
can Labor from NRA to Taft-Hartley." Work 
and Occupations 10:325—47. ` 

Sheflin, Neil, Leo Troy, and C. Timothy Koeller. 
1981. “Structural Stability in Models of Ameri- 
can Trade Union Growth.” Quarterly Journal of 
Economics 96:77—88. 

Skocpol, Theda. 1984a. “Sociology’s Historical 
Imagination.” Pp. 1-21 in Vision and Method in 
Historical Sociology, edited by Theda Skocpol. 
Cambridge: Cambridge University Press. 

Skocpol, Theda. 1984b. “Emerging Agendas and 
Recurrent Strategies in Historical Sociology.” 
Pp. 356-91 Vision and Method in Historical 


890 


‘Sociology, edited by Theda Skocpol. Cambridge: 
Cambridge University Press. — ' 

Skocpol, Theda. 1987. "Social History and 
Historical Sociology: Contrasts and Complemen- 
tarities." Social Science History 11:17-30. 

Snyder, David. 1975. "Institutional Setting and 
Industrial Conflict: A Comparative Analysis of 
France, Italy, and the United States." American 
Sociological Review 40:259—78. 

Spiethoff, Arthur. 1952. “The ‘Historical’ Charac- 
ter of Economic Theory." Journal of Economic 
History 12:131-39. 

Stinchcombe, Arthur. 1978. Theoretical. Methods 

` . of Social History. New York: Academic Press. 

Thompson, Edward. 1978. The Poverty of Theory 
and Other Essays. London: Merlin. 

Tilly, Charles. 1981. As Sociology Meets History. 
New York: Academic Press. 

Tilly, Charles. 1984. Big Structures, Large 


AMERICAN SOCIOLOGICAL REVIEW 


Processes, Huge Comparisons. New York: 
Russell Sage Foundation. 

Tuma, Nancy and Michael Hannan. 1984. Social 
Dynamics. New York: Academic Press. 

U.S. Bureau of the Census. 1975. Historical 
Statistics of the United States: Colonial Times to 
1970. Washingtoa, D.C.: U.S. Government 
Printing Office. 

U.S. Commissioner of Labor. 1901. Sixteenth 
Annual Report: Strikes and Lockouts, Washing- 
ton, D.C.: U.S. Government Printing Office. 

Weber, Max. 1905/1949. The Methodology of the 
Social Sciences. New York: Free Press. 

Zaret, David. 1978. “Sociological Theory and 
Historical Scholarship.” American Sociologist 
13:114-21. . 

Zieger, Ron. 1986. American Workers, American 
Unions, 1920-1985. Baltimore: Johns Hopkins 
University Press. 


E XE E S 


ORIGINS OF THE AMERICAN WELFARE STATE* - 


BARBARA G. BRENTS i 
University of Nevada-Las Vegas’ 


J. Cd JENKINS 
Ohio State University 


SOCIAL PROTEST, HEGEMONIC COMPETITION, AND SOCIAL 
REFORM: A POLITICAL STRUGGLE INTERPRETATION OF THE 


Recent neo-Marxian and state-centric .analyses of the origins of the American . 
welfare state have misspecified the autonomy of the state, thereby conflating policy ' 


formulation with policy-making and missing the complex political struggles that 


shaped the formulation of the Social Security Act of 1935. Synthesizing ` 


Poulantzas's "class struggle" theory of. state with social protest theory and 
Domhoff’s analysis of capitalist dominance, we advance a political struggle theory 
that identifies two major processes leading to social reforms: (1) sustained protest 
waves by excluded groups and threatened polity members that create a sense of 
political crisis among elites; and (2) hegemonic competition between capitalist 
blocs that use policy-planning and electoral investments to promote alternative 
political programs. This model is then applied to the formulation of the Social 
Security Act. Unemployed protests, industrial strikes, and middle-class reform 
movements, interacting with electoral instability, created án elite sense of political 
crisis. Simultaneously, rival capitalist blocs centered in bank groups and industrial 
segments competed for political dominance, creating opportunities for protest and 


placing major ied on the national ead agenda. 


The political origins of the American welfare 
state remain a puzzle. Its peculiar develop- 
ment—the lack of national standards, the 
sharp split between insurance and relief, the 
weak coordination between social and macro- 
economic policies, and its late arrival—have 
received considerable attention (see Quad- 
agno 1987, pp. 120-24; Weir, Orloff, and 
Skocpol 1988). Yet the formulation of policy 
proposals for unemployment insurance, old 
age pensions and old age relief, and,. most 
important, the political: struggles to place 
_these proposals on the agenda of national 
“| 
* Direct all corréspondence to J. Craig Jenkins, 
Department of Sociology, 300; Bricker Hall, 190 
N. Oval Mall, Ohio State University, Columbus, 
OH 43210. 

An earlier version of this paper was presented at 
..the American Sociological Association meetings, 
; August 1987, Chicago, Illinois. Harold Kerbo and 
Richard Shaffer generously provided us with 
counts of their data -on -unemployed protest, 
Michael Wallace helped with the strike data, and 


Phillip Burch provided the 1935 NAM Executive - 


Committee. Their comments as well ás those of 
William Domhoff, Jill Quadagno, Kevin Leicht, 
Beth Mintz, and anonymous ASR referees sharp- 
ened our arguments, for which we alone ‘are : 
responsible, | 4 


. class fractions 


ing class: played "almost no role," 
coming 


| American 3ociolosica; Review, 1989, Vol. potes 


, 


political eaten in iie 1930s remain relatively 
unexplored. 

Neo-Marxists and state-centric theorists 
have recently offered interpretations of the 
formulation and passage of the Social Secu- 
rity Act of 1935. Quadagno’s (1984, 1985) 
neo-Marxian interpretation sees the: Social 
Security Act as a compromise between 
monopoly and competitive sector capitalists 


that “incorporated working class demands 


into legislation on capitalist terms” (1984, p. 
646). The American state was “relatively 
autonomous” in that political leaders! con- 


, structed this compromise, ° reconciling ` the 


economic and political interests of these two 
“operating at different levels 
within a hierarchical state structure" - (1984, 
p. 646). Yet, contrary to most neo-Marxian | 
interpretations, ‘she concludes that the work- 
pressures 
instead from the middle-class 
Townsend movement. In contrast, state- . 
centric theorists have advanced an institutional/ l 


‘political process interpretation that: empha- 
- sizes the interests of political leaders and the 


constraints of past policies. The initiative for 


‘Social Security came from a coalition for 


*bureaucratic autonomy" composed: of pro- 
gressivist political officials, policy intellectu- 
als,, middle-class political activists and a 
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"handful" of liberal capitalists who re- 
sponded to a general "democratic upsurge" 
and worked against the concerted opposition 
of "virtually all politically active business 
leaders and organizations" (Skocpol and 
Amenta 1985, p. 572; see also Skocpol and 
Ikenberry 1983; Orloff and Skocpol 1984; 
Amenta, Clemens, Olsen, Parikh, and Skocpol 
1988; Orloff 1988). 

These analyses have moved beyond earlier 
interpretations by framing specific hypotheses 
in terms of explicit theories of the state.! 
However, by misspecifying the autonomy of 
the state, these analyses have failed to 
distinguish policy formulation from policy- 
making, underestimated. social protest as a 
: force in placing social welfare proposals on 
the political agenda, and misanalyzed the 
sources and significance of intracapitalist 
conflicts over the passage of the Social 
Security Act and related “Second New Deal” 
reforms.? 

We draw on Poulantzas (1973, 1980). to 
argue that the liberal democratic state consti- 
tutes a "relatively autonomous" political 
arena. which simultaneously mediates intra- 
capitalist hegemonic competition and inter- 
class struggles. This “relative autonomy" 
frames our three major arguments. First, the 
liberal state creates the arena or political 
space within which lower classes (and other 
politically excluded groups) mobilize and 
make political demands. We argue that a 
wave of social protests (including strikes, 
demonstrations, ‘movement campaigns, and 
resulting electoral instability) created a sense 
of political crisis among elites, thereby 
placing major reforms on the political agenda. 
Second, capitalist fractions use this arena to 
organize political coalitions that compete for 
political dominance, those that prevail becom- 


ing the dominant power bloc. We argue that a 





‘ We focus on these recent efforts because of 
their attention to linking general theoretical 


frameworks to specific hypotheses alongside their 


_ empirical evaluation of specific hypotheses. While 
much of the empirical evidence we will discuss is 
not new, we disagree with Domhoff's (1986/87) 
conclusion that recent state theories have added 
nothing to the analysis of policy formulation. 

? By the “Second New Deal" we refer to the 
reform program of social insurance, emergency 
work relief, progressive taxation, trust-busting, 
unionization, and wage and hours regulation 
launched in 1935. 
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rising "corporate liberal" coalition guided the 
formulation of the Social Security Act, 
thereby placing policy proposals onto the 


` agenda of political leaders that reflectec a 


new global strategy favoring that coalition's 
economic and political interests. Yet, end 
third, this “autonomy” is “relative” in that 
political actors operate within a capitalist 
context that defines the major contending 
interests and sets limits on the range of likely 
solutions. Thus, the economic crisis of the 
Great ‘Depression generated a wave of pro- 
tests and, in response to convergent interests 
in political stability and: economic growth, 
capitalist coalitions and ` political leaders 
developed alternative programs. that re- 
sponded to these underlying problems within 
a capitalist framework. 

Poulantzas’ theory, however, has three 
major limitations. First, it does not account 
for the complexities of "class: struggles,” 
especially, the multigroup bases of social 
protests and the links between extra- 
institutional and conventional politics. Draw- 
ing on theories of social protest (Piven and 
Cloward 1977;. Tilly 1978), we argue that 
protest is rooted in political exclusion and, 
attempts'to renegotiate the terms of political 
access. Ín other words, the conflicts are 
largely over political access rules and the 
protesters are frequently drawn from diverse 
groups whose primary interest lies in altering 
these rules. Moreover, these "struggles" 
frequently combine unruliness with institution- 
alized pressure activities. Second, the theory 
lacks an analysis of political opportunities, 
implicitly treating them as constant. We argue 
that new opportunities are primarily created 


'by upper-class splits which facilitate party 


realignments, thereby creating incentives for 
elected officials to support bids for political 
access (Piven and Cloward 1977; McAdam 
1982; Jenkins 1985). Third, the theory does 
not analyze the processes involved in hege- 
monic competition. We draw on Domhcff 
(1978): to argue that the dominance of 
particular capitalist coalitions depends on 
policy-planning efforts and campaign invest- 
ments. Hegemonic competition, then, is a 
process by which rival capitalist blocs use 
policy, planning and electoral investments to 
guide responses to emerging political crises. 
Political dominance depends on victory in 
both arenas. Before turning to an examination 
of this ‘political.’struggle theory, we look 
briefly at how these arguments depart from 
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recent analyses of the formulation of Social 
Security. : 


` RECENT NEO-MARXIST AND 
. STATE-CENTRIC ANALYSES OF THE 
SOCIAL SECURITY ACT $ 


Dirawi on Althusser’ s structuralism and 
Gramsci’s theory of hegemony, Poulantzas 
(1973, 1980) argued that the modern liberal 
democratic state cannot be understood. as an 
' instrument. of a dominant class or class 
fraction but as an arena of class struggle: 
"The State is not a monolithic bloc but a 
strategic battlefield" (1980, p..152). The 
“relative autonomy" of the liberal state 
means that it performs two mediating political 
functions: (1) “It represents and organizes the 
‘dominant class or classes; or more precisely it 
represents and organizes the long-term politi- 
cal interests of a power bloc, which is 
composed of several bourgeois class frac- 
tions" (1980, p. 127); and (2) "by bringing 
the power bloc and certain dominated classes 
into a variable game of provisional compro- 
mises . . . [which] often involves imposing 
material compromises on the various fractions 
of the power bloc," it recreates thé conditions 
for capital accumulation and political stability 
(1980, p. 140). In short, the liberal demo- 
cratic state is sufficiently independent from 
particular capitalist interests that it can serve 
as an arena for mediating intracapitalist 
. conflicts while simultaneously framing poli- 
cies that incorporate lower-class demands in 
ways that reinforce capitalist development. 
This double mediation process has largely 
been missed in recent analyses. Quadagno 
(1984, 1985) focused on the intracapitalist 
rivalry, arguing that the Social Security Act 
represented a compromise between monopoly 
and competitive capitalists. The former initi- 


ated policy planning through the American: 


' Association for Labor Legislation (AAI), the 
Business Advisory Committee (BAC) and the 
Presidential Advisory Committee on Eco- 
nomic Security (CES), while the latter (small 
manufacturers and Southern plantation own- 
ers) used Congressional power to block 
features threatening their labor controls (esp. 
federal controls over unemployment insur- 
ance and emergency relief). The state. was 
"relatively autonomous" in providing an 
arena for constructing this compromise. Yet 

‘working-class protests .were ‘largely. irrele- 
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vant, the only deion pressure coming 
from the middle-class Townsend Clubs. 

This analysis has four problems. First, it 
misanalyzes the intracapitalist split. In. the 
early New Deal, both small and large 
business generally supported the Administra- 
tion, hoping for economic recovery under the 
NRA and other emergency legislation. When 
concerted business opposition did emerge 


-after 1934, we will show that it followed the 


cleavage between two rival coalitions: (a) a 
conservative bloc led by representatives of the 
House of Morgan and the DuPont-General 
Motors complex -with support centered in 
labor-intensive domestic industries; and (5) a 
rising "corporate liberal": coalition led by 
representatives of the Rockefeller and "our 
crowd" bankers supported by advanced export- 
ofiented industrialists (Burch 1980, pp. 25- 
52; Ferguson 1983, 1984). At least on the 
Social Security issue, the split was not simply 
competitive versus monopoly sector. 

Second, Quadagno neglected the , links 
between the policy planning and electoral 
activities of these coalitions. With the 1928 
elections, several leading bankers and indus- 
trialists founded the Democratic National 
Committee to contro] the selection of Presi- 
dential candidates (Lundberg 1937, p. 452- 
57; Ferguson 1983, 1984). Throughout the 
New Deal; several of these progressive 
capitalists provided policy advice, accepted 
administrative positions, and mobilized busi- 
ness support for the Roosevelt Administration 
(Domhoff 1970, pp. 207-18; Burch 1980, pp. 
25-52; Collins 1978; McQuaid 1976, 1978). 
In the mid-1930s, business conservatives 
formed rival policy organizations such as the 
Sentinels for the Republic and the Liberty 
League to advance their "American Plan" 
alternative and heavily funded Alf Landon's , 
campaign against Roosevelt in the 1936 
elections. Hegemonic competition rested on 
both policy planning and electoral efforts. 

Third, the analysis misses the significance 
of working-class protest. Setting aside for the 
moment the contradictory claims that “work- 
ing-class demands" were “incorporat[ed] on 
capitalist terms" and. yet working-class "in- 
put" played "almost no role" (1984, pp. 
645-46), the key problem is looking for 
working-class “input” in the policymaking 
activities of the White House and Congress. _ 
Since the industrial working. class was: - still y P. 
politically excluded (Griffin, Wallàce;: f and 
Rubin 1986), its pay leve: gé.- should 
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instead have been in extra-institutional politics: 
illegal strikes, protest rallies, mass marches, 
and related forms of social protest. We will 
argue that working-class protests interacted 
with other challenges and the resulting 
electoral instability to create an elite sense of 
‘an impending political crisis. 


Finally, the analysis and Sbe an ex- 


change with Skocpol and Amenta (1985) 
‘deals primarily with policy-making, not policy 


formulation. Policy formulation is the process 


of defining the goals for major policy 
innovations and inserting them in the political 
agenda, while policy-making is the process of 
specifying the statutory and administrative 
means to realize these goals (Domhoff 1978, 
p. 10). Instead of focusing on Congressional 
debates over statutory details (e.g., Ohio vs. 
Wisconsin plans; national vs. state administra- 
tion), the study of policy formulation should 
focus on the development of basic policy 
ideas and their insertion on' the agenda of 
national political leaders. By’ the time the 


‘Social Security bill had reached Congress,’ 


“the basic agenda for policy consideration 

[should have] already been set by other elites, 
‘chiefly, the large capitalists, foundation 

directors, and university-based researchers of 

the policy-planning network" (Dye 1979, p. 

226) and, we will argue, by broader political 

processes, including protests and intracapital- 
, ist competition. 

What of the state-centric approach? Begin- 
ning :with the premise of “the potentially 
autonomous role of all kinds of political 
officials" (Skocpol and Amenta 1985, p. 
574), these analysts have argued that the 
initiative for Social Security came from 
progressive political officials allied with 
middle-class reformers and policy intellectu- 
als, labor officials, and a "handful" of liberal 
capitalists who responded to a general “dem- 
ocratic upsurge” 
1983; Amenta et al. 1988;, Orloff 1988). 
Previously, this “coalition for bureaucratic 
autonomy” had failed to carry through its 
social policy agenda because of middle-class 
opposition to` patronage-based parties, the 
isolation and weakness of organized labor, 
and the administrative weakness of the U.S. 
state. These barriers, in turn, were rooted in 
the development of the state, especially its 
early democratization (Orloff and Skocpol 
1984). In constructing the welfare state, the 


architects drew. on institutional traditions, - 


devising a contributory federal-state system 


(Skocpol and Ikenberry 
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based on private insurance ‘ideas that distin- 


guished between and “emer- 
gency relief.” . 

The chief problem with this analysis is that 
it better explains the lateness of the American 
welfare state and its peculiar form rather than 
the political forces that produced it.5 More- 
over, by defining the;"state" as governmental 
institutions, the analysis collapses the distinc- 
tion between policy formulation and policy- 
making and implies that the autonomy of the 
state is potentially absolute, that is, the state 
is a self-regulating and self-contained process 
which is unrelated to class interests and 
conflicts. 

This analysis also rests on several empirical 
errors. Skocpol and Amenta assert that 
“virtually all politically active business lead- 
ers and organizations strongly opposed na- 
tional and state-level pensions and social 
insurance" and that business support was 


"insurance" 


'limited to a "handful of liberal-reformist 


businessmen who remained in contact with ` 
the New Deal after mid-1934” (italics added; : 
1985, p. 572; see also Weiriet al. 1988). We 
will demonstrate, however, that liberal capi- 
talists controlled the formulátion of the major 
proposals, that capitalist leaders generally 
supported them, and that concerted opposition 
emerged late in :he policy-making battles 
because of the formation of a rival conserva- 
tive bloc that used social security taxes as a 
secondary theme in its general attack on the 
New Deal. 

Finally, state-centric analysts have inter- . 
preted a social protest thesis too narrowly 
and, despite. referring to a “democratic 
upsurge," left unclear the precise links 
between protests, electoral instability and 
popular reform pressures. A protest thesis 
does not mean that protestors secured their 
demands or that the major policy moves were 
coordinated by protest peaks. Rather a series 
of sustained protesis combined with growing 
electoral instability to create a sense of 
political.crisis among elites, thereby placing 
major reform proposals on the political 
agenda. 

What, then, accounts for the formulation of 
major social reforms? We suggest that two 

| 





? Actually, many of the supposedly "excep- 
tional" features of the American welfare state (its 
contributory base,.the lack of permanent invest- 
ment funds) are quite common (see Stinchcombe 
1985). . 
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major processes were at work: (1) a sustained 
wave of protests by excluded groups and 
threatened polity members that created elec- 
toral instability and an elite sense of crisis; 
and (2) a capitalist split created by the 
competition between ‘a “corporate liberal” 
bloc and a conservative coalition, thereby 
creating opportunities for outsiders and a 
series of reform initiatives by the liberal bloc. 
Both, in turn, stemmed from the economic 
crisis of the Great Depression. Figure 1 charts 
the basic hypothetical links between crisis 
events and political realignments that we 
think divided capitalists and stirred protests. 


WHAT ABOUT SOCIAL PROTEST? 


As the Great Depression spread, it created 
persistent mass unemployment, devastated 
whole communities, and generated a massive 
and sustained upsurge of protest. In past 
recessions, economic hardship had stirred 
isolated and small-scale protests, but in the 
Great Depression the response was massive, 
almost immediate, and, most important, 
persistent. Veterans marched en masse on 
Washington to demand early pensions; angry 
crowds attacked bakeries, hotels, and grocery 
stores, demanding food; threatening mobs 
blocked housing evictions and held rent 
strikes; the unemployed created councils and 
organized mass marches on city hall demand- 
ing (and receiving) outdoor relief (Piven and 
Cloward 1977, pp. 48-60). Figure 2 charts 
the unemployment rate, the number of 
unemployed protests, and estimated protest- 
ors as derived from a content, analysis of The 
New York Times (Kerbo and Shaffer 1986).* 
This politicization of unemployment emerged 


as a result of the scale and persistence of | 


unemployment which rose rapidly from just 
over 3 percent in 1929 to a peak of 25 percent 
_in 1933 and then only gradually declined. The 
effects spread throughout the class structure 
so that, by the mid-1930s, virtually all 
households were affected. In industrial cities 
like Toledo and Cleveland, unemployment 
reached rates of 50 and 80 percent, devastat- 





* Unemployed protests were defined as collec- 
. tive actions (marches, rallies, protests) with more 
than IO participants by unemployed individuals. 
Stories were identified through The New' York 
Times Index and then coded from the actual news 
story. For coding details, see Kerbo and Shaffer 
(1986). 
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ing small Ó— and the traditional 
security of white-collar jobs (Katz 1986, pp. 
206—9). This politicization of unemployment 
was also rooted in increasing elite recognition 
of unemployment as a social, not an individ- 
ual, problem. Starting with President Hard- 
ings Commission on Unemployment in 
1921, political leaders and progressive capi- 
talists launched discussions of the unemploy- 
ment problem, developing private “welfare 
capitalist” solutions (Brandes 1976; Kerbo. 
and Shaffer 1986). Some even advocated , 
public programs güided by a "preventive" 
approach to unemployment ‘insurance and 
extending protection to old age ànd health 
(Lubove 1968). When the Great Depression 
hit, the fiscal crisis of welfare capitalism 
immediately politicized the issue and legiti- © 
mized public initiatives in the eyes of many, 
including progressive business leaders. 
Wage cuts and threatened layoffs also set 
off a sustained strike wave, first with the . 
craft-based economic strikes of 1929-30 and, 
then, following the passage of Section 7a of 
the NRA in 1933, an upsurge of organiza- . 
tional strikes in the mass production indus- 
tries (Bernstein 1970; . Rubin, Griffin, and 
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Wallace 1983). In the face of determined 
employer attempts to destroy the union 
movement and defy the NRA, the strike wave 
became more militant and politicized, giving 
rise to the bloody “Battle of Toledo” in April 
1934, the general strikes in Minneapolis and 
San Francisco in the spring and summer of 
1934, and the bitter Kohler and regionwide 
southern textile strikes that summer (Brecher 
1972, pp. 150-77). As Figure 3 shows, the 
number of strikes took off in 1931 and rose to 
a peak in 1937. With the shift into mass 
production industry and the increasingly 
political character of the upsurge, the strikes 
become larger and more disruptive. Mass 
strikes of 10,000 or more rose from a single 
event in 1930 to 17 and 18 in 1933-34 and 26 
in 1937 while the number of strikers rose 
from 183,000 in 1930, to 1,470,000 in 1934 
and 1,860,000 in 1937 (U.S. Bureau of Labor 
Statistics 1936-55). Most important, the 
strike wave developed an expansive momen- 
tum that threatened further disruptions. Figure 


3 charts this dynamic, using Cronin's (1979, - 


p. 11) wave frequency index. Peaking in 
1932-35 and again in 1937, the strike wave 
refused to subside and temporizing conces- 
sions (especially Section 7a of the NRA) only 
worked to reinforce it. 

The Depression also threatened the urban 
middle class and small farmers, setting off 
protests by polity members. In the spring of 
1934, the Townsend movement launched an 
intense petition and letter-writing campaign, 
_ establishing Townsend Clubs in most Congres- 
sional districts and gathering over 2.5 million 
signatures for the Townsend Plan of a $200 
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monthly old age pension to be financed from 
a 2 percent financial transaction tax (Holtz- 
man 1963, pp. 48—55, 87-89). When New 
Deal agricultural and credit policies damp- 
ened the small farmer protests in the corn and 
wheat belts, the Farmer's Holiday Associa- 
tions turned to electoral politics, supporting 
third party challenges, including Senator 
Huey P. Long's third party Presidential bid 
(Shover 1965, pp. 166-67, 187-99). In the 
South, the Southern Tenant Farmers Union 
organized a flurry of strikes in 1934—35, 
contesting the AAA policies and the landlord/ 
Bourbon Democrat alliance (Grubbs 1971; 
McConnell 1953, pp. 8-10). 

These protests fed off one another, creating 
a volatile electorate that increasingly favored 
the New Deal Democrats and encouraged 
third-party challengers. The strengthened New 
Deal coalition, in turn, created opportunities 
which reinforced these processes. The pro- 
tests accelerated the voter realignment, giving 
the Democrats a series of victories, most 
critically the 1934 by-elections in which, for 
the first (and only) time in U.S. electoral 
history, the Presidential party increased its 
Congressional margin in an off-year election. 
Significantly, voter volatility in the form of 
party switching and increased turnout was 
greater in urban working class districts 
(Sundquist 1983, pp. 229—39), the same tyne 
of districts in which the protests had been 
concentrated. Meanwhile, the Townsend and 
small farmer movements pressured Congres- 
sional representatives (including many Repub- 
licans) to support reforms. The protest wave 
and volatile electorate also encouraged third- 
party challengers, from Upton Sinclair's End 
Poverty In California campaign, the Non- 
Partisan League and Farmer-Democrat efforts 
in the midwestern farm belt, and, finally and 
most importantly, Senator Long's Presidential 
bid based on an alliance between his Share 
the Wealth Society, Father Coughlin's Na- 
tional Union for Social Justice, the Townsend 
movement, and the Utopian Society (Harris 
1975, pp. 294—318; Shover 1965; Brinkley 
1982). The threatened Long campaign finally 
goaded the Roosevelt Administration into 
action. In the spring of 1935, the Democratic 
National Committee commissioned a secret 
opinion poll to gauge the third-party threet, 
concluding that “Long might have the 
balance of power in the 1936 election” 
(James Farley, cited in Brinkley 1982, >. 
208). Outmaneuvering the challengers, Presi- 
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dent Roosevelt called on Secretary of Labor 
Frances Perkins to head the CES, instructing 
her >- : 


We must not delay.. I am convinced. We must 
have a program by next winter and it must be in 
operation before many more months have 
passed. (cited in Perkins 1946, p. 281) 


What, then, was the protest impact? As 
Skocpol (1980, pp. 187-89) has shown, New 
Deal policy moves were not consistently 
timed by strike (or other protest) peaks. Nor 
did the protestors (especially the unemployed 
and the Townsend-Long-Coughlinites) di- 
rectly secure their demands. Rather, multiple 
protests became sustained and threatened 
continued economic disruptions (including 
the fiscal strains of growing emergency 
relief) and sufficient electoral instability in 
form of voter volatility and third-party 
challenges that political and business leaders 
became convinced of an impending political 
crisis. In place of emergency measures (e.g., 
emergency relief, bank holidays, ad hoc 
government-business planning, the Patman 
Bonus bill, the Wagner-Lewis and Dill- 
Connery bills), President Roosevelt turned to 
permanent and comprehensive solutions, pro- 
posing an Economic (later Social) Security 
Act alongside "Second New Deal" proposals 
for public works spending, rural electrifica- 
tion, progressive taxation, and renewed agri- 
cultural production controls (Perkins 1946, 
pp. 284-85; Martin 1976, pp. 345-46). 

The sense of impending political crisis and 
"dangerous reforms" was clearly uppermost 
in the minds of the social security architects. 
Secretary Perkins later explained that the 
Townsend plan "both drove us and confused 
the issues" (Perkins 1963, p. vi). Writing to 
his staff in the midst of drafting the 
legislation, Edwin Witte, Director of the CES 
Technical Board, warned that "the Townsend 
movement has become a terrific menace 
which is likely to engulf our entire economic 
system" (cited in McKinley and Frase 1970, 
p. 11). With Townsend and Lundeen propos- 
als before Congress and the House having 
already passed the noncontributory Dill- 
Connery old-age pension bill the previous 
session, Rep. Robert Doughton (D.-N.C.), 
cosponsor of the Social Security bill, intro- 
duced it to the House of Representatives: 


The existence of such a large relief problem, the 
presence of insecurity on such a vast scale is a 
serious threat to our economic order. . . . The 
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fact that several of these (alternative) proposals 

have attracted a wide-spread following implies a 

threat to our existing institutions which should , 

not be regarded lightly. (U.S. Congress 1935, p. : 

5468) 
In sum, an expanding protest wave created - 
sufficient economic disruptions and political 
volatility that political leaders became con- 
vinced of an emerging political crisis, thereby 
placing social reforms on the national politi- 
cal agenda. 


THE RISE OF A "CORPORATE 
LIBERAL" POWER BLOC 


At critical points, the New Deal depended 
heavily on the support of prominent business 
leaders as well as a "brain trust" of policy 
intellectuals. Liberal business leaders funded 
and directed major policy organizations, such 
as the Twentieth Century Fund and the 
AALL, provided campaign funds, served on. 
the BAC and other governmental advisory - 
panels, held governmental offices, and pub- 
licly endorsed many. of the New Deal reforms 
(for a summary, see Burch 1980, pp. 13-68). 
On Social Security, virtually every account of 
the formulation of the proposals shows 
extensive involvement of these "corporate 
liberals" and academic experts (Altmeyer 
1966, pp. 32-33; Eakins 1966, pp. 208—10; 
Nelson 1969, pp. 129, 200—204; Domhoff 
1970, pp. 207-18; Brown 1972; Berkowitz 
and McQuaid 1980, pp. 101—4; Burch 1980, 
pp. 35-45; Quadagno 1984; Domhoff 1986/ 
87). Yet, contesting a capitalist dominance 
interpretation of this involvement, Skocpol 
and Amenta contend that “virtually all : 
politically active business leaders and organi- 
zations strongly opposed national and state- 
level pensions and social insurance" and, at 
least after 1934, the rest of the New Deal 
initiatives (1985, p. 572). 

This dispute hinges on three issues: Did 
liberal capitalists control the formulation of 
the social insurance proposals or were these 
independently developed by policy experts 
and Administration officials? Second, did the 
capitalist class or particular fractions and their 
political leaders sport or oppose these propos- 
als and, if so, at what point and why? Finally, 
what does this tell us about the broader 
political alignments of the capitalist class? 


Formulating Social Security Proposals 
We think the evidence is quite clear that 
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prominent business leaders controlled the 
formulation of the two major planks of the 
Social Security Act: unemployment insurance 
and old age pensions. Unemployment insur- 
. ance proposals were largely developed by the 
AALL, which brought together progressive 
‘employers and academic experts in the 1920s 
to work on various private insurance propos- 
als. In 1930, the AALL turned from 
disability and health insurance to unemploy- 
ment, developing a model bill for state-level 
insurance based on a “preventive” approach. 
In 1932, the state of Wisconsin adopted the 
plan (hence the “Wisconsin plan”) and, under 
AALL lobbying, the Democratic Party incor- 
porated it into its platform. Simultaneously, 
the rival American Association for Old Age 
Security (AAOAS) developed an alternative 
“Ohio plan” focusing on need rather than 
prevention by pooling funds and universaliz- 
ing coverage. The AALL won this policy 
dispute. In February 1934, Secretary Perkins 
asked Paul Rauschenbush, the AALL’s Wash- 
ington lobbyist, to draft the Wagner-Lewis 
bill, which served as the basis for subsequent 
: discussions inside the CES. AALL represen- 
tatives occupied the key staff positions within 
the CES Technical Board, drafted the final 
bill and, despite secondary disagreements 
with business representatives in the BAC and 
the CES Advisory Council (several of whom 
were AALL members), wrote their major 
planks into the final legislative proposals 
(Witte 1962, pp. 29-30; Berkowitz and 
McQuaid 1988, pp. 109-14). 

. Was the AALL an association of “reform- 
minded scientific -experts" (Skocpol and 
Ikenberry 1983, p. 100) or representative of 
the "moderate forces within the power elite" 
(Domhoff 1970, p. 174)? Although academ- 
ics and middle-class reformers (such as John 
Commons, Raushenbush, and Director John 
Andrews) were formal leaders, composed 
most of the membership and carried out the 
basic research, reform-minded business lead- 
ers such as Louis Brandeis, Bernard Baruch, 
Lincoln Filene, Morris Leeds, Henry Denni- 
son, and Ernest Draper served on the 
governing board, guided policy discussions 
and provided funds. Wealthy benefactors and 
private foundations, 
Fund, John D. Rockefeller, Jr., Elbert Gary 
(U.S. Steel), Charles M. Cabot (Cabot 
Corporation), C. Temple Emmet, Madeline 
Astor (daughter of John Jacob Astor), and 
Anne Morgan (daughter of J.P. Morgan) 


such as the Milbank: 
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provided general funding as well as support 
for special projects such as the 1932 promo- 
tional campaign (Domhoff 1970, pp. 172- 
73). This gave “corporate liberals" the ability 
to control the AALL agenda, define the major 
problems and modify proposals devised by 
the experts. Significantly, the AAOAS was 


: an association of “scientific experts" and lost 


out in the policy discussions partially because 
it lacked capitalist support. 

The business liberals controlled the devel- 
opment of the old age insurance proposals by 
staffing of CES Technical Board and partici- 
pating in the CES Advisory Council. In the 
early 1920s, John D. Rockefeller ("Junior") 
created the Industrial Relations Council (IRC) 
as. a general labor relations institute to 
develop private welfare systems (Collier and 
Horowitz 1976, pp. 240-41). Because the: 
Federal government lacked the research 
capacity to evaluate social insurance propos- 
als, CES Director Edwin Witte hired Bryce 
Stewart, Director of the IRC, to direct the 


- Technical Board and employed his staff to 


handle the actual studies. As Witte later 
explained, "the arrangement in effect 
amounted to employing the Industrial Rela- 
tions Counselors, Inc. to make this study" 
(1962, p. 29). As for the final proposals, 
three IRC affiliates— Murray Latimer, then 
serving as Chairman of the Railroad Retire- 
ment Board, J. Douglas Brown of the 
Princeton Industrial Relations Department, 
and Barbara M. Armstrong, a law professor at 
the University of California (also active in the 
AALL)— joined Otto Richter, an actuary for 


` the Metropolitan Life Insurance Company. to 


devise the final proposals adopted by the. 
Advisory Council and the CES Cabinet 
Committee (Witte 1962, pp. 29-30; Brown, 
cited in Berkowitz and McQuaid 1988, p. 
142). 

Finally, the business liberals were directly 
involved in drafting the final legislative 
proposals by the BAC and the CES Advisory 
Council. Although they did not secure their 
first priorities, they defined the major planks 
and supported the resulting bill. Organized in 
January 1933 to provide policy advice to the 
White House, the BAC had developed the 
NRA, drawing on Chairman Gerard Swope's 
earlier proposals for government-sponsored 
production controls (McQuaid 1976, 1978). 
On March 8, 1934, President Roosevelt 
turned again to Swope for advice on social 
insurance. Swope urged a comprehensive 
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public system covering life, old age, disabil- 
ity, and unemployment insurance based on 
joint employer-employee contributions with 
the employers paying two-thirds (Loth 1958, 
pp. 234-38). The BAC then organized an 
Industrial Relations Committee to develop 
proposals for unemployment insurance and, 
on April 3, 1934, Walter Teagle, Chairman of 
the Committee, circulated a memo urging 
federal legislation requiring state unemploy- 
ment insurance with uniform practices. The 
Committee soon split, however, with Fred 
Kent (Bankers Trust) circulating a meme 


blasting unemployment insurance because’ 


“buying power without work would be 
harmful to men and destroy their contentment 
in life.” On June 28, 1934, the General 
Council met on the issue. Eight members 
(Karl Compton, President of MIT; Ernest 
Draper; Lincoln Filene; Ralph Flanders, Jones 
and Lawson: Machine; Morris Leeds, Leeds 
and Northrup; Gerard Swope; Thomas J. 
Watson, I.B.M.; Robert E. Wood, Sears- 
Roebuck) voted favorably and four opposed 
(Robert Lund, Lambert Pharmaceuticals and 
Executive Vice President of the. NAM; Gano 
Dunn, Grace Bank; Walter Gifford, AT and 
T; and E.C. Van Diest, Utah. Ice and 
Storage). Despite.. the split, Secretary of 
Commerce Roper conveyed a positive recom- 
mendation to the White House, and the next 
day, June 29, President Roosevelt announced 
the creation of the CES to develop legislative 
proposals (Nelson 1969, p. 205). E 
Roosevelt placed his key cabinet members 


‘(Secretaries Perkins, Morgenthau, and Wal-. 


lace, and Emergency Relief Commissioner 
Harry Hopkins) in charge of the CES, with an 
Advisory Council of business, labor, and 
academic representatives and a Technical 


Board. Perkins recruited three pro-insurance 


members from the BAC Labor Committee 
(Teagle, ‘Swope, and Leeds) along’ with 
Marion Folsom (Eastman Kodak) and Sam 
Lewisohn. (Miami Copper, Lewisohn’ and 


= 


5 This and following quotes in the following 
paragraphs come from the BAC “Memorandum on 
Unemployment Insurance—Comments by Mem- 
bers of the BAC,” 26 October, 1935, located in 
Record Group 40, Histories of the BAC, General 
Records Department, Dept. of Commerce, Office 
of the Secretary, Histories of the Business 
Advisory Council, Box NN3-40-79-3, File 94765, 
Social Legislation Committee #2, National Ar- 
chives, Warner, DC eder BAC Arch); 
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Sons, and! ' Equitable Life) as business repre- 
sentatives.. With the CES. scheduled to 
assemble in November 1934, the BAC again 
debated the insurance issue. In July, the BAC 
split again and, in early August, Alfred Sloan 
(DuPont, G.M., Johns-Manville, and Pull- 
man Car Co.) resigned in protest and joined 
other BAC conservatives (who temporarily 
stayed on the BAC)—Pierre DuPont, John J. 
Raskob (DuPont, G.M., and Bankers Trust), 
Francis Davis, Jr. (U. S. Rubber)—to launch 
the Liberty League attack against the New 
Deal (Berkowitz and McQuaid 1980, pp: 
90-91; Burch 1980, .p..19; Zilg :1974;. p. 
324).5 The major dispute, however, was, not 
unemployment insurance but the NRA labor : 
provisions (McQuaid' 1976, pp.’ 176-77; 
Skocpol and Ikenberry 1983; p. 174-77). On 
September. 19, the BAC endorsed the Com- 
mittee’s unemployment insurance recommen- 
dations modeled on-the AALL proposals and 
charged with “drafting a. T for Federal 
enactment"? 

; When the CES- assembled in Noveinber 
1934, there were three available unemploy- 
ment insurance proposals: a national contrib- 
utory system with Federal administration and 
provisions] for private -industrial plans . (a 
synthesis of the "Ohio" plan and Swope's 
BAC proposal); a subsidy plan in which the ' 
Federal government collected the taxes, and 
held the reserves, granting funds to the states 
which administered the programs; and the 
AALL’s tax-offset plan allowing state's their 
own programs. The business representatives 
on' the Advisory ‘Council pressed for' the . 
subsidy plah and the Technical Board the 
national plan but, confronted with the Su- 
preme Court overturning of the Railroad 
Retirement Act as violating state government 
powers, thé CES concluded that the tax-offset 
plan was the only version that would survive 

t E 


i 
ô This conflicts with Berkowitz and McQuaid's 
(1980, pp. 90-91) assertion that DuPont, Raskob, 
and Davis resigned from the BAC at this point. 
According to Zilg (1974, p. 324), Pierre DuPont 
remained the BAC through the summer of 
1935. Raskob was later appointed to the BAC . 
Social Legislation Committee on September 20, 
1934 (BAC Archive, Document RG 40, Box 
NN3-40-79-8) and Francis Davis attended the May 
2, 1935, BAC meeting endorsing the Social 
Security Act (New York Times, 3 May 1935, p. 1). 
7 BAC Archive, Document RG 40, Box NN3- 

40-19-3. o 
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inevitable Supreme Court tests.? On old age 


-insurance, the CES adopted the proposal: 


developed by the Technical Board and 
endorsed by the Advisory Council for a 
national contributory plan as the only admin- 
: istratively feasible and fiscally sound program 
(Brown, cited in Berkowitz and McQuaid 
1988, pp. 120-21). Adding minor provisions 
for mother’s pensions and children's relief, 
the CES then submitted the bill to Congress. 
In sum, liberal capitalists did not agree with 
the specifics of the proposals (especially the 
administration of unemployment insurance), 
but all of the proposals that received serious 
consideration by the CES had been developed 
by business policy organizations and their 
representatives. The controversy was, in 
effect, over administrative issues, not basic 
policy goals. 


Capitalist Support and Opposition 


No perfect method exists to estimate capitalist 

.support for the Social Security proposals at 
the time of their formulation. The only 
reliable general survey of business attitudes 
was conducted by Fortune four years after- 
wards. Interestingly, Social Security proved 
the most popular of the Second New Deal 
measures, fully endorsed by 24 percent of the 
corporate executives, 58 percent desiring 
modifications and 17 percent asking for 
repeal (cited in Krooss 1970, p. 193). 

More relevant to the policy process were 
the stances of capitalist political leaders. In 
February 1935, the National Publishers Asso- 
ciation surveyed trade association representa- 
tives and magazine editors who were presum- 
ably in direct contact with business leaders. A 
majority claimed ignorance or no opinion on 
the proposed bill, and, of those expressing an 
opinion, most offered a qualified favorable 
evaluation. As L.C. Morrow summarized the 
survey results in his Congressional testimony: 


3 While it is true that the experts were split and 
Roosevelt preferred a state plan (Altmeyer 1966, 
pp. 21—22; Witte 1962, p. 111), the constitutional- 
ity issue was uppermost in the minds of the 


` > Cabinet Committee (Perkins 1946, pp. 286-87; 


Witte 1962, pp: 47-62; Nelson 1969, pp. 298- 
311; Folsom testimony, U.S. Senate Finance 
Committee 1935, pp. 555—59). The state plan also 
allowed states to adopt at their own discretion the 
"Ohio" alternative, thereby satisfying employer 
demands for pooled reserves. : 
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“Industry, as a whole, is favorable toward the 
aims of social security and believes that some 
legislatiori must be in effect some day" (L.C. 
Morrow, U.S. Senate Finance Committee 
1935, p. 787). - 

Another gauge of business leaders is belt 


, Congressional testimony. Of 23 business 


representatives who appeared, 9 supported 
(despite reservations about minor features); 
11 endorsed the principles but opposed major 
features (especially employer unemployment 
insurance taxes), and 3 condemned the entire 
effort (U.S. House Ways and Means Commit- 
tee 1935; U.S. Senate Finance Committee 
1935). Business leaders were roughly equally 


divided with a majority either opposed or 


favoring major revisions; yet a majority also . 
endorsed the basic principles of the Act. In 
sum, business leaders took a strongly quali- 
fied but supportive stance. 


Capitalist Political Alignments 


. The Social Security debates occurred in the 


midst of a sea change in capitalist alignments. 
On the eve of the White House transmission 
of the final proposals to Congress, the two 
major business peak associations —the NAM 
and the Chamber of Commerce— endorsed 
the proposals. After a vigorous floor debate, 
the NAM convention provided “an overwhzim- 
ing vote for cooperation with the Washington 
administration in every decisive vote” (Busi- 
ness Week, 15 December 1934, p. 7). At the 
Joint Industry Conference of the NAM and 
the Chamber, delegates offered a more tepid 
endorsement, concluding (over the opposition 
of a few “sturdy belligerents who a short 
while back wanted to draw a battleline and 
defend it”) that the legislation should be 
passed pending further study (Business Week, 
22 December 1934, p. 5). But by the time of 
the Congressional hearings in the sprirg of 
1935, the stances had become mixed. Cham- 
ber of Commerce President Harriman urged 


. passage with allowances for private plans, the 


NAM concluded that the bill was going to 
pass and so attacked only the most objection- 
able features, and the Sentinals for the 
Republic contended it was “un-American” 
(U.S. Senate Finance Committee 1935, pp. 
679, 921, 940-61, 1103). 

This'split became sharper after the Congres- 
sional hearings. In May 1935, the Chamber of 
Commerce convention revolted against the 
leadership, members outshouting outgoing 
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President Harriman and applauding speakers 
charging that “businessmen are tired of 
hearing promises to do constructive things 
which turn out to be only attempts to sovietize 
America” (New York Times, 3 May 1935, p. 
1). Delegates elected an anti-New Deal 
` President and passed motions calling on 
President Roosevelt (who was notably absent) 
to suspend. work relief, delay the Social 
Security Act, abolish the NRA, and make 
_ major modifications in the recently enacted 
Securities Exchange Act. To counter the 
negative publicity, 20 BAC members held an 
impromptu press conference to reassert busi- 
ness support for the New Deal, calling for a 
two-year extension of the NRA and passage 
of Social Security (New York Times, 3 May 
. 1935, p. 1; 12 May 1935, p. 10; 18 May 
1935, p. 16; 19 May 1935, p. 2; McQuaid 
1976, p. 178). Over the summer and fall, 
: White House-BAC relations worsened, with 
threatened en masse _ resignations, . heated 
debates about failures to consult the BAC; 
and calls for a “breathing spell” in -the 
reforms (New York Times, 5 December 1935, 
p. 14). Then, two weeks later and four 
months after the passage of the Act, the NAM 
formally announced its opposition to Social 
Security, calling for a wholesale attack on the 
New Deal (NAM 1935). 

What explains these divisions? Three 
interpretations of capitalist splits over the 
New Deal have been advanced: a big versus 
small thesis (Quadagno 1984; Domhoff 1978; 
Burch 1983); a bank interest group theory 
(Sweezy 1939; Burch 1980, pp. 16-22; 
Ferguson 1984); an international product- 
cycle theory (Ferguson 1984; Gourevitch 
1984); and a consumer versus producer 
market interpretation (Lundberg 1937, pp. 
478-9]). The big/small thesis holds that 


.competitive sector capitalists opposed Social | 


Security because they could not transfer the 
new taxes to consumers, had less working 
capital, and were threatened by rising labor 
costs (Quadagno 1984, pp. 642-43). The 
larger monopoly sector.corporations could 
absorb these costs and, because of broader 
social and political links, could more readily 
perceive long-range benefits. Another version 
points to the distinction between an “upper 
. tier" and a "second tier" of the corporate elite 
has been advanced by Domhoff (1978, pp. 
85-86, 117-18) and Burch (1983) to explain 
the moderate/ultraconservative split in the 
post-World War II period. The bank group 


from Marquis’ 


d _ 901 i 
thesis points to the intense competition 
between the previously powerful House of 
Morgan and a loose alliance between the “our 


- crowd” investment bankers, the Rockefellers, 


and various regional financiers, such as A.P. 
Giannini (Bank of America) and Joseph P. 
Kennedy. Moreover, bank ties created the 
major intercorporate networks in that era 
(Dooley 1969), making them a key axis in 
capitalist political mobilization.? The interna- - 
tional product cycle theory asserts that 
technologically advanced “new products” ` 
industries favored the stimulative economic 
policies of the New Deal (including export 
promotion) because of lower labor costs and 
elastic demand curves. Less advanced “old 
product” industries opposed because of higher 
labor costs and hence payroll taxes, and less 
to gain from increased demand. Finally, the 
consumer versus producer market: thesis 
argues that, where the NRA had favored 
heavy industry by restricting production, the 
Second New Deal favored consumer goods 
and retail trade by stimulating consumption. 
To evaluate these theses, we examine the 
economic ties of the business leaders who 
took clear and consistent stances on Social Se- 
curity: the BAC members (including those at-. 
tending the press conference); the CES Advi- ' 
soty Committee members (Witte 1962, pp. 49— 
50); business representatives testifying at the . 
Congressional hearings; the officers of the.Sen- 
tinals for the Republic (Woofskill 1962, pp. 
231434); and the NAM Executive Committee 
for 1935 (NAM 1935).!? This produced a list of 
32 supporters and 29 opponents. Our primary 
aim is to explain the stances of capitalist leaders 
on Social Security and, to capture views of the 


` New Deal generally, the 1936 Presidential elec- 


tion. 

We collected data on the size of the 
primary businesses and their primary products 
Who's Who in America 
(1936-38) and Moody's Manuals (1935-36). 
In the absence of firm-level data, we 
estimated labor-intensiveness and . exports 
from industrial statistics on wages as a 
percent, of value-added and exports as a 
percent.of production for the primary prod- 
ucts of these firms (U.S. pee of the 





? Bankers’ major concern about Social Security 
was the buildup in reserves, but this apparently did 
not Mosi a major dispute. 

Provided by Philip Burch from the NAM 
Archives. : 


Census 1938, pp. 473-52, 752-75). Data on . 


bank group | ties were derived from the U.S. 
Senate National Resources Committee (1939, 
p. 169; Sweezy 1939, pp. 306-17) and data 
on the “our crowd”/Rockefeller complex 
from Birmingham (1967) and Burch (1980, 
| pp. 17-20, 50-52). Consumer and producer 


industries were coded by using their primary 


| products to distinguish extractive and heavy 
manufacturing industries from light industry 
and services. Following Lundberg (1937, p. 
479), banks were classified with. heavy 
[industry unless they .were extensively in- 
;volved in financing real estate and retail 
` jtrade. Trade’ association representatives were 
, classified with their respective industries. 

: First, we used crosstabs and Pearson 
correlations ,to identify the basic patterns; 


jreating stances on Social Security as a 


‘dummy variable (opposition = 0; support = 1).! 
‘The evidence clearly fails to support either 
version of the big/small thesis, supports the 
bank group' argument and provides 'some 
‘support for the product-cycle and consumer 
market theses (Table 1) Firm size was 
‘negatively related to. pro-Social Security 
lr — .093; p 7.4779). In fact, if anything, 
,,the very largest capitalists were more opposed 
. ‘with such ' 
iPierre DuPont and Howard Pew (Sun Oil) 
leading the opposition. Support and opposi- 
tion were roughly evenly distributed across 
the businesses by size. As for the product- 
'cycle theory, the export thesis worked 
(r= .348; p=.006) while the labor-intensity 





proposition was very weakly supported (r= ' 


i — 122; p=.3493). The strongest factor was 


the. bank ties. The Rockefeller and "Our. 
“Crowd” figures were all supportive while. 


lover two-thirds of the Morgan, Mellon, and 


DuPont affiliates were opposed and the. 


‘nonaffiliated were evenly divided. Using 
‘dummy variables and treating the Rockefeller 
{and “Our Crowd” banks as 4 liberal bloc, the 
‘Mellon, Morgan, and DuPonts as conserva- 
‘tives, and the unaffiliated as neutrals, the 
bank groups. emerged as the strongest factor 
‘(Rockefeller/“Our Crowd" r = .471, p = 
` .0001; Morgan/Mellon/DuPont r= — .36, p 


11 Statistical analysis was performed with SAS, 
Pearson CORR, REG, and GLS procedures. While 
iprobit could have been used, linear regression is 
more familiar and the results would have been the 
{same given the roughly POPE number of pro and 
«con groups. 


: weak support (r = .19, p = 


'superich" figures as Irenee and 
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=" 004; unaffiliated r = —.052, p = .692). 
The consumer markef thesis also received 
137): 

Was this due to a single underlying 
dimension or a complex of factors? /To assess 
this, we turned to multiple regression: analy- 
sis, using dummy variables for the banks and 
consumer market (Table 2). Bank ties (espe- 
cially Rockefeller/“Our Crowd") were the 
only significant. factors to emerge from the 


various equations (using a significance level, : 


of an unstandardized coefficient at least twice 
the standard error). The. industrial characteris- 
tics were largely explained away by the bank 


networks, indicating that these were the- 
. product of selective networks of the banks. 


The Rockefeller/“Our Crowd" banks were 
closely tied to the exporters (r — .522), less 
labor-intensive firms (—.433), and those 


` involved. in consumer markets (.175), while 


the Morgan/Mellon/DuPont cliques were 
linked to domestic (— 240), labor-intensive 
firms (.240) operating. outside of consumer 
markets (—.221). Except for smaller assets, 
the unaffiliated either fell near the mean ‘of 
the sample or resembled the Morgan/ 
Mellon/DuPont clique (—.200 for exports, 
—.129 labor-intensity; ‘and .059 consumer 
markets). In general, the bank-tied firms were 
latger, the Rockefeller and Morgan groups 
being the largest (mean assets of $1334 and 
$1237 millions), followed by the DuPont 
($746 million), *Our Crowd" ($168 million), 
and Mellon firms ($88 million) with the 
unaffiliated firms the smallest ($36 million as 
opposed to a sample mean of $286.3 million). 

Overall, it seems that the bank groups 
served as the centers for political mobiliza- 
tion, their selective networking with particu- 
lar industries working to structure support and 
opposition. While industrial differences may 
have reinforced these alignments, the bank 
networks played the primary role. Overlap- 
ping bank and industrial ties flowed from the 
investments, loans, and interlocking director- 
ates of the banks. The Rockefellers had only 
recently moved into banking, acquiring. the 
Equitable and Chase banks. Both were 
commercial banks, lacking major industrial 
ties. The “Our Crowd” bankers had heavily 
financed the expansion:of retail trade and 


related consumer industries (Birmingham , 


1967). The Morgan banks, by contrast, were 
extensively involved in ‘public. utilities and 
heavy manufacturing and thus overlapped 
with the steel, rubber, and automobiles of the 
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Table 1. Capitalist Support and Opposition to the Social Security Act of 1935 








. Pro Con Totals 
Size of firms: % N % N % N 
(Millions of assets) 
$500 + 33 (3) 67 (6) 100 (9) 
10-500 75 (6) 25 (2) | 100 (8) 
50-100 44 (4) 56 (5) ; 100 (9) 
10-50 59 (10) 41 (7) | 100 (17) 
1-10 56 (5) 44 (4) | 100 (9) 
Not available 50 (3) 56 3). 100 (6) 
Trade associaton representatrives 33 (1) 67 a) | 100 (3) 
Labor-intensiveness: 
Wages as a % value-added: l 
Less than 20% É 73 (8) 27 (3) 100 a1) 
20-29% 46 (12) 54 (14) 100 (26) 
30-39% 60 (6) 40 (4) 100 (10) 
40-49% 50 (4) 50 (4) 100 (8) 
5096 + 33 (2) 67 (4) 100 (6) 
Export orientation: 
Exports as a % of production: 
15%+ 78 aD 27 (4) 100 (15) 
10-1496 29 (2) 71 6) 100 (7) 
5-9% 55 (6) 46 (5) 100 (11) 
Less than 5% 41 (9) 59 (13) 100 (21) 
Not available 67 (4) 33 (2) 100 + (6) 
Bank group affiliations: l 
“Our Crowd” 100 (9) 0 (0) 100 (9) 
Rockefeller 100 (3) 0 (0) 100 (3) 
Morgan 33 (2) 67 (4) 100 (6) 
Mellon 25 (1) 75 (3) 100 (4) 
DuPont 25 (1) 75 (3) 100 (4) 
Not affiliated 46 (11) 54 (13) 100 (24) 
27 23 
Not available 45 (5) 55 (6) (11) 
Consumer/producer markets: 
Consumer 65 (17) 35 (9) 100 (26) 
Producer 43 (15) 57 (20) 100 (35) 


Mellon and DuPont cliques (Burch 1980). 
Yet the industrial bases of these bank groups 
were quite diverse, the latter including 
chemical ‘and petrochemical as well as other 
advanced firms (Zilg 1974). 

The effects of labor-intensity were complex. 
When entered into the multiple regression 
equations, labor-intensity reversed directions 
from the: Pearson correlations. While never 
independently significant, this suggests a 
possible interaction effect. To explore interac- 
tion effects, we constructed all possible 
.two-variable interaction terms and respecified 
the bank groups to examine the unaffiliated. 
Using forward stepwise regression, the only 
interaction term that emerged as possibly 
significant was that between the unaffiliated 
and labor intensity (Equation 7, Table 2). 


Among the bank-tied, it made no difference. 
But among the unaffiliated, the greater the 
labor-intensity,. the greater the support for 
Social Security. The. coefficient, however, 
was less than twice the standard error. 

The interpretation is unclear. Almost 40 
percent of the sample (24 out of 61) were 
unaffiliated with wages representing 32.4 
percent of value-added as compared to 30.8 
percent for the entire sample. Several of the 
unaffiliated supporters: owned construction 
firms which had presumably benefited by the 
public works spending of the Roosevelt 
administration. Since they could pass along 
the new employer payroll taxes in new public 
works contracts, this cost was insignificant. 
Several unaffiliated opponents were light 
industry manufacturers, such as S.B. Colgate 
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Table 2. Regressions an Sources of Support for Social Security (V = 61) 





1 “2 3 
Rockefeller 5000***T — ,spQpe«T — 5625e48T 
Our Crowd# —— (.3980) (.3980) (4477) 
Morgan/ . : 
Mellon/ —.2647**«T — y7]]***T — 2310* 
Dupont# (—.2377)  (—.2434)  (—.2074) 
Labor .0058 .0061 
intensity (1211) (.1576) 
Exports .0042 .0034 
(.0969) (.0792) 
Size — .00008 
(—.1160) 
Consumer 
market . 
Unaffiliated@ 
Unaffiliated* . 
labor intensity 
Adj: R? = .248 .244 242 
f= 10.89 5.835 4.820 
P = .0001 .0005 0011 


* p<.15, one-tailed test. 
** n<.10, one-tailed test. 
*** 1<.05, one-tailed test. 


4° 5 6 T 
.5183***T — .sg4ge*sT — gi7Qe»«T ` TQ4s*x«T 
. (4126 (.4728) (.7299) (.5608) 
—.2559** . —.2066 .2197 
(-.2297  (-.1855) (.1972) 
.0062 .0081 0001 —:0037 
(.1608) (.2101) (.0039)  (—.0955) 
.0031 . .0021 .0008 
(.0730) (.0483) (:0193) 
(— .0009) — .00005 
(—.13500  (—.0820) 
. 1129 .1320 .1334 
(.1112) '(.13001 (.1313) i 
.0131 — ,2468 
(.4815)  (—.2468) 
.0154* — 
(.5679) 
241 243 .2501 2701 
4.812 4.202 3.858 6.551 
.0011 .0015 .0019 .0002 


T = unstandardized coefficient is twice standard error; 


unstandardized coefficient is upper figure; standard- 
ized coefficient is in brackets below. 


@ = reference equals bank-tied. 


(Colgate Palmolive) and H.A. Bullis (General 
Mills) whose labor costs were under 15 
percent of’ value-added. Possibly the indus- 
trial averages misspecify firm characteristics. 
In any case, it would take clearer evidence 
from a larger sample to be certain about the 
industrial factors. Overall, they appear secon- 
dary to the bank networks. 

Overall, Social Security does not appear to 
have been sufficiently divisive to have 
provoked sharp industrial divisions. Of course, 
we are looking at elite politics and a single 
policy debate. Local and state politics and 
other New Deal policies might reveal a 
different picture. But Social Security appears 


to have been an ideological test in which. 


network ties were more salient than direct 
economic interests. In this sense, the intra- 
capitalist struggle seems to have been over the 
general direction of the New Deal. 

To see if these alignments reflected a 
broader contention between a "corporate 
liberal" and a conservative anti-New Deal 
. coalition, we examined campaign contribu- 
tions to the Presidential candidates in 1936. 


Using the campaign contribution data assem- 


# = reference equals unafziliated. 


bled by the U.S. Senate Special Committee to 
Investigate Campaign Expenditures (1937), 
we found the same political alignments. 
Because family groups play a strong role in 
political donations, we coded our lists accord- 
ing to individual and immediate family 
donations, using Who's Who to identify 
kinship. Of the 61, a third (20) were donors 
with comparable percentages for the pro- and 
anti-Social Security camps (11 of the 32, pro, 
and 9 of the 29 anti). Of the 11 supporters, 
two-thirds (64 percent) gave only to, the 
Roosevelt campaign, less than a third to the 
Republicans, and one supported both parties. 
The opponents were strictly Republican 
donors. Reflecting their larger wealth and 
possible political commitments, the oppo- 
nents were also more generous, with a mean 
contribution of $6,857, the smallest being . 
$500 and the largest $121,000. By compari- 
son, the liberals invested only $3,068 on 
average, $1,200 being the smallest and 
$8,500 the top donor. In view of the intense 
class rhetoric and vitriolic attacks on the 
Roosevelt administration by prominent busi- 
ness leaders in the 1936 election, these data 
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probably underreport both actual financial 
support for Roosevelt’s reelection (there were 
other less public methods of contributing, 
such as purchasing advertising) and possibly 
capitalist support for the New Deal. 

These alignments suggest that the Social 
Security splits were rooted in a broader 
political battle between rival capitalist blocs. 
Each bloc was based on specific networks, 
` class fractions, ideological outlooks, and 
programmatic political visions (Poulantzas 
1973, pp. 229-45). Judging from our Social 
Security data, the rising “corporate liberal” 
coalition was centered in less powerful banks 
and advanced industries. Leadership came 
from “Our Crowd” bankers who had financed 
the retail trade expansion in the 1920s, the 
owners of advanced firms, such as IBM and 
Kodak, and retail merchants, such as Lincoln 
Filene, Jesse Straus (Macy’s), and Samuel 
Reyburn (Lord and Taylor). Although the 
Rockefellers remained loyal Republicans, 
they supported many of the New Deal 
reforms, including the banking and securities 
legislation that dismembered the House of 
Morgan. By linking their campaign invest- 
ments to their stances on specific reform 
legislation, several of these “corporate liber- 
als” put their reform agenda to the test of 
popular elections, thereby testing their hege- 
monic capacities. 

The formation of the conservative coalition 
was more complex. Although several conser- 
vative leaders had been earlier involved in 
social reforms (especially the Morgan bank 
representatives), by the mid-1930s they had 
turned against the New Deal. Direct policy 
threats, such as the New Deal banking and 
securities legislation and the Senate munitions 
hearings on the DuPont’s World War I 
profiteering, were critical. In addition the 
failure of the NRA to restore prosperity and 
labor peace set off intracapitalist disputes 
over production controls and labor relations. 
Drawing on a wide range of support, these 
business leaders mounted a general attack on 
the New Deal, creating new policy organiza- 
tions such as the Liberty League, the 
Crusaders, the Sentinels, and the Southern 
Committee to Uphold the Constitution. They 
seized control of the NAM and U.S. Chamber 
of Commerce and invested heavily in the 
Republican campaign (Lundberg 1937, pp. 
480-83; Burch 1973, 1980, pp. 45, 66; 
Krooss 1970, pp. 180-81; Zilg 1974, pp. 
309-26). As the 1936 elections demonstrated, 
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however, this conservative bloc lacked a 
political formula that could simultaneously 
promise economic recovery, create labor 
peace, and mobilize popular electoral sup- 


port. 


CONCLUSIONS 


We have synthesized Poulantzas’ (1973, 
1980) neo-Marxian theory of the state with 
Social protest theory (Piven and Cloward 
1977; Tilly 1978) and Domhoff's (1978) 
theory of capitalist dominance to advance a 
political struggle theory of social reform that 
accounts for the origins of the American 
welfare state. The formulation and passage of 
the Social Security Act was critically shaped 
by two political processes: a sustained wave 
of social protest, interacting with electoral 
instability, that created an elite sense of 
political crisis; and the hegemonic competi- 
tion of rival capitalist blocs that responded to 
this crisis with opposing policy planning and 
electoral efforts. 

This analysis has been framed in terms of 
Poulantzas' (1973, 1980) theory of the 
“relative autonomy” of the liberal democratic 
state. On the one hand, the liberal state 
creates an arena in which various groups 
(including capitalists) mobilize and contend 
for political power. The state is, in this sense, 
"autonomous" from the interests and influ- 
ence of particular classes and class fractions. 
It is "not an instrument" but "a strategic 
battlefield" (Poulantzas 1980, p. 152). Ex- 
cluded groups and tlie lower classes use the 
liberal state as a forum for making political 
demands and organizing protests, movement 
campaigns, and voting for candidates who 
support measures to protect against the 
marketplace. Demands for citizenship rights 
and social reforms are, in this sense, rooted in 
the economic interests of excluded groups and 
the lower classes. Social reforms typically 
require concessions by upper-class groups. 
Simultaneously, the upper class finds it 
necessary to contend for political power, 
forming rival political coalitions that compete 
for’ hegemonic leadership by working with 
policy intellectuals and political officials to 
formulate reform proposals and guiding mass 
electoral outcomes by making campaign 
investments. In short, the upper class is often 
politically divided and has to organize and 
compete for political leadership. It cannot 
unilaterally impose its interests or rely on the 
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state to automatically represent its interests. 
Capitalist blocs also have to “struggle” for 
political power, especially in the midst of 
political crises in which the formula for 
economic prosperity and political stability 
have become problematic. In this sense, the 
liberal state simultaneously mediates the 
interclass struggles and intracapitalist hege- 
monic competition set off by political crises. 
Mass elections constitute an “early warning” 
system, signaling problematic legitimacy and 
affording the upper class an opportunity to 
advance alternative solutions. These political 
struggles often result in major social reforms. 

On the other hand, this liberal state is not 
entirely independent from the capitalist econ- 
omy. This is the central insight of Poulantzas’ 
(1973, 1980) “stracturalist” argument. Not 
only does the liberal framework legitimize 
private property and individual citizenship 
rights, but the capitalist context defines the 
underlying problems that generate political 
crises and the major interests in political 
conflicts. Moreover, the state’s structural 
dependence on the capitalist economy for tax 
revenue and political stability restricts the 
maneuvering room of political elites. If only 
to insure a stable fiscal environment and a 
predictable electorate, political leaders have 
to attend to the conditions that insure capital 
accumulation. In this sense, the liberal state is 
structurally constrained to represent the eco- 
nomic and political interests of the capitalist 
class. . 

Yet this “class struggle” theory needs 
major revision. It fails to explain the nature of 
social protests, the mainsprings of political 
opportunities, and the methods by which 
capitalist coalitions compete for political 
dominance. Poulantzas (1973, 1980) dis- 
cussed the "new social movements” and 
“class struggles” but took a reductionist 
stance on the origins of social protest and 
ignored the diverse forms of “struggle.” 
Similarly, he discussed competition between 
potential power blocs but did not analyze the 
processes by which capitalist blocs are 
formed and become dominant. In short, his 
theory lacked an analysis of. political pro- 
cesses. We have turned to theories of social 
protest (Piven and Cloward 1977; Tilly 1978) 
and Domhoff’s (1978) analysis of capitalist 
dominance to explain these processes. 

. We have argued that the protest wave was 
rooted in attempts by excluded groups and 
economically threatened polity members to 
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renegotiate political access, thereby institut- 
ing programs that would protect the lower and 
middle classes against the hazards of the 
marketplace. These protests were multiclass 
based, drawing on the unemployed, industrial 
workers, small farmers, and the middle class. 
While these struggles rested on extra- 
institutional measures, they also entailed 
institutional pressures, especially electoral 
volatility and support for third-party chal- 
lenges. While emergency measures addressed 
the grievances of particular protestors, the 
introduction of proposals for permanent 
solutions such as the Social Security Act 
stemmed from widespread protests and their 
electoral spillover. 

This political upsurge was also rooted in 
new political opportunities. Officials re- 
frained from repressing protests and, in some 
cases, worked to support these challenges. 
Capitalist splits, mediated through the elec- 
toral arena, helped create these opportunities. 
Confronted with an impending political crisis, 
capitalist groups divided, supporting alterna- 
tive political programs and investing in 
competing electoral candidates. A “corporate 
liberal" bloc promoted reforms that would 
politically incorporate industrial workers, the 
unemployed, small farmers, and the middle 
class while a conservative bloc promoted a 
return to the "American way." The protest 
wave, in turn, reinforced these processes, 
creating electoral instability that strengthened 
the Democratic Party and provided electoral 
incentives for supporting reforms. 

This intracapitalist contention was orga- 
nized through two processes: policy-planning 
efforts to develop proposals to address these 
problems; and campaign investments in com- 
peting candidacies. The liberal bloc supported 
a range of social insurance initiatives from 
"welfare capitalist" to emergency relief, 
unemployment insurance, and old age pen- 
sions. Simultaneously, many of these "corpo- 
rate liberals" financed the 1936 Roosevelt 


.campaign, effectively putting their reform 


ideas to an electoral test. Meanwhile, conser- 
vatives promoted an "American plan" return 
to laissez faire and "rugged individualism" 


‘by launching organizations such as the 


Liberty League and the Sentinels for Republic 
and investing heavily in the Landon cam- 
paign. As the 1936 election demonstrated, the 
conservatives lacked the political ideas that 
would simultaneously promise economic re- 
covery and mobilize popular support. Hege- 
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monic leadership depended upon victory in 
both arenas. — . 

This political struggle theory of the state 
frames our criticisms of recent analyses of the 
Social Security Act. Drawing on the mislead- 
ing contrast between an “instrumentalist” and 
a “structuralist” théory of the state (Gold, Lo, 
and Wright 1975), these analysts have treated 
the state .as either a neutral arena for 
reconciling upper class interests (Quadagno 
1984, 1985) or a: self-contained institutional 
system ¿guided by the interests of political 
officials and organizations (Skocpol and 
Amenta 1986). In either case, these analysts 
have collapséd the distinction between the 
state and government, thereby conflating 
policy formulation and policy-making, under- 
estimating the power of social protests, and 
oversimplifying intracapitalist political com- 
petition: In contrast, we have emphasized thé 
distinctive logic of the political struggles 
defined by liberal democratic capitalism. 

There are; several remaining questions 
opened up by, this research. We do riot know 
how significant bank networks are in structur- 
ing capitalist political. -alignments. These may 
have been limited to a single issue’ or a 
particular period. Since banks constituted the 
major intracapitalist . network during this 
period (Dooley 1969), their significance may 
be historically limited. Nor, do we know how 
other networks, such as kinship, schools, and 
social clubs, jmight be relevant. Nor do we 
know much about what creates the upper- 
class splits that make for political opportuni- 
ties. Economic crises, major economic restruc- 
turing, and wars presumably play a major 
role. The civil rights movement benefited by 
such changes (Piven and Cloward 1977; 
McAdam 1982) and such splits are common 
in revolutionary situations (Tilly: 1978, pp. 
190-93). Yet we know little about what 
creates such divisions, how-they translate into 
opportunities or what channels, them toward 
various political outcomes. Finally, we ‘still 
need to integrate this analysis of the “excep- 
tional" American case into general "struggle" 
theories of the welfare state (Shalev 1983). 
The features: that made the United States 
distinctive— weak unions, a party system 
based in patronage and sectionalism, federal- 
ism and a weak bureaucracy, a strong 
capitalist class—may well ‘account for the 
importance of social protest and hegemonic 
competition in the American case. Once we 
have an analysis Har. treats the U.S. case as 


| 


part of a general pattern rather than an 
“exception,” we will have a general theory of 
the politics of social reform. 
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LIFE CHANCES AND THE CONTINUITY OF RANK: 
AN ALTERNATIVE INTERPRETATION OF MOBILITY ` 
MAGNITUDES OVER THE LIFE CYCLE*  . i 


STEVE RYTINA E . i 
McGill University ; 1 


This paper presents an alternative to status attainment modeling of life cycles, 
originally proposed by Blau and Duncari (1967). Statistically, the paper describes 
a new way of representing causal chains, which is less complex, and more easily 
comprehended and communicated. Conceptually, it proposes a focus on life 
chances, defined as the probability distribution over hierarchical outcomes. Life 
chances order a population or cohort from most advantaged to least. For each life 
cycle stage there is a distinct rank. The transitions between stages reshuffle ranks. 
The correlation of successive rankings measures mobility/immobility, which was 
not directly assessed in the classical treatment. The "life chances perspective" 


me A — C 


recasts the original data as a sequence of correlations describing the degree of 
change in social rank. This leads to a substantive reinterpretation, that mobility is 
much less prevalent than it previously appeared: . 


The most extensively developed research 
agenda in sociology is probably status attain- 
ment. It was initiated by Blau and Duncan's 
(1967) research exemplar. That rich volume 
has many facets, but this paper centers on a 
seminal feature: the path analytic representa- 
tion of the socioeconomic life cycle. 

I will propose three related novelties that 


constitute an alternative approach. The first is, 


a statistical technique for extracting indices of 
status continuity from a correlation matrix. 
The new tool condenses the information into a 
minimalist pattern, such that the rules of path 
analysis generate a causal chain where each 
stage depends solely on its immediate prede- 
cessor. This structure should aid comprehen- 
sion and communication. l 


Second, I will propose a reconceptualiza: 


tion of the central problem in terms of “the 
life chance perspective.” The life cycle stages 
of status attainment are taken as sites for the 
measurement of life chances. Life chances are 
thus conceptualized as a probability distribu- 
tion over possible outcomes. These have a 
prospective component, namely expected 
outcomes, and a probabilistic component 
capturing the uncertainty that separates expec- 
tation and final fate. f 





* I would like to acknowledge helpful comments 
from Peter Blau, Christopher Jencks, Peter Mars- 
den, Trond Petersen, and Aage Sørensen on earlier 
drafts of this paper. The errors or misinterpreta- 
tions that remain are mine alone. 
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Life chances order a population or cohort 
from most advantaged to least. At each life 
cycle stage there is à distinct ranking. Passage 
between life cycle stages reshuffles ranks. 
Change in life chances is social mobility. The 
amount of change is assessed -by correlating 


: life chances before and after the reshuffling 


process. (Strictly, the correlation measurés 
immobility.) The numerical core of the “life 
chance perspective" is a sequence of correlà- 
tions describing the continuity of rank. Each 
succession is associated with a definite 
quantity of reshuffling of. rank, or social 
mobility. 

Third, application of the life ‘chance 
perspective to Blau and Duncan’s data, and 
other data from that seminal period, will 
reveal striking immobility across the key 
transitions of educational completion and 
labor market entry. This reinterpretation is 
partly subjective, insofar as there is no widely 
shared standard for how large a correlation 
must be to merit an adjective like “striking.” 

It is therefore helpful to, ground the 
interpretive issue in an apparent contradiction 
in findings. On the one side is "status 
attainment," for which Blau and Duncan's 
work is the paradigmatic exemplar. On the 
other is Paul Willis (1977) insightful 
ethnography, Learning to Labor. ' 

Willis’s work does not directly confront the 
findings or interpretations found under the 
Status attainment rubric. But it appears to be 
inconsistent with those results.. His ethnogra- 
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phy appears ito suffer from a characteristic 
vice, namely, the acceptance (and uncritical 
transmission) of a small sample of members’ 
accounts as if they faithfully reflected large- 
scale structural forces. 

Willis’s subjects, “the lads,” were British 
working class youth disenchanted with aca- 
demic striving. They believed that effort at 
school was useless, because their early adult 
life chances were largely fixed. They believed 
that people like themselves were confined to a 


narrow range of outcomes and that what they. 


did (or did not do) in school would make little 
difference. 

These beliefs seem to deny or contradict 
status attainment results. First, completed 
education is substantially decoupled from 
social background, so that working-class 
origins would be only a modest barrier. 
Second, labor market entry is further de- 
coupled. Eventual occupational attainment 
was even more so. Therefore, mobility was 
far from impossible. Even if one grants the 
accuracy of Willis’s report of “the lads” 
views, their folk beliefs appear to be unduly 
cynical. And if “the lads” are mistaken in 
viewing their early adult life chances, and 
those of their classmates, as already fixed by 
midadolescence, then Willis’s larger interpre- 
tation based on the validity of those beliefs 
must be called into question. 

And thus I will pose the issue. Should “the 
lads” (or at least Professor Willis) be required 
to read Blau and Duncan (or Inequality [1972] 
or Hope’s [1984] British, Scottish, American 
comparison As Others See Us) to restore their 
faith in the enormous mobility chances of 
industrial society? Who was closer to the 
mark, Blau and Duncan or“the lads"?! 

I will outline a new approach to such issues 
in five sections. First, I will characterize and 
critique the image of mobility chances that 
was implicit in Blau and Duncan. Second, I 
will motivate the “life chance perspective" as 
an alternatiye way to interrogate the same 


! Folk observers generally have access only to 
small samples of people similar to themselves. An 
implicit issue motivating this paper is whether such 
samples should show reliable traces of the 
intersection of structure and biography, or whether 
lay interpretations are almost certain to depart from 
large sample findings. Ultimately this bears on a 
central motivating assumption of mobility re- 
search, that patterns and perceptions of opportunity 
shape sentiments toward inequality. 
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data. The third section will present empirical 
examples to make the contrasts of the two 
approaches concrete. The fourth section will 
suggest speculative applications, or thought 
experiments that are facilitated by the simplic- 
ity of the new approach. The fifth section will 
explore differences in interpretation between 
the two approaches. 


THE MANIFOLD MOBILITY OF 
MULTIPLE CAUSES 


The heart of the opposition between Willis 
and Blau and Duncan is the issue of mobility 
versus constraint or fluidity versus determin- 
ism. Blau and Duncan did not directly address 
the issue but it is closely intertwined with two 
of their major objectives. One goal was to 
assess the relative impact of achievement 
versus ascription. They further sought to 
untangle the magnitude of constraint opera- 
tive at different life cycle stages. As they put 
it, . 


The questions we are continually raising in one 
form or another are: how and to what degree do 
the circumstances of birth condition subsequent 
status? and, how does the status attained 
(whether by ascription or achievement) at one 
stage of the life cycle affect the prospects for a 
subsequent stage? (1967, p. 164) 


A concealed choice anchors this agenda. 
Blau and Duncan’s summary suggests that a 
key question is the amount of change in status 
as life cycle stages succeed one another. The 
amount of status change for a population or 
cohort is the amount of social mobility. 
Therefore, it might seem that a measure of the 
amount of mobility that accompanies each 
new stage would be central. But they 
undertook a different line of attack. ‘ 

Without explicitly marking the shift, Blau 
and Duncan replaced the singular concept of 
“status” with a plural concept of the 
succession of “statuses.” Successive ranks 
are along distinct, different dimensions. But 
transitions like education to first job combine 
quantitative change in rank with a confound- 
ing qualitative chance to a different criterion 
of rank. Consequently, “mobility” becomes a 
compound of vertical movement and align- 
ment along a different metric of differentia- 
tion. In effect, the starting concern with 
mobility was abandoned, and in its place the 
causal relations among qualitatively distinct 
rankings were analyzed. This choice has a 
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critical consequence. The Blau and Duncan 
approach does not provide a direct quantita- 
tive measure of mobility (or immobility) for 
each succession between life cycle stages. 

Closely bound up with this shift is Blau and 
Duncan’s key innovation: the introduction of 
path analysis, and the characterization of 
stratification as a succession of causal pat- 
terns among distinct dimensions of rank.? 
This innovation contributed to a new picture 
of the phenomenon. 

In a central demonstration of the novelty of 
their results, Blau and Duncan demolished the 
‘previously prevalent imagery of cumulative 
disadvantage leading to “vicious circles” 
(1967, pp. 199—205). First, the modest value 
of the intergenerational correlation of occupa- 
tion (.405) was cited as evidence against 
Lipset and Bendix's claim that "many factors 
. . . make it difficult for individuals to modify 
their status" (quoted on pp. 199-200). 
Second, they called attention to the tiny 
(.061) contribution of prior causes (i.e., 
‘family background) in the decomposition of 
the correlation of education with occupation. 


This is the entire part of the effect of education 
that has to do with "perpetuating" the "family's 
position." . . . Far from serving in the main as a 
factor perpetuating initial status, education 
operates primarily to induce variation in occupa- 
tional status that is independent of initial status. 
(p. 201; italics and quotes in the original) 


This statement is balanced by the qualifica- 
tion that “This is not to gainsay the equally 
cogent point that the degree of ‘perpetuation’ 
(as measured by ray) that does occur is 
mediated in large part by education” (p. 201). 
Since the preceding page belittles ray as an 
indication of major barriers to mobility, the 
balance of this argument is against the view 
that inheritance is as important as thought by 
previous analysts. By an indirect path, it 
suggests that parental status makes a very 
modest difference. The text nowhere suggests 
that it doesn’t matter. But the impression is 
left that it doesn’t matter very much. 
Although this work is arguably the classic 


? Later analyses adopted the approach of struc- 
tural equations where “cause” is quantified in the 
metric of the several dimensions. This is parallel 
with directing attention to relations among qualita- 
tively different dimensions, and it highlights the 
qualitative differences that mark the succession of 
stages. 
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exposition of a causal analysis, the translation 
of numbers into words involves multiply 
faceted nuance that approaches ambiguity. 
There are different numbers of different kinds 
associated with each pair of variables and 
with larger blocks of variables. No single 
quantity captures "how much a factor mat- 
ters." The amount of difference due to a 
factor is presented, indeed must be presented, 
as a set of magnitudes describing different 
facets of a factor's impact. And many of the 
qualified quantities are portions and partials, 
so that the more sophisticated angles of view 
reveal modest quantities. 

Applied repeatedly, this tends to a general- 
ization about every factor discussed. In a 
phrase, many factors matter, but each matters 
modestly. Conversely, nothing stands out 
sufficiently to justify any ringing generaliza- 
tion about constraint.) This is illustrated by 
the conclusion of the quoted passage. Rela- 
tive to what they saw as an important received 
view, they sound a contrarian note, that the 
key factor of education is mainly a motor of 
mobility, not a locus of status continuity. 

That many factors matter, but only mod- 
estly is what I call the manifold mobility of 
multiple causes. It describes the global pattern 
that emerges when multiple, qualitatively dis- 
tinct, variables are statistically integrated into 
an expanding model representing causal links. 
The paths are many, and the weights are small. 
Complexity grows, effects diminish, and re- 
siduals proliferate as the pattern is refined by 
the addition of more variables. The resulting 
picture sustains themes of the weakness of bar- 
riers and the fluidity of rank. 

The message accumulates from several 
empirical circumstances that are ubiquitous 
across this research genre. First, the zero- 
order correlations are not very large, even 
while they are far from negligible. Second, 
the more sophisticated and refined “direct 
effects” are still smaller, as are the enlarging 


? Of course, race was prominently featured as an 
ascriptive defect in The American Occupational 
Structure. But the standard of comparison was the 
more favorable pattern for whites, so that even this 
exception tended to sustain the opposite implica- 
tion for the bulk of the population. 

* Of course, this pattern depends on measured 
magnitudes and is not logically inevitable. At the 
same time it is commonly observed and therefore 
somewhat independent of the empirical problem at 
hand. 
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tangle of "indirect effects." Quite often these 
are absolutely quite small. Third, few things 
matter in themselves. Most are entangled with 
-other variables. Fourth, intervening variables 


are both non-negligible as causes, and as. 


transmitters, but are substantially decoupled 
from (statistically "unexplained" by). prior 
factors. 

The conceptual status of the different 
variables contributes to the impression of 
fluidity. The multiple dimensions can be 
categorized into ascriptive factors and individ- 
ual achievements. The direct effects that 
unambiguously reflect ascription are among 
the coefficients that fade toward nonsignifi- 
cance as impact is divided among more 
variables. The compensating indirect paths 


are not merely fractional, but multiple. Status, 
continuation is fragmented into a plethora of 


channels. It is still there, in some sense, but 
. ‘subdivision into numerical trivia lowers 
visibility. 

' . Each of the various dimensions of individ- 
ual achievement is accompanied by a large 
residual. This suggests that generic Achieve- 
ment is substantially orthogonal to measured 
social background. The many modest -paths 
describing status transmission appear to be 
overlaid with. a substantial residuum of 
‘accomplishment independent of background. 

. Two conclusions follow.: Many modest 
effects along with large residuals imply that 
no factor delimits a major constraint. And 
modest effects for ascriptive factors combine 
with large residuals for individual achieve- 
ments to suggest that “Rank” is achieved, in 
large part, independent of background. As 
Blau and Duncan summarize it, the United 


In the Blau-Duncan matrix, the correlations lie 
between .322 and .596, with a mean of .456. Thus 
the observed correlations are clustered in a small 
range of the logical possibilities, [-1 .. . +41]. 
An idealization of this (common) situation is a set 
of variables that are equally correlated. Under this 
condition, incrementing the set of independent 
variables decrements the direct effects in the 
preceding equations at a diminishing rate with zero 
as a limit. More exactly, p; r/[1-- (i— 1)r] where 
r is the common correlation, and p, is the 
(common) value of the path coefficient(s) when i 
independent variables are in the equation. 

This idealization is the formal basis for the claim 
that adding more variables tends to. decrease all 
coefficients. At the same time, zero-order correla- 
tions will be resolved into ever-enlarging tangles of 
.  ever-smaller indirect paths. , 
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States approaches a type of society that 
“perpetuates a structure of differentiated 


positions but not their mentes (98h. p. 
441). 

The cumulative impression of massive 
mobility is only loosely grounded in the 
quantitative results. Since there is no direct 
numerical assessment of quantities of mobil- 
ity, the relative magnitudes of mobility and 
constraint are not directly available. But the 
coefficients that describe constraint are mod- 
est. They are accompanied by substantial 
residuals, or "unexplained variation" in all 
critical factors. This means that constraint, in 
any satisfying sense of determination of 
individual fate by structural circumstance, is 
weak. It is not absent, nor negligible, but it is 
a highly qualified and complex consideration 
of many factors, overshadowed by substantial 
doses of indeterminacy. 

Critics of the fluid, voluntaristic image 
are thereby disarmed. The patent injustice of 
fate imposed by external circumstance is 
nowhere clearly visible. Any complaint di- 
rected to. any specific barrier is therefore 
much ado about next to nothing. The large 
residuals, which proliferate as more interven- 
ing stages are incorporated, buttress the 
impression that movement is prevalent. 
Strictly, the residuals are an index of 
ignorance. But the absence ‘of constraint 
entails a veiled arena where winners are 
somehow sorted from losers without contam- 
ination by structural injustice. . 

Presumably, this applies to Willis’s respon- 


‘dents. They are, on the standard interpreta- 


tion, mistaken ‘if. they believe that their 
fathers’ bottom-tier jobs will strongly deter- 
mine their. educational outcomes. Their labor 
market entry points are still further de- 
coupled, and their ultimate outcomes are yet 
further removed. On the standard interpreta- 
tion it follows that they cannot accurately 
anticipate immobility that would empirically 
justify their fatalism. But this judgment is 
premature. It must be tempered by attaching 
quantities to the amounts of mobility implied 
by the sample survey results of which "the 
lads" were presumably ignorent 


THE LIFE CHANCE PERSPECTIVE. 


Mobility can be measured. by assessing 
changes in life chances. This possibility 
inheres in the "life chance perspective" 
which is a different agenda applicable to the 
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empirical materials of the status attainment 
perspective. 

The life chance perspective, like status 
attainment, draws on the notion of a socioeco- 
nomic life cycle. The life cycle is operation- 
alized as a sequence, arranged from birth to 
adult fate, of heterogeneous measurements of 
rank and/or restratifying processes. But one 
can abstract from this cacophony of distinct 
measures to populations, or to cohorts, 
differentiated along an abstract dimension of 
rank. As time passes, the contingents perco- 
late up, sideways, and down in the social 
hierarchy. Percolation forward in time results 
in flows fanning out from initial locations. Or 
in retrospect, a given outcome is peopled by 
some pattern of flows fanning in from initial 
locations. The total collection of trajectories 
of individuals can be identified with my 
central conception of the unfolding of life 
chances. l 

The successive measurements over the life 
cycle are a record of the development of life 
chances. Life chances is not a simple idea, 
but one core meaning is “opportunities for 
individual development provided by social 
structure” (Dahrendorf 1979, p. 61). Health, 
power, autonomy, leisure, cultural variety, 
secure family life, and money are compo- 
nents. One also must somehow distinguish 
early life chances, or the range of possibilities 
that could have been reached, and later life 
chances which are the possibilities within 
immediate reach. A crude but serviceable 
index of the latter is provided by occupational 
rank as conventionally measured. By and 
large, higher occupational rank means greater 
access to the goodies, both vulgar and 
sublime.5 

Chances also has a probabilistic, prospec- 
tive connotation which corresponds to unfold- 
ing over the life cycle, as individuals fan out 
from initial ranks. Early in life, many 
possibilities or different fates are open. 


5 This distributional concept of life chances is 
less ambitious than Dahrendorf's broader concep- 
tion that included collective goods, such as civil 
liberties or an orderly social environment, in 
addition to private goods. For present purposes, I 
restrict the concept to relative position or ranks, 
which endows differential life chances with 
overtones of zero-sum competition among individ- 
uals. The anti-utopian bias that results is a common 
and probably unavoidable consequence of a focus 
on individual mobility. 
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However, individuals! chances are different. 
Different prospects mean differences in prob- . 
ability distributions over outcomes, and do 
not refer to certainties. The wellborn are more 
likely to attain the desirable positions. The 
less wellborn are, in most industrial societies, 
not formally excluded but have lesser proba- 
bilities or reduced chances. As contingents 
advance through the life cycle, these early 
prospects gradually harden into adult certain- 
tes. ; 

Movements up, down, and sideways corre- 
spond to changes in prospects marking the 
convergence toward certainty. In principle, 
one could identify various choices by individ- 
uals or decisions by gate-keeping power 
holders with such movements. Attaching 
grades to ninth-grade schoolwork ratifies or 
continues some persons' ranks and produces 
(tiny) shifts for others. Such minor events 
may be accumulating into larger summaries, 
like high school GPA, that will influence such 
major reshufflings or rank as the decisions of 
college admissions committees. Further up. 
the scale of abstraction, coarse operational 
measures, like years of completed education, 
summarize many tinier fragments of informa- 
tion about choices and decisions. 

Every incomplete biography is like a 
growing dossier of information that bears on 
social rank. The prospects or average chances 
associated with each dossier is the average or 
predicted outcome for that record. To capture 
predictability, one can imagine an array of 
persons in rank order of ultimate outcomes. 
Empirically, this ranking can be taken as 
eventual occupational attainment, although in 
principle one could employ any measure of 
differential life satisfactions that are socially 
structured.5 This measure of final results I call 
the individuals' fates. 

Regression provides a tool for measuring 
prospects as they unfold toward fate. Fate, as 
a vector of scores or ranks, can be regressed 
on a set of measures, for example, the various 
ranks characterizing family of origin. Appli- 
cation of the regression weights or coeffi- 
cients to the background variables yields a 
score for each youngster that is the best 
predictor of eventual fate. I call this score a 
measure of the youngster's life chances. 


$ The strict requirement is that one could use 
any one dimensional summary. Occupational SEI 
is adopted here for the sake of comparison with the 
vast body of work based on this measure. 


. through restratifying processes, like educa- 


bi “af i i iz u : ; : | - * 
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"This ranking score is the predicted value be 


the individual’s fate. ‘Arithm 
weighted linear combination’ of the individu- 
al’s status attributes. The weights are the 
regression coefficients wien fate is regressed 
on the set of predictors. Conceptually, ! it 
measures differential expectations. t 
Differential expectations: åre both present 
advantages and “mere” chances subject to 
revision. Those with high expectations will, 
on average, get better outcomes. But expecta- 
tions are merely the predictable part of 
personal choices and. decisions by power 
holders. Some with lesser expectations will 
pull ahead of some who had greater, as 
deviant decisions and unpredicted outcomes 


_accumulate into changes in rank. Life chances 


are subject to modification as probabilities 
give way to certainties. f 

Most: strictly, the aggregate of persons 
sharing a given rank have ‘shared chances. 
Life chances refers to their total distribution 
of probabilities over outcomes. This is not 
directly measurable (at reasonable cost) but 
can be summarized.by the mean and variance 
of the outcomes they will experience. The 
mean is the average or expected outcome, 
which corresponds to the “predicted” value 
of the regression summary. The variance 
corresponds to the error or residual variance 
of the regression. Life chances are a mixture 
of expectation and variance. People with 
identical lifé chances do not enjoy identical 
outcomes, 

The dispersion of subpopulations from 
common expectations to divergent fates. is the 
unfolding of life chances. As cohorts advance 


tion, some move up and some mové down. 
But each new ranking is’ predictable in 
definite degree from its predecessor. Insofar 
as the new rank is predictable, no mobility 
has occurred. And each new ranking can be 
calculated in parallel by regressing faté on 
current facts. So.the new ranking after, say, 


reshuffling due to education is based on a new 


regression with education added to family of 
origins measures. 

: Unfolding can also be. viewed by looking 
back from adult rank. In the'immediate past, 
last year's occupational rank must be a nearly 


perfect predictor of this year's. Stepping back. employed. (One could examine the metric variant 


in time, the prediction weakens steadily. Far 
enough back; at labor market entry, occupa- 
tional rank winks out of existence, but a 
foreshadow of adult rank based on education 


1 


etically, it is & 
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and other factors can be calculated. Ulti- 
mately; ione could unfold the process all the 
way back to rank based solely on family o: 
origin. Each ranking could be resolved int 
little categories like percentiles. And then the 
movement of a cohort through the life cyck 


could be viewed as myriad trajectories’ o: 
‘flows where occupants of each percentile 


disperse upward, sideways, and downwar 
into the succeeding set of boxes.” And, o 
'course, associated with such transitions am 
the. various choices by individuals and by 
gatekeepers that move some ahead and other: 
behind. f 

A measure of the immobility associate: 
with the change in the ranking of persons i: 
the correlation between successive rankings 
No mobility has occurred if prospects or lift 
chances:are predictable. from previous pros 
pects. This i is equivalent to saying that one": 
rank was unaltered by the new information o 
Score on the most recent dimension. Con 
versely, mobility or changes in life chances i: 
measured by the residual that is a comple 
‘ment to the correlation. 

: Thus ; the perspective leads, first, to i 
comparative ranking across the sample fo 
each life cycle stage. Infants, teens, youn 
adults, and so forth can each be stratified oi 
the basis of the applicable variables. li 
principlé, one can regard successive studie: 
as refinements converging toward an uppe 
bound of predictability based on all informa 
tion ‘about persons of a given maturity. Li 
practice, attention is confined to the variable: 
available within a study. 8 Second, the transi 
tion between stages is identified with : 
definite degree of stability and reshuffling-o: 
immobility and mobility. Some transitions 


? In continuous form, this can be identified witl 
a Gaussian random walk. The present paper is ar 
approximate operationalization of this concept 
Which wás my original starting point, 
8 There is no restriction against categorica 
. predictors. And there is no requirement that ever) 
samplé member have values on all variables. The 
possibility mentioned in the text of employing ful 
p rape is not entirely theoretical. 
? In general, scores used for ranks are "contin 
uous" linear combinations of "continuous" mea 
sures. Ordinary product moment correlations am 


. but the resulting coefficients are identically equa 
to 1.0 and carry no information.) So the scores ar 
ranks only insofar as they refer to placement along 
an abstract dimension of expectations. 


r 
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like college admission against a backdrop of 
high school performance, presumably would 
show high predictability. Predictability means 
that the new ‘positions „are ratifications or 
continuations of previous rank. More signifi- 
cant watersheds, like the reshuffling imposed 
' by gatekeepers at lebor market entry, should 
lead to lower correlations. In the latter case, 
the empirical analysis will soon provide a 
definite magnitude for inspection. 

Associating a definite ranking with each 
stage of the life cycle makes it possible to 
assess the degree of change of rank, or social 
mobility, associated with each life cycle 
transition. “How much mobility” is not 
directly apparent from status attainment 
statistics. Instead, these address a variety. of 
questions of dependence/independence among 
qualitatively varied factors. In effect, the 
dependence of any factor, like education, 
reflects immobility with respect to that axis of 
differentiation. However, the importance of 
the particular axis in the larger picture is 
empirically contingent. The result is some- 
thing akin to “how much mobility” that on 
close analysis must be doubly qualified, once 
because the mobility is with respect to a 
particular dimension; and again to incorporate 
bow much that dimension matters for other 
things. 

The life chance perspective addresses a dif- 
ferent question: how much any factor like ed- 
ücation reshuffles the rank order of predictable 
access to the goodies. Such ranks summarize 





Note, however, that the comparison is specific 
for sample members and one could single out, say, 
‘the 97th percentile based on family of origin. The 
normal (Gaussian) distribution could be used to 
interpolate details of the probabilities implicit in a 
sequence of percentiles. Conversely, given suffi- 
cient biographical detail to replicate the coding 
procedures, one could examine any life, like one's 
own, as a sequence of percentiles and jumps of 
definite length. Then one could see if one's 
personal intersection of history with biography 
contained statistically significant departures from 
the model. 

Such applications rest on the approximation that 
the pattern from a given empirical cross section can 
be applied to other times. Fortunately, there is 
considerable evidence (cf. Featherman and Hauser 
1978) that change over the 
within rather narrow bounds. The weighted sums 
of components called for by the present approach 
“further involve a damping that will help insure 
reasonable accuracy for such extrapolation. 
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the chances associated with the cumulative sta- 
tuses and outcomes of one's life to date. As 
the empirical material will show, the predict- 
ability of global rank generally exceeds, often 
by a wide margin, the predictability of the 
contributing components. 

Thus the life chance perspective is an 
alternative to examination of the causal 
relations among variables. Attention is di- 
rected to the degree to which accumulating 


information (about stratifying outcomes and 


decisions) alters the ranking of individuals as 
expectations converge toward fate. With these | 
tools, one can directly: examine the amounts 


‘of mobility associated with the transitions 


captured by any sequence of empirical 
measures along the life cycle. 


EMPIRICAL RESULTS 


The relation of rankings at different stages in 
the life cycle is summarized by the correla- 
tions of the successive scorings predictive of 
fate. I call this summary a prediction table 
since it records the predictability of rank 
among the several stages of the empirical life 
cycle under examination. Panel D of Table 1 
presents such a table for the correlations 
reported in Blau and Duncan (1967). Panels 
1A to 1C record the correlations and other 
statistical summaries and decompositions. !° 
The calculations needed for panels A to C 

are widely understood. Panel D, the predic- 
tion table, is the novelty. The entries are 
correlations, but the variables are the rankings 
of life chances. for each life cycle stage. The 
new variables are “life chances after factor” 
denoted as LCA (factor). These are calcu- 
lated by regressing fate on the variables 
realized by that stage. For example, LCA 
(education) is the linear combination — .0068 
(PA. ed) + .1808(Pa. occ) + .5120(Ed). (The 
weights are the path coefficients for occupa- 
tion on the three variables.) Thus the rows 
and columns in Panel 1D refer to collections 
of variables, associated with life cycle stages 
linearly combined into rankings with respect 
to expected fate. The left stub of Panel D lists 
the variables recoded into each LCA measure 


10 Numbers calculated by author are set off from 
numbers taken directly from published sources by 
reporting four digits in the text and six in the 
tables. The extra digits are to aid efforts to verify 
comprehension by calculation but are not intended 
to convey any empirical significance. 
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Table 1. Correlations, Path Analyses, and Life Chance Summaries for the United States, from: Blau and Duncan 








(1967) 
A. Correlations and path analyses 
pa. ed pa.occ 
pa_ed 1 0.516 
pa_oce 0.516 0.733744 ` 
educ 0.453 0.438 
Ist job 0.332 0.417 
'62 occ 0.322 0.405 


educ lst job '62 occ 

0.309361 0.025399 —.013940 
0.278369 0.214425 0.120526 
0.737933 ~ 0.432575 0.398303 
0.538 0.669426 0.281082 
0.556 0.541 0.566222 


Note: Path coefficients for the column variable as dependent are in the upper right, residual variances are on the 


diagonal, and correlations are in the lower left. 
B. RÊ, multiple R, and the residual e 


paed pa occ 
R ; na 0.266256 
R na 0.516 
e na 0.856588 





Note: Entries are for the column variable regressed on variables to the left. 


C. Multiple R for '62 occupation on sets of independent variables 


pa_occ& educ& 
0.425950 0.617161 


Note: educ& designates the set { pa. ed, pa occ, educ). 
D. Prediction table 


pa_ed pa_occ& 
pa ed 1.0 0.779434 
pa.ed, pa occ 1.0 


paed, pa occ, educ 
paed, pa. occ, educ, Ist job 
'62 occ 


educ Ist job '62 occ 
0.262066 0.330573 0.433777 
0.511924 0.574955 0.658618 
0.859030 0.818184 0.752477 
Ist job& 
0.658618 
educ& Ist job& ' '62 occ 
0.537947 - 0.504085 0.332 
0.690176 0.646732 0.425950 
1.0 0.937052 0.617161 
1.0 0.658618 
1.0 ` 


Note: Correlations of life chances after the variable in the top label, which are based on the set given in the left 


label. 


while the top labels abbreviate LCA (factor) 
as factor&. 

It is straightforward, but tedious, to 
calculate the correlations among the LCA 
variables using the actual weights that de- 
scribe the successive measures of life chances. 
Appendix A justifies a shortcut. The life 
chance correlations are shown to equal the 
ratios of the multiple correlations for fate on 
the variables included in each stage. Thus 
.6902, the correlation between LCA(back- 
ground) and LCA(education), is the ratio of 
.4260 and .6172 which are the multiple R for 
fate on father's occupation and education, and 
the multiple R for fate on education, and 
father's occupation and education, respec- 
tively. Accordingly, life chances results can 
often be generated from published reports 
without recourse to microdata. !! 


! Tf the multiple R after including an interven- 
ing variable were zero, then the formula for the life 
chances correlation is a ratio of two zeros. It is 


Correlations among nonadjacent columns 
are multiplicatively related, for example, 
.6467 = .6902*.9370 where the three 
numbers are the correlations of LCA(back- 
ground) with LCA(first job), LCA(back- 
ground) with LCA(education), and LCA(ed- 
ucation) with LCA(first job), respectively.. 
Appendix A contains the proof that this 
property generalizes. If A,B,C,D denote 
successive stages of the life cycle character- 


undefined, because it refers to the correlation of 
two zero vectors. This means that an implicit 
assumption for this perspective is that fate is 
somewhat predictable from at least the first 
intermediate variable. Thus differentiation of life 
chances would be undefined in societies like 
Marx's utopia where the division of labor has been 
abolished. In more practical terms, the perspective 
implicitly assumes that previous research has 
identified measures that show some continuity or 
correlation over the life cycle. I would like to 
thank Christopher Jencks for calling attention to 
this limiting case. 
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ized by the addition of one. or more measured 
variables, then the life chances correlation of 
A with D, denoted R(A,D), is equal to the 
product R(A,B)*R(B,C)*R(C,D). 
' This property entails a simple structure for 
path analysis of a life chances correlation 
matrix. Figure 1 shows the result for the Blau 
and Duncan data. The path model forms a 
perfect causal chain, with A-B,B—C, and 
C-—»D. Bypassing direct: effects, such as 
AC, are identically equal to zero. There- 
fore, a consequence of the life chances 
reconceptualization is an image of minimum 
complexity. An incidental consequence of the 
absence of bypassing effects is that the paths 
in the life chances model are identically equal 
to the life chances correlations. Accordingly, 
the model can be understood and communi- 
cated in terms of correlations and without 
recourse to’ part correlations, or other higher- 
order quantities. — ' 

This means that the life chances approach 
can be viewed as a strategy for recoding 
variables into a pattern that gives a perfect 
causal chain. It therefore provides a useful 
summary or baseline for statistically oriented 
workers. The reduction to simple correlations 
may prove useful for teaching and other 
communication. Even sophisticated workers 
may find this useful for grasping the individ- 
‘ual ‘level implications of a multivariate 
picture. f 

The results in Table 1E summarize the data 
- in a new way. For example, LCA(education) 
correlates .6902 with LCA(background). This 
is larger than the .596 correlation of education 
with adult occupation that is the largest 
correlation in the Blau and Duncan matrix. 
"Thus one could say that background is more 
closely related to life chances before labor 
market entry than education is to occupation. 
This is a simple, alternative description of the 
same evidence that depicted education as a 
prime motor of mobility. 


Perhaps the most apt contrast is of the. 


.6902 life chance ccrrelation with the multi- 
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The estimates apply to life chances after the variable 
appearing in the box. They are based on the correlations 
in Panel D of Table 1. , 


Fig. 1. Path Diagram describing life chances based on 
Blau and Duncan (1967) 
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ple correlation of education with background 
of .5119.!? The latter says that, on average, 
white male offspring from two standard 
deviations above the mean retain an educa- 
tional advantage of only one standard devia- 
tion. In terms of life chances, about 1.4 
standard deviations of advantage will be 
retained at the end of schooling. This still 
entails substantial regression to the mean, but 
the 35 percent increase in apparent status 
retention is hardly trivial. And while 
advantaged white males might be dismayed to 
realize that their white male offspring will, on 
average, retain only 69 percent of parental 
advantage after completing education, those 


12 There is some ambiguity in selecting compar- 
isons. The life “chances approach describes the 
continuity of rank before and after education by a 
correlation of .6902. This directly measures 
immobility, and the complementary residual de- 
scribes movement. But mobility and immobility 
are not complements within the standard perspec- 
tive. - 

In the conventional analyses, mobility is an 
untidy combination of unexplained variance and 
variance explained bv intervening processes. Im- 
mobility, if visible at all, is not a single number but 
a hodgepodge of inheritance measures across 
various contributing factors. One source of this 
ambiguity is the concern for direct and indirect 
effects. In the influential language of Lazarsfeld, 
an intervening variable which reduces a causal 
measure (e.g., a direct path) to zero, "explains" 
the original relationship. In the first instance, 
inheritance or immobility is thereby "explained." 
But the explanatory factors, like education, are 
frequently measures of individual achievement. 
Since these must "explain" final outcome, in the 
sense of incrementing R-square, to perform as 
intervening variables, these factors also "explain" 
mobility. The ambiguity arises because effective 
intervening variables ^explain" both mobility and 
immobility, but in different senses of the verb 
"explain." As a result, the two complementary 
concepts become confounded. 

13 One could also ccmpare the logically comple- 
mentary residuals of the correlations, although this 
requires suitable negations of propositions. The 
residual variance in education (.738) is 41 percent 
greater than the residual variance in life chances 
after education (.524). The percent describes the 
ratio of the areas of the ellipses that circumscribe 
"most" of the observations in the corresponding 
scatter plots. But, in general, it can be misleading 
to mix comparisons of linear metrics such as 
correlation with square metrics like variance, just 
as it can be awkward to compare lengths and areas. 
In the text, I will restrict attention to correlations. 
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without advantage might be less than over- 


whelmed by this fluidity. 


Of course, life chances after education is - 


not the end of the story. At each succeeding 
stage, there are conditionally independent 
lotteries that will further reshuffle the rank 
order. In one sense, those who have retained 


disadvantage might envy the advantaged less | 


because the race toward fate is slightly less 
than half over. Within the life chance 
perspective, the partition of the metaphoric 
race is numerically definite. LCA(education) 
correlates .6172 with fate, which means that 
slightly more reshuffling will occur from 
educational completion to fate than occurred 
from background to LCA (education). 

But conditionally independent displace- 
ments from current rank offer limited solace 
to individuals. There is no built-in tendency 
to cancel extremes of advantage or disadvan- 


tage. Future positive and negative displace- : 


ments around current position will exactly 
balance at every position.!^ So the provisional 
or preliminary advantage retained after educa- 
tion is not yet fate; but relative to age peers it 
is advantage calculated over the only factors 
that yet exist. In the larger picture, it is a 
foreshadow subject to modification. In the 
short term, it is the sum total of the life story 
to the present. 
Labor market entry is a minor lottery, and 


14 This claim assumes that the conditional 
means estimated by the regressions are unbiased, 
which is'similar to assuming that the model is 
correctly specified. Under this assumption, most 
individuals lack any objective basis for expecting 
their future to be more advantaged or disadvan- 
taged than their current rank, even though 
collectively there will be many modifications. 
Strictly, this refers only to the information 
incorporated into the model. If one believed 
oneself to have a preponderance of unmeasured 
positive (or regative) qualities, then one could 
rationally expect favorable (or unfavorable) modi- 
fications. 

My suspicion is that subjective interpretations of 
this will turn where one stands in the process. 
Young persons, whose total rank can be based only 
on early factors, may be less inclined to discount 
the extant retention of advantage, because there is 
no reason to believe that they personally have any 
prospect pf benefiting from later modifications. 
Older persons, especially those looking down from 
positions -of success, may be considerably more 
inclined. to regard the earlier ranking as permeable 
to those who, like themselves, possess the requisite 
unmeasured qualities. : 

| 


[ 


i 


presents the sharpest contrast between the two 
ways of representing the pattern of status 
continuity. The life chances correlation is 
.9370. This;close coupling with background 
plus education is quite different from the 
decoupling suggested by the multiple correla- 
tion for first job of only .5749. Although first 
job is not very predictable, life chances at 
first job are. On the other side of the coin, the 
modest reshuffling entailed by first job 
contrasts with the substantial direct effect of 
first job on final attainment of .2811.!5 

The contrast between the high life chances 
correlation from educational completion to 
first job with the much weaker dependence of, 
first job on the prior variables of background 
and education corresponds to a central 
novelty of the life chances approach. Appen- 
dix B shows that in the limiting case where 
direct effects of the prior variables on fate are 
zero, the life chances correlation for any stage 
with its predecessor will equal the depen- 
dence of the variable added at that stage on 
preceding variables. (Similarly LCA [factor] 
will be a vector of scores perfectly correlated 
with factor.) The excess of the life chances 
correlation over its path analytic counterpart 
increases as "bypassing" direct effects from 
prior variables are greater. 

What this means, in the current context, is 
that relative standing or rank at first job 
depends on more than first job alone. It also 
depends on education and background. Those 
with identically ranked first jobs are unequal 
with respect to final outcome, in the degree 
that there are differences on prior factors. 
This means’ that such differences on prior 
variables need to be taken into account in 
assessing mobility. Conversely, the residual 
variance for first job overstates the fluctuation 
in rank that accompanies succession into first 
job.!5 

If there were no other direct effects. to be 


; accumulated into total rank, one could assess 


15 Algebraically, the squared direct effect ap- 
pears in the denominator of the life chance 
correlation. Numerically .9370? = (.3809)/(.3809 
+ (1—.5799?) (.28117)). All things equal, larger 
direct effects mean smaller life chance correla- 
tions. The example illustrations, however, that a 
relatively large direct effect can accompany a life 
charce correlation that is by no means small. 

!6 Compare Goldthorpe (1980) for the substan- 
tial rebound from low-ranking first jobs by persons 
with other advantages. 
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dependence/independence from prior factors 


by attending to first job alone. The life. 


chances result would converge to the standard 
resült for this special case. More generally, 
the life chances correlation reflects the weight 
of direct effects, and shows that the summary 
ranking after first job is quite similar to the 
ranking that described expectations for per- 
sons completing education. 

Tables 2 and 3, derived from Hope's 
(1984) comparative study of meritelection 
(his coinage for meritocracy) show interesting 
similarities and differences with the preceding 
analysis. Table 2 presents Scottish data. 
Table 3 is American data, derived by Hope 
from Jencks et al. (1972) who assembled it 
from various sources. For identification 
purposes, I will refer to it as Jencks's data. 
Proceeding directly to the prediction tables 
(panels D), we see that education reshuffles 
the rank somewhat more. The correlation of 
background life chances with post education 
life chances is .6117 for Scotland versus 
.6485 for the American data (.6902 was the 
most comparable Blau-Duncan result). 

The reason for this is worth exploring since 
it affords some insight into the present 
perspective. Comparison of panel C of Tables 
1, 2, and 3 shows that the correlation of 
background with fate is quite similar for the 
three sets of data (.4259 [Blau and Duncan], 
.44 [Jencks], and .423 [Hope]). The correla- 
tion of post education with fate, respectively 
.6171 (Blau and Duncan), .6784 (Jencks), 
and .6915 (Hope), differs more. Since the life 
chances correlation is the ratio of the first to 
the second for each country, the stronger 
relation of education plus background to 
outcome in Scotland is the difference that 
leads to the lower status retention in the 
Scottish case. Over the longer haul, status 
retention is at similar levels in the three data 
sets. But in Scotland, more of the shuffling 
takes place by educational completion, while 
the American data suggests that labor markets 
(or in any event, post education processes) do 
more of the decoupling. 

The preceding comparison is somewhat 
unscrupulous. Blau and Duncan's data in- 
cluded father's education, but not IQ, while 
the reverse holds for the Jencks and the Hope 
data sets. In the latter analyses, IQ is 
incorporated in postschooling rank. Yet this 
makes surprisingly little difference. If one 
simply uses the common variables of father's 
occupation, education, and own occupation, 


N 
\ 
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life chance correlations would be .6563 
(7.405/.6171, Blau-Duncan), .6629 (= .44/ 
.6637, Jencks), and .6206 (=.423/.6816, 
Hope). This sort of stability with respect to 
measurement differences is common among 
the life chance calculations (most not reported 
here) I have carried out. The source of this 
robustness is an interesting topic for future 
investigation. 17 

The most interesting comparison concerns 
IQ in the American versus Scottish data 
(panel D, Tables 2 and 3). Information about 
measured IQ leads to a new ranking for the 
Scots that correlates only .6825 with father’s 
occupation. Rank is more impervious to IQ ` 
results in the Jencks data, since r is .7660. 
This result parallels Hope’s (1984) finding 
that IQ results are more rigorously heeded in 
the Scottish system. (However, comparison 
with the Scottish [.3] and. American [.357] 
correlations of IQ with background shows 
that the reshuffling in both systems is far less 
than would be needed to bring life chances 
into alignment with measured ability.) Most 
dramatically, in the Scottish data, background 
together with test results produce a ranking 
that çorrelates .8962 with the rank after 
school completion. The comparable figure, 
from the Jencks data, is a smaller but still 
quite large .8467. In both systems, early adult 
life chances differ but slightly from the 
ranking based on the information available in 
early adolescence. 


INTERPOLATIONS AND 
GUESSTIMATES 


Magnitudes aside, the conceptual and numer- 
ical simplicity of life chances offers addi- 
tional advantages. The succession of stages is 
placed in a common framework, and each 
succession is described by a single, simple 
correlation. The resulting simple pattern is 
both rich in implications and easily extended 
to incorporate new features. 

The life chances summary results in a 
perfect causal chain when subjected to path 
analysis. As a result, the several paths 





17 But it is not completely mysterious. First, 
many kinds of measurement error cancel (or nearly 
cancel) when life chance correlations are calcu- 
lated. Second, adding new indicators to the 
measures incorporated to a stage increases both the 
numerator and the denominator of the life chance 
correlation, leaving the result fairly stable. 
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Table 2. Correlations, Path Analyses, and Life Chance Summaries for Scotland, Derived from Hope (1984) 























A. Correlations and path analyses 


pa.occ IQ 
pa.occ 1 0.3 
IQ 0.3 0.91 
educ 0.427 0.723 
occ 0.423 0.559 


educ occ 

0.230879 0.172369 
0.653736 0.168523 
0.428763 0.468555 
0.664 0.521762 





Note: Path coefficients for the column variable as dependent are in the upper right, residual variances are on the 


diagonal, and correlations are in the lower left. 


B. R*, multiple R, and the residual, e 

















pa.occ ] IQ educ Occ 
R na -0.09 0.571236 0.478237 
R na 0.3 0.755802 0.691547 
e na 0.953940 0.654800 0.722330 
Note: Entries are for the column variable regressed on variables to the left. 
C. Multiple R, for '62 occupation on sets of independent variables 
pa_oce IQ& educ& 
0.423 0.619762 0.691547 
Note: educ& designates the set (pa. occ, IQ, educ]. 
D. Prediction table 
pá.occ IQ& educ& occ 
pa_oce 1.0 0.682520 0.611671 0.423 
pa.occ, IQ 1.0 0.896196 0.619762 
pa. occ, IQ, educ 1.0 0.691547 
Occ 1.0 


Note: Correlations of life chances after the variable in the top label, which are based on the set given in the left 


label. . 


connecting background to fate are factors 
whose product is the multiple correlation 
from background to fate. Thus the paths are a 
partition. 

Interpretively and conceptually, this offers 
major advantages. Sequences of stages can be 
combined by simple multiplication. Both the 
components and the products are correlations, 
and thus the familiar model of the bivariate 
normal (Gaussian) distribution can be used to 
interpolate details for any transition. 

This is potentially useful in several regards. 
The correlations linking life cycle stages can 
be approximately identified with years of age 
for cohorts. This means that the model places 
limits, and often very narrow limits, on the 
expected amount of change over particular 
intervals. One can quite readily translate these 
into percentiles, and into other descriptions 
that can be interpreted as a contest among 
persons. 

Narrow limits can produce sharp implica- 
tions. If mobility is contextualized in this 
manner, productive thought experiments be- 
come possible. For example, consider the 


fictitious headline "Harlem youth graduates 
from Harvard." Can this be regarded as a 
change from the 10th percentile to the 99th? 
If the frame is taken as youth from age 16 to 
age 22, then the governing correlation is 
higher than .85. The corresponding probabil- 
ity is less than 1 in 20,000,000,000 (z — 6.49, 
p=4.29 xX e—11). At those odds, it has 
probably not happened in the history of the 
republic. !8 

In a somewhat similar fashion, it should 


'8 The correlation of .85 is below the estimates 
in Tables 2 and 3 for rank after testing to rank after 
educational completion. Alternatively, the estimate 
in Table 1 for family of origin to school 
completion is .69, and age 16 to 22 must be a 
small fraction of that. The 10th percentile amounts 
to a standard deviate of — 1.28. The conditional 
distribution has a mean of — 1.09 and a standard 
deviation of .5267 (= sqrt(1—.852?). Thus an 
outcome in the 99th percentile, or unconditional 
Standard deviate of 2.33, corresponds to a standard 
deviate of 6.49 (=(2.33—1.09)/.5267) in the 
conditional distribution. 
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Table 3. Correlations, Path Analyses, and Life Chance Summaries for the United States, Taken from Hope (1984), 


Derived from Jencks et al. (1972) 





A. Correlations and path analyses 








pa.occ IQ educ occ 
pa.occ 1 0.357 0.318537 0.147232 
IQ 0.357 0.872551 0.466282 0.173319 
educ 0.485 0.58 0.575065 0.476067 
occ 0.44 0.502 0.648 0.539720 





Note: Path coefficients for the column variable as dependent are in the upper right, residual variances are on the 


diagonal, and correlations are in the lower left. 


B. R?, multiple R, and the residual, e 











pa.occ IQ educ Occ 
Rm na 0.127449 0.424934 0.640279 
R na 0.357 0.651869 0.678439 
e na 0.934104 0.758330 0.734656 
Note: Entries are for the column variable regressed on variables to the left. f 
C. Multiple R for '62 occupation on sets of independent variables 
| pa-occ IQ& educ& 
0.44 0.574410 0.678439 
Note: educ& designates the set (pa occ, IQ, educ]. 
D. Prediction table 
pa.occ IQ& educ& occ 
pa_oce 1.0 0.766003 0.648547 0.44 
pa.occ, IQ 1.0 0.846664 0.574410 
pa.oce, IQ, educ 1.0 0.678439 
occ 1.0 


Note: Correlations of life chances after the variable in the top label, which are based on the set given in the left 


label. 


prove possible to contextualize other local 
investigations of small and nonrandom sam- 
ples, for example, ethnographies. With nar- 
row limits, even small samples can defy the 
laws of chance and could suggest where the 
larger model needs modifications. Thus life 
chances provides the beginnings of tools for 
constructing a framework of expectations 
based on expensive large samples that can be 
applied to smaller local samples. 

Another kind of interpolation involves 
variables and social processes. The multipli- 
cative property becomes additive in loga- 
rithms. Therefore, letting 1 stand for back- 
ground, 7 for fate, 2, 3 . I-1 for 


intermediate stages, and R(4,5) for the life - 


chances correlation among stages 4 and 5, 


Ln(R(1,D)— X Ln(R(1,2) 
+ Lr(R(2,3) . 
+Ln(R(I~1,D). 


An implication is that interpolation of addi- 
tional stages is a zero sum analysis. That is, 
to stick a stage between two extant stages is 


one-to-one with dividing the corresponding 
log of the correlation into two pieces that sum 
to the old total. 

An additional intuitive tool is provided by 
the fact that for correlations that are large 
—Ln(r) approximately equals 1-r (where Ln 
refers to the natural logarithm). Thus the 
numbers to be subdivided are decrements of 
correlations from unity. So a .9. can be 
divided approximately into a .99 and a .91, or 
a .98 and a .92, or into any other combination 
of numbers whose difference from 1.0 sums 
to 1.0 —.9. , 

I have used this property to guesstimate the 
values in Figure 2, drawing mainly on the 





The estimates apply to life chances after the variable 
appearing in the box. They are based on the interpola- 
tions discussed in Section IV. 


Fig. 2. Path Diagram describing life chances based on 
Blau and Duncan (1967) with interpolations 
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data used above! One additional feature i is that 
measurement error in’ occupational rank, 


estimated by Bielby, Hauser and Featherman ` 


(1977) to be .8, appears at the penultimate 
stage,'as it were, between this year's job and 
last year’s, since in fact it was derived from a 
test retest over an eight-month interval. 

I would not defend the estimates in detail, 
but merely suggest that such interpolations 
within’ the zero sum context of the life 
chances approach are far from arbitrary. All 
such interpolations are governed, first, by 
making sure that products correspond to any 
actual, estimated paths, and second, by 
imposing a zero sum split into portions 'as 
new s(ages are added. It is a considerable aid 
that the guesstimates are accessible às decre- 
ments.of correlations from unity. It is hard. to 
imagine that any interpolations are off by as 
much ‘as .05, because most of them cannot 
logically vary across much more of à range. 

Together these tools point toward rough 
and réady estimates of the typical kinds of 
changes associated with the full range of 


ranks across the full range of life cycle'stages. ' 


Life chances thus provides a simple summary 
for synthesizing results. 
summary, thought experiments relating local 


quite narrow bounds on outcomes. It offers a 
new way to attach new meaning to patterns of 
social ‘mobility. D 
i 


i PE 
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So what should “the lads” make of it? Since 
“the lads” had been tested and placed in 
academic tracks by the time of Willis's 
observations, they were somewhere: midway 
in the mobility from IQ results tò- school 
leaving. Hope’s claim that the British system 


is intermediate between Scotland and the 


U.S. means that their movement is: circum- 
scribed by a correlation falling between the 
Scottish result of .8962 and the U.S. result of 
.8467. Since the least of these is “high” by 
almost any standard, fatalism on the part of 
"the lads" seems amply warranted. The 


typical result of academic: striving versus . 


alienation can amount to only a very few. 
rungs on the social ladder. For example, a 
person starting from the 99th percentile under 
Scottish conditions would land between the 


98.4th and 99.4th percentiles two-thirds of . 


the time. The looser American system would 
give a range of 96.5th to 99.8th. There is rank 
Le i , : i t ` 
: i 


Within :such a. 


to be won or lost at Shaol, but it is id to 
argue that the stakes are very great. 

- Should “the lads” discount this because 
earlier and later processes also decouple 
people from origins? The only earlier process 
recordéd in the data is IQ measurement. 
Willis reports that "the lads" were not overly 
impressed by the validity of such testing. In 
any case, their appreciation of their personal 
chances incorporated the fact that they had, 
already been tested and found wanting. On 
the other side, they might take comfort in the 
possibility of later good fortune. But such 
luck as there might be is independent of 
schooling outcomes, and thus irrelevant to 
orientations toward 'Schooling. Furthermore, 
there are no grounds for anticipating more ` 
good luck than bad. So if the "the lads" were 
attending to their immediate future, or more 
precisely to the prospects associated with 
their future 10 years: hence, they were. 
essentially correct in their assessment that 
effort at school would avail them very little. 

.In some strict sense, such conclusions were 
implicit in earlier analyses. The prediction 


"table is based on the same empirical base and 


summarizes the same facts. But the interpre- 


' tive or subjective difference is substantial. 
observations to large sample results generate . 


And it is useful to outline the sources ‘of the 
greater continuity of rank that emerges from 
the alternative analysis. 

The greatest contrast is that the correlations 
in the prediction tables are large. This, in 
turn, is connected with the simplicity of the 
life chances path representation. At the limit, 
if the standard representation yielded a simple 
causal chain, then the diagrams from the two 
approaches would be identical. More gener- 
ally. the standard representation disperses 
continuity across a complex tangle of bypass- 
ing direct paths. The greater life chances 
magnitudes correspond to condensing the 
tangle into, simpler, and more Sin. indices 
of status continuity. 

Greater simplicity and larger magnitudes 
recording continuity are therefore empirically 
contingent; The contingency will usually 
hold, since perfect “explanation” by interven- 
ing variables is more the exception than the 
rule. The contrast between life chances and 
conventional causal analysis will be greater as 
bypassing direct effects are more numerous 
and are weightier. And since the greater 
continuity , brought out by life chances is 
contingention the data, the contrast- between 
the approaches is not one of fact or validity. 
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The life chances pattern may appear novel, 
but in another sense it merely highlights what 
was already (or always) there. 

The central difference is in the question(s) 
addressed. Adoption of the path analytic 
representation impels attention to the family 
of questions “how do each of the earlier 
measures of status contribute to standing on 
each of the later measures?” But the same 
material can be used to address the more 
. pointed question “how much mobility?" The 
latter question leads to a simpler family of 
answers because it abstracts from the qualita- 
tive distinction among variables to a singular 
, characterization of abstract rank that is 


+ 
mn 


comparable across the several life cycle. 


stages.. 

The abstraction io rank brings into focus 
' the slowly shifting relative positions of 
individuals. The more pointed question leads 
to a wholistic summary of total position 
across the several contributing factors. The 
result is directly analogous to a contest among 
persons; it describes standing vis-à-vis age 
peers as the metaphoric race for advantage 
unfolds. The close analogy between popular 
metaphor and technical summary suggests 
that the results lend themselves to direct 
comparisons with lay perceptions. 

Standard causal analysis forces attention 
onto an intervening conceptual layer of 
qualitatively’ distinct variables. Interpretation 
‘requires contextualization, for controls on 
prior variables and by degrees of effect on 
later variables. This intermediate complexity 
does not refer to persons with scores on the 
several factors. Causal results are separated 
from the ordinary language -of wholistic 
anecdotes and stories that are a primary 
means of communicating understanding of 
social life. The less abstract causal pattern is 
not false-and should be examined when 
substantive concerns parallel the possibilities 
implicit in relative causal weights. But the 
relevance for subjective appreciations, by 
. individuals concatenating their various ranks 
into a total position, is obscure. 

First job illustrates the contrast. The Blau 
and Duncan result assigns a complex role as a 
. mediator for both education and background. 
First job is also a moderately potent precursor 
for adult occupation. But it is also substan- 
tially orthogonal to prior factors. In one 
sense, this indeterminacy is a species of 
mobility. But it is misleading. The life chance 
result brings out the very substantial continu- 
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ity of rank across labor market entry. This 
damping of apparent motion occurs because 
family background and education exert con- 
siderable weight on fate over and above first 
job. For persons, with scores on all dimen- 
sions simultaneously, advantage/disadvantage 
at labor market entry turns on more than first 
job status, and therefore the volatility of this 
status overstates the fluidity that would enter 
a realistic assessment of where one stands. 

In sum, life chances does not represent an 
empirical “refutation” of path analytic results. 
Such refutation as there may be is conceptual. 
The difference is one of focus and of motivat- 
ing question. The greater simplicity and greater 
continuity of rank is one-to-one with changing 
the focus to shifts in the relative positions of 
individuals and away from the interdependen- 
cies among variables. Thus a simpler pattern, 
expressed in the lower-order concept of cor- 
relations, emerges from a more central ques- 
tion, to yield results more directly relevant to. 
individuals as wholes. This holds promise for 
more direct comparison with lay perceptions’ 
of changing relative standing. The striking con- 
tinuity of rank that is revealed may even alter 
the subjective appreciation of some sophisti- 
cated observers. 

In a strict sense, the greater continuity of 
rank revealed by life chances is not novel. It 
may appear novel because framing the issue 
differently leads to different answers. But the 
empirical facts are the same. In principle, one 
could apprehend the slowly changing ranks of 
individuals from the various small effects dis- 
persed across paths in the more baroque causal 
representation. In another sense, the novelty is 
substantial. By condensing the continuity that 
was previously splintered among causal effects 
into singular indices of the continuity of rank, 
the life chance perspective reveals a picture of 
considerably reduced fluidity. 
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APPENDIX 


Formal results 


The first section will state and prove the theorem 
that justifies the calculation of correlations among 


ranks. The multiplicative rule relating trios of : 


stages follows a correlary. Together these justify 
the construction of the prediction tables. The 
second section will outline the relation of correla- 
tions among ranks with direct and indirect effects. 


The correlation of life chances from different life 


_cycle stages 


The correlation of life chances for two different life 
cycle stages, i and j, can be denoted R; ;. It is the 
correlation between the rankings of individual 
expected outcomes at the two stages. Each ranking 
is a vector of predicted values, or ? for a given set 
of independent variables. Call the first set X, and 
the second X2, where X, and X? are matrices of 1 
or more independent variables. X,» stands for the 
matrix of the two sets taken together. 


In this setup, . 
?, = XiQGX) XY 
= AY ; , 
Pia = X12(X),2X1,2)7'Xi 2Y 
= Hir, 


where Y, and Y, are the predicted values from 
the regression of Y on X, and X,,, respectively. 
H, and Hj are notation for the corresponding 
symmetric, idempotent matrices X, (X,X,) ^ !X, and 
X2(X) 2X1.2)7!X).2. They are called H matrices 
because they “ put a hat” over Y. 

The covariance of the predicted values is given 


by 
l cov = ffi; a) 


-YHH.Y. 
This can be simplified by noting that 
Hi2 = Hey + H, 


(compare Searle 1982, p. 269), where H» , is the 
idempotent form associated with the regression of 
Y on X21, and X2, stands for X» residualized on 
X,. 


Substitution yields 
COV = = Y'H (H2: + Hy)Y (2) 
‘= YH iY 
since H, X H5, = 0 and H; is idempotent. 
The correlation is the covariance divided by 


the standard deviations of the vectors f; and 
Y,2 which are /Y'H,Y and VY H, 2Y. There- ` 
fore, the correlation of the paces values is 
given by 


YHY 


—————— 3 
Re,p, = VYH;Y VY'H, 2Y 9 


= VYH,Y 
VY H, 2Y 
Since the R? for Y on X, and for Y on X, and X; 
jointly are 


YH,Y Y'H,,Y 
e ERT and RM ru m 


Var(Y) Var(Y) 
the correlation may also be given as 
Ri = R? on X, 
E (4) 
Ry Xa. 


Thus the correlation of life chances is the square 
root of the ratio of the R? for Y on X, to the R? for 
Y on X, and X?. This describes the calculation of 
the entries in the prediction tables. 

The multiplicative relation among: more than 
two life cycle stages follows from the theorem. If 
Ri, R$, and Rs are the square roots of the 
proportion of variance explained (i.e., the multiple 
correlation coefficients) after stages 1, 2, and 3 
respectively, then the correlation of expected 
outcomes of stage 1 with stage 2, or Ri, 2, is equal 
to R,/Rz. Then 


Ri3 =R,/R3 , . (5) 
=(Ry/Rz) X (RolRs) 
=Ri2 XR. 


Of course, this extends to any number of stages. 


The relation of direct and indirect effects to 
corretations of life chances 


This section will connect the correlation of 
life chances with direct and indirect effects. 
Matrice notation is employed to achieve full 
multivariate generality. More accessible results are 
presented for the special case of one intervening 
variable and in the qualitative summary in the final 
section. 

Let X, stand for an n by k, matrix of n 
observations of k; independent variables. Let X, 
stand for an n by k, matrix of k, intervening 
variables, and Y for the dependent variable. In 
concrete terms, Y might refer to culminating 
occupation, X, to measures of the educational ' 
process, including years completed, and X; to 
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assorted ranking criteria applicable to the family of 


origin. 
The model is two matrix equations, 


Y = XiBi + X2 Bo + & (6. 


(X) = XiBs + ex, (7) 


“where B and e refer to unobserved popula- 
tion values. I will use Roman letters (Ê and e) 
to refer to sample values. Since the follow- 
ing argument rests on the algebra of OLS 
estimators, assumptions about the errors would be 
superfluous. f 

‘Assume that all columns are mean centered 
so that intercepts can be ignored. Assume fur- 
ther that the independent variables are scored so 
that their zero-order correlation (and hence covari- 
ance) with Y is = 0. There is no restriction on the 
correlations among independent variables or the Bs 
or fs, so that these can be negative. This 
directional convention does not sacrifice general- 
ity. 

The (squared) correlation of life chances at stage 
1 and stage 2 is 


i 








YHgaY a 
i Riz cs = 
YH, Y | 
s Vf 
fish 
Since . 
fi = Xi + X,B,B, 


Ê, a= Xıbı + XB, 


the (squared) correlation can be written in terms of 
the sample estimates as 


Ri = ' (8) 


E: QXXGSB, + 28x UB, ore 
BYX',X,B,  2B',B' X' XB, tB X, 


"When Ê, is zero. there is no “direct” effect, 
+, and the (squared) correlation of life chances 
reduces to : 


Fia = y re 


—ow—— — An 


BB X BB, (9) 


i 
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The substitution B’,x’, = x ud Xp) X. 
X'4Hy, leads to 


BX’ H., XB, 

B’,X’, XB, 

XB, on X, ; (10) 
which reveals that this is 'a squared multiple 
correlation with X25, taking on the role. of the 
dependent variable and X, as independent variable. 
Since XB; is the vector of values of Y predicted 
on X5, this multiple correlation indexes the degree 
that X; contributes to Y through X3, i.e., indirectly. 

In the context of the model, this makes good 
sense. X, can refer to multiple variables. But the 
dependency that matters is of the particular linear 
combination‘of the columns of X» that carry Y. As 
it were, the construction brings out the optimal 
path of indirect dependence of Y on X, through the 
variables making up X,. When B, = 0 life chances 
at the latter stage depend on the former stage only 
insofar as the prior stage variables "explain" the 
part of the latter stage variables that cause (or 
index) final fate. 

For the case when X; is a single variable, a 


2 1 
R2 = 


. particularly simple result holds. The ‘interior 


portion of the numerator X'2H,,; X; is the ESS for 
X; regressed on X. This ESS is equal to X2X2R? y; 
on xı. Then 


D'R, on xX’ X282 
B'X'X,B, 


X, on X, 


Rs ES 
(11) 


Hence, no direct effects (B, = 0) implies that the 
(squared) correlation of life chances is equal to the 
R? for the intervening variable X; on the preceding 
variables, Xj. In this happy circumstance, the de- 
pendencé of the intervening variable on the prior 
variable is an index of the preservation of rank (or 
immobility) while the residual (or “unexplained”) 
variance is an index of change of rank or social 


. mobility. Thus, when direct effects are absent, the 


dependence of intervening statuses and the depen- 
dence of life chances are numerically indistinguish- 
able. : 

In general there are direct effects. To assess their 


' impact, derivatives can be calculated to see how 


increases in B, affect the (squared) correlation of 
life chances. 

To describe the Cerivative of the correlation of 
life chances with respect to changes in direct 
effects, it is helpful to substitute simple symbols 
for complex ones. Let 


f) = = B'x'x,B, + 28',B',x', xb, 
» = Bx GB, 


= OK xb, on X, 


W 
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| RC  f(B) 
1,2 C m fÊ 


Since 


af(B;) 


u 


= 2(X',X DÊ, + AXX )bb, = 2X' LY, 


aR’, » 


àB;, 
(C fBpypx,Y- (RC + fB,))2X" Y 
(C + RBD? 





_2e(1 - R’)X' Y (12) 
(C + fB)Y 


describes the implications for the (squared) corre- 
lation of a change in the i-th -element of By, The 


. numerator of this derivative is.always = 0 since 1 


— R? = 0, C is a positive semidefinite quadratic 
form and always = 0 and X';;Y is non-negative by 
the convention of the coding of variables so that 
X'jY = 0. The denominator is a square and pr 


: positive. Thus any increase in an element of. 


implies an increase in the life chances correlation. 

One special qualification is in order. Any 
increase in a negative element of B, toward zero is 
here regarded as an increase in direct effect. This 
is verbally straightforward, .and substantively 
sensible. By convention, all zero-order relations 
with the dependent variable are coded to positive. 
Thus a negative element denotes a partial relation 


‘of opposite sign from the zero-order relation, or an 


instance of suppression. Of course, such configu- 
rations are rare empirically, and almost always are 
modest in magnitude. : 
Although it is conventional to regard negative 
partials as diminishing toward zero, such usage 
improperly collapses two different changes away 
from zero direct effects. Congruent changes, where 
direct effects increase, are accompanied by in- 
creases in the zero-order relation and in explained 
variance. Incongruent changes, where direct ef- 
fects become more negative, have the unusual 
implication of diminishing zero-order relations. 
(Of course, parallel considerations govern negative 
zero-order relations and partials of the same or of 
opposite sign.) Thus conventional usage leads to 
the awkward possibility that a relationship strength- 
ens and weakens at the same. time. Í prefer to 
regard any change in a direct effect that strengthens 
a zero-order relation as an increase, although that 
does include the possibility of an increase toward 
zero that is a decrease in absolute magnitude. 
Those who wish can add the appropriate qualifica- 


tion, which amounts to substituting "direct effects 
changing so as to increase the magnitude of the 
(non-negative) zero-order relations, that is dimin- 
ishing toward zero or increasing. above zero" for 
“direct effects increasing.” 

Thus any increase in any element of B, will in- 
crease the correlation of life chances. With the qual- 
ification on sign given above, any increase in “direct 
effects” will increase the life chance correlation. 


The qualitative relation of direct effects, indirect 
effects, and life chances correlations 

“Direct effects” refers to the circumstance where 
an intervening variable fails to mediate all of the 
relation -between a prior and a posterior variable. 
The preceding justifies two generalizations. At a 
limiting point where direct effects are zero, the life 
chances perspective and the path analytic results 
converge. The correlation of life chances is equal 
to the correlation of intermediate status rank with 
preceding rank. The residual (“unexplained”) 
variance in the intervening variable corresponds to 


. the change in rank, that is, to the mobility as 


measured in terms of life chances. The path models 
for the two variants would be numerically 
identical. 

In contrast, where direct effects persist when 
intervening variables are introduced, the life 
chances correlation will be greater than the 
corresponding explained component of the interven- 
ing stages. As direct effects are greater, the excess 
of the life chances correlation will be greater. 

A useful concept ‘for understanding the contrast 
is. “sufficiency.” If an intervening factor reduces 
all residual direct effects to zero, then prior 
variables carry no additional predictive capacity. 
Such a factor is sufficient for later rank insofar as 
other factors give no additional predictive power. 
In general, intervening variables like IQ, educa- 
tion, and first job are not sufficient. 

If a variable were “sufficient” the associated 
LCA variable would be perfectly correlated. 


. Correspondingly, the weights for the prior vari- 


ables would be zero. When “sufficiency” fails, 
these weights are greater. And accordingly, the 
correlation of the LCA variable with prior 
variables increases from the lower limit where the 
weights were zero, X 
Substantively, this reflects the circumstances 
that prior variables must be weighted in along with 
any variable that is insufficient. First job is an 
example. Education and background carry substan- 
tial weight, alongside first job, in the prediction of 


fate from the point of labor market entry. 


Accordingly, rank at first job is more predictable 
than first job alone, because the factors employed 
as predictors are also components of rank at labor 
market entry. 

At the opposite extreme, if first job had no net 
impact on fate, then rank at first job would be a 
linear combination of education and background 


! 
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alone. Indeed, it would be the same linear 
combination as LCA (education) and the life 
chances correlation would be unity. Of course, 
empirical configurations will generally fall some- 
where between, as illustrated by the Blau and 
Duncan data. 

The analysis revealed an exception that is more 
apparent than real. If a factor has a negative direct 
effect, even though it is positively correlated with 
fate, and produces "status inversion," it reduces 
the predictability of life chances. This makes 
substantive sense since "the first become last and 
the last become first." As such a pattern is weaker, 
the life chance correlation is greater, and measured 
mobility is reduced. Although reduction in "status 
inversion" is absolute diminution of a direct effect, 
it is also an increase in the coupling of a factor 
with fate, and it makes substantive sense to 
assimilate it to the notion of “increase of direct 
effect." 

In sum, zero direct effects, which means that 
some variable is a suzficient predictor for eventual 
outcome, is a limiting case where life chances and 
conventional path results coincide. But greater 
direct effects, and more numerous direct effects, 
increase the life chances correlation in comparison 
with the degree of determination of the relevant 
intervening variable. In this sense, the more 
parsimonious life chances representation is a 
condensation of the effects which causal analysis 
disperses across the parhs bypassing the interven- 
ing factor. The complexity which comes with 
intervening variables that leave residual direct 
effects masks the continuity of individual rank. 


_Jencks, Christopher, 
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Simplicity and larger magnitudes indexing continu- 
ity go hand in hand. 
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THE ENCLAVE AND THE ENTRANTS: PATTERNS OF ETHNIC 
ENTERPRISE IN MIAMI BEFORE AND AFTER MARIEL* 


ALEJANDRO PORTES 
The Johns Hopkins University 


LEIF JENSEN 
Pennsylvania State University 


Four hypotheses concerning the character and consequences of participation in an 
ethnic enclave economy are examined. The first concerns the justifiability of. 
operationalizing enclave participation on the basis of place of residence, as done 
in recently published research. The second and third pertain to the effects of ethnic 
economy participation on entrepreneurs and workers, respectively. The fourth 
deals with determinants of self-employment among ethnic minorities. Two data sets 
are employed in this analysis: the 1980 Census individual sample for Cuban-born 
adult immigrants in South Florida and a 1983—86 longitudinal survey of Mariel 
entrants in the same area. These two independent data sets permit a more 
authoritative evaluation of the hypotheses. They also allow an assessment of the 
extent to which earlier findings about the pre-1980 Cuban-born population apply 
to the more recent entrants. Results consistently contradict depictions of ethnic 
enterprise as vehicles for exploitation and of enclaves as mere residential 
agglomerations. These structures are defined by physical proximity of firms, not 
dwellings. Ascriptive factors —primarily gender and marital status—play a decisive 
role in the emergence of enclave enterprise net of human capital endowments. 
Theoretical implications of these findings, in particular the relationship between 
intact nuclear families and the rise of an entrepreneurial minority, are discussed. 


This study has two purposes. The first is to 
reexamine empirically the concept of ethnic 
enclave and the consequences and determi- 
nants of enclave employment in the context of 
competing arguments in the recent literature. 
The second is to delineate how these 
processes have affected the economic condi- 
tion of a recent wave of immigrants—the 
Cuban Mariel refugees — concentrated in the 
Miami metropolitan area. 

Events in Miami are part of a broader trend 


which has literally transformed the character 


of ethnicity in America and compelled 
parallel changes in our theories of racial and 
ethnic relations. This trend has two causes. 
The first is the increasing volume and national 
diversity of immigration in the wake of the 
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1965 Immigration Act. Since that year, 


inflows from Asia, the Caribbean, and 
elsewhere have grown considerably. In 1980, 
the foreign-born already numbered 14.1 
million—the largest figure since 1920. In- 
cluded among them were groups, like Asian 
Indians, Vietnamese, and Dominicans, that 
had no real presence in the country two 
decades earlier (U.S. Bureau of the Census 
1984). By the mid-1980s, the United States 
was receiving over half a million legal 
permanent residents per year from more than 
150 different foreign nations and dependen- 
cies (U.S. Immigration and Naturalization 
Service 1987). 

The second cause of the changing character 
of ethnicity is the parallel diversity in patterns 
of incorporation documented in the research 
literature. No longer tenable, for example, is 
the image of an unilinear assimilation path in 
which immigrants of modest origins enter at 
the bottom of the socioeconomic hierarchy to 
begin a gradual process of acculturation and 
mobility lasting several generations. In 1986, 
slightly over 63,000 immigrants were admit- 
ted for permanent residence as professionals 
and managers, a figure representative of those 
during the preceding decade. Although not all 
foreign professionals resume their respective 
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careers in the United States, many do so and 
they tend. to follow a pattern of adaptation 


quite different from the conventional assimi- 
lation image (U.S. Immigration and Natural- 


ization Service 1987).! 


Even more than professionals, immigrant 
entrepreneurs have received growing attention 
in the research literature as their presence has 
been felt in major metropolitan areas. Ivan 
Light's Ethnic Enterprise in America (1972) 
pioneered this line of inquiry which was given 
subsequent impetus by the performance of 
groups who—like the Koreans, Taiwanese, 


and Cubans — proceeded to create entrepreneur- 


ial communities in their principal areas of 
settlement (Kim 1981; Boswell and Curtis 
1984; Cheng 1984). Research on these and 
other groups seems to have shifted the 
emphasis of race and ethnic studies from the 
plight of ethnic minorities to the conditions 
which may lead to alternative social and 
economic outcomes (Forment 1987). To this 
more diversified literature belong such con- 


cepts as “middleman minorities" (Bonacich . 


1973), "entrepreneurial resources" (Light 
1984), and "ethnic enclaves" (Wilson and 
Martin 1982). 

Research on patterns of economic adapta- 
tion among recent immigrants has not been 
free of controversy, however, about either the 
origins or consequences of alternative paths. 
In particular, ethnic enclaves—highly visible 


and complex social phenomena— have been . 


painted in very different colors by various 
authors. Their arguments give rise to the 
divergent set of hypotheses which this study 
seeks to investigate. 


CONCEPTS AND HYPOTHESES 


Several researchers have portrayed ethnic 
enterprise and entrepreneurial communities in 


.positive terms as an unorthodox but effective 


vehicle for upward mobility among immigrant 
minorities (Portes and Bach 1985; Wilson and 
Martin 1982; Nelson and Tienda 1985). In 





- | In 1980, for example, Asian Indians, a group 
noted for its high proportion of university-trained 
professionals, received median household incomes 
of approximately $25,000, a figure which ex- 


ceeded by $3000 the national median despite their , : DE : 
y$ ien : P ‘ entrepreneurial communities live and work in 


relative recency of arrival. Over three-fourths of 
Indian immigrants have been in the United States 
for less than a decade (U.S. Bureau of the Census 
1984). 
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their view, the ethnic community offers 
access to resources (e.g., credit, markets, and 
labor) which enable small enterprises to 
survive and prosper. Other authors have 
portrayed these small units as functional 


adaptations to the absence of opportunities in 


regular employment because of outside dis- 
crimination. The emphasis of this second 
School has shifted gradually, however, from 
the "survival" function of early immigrant 
enterprise to a more positive characterization 
of such initiatives as means for economic 
advancement (Light 1984; Waldinger 1986). 

A final and more pessimistic viewpoint 
paints immigrant enterprise as a disguised 
vehicle for capitalist exploitation. Bonacich 
(1987) argues, for example, that large capital- 
ist firms use middleman entrepreneurs to 


'exploit a captive immigrant labor force and 


reduce their own production costs. Labor 
conditions in ethnic firms are portrayed as 
similar to those in the broader "secondary" 
labor market, offering mostly menial, dead- 
end jobs. Immigrant workers can find them- 
selves in an even worse situation, as owners 
invoke ethnic bonds and take advantage of the 
immigrants' ignorance of the language and 
labor laws to impose harsh work conditions 
(Bonacich 1987; Sanders and Nee 1987). 
Several clarifications are in order before we 
can empirically test these competing asser- 
tions. First, what is meant by ethnic enter- 
prise and, in particular, by ethnic enclave? 
The first may be defined as a firm of any size 
which is owned and managed by members of 
an identifiable cultural or national minority. 
The second concept has been defined as a 


: concentration of such firms in physical 


space— generally a metropolitan area— which 
employ a significant proportion of workers 
from the same minority (Wilson and Portes 
1980). This sociological definition of “en- 
claves" differs from the everyday. use of the 
term as simple residential agglomerations. 

A recent article which tested effects of 
enclave participation among two immigrant 
minorities defines the concept operationally 
as areas where members of each group reside, 


- rather than places where they work (Sanders 


and Nee 1987). The study asserts implicitly 


- the identity of the everyday and sociological 


definitions by assuming that participants in 


the same areas. This assertion has not been 
demonstrated empirically, however. Prior to 


. testing competing views about the economic 
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effects of ethnic enclaves, it ‘is € to 
examine the extent to which this assumed 


identity holds. Following the above authors, a . 


preliminary hypothesis can be formulated as 
follows: 


I. There is no difference in the economic 
- effects of enclave participation, whether 
measured by where members of an 
ethnic minority live or where they work. 


A second problem is whether predictions 
about the outcome of ethnic enclave participa- 
tion apply equally to different categories of 
individuals. The most negative descriptions 
regard these activities as alternatives to 


unemployment or minimally remunerated 


employment in the open labor market. In this 
view, all participants — whether employers or 
employées—are disadvantaged because their 
poorly capitalized ventures represent basically 

“invented employment” at the margin of the 
teal economy. Bonacich (1987) has rein- 


forced this image by depicting the precarious 


condition of ethnic “middleman” contractors 
squeezed between the power of large firms 
and the; need to preserve the loyalty of a 
coethnic labor force. This argument, which 
depicts the negative effects of ethnic enter- 
prise in the broadest terms possible, may be 
phrased: as follows: 


II. There is no advantage to participation 
in an ethnic enclave economy, regard- 
less of one's position in it. In particu- 
lar, enclave entrepreneurship does not 

- lead to a significant FRDFOMOHENI S in 
. earnings. 


A' second alternative position views ethnic 
enterprise as a vehicle for‘ the economic 
progress of entrepreneurs, but at the expense 
of their workers. Owners rely on the 
"competitive edge" provided by a cheap and 
diligenti coethnic labor force in order tó 
expand their enterprises. The economic gains 
can be considerable, but the direct producers 
receive : little in return. Sariders ‘and Nee 
(1987) ‘report, for example, 
entrepreneurs do as well as those who live 
outside ‘areas designated. as “enclaves,” but 
that workers do worse. The latter's plight is 


attributed to exploitation of their i ignorance of ` 
English and of legal covenants by employers." 


These and similar arguments lead to the 
following alternative formulation: 


HI. Participation in an. ethnic economy 


that ethnic . 
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may yield significant economic gains 
for entrepreneurs, but not for workers. 

. Relative to individuals of similar 
qualifications in outside employment, 
‘enclave wárkers receive lower earn- 
ings. 


Finally, there is the question of determi- 
nants of participation in ethnic enclaves and, 
in particular, entrepreneurship as opposed to 
wage employment. A number of studies have 
analyzed differential propensities for self- 
employment between ethnic minorities (Fra- 
toe 1984; Pedraza-Bailey 1985a; Aldrich and 
Zimmer 1985; Light 1984); the complemen- 
tary question of. employment differences 
within entrepreneurial groups themselves has 
received less attention. There are at present 
no general theories of determinants of immi- _ 
grant and ethnic enterprise. Economists :who 
have dealt with the issue have modeled 
self-employment as a human capital outcome, 
employing the same set of predictors as in i 
earnings equations. Their rationale is that 
such variables as education and work ' experi- 
ence should increase the skills necessary to 
open a.new business. Among the foreign- 
born, years since immigration and knowledge: 
of English are expected to operate in the same 
direction (Borjas 1986; Stevens and Chen 
1984). f 

Less attention has been paid to ascriptive 
variables such as gender, marital status, and 
household composition. Borjas (1986) notes 
that marriage should help solve the “shirking” 
problem. involved in hiring strangers as 
employees in small enterprises and success- 
fully incorporates the variable into his model 
of self-employment. With this exception, the 
empirical literature has, paid scant attention to 
historical and ethnographic evidence suggest- 
ing. that, such factors as the sexual division of 
labor and household composition can play an 


. important role in the emergence and consoli- 


dation of small enterprise. Contrary to this 
trend, we propose the IUIONing: foufth 
hypothesis: 

IV. Human capital PETE play only a 
limited role as determinants of enclave: . 
entrepreneurship. . Ascriptive vari- 
ables, in particular gender and mari- 

' tal status, have a significant causal 
,role in the process. a PE: 


SETTING AND DATA’ . | ae 
Cubans are the fourth largest and the most 
l 


, I 
i i 


bc 
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spatially concentrated foreign minority arriv- 
ing in the United States since 1965. While 
most other large immigrant groups have 
chosen New York or Los Angeles as their 
destinations, Cubans have clustered in the 
Miami metropolitan area. In 1987, 77 percent 
of new immigrants of Cuban origin declared 


Miami as their area of destination, a figure far . 


exceeding the level of concentration of all 
other major groups (U.S. Immigration and 
Naturalization Service 1988). As a result; the 
majority of that city’s current population is of 
Cuban origin and about half of the population 
of Dade County (the Miami SMSA) is 
“classified as Hispanic (Allen and Turner 
1988, pp. 163-65; U.S. Bureau of the Census 
1984). 
The Cuban inflow was originally one of 
dissidents of .-Fidel Castro's revolutionary 
. regime who planned to return after its demise 
' (Boswell and Curtis 1984;. Pedraza-Bailey 
1985b). The return option became blocked, 
however, in the aftermath of the Bay of Pigs 
defeat in 1961 and after the resolution of the 
Missile. Crisis in 1962. Beginning in the 
mid-1960s, a process of entrepreneurial 
: development took place in Miami which saw 
Cuban-owned firms increase from about 900 
in 1967 to approximately 25,000 today (U.S. 
Bureau of the Census 1985; Botifoll 1984). In 
1982, Dade County: ranked second in the 
absolute number cf Hispanic-owned firms in 


the country and first in gross receipts of such ` 


firms. The 2.2 billion dollars in sales reported 
by Dade’s Hispanic firms exceeded the 1.7 
billion of Los Angeles County, despite the 
latter exceeding Dade’s Hispanic population 
by a five-to-one margin (U.S. Bureau of the 
, Census 1985). 

. Studies of the Miami enclave economy 
have been based, for the most part, on survey 
data and aggregate information collected 
during the 1970s (Portes and Bach 1985; 
Wilson and Martin 1982). From a contempo- 
rary perspective, these data have two limita- 
tions. First, the available surveys were not 
representative of the entire Cuban-born popu- 
lation of the area because they were drawn 
: from immigrant cohorts arriving in a particu- 
lar year. Second, inferences from these data 
became even more questionable in the 
aftermath of the 1980 Mariel exodus which 
brought another 125,000 new Cuban exiles to 
the United States (Bach 1981; Clark, Lasaga, 
and Reque 1981). . 

. The manner of arrival of this latest wave 
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and its mode of reception in the United States 
departed significantly from the experience of 
early Cuban cohorts.. The U.S. governmerit 
refused to grant political asylum to Mariel 
refugees, assigning to them instead the label 
of "entrants, status pending." As entrants, 
the new arrivals had no access to benefits 
under the 1980 Refugee Act and faced a very 
uncertain future. For example, those detained 
even for minor crimes could be held indefi- 
nitely in federal prison pending deportation to 
Cuba. Adding to these unfavorable circum- 
stances were the widely diffused negative 


‘images of the Mariel exodus. Mariels were 


clearly “unwelcome immigrants” and their 
prospects for successful adaptation were 
correspondingly dim (Bach 1981; Portes and 
Stepick 1985). 

Nineteen-eighty was also the year of the 
decadal Census and its results may be used to 
correct the first limitation of earlier analyses 
of the Miami enclave, namely their lack of 
representativeness of the Cuban-born popula- 


. tion. Census data collection ended just prior 


to the start of Mariel and, hence, results 
cannot be used to correct for the second 
problem, that is the exclusion of this latest 
wave. The analysis below attempts to correct 
for both shortcomings by making use of two 


independent data sets: (1) the 1980 5/100 


Census public-use microdata sample -A 
(PUMS); (2) a longitudinal survey. of Mariel 
entrants in the Miami metropolitan area, 
conducted between 1983 and 1986. The 
relative strengths and weaknesses of these 
data Sources complement each other as 
follows: 

(a) Census results are representative of the 


. Cuban-born population in the Miami metro- 


politan area and Florida in 1980 but exclude 
the 1980 Mariel entrants and also omit several 
significant variables. Information is missing, 
partially or totally, on places of work and 
ethnicity of firm owners and coworkers. 
These omissions give: rise to problems of 
validity and selectivity, which are discussed 


' below. 


(b) The longitudinal. survey of Mariel 


„entrants is based on a two-stage area sample 


of respondents, aged 18 to 60, and living in 
the principal areas of Cuban concentration in 
the Miami SMSA.? The original sample of 





? Census tracts with 50 percent or higher 
concentration of Cuban-origin population formed 
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500 was interviewed in 1983-84 and reinter- 
viewed two years later. The follow-up survey 
retrieved 402 cases or 80.4 percent of the 
original sample.? These data contain all 
relevant variables, including those not avail- 
able from the Census. However, although the 
follow-up retrieval rate is acceptable, it poses 
problems of selectivity which are com- 
pounded by high unemployment rates among 
1986 respondents. 


RESULTS 


Ethnic Enclaves: Places of Residence or 
Places of Work? 


The sociological literature on enclave econo- 
mies. has emphasized the commercial and 
social networks among ethnic enterprises and 
their employment of a coethnic labor force. 
Popular usage of the term continues to stress, 
however, the residential concentration of a 
particular minority, a definition which makes 
“enclaves” identical with ethnic neighbor- 
hoods. Sanders and Nee (1987) examined the 
economic effects of enclave participation 
among Cubans by defining. as participants 
those who lived in the principal areas of 
ethnic concentration—the cities of Miami and 
Hialeah—and as nonparticipants those who 
lived elsewhere in Florida. These authors 
analyzed the 1980 PUMS comparing Cuban 
employees and the self-employed in and out 
of the Miami/Hialeah area. They conclude 
that there are no significant differences 
among the self-employed, but that enclave 
employees are at a significant disadvantage. 
This conclusion is questionable, however, 
because of the operationalization on which it 


is based. Although most Cuban-owned firms : 


in Dade County concentrate in the cities of 
Miami and Hialeah, this fact does not 


the sampling universe. Most are in the cities of 
Miami and Hialeah. Within tracts, blocks were 
assigned unique four-digit identifiers and selected 
through a simple random sample procedure. The 
probability of selection within blocks was fixed at 
1, making all eligible households in the block fall 
into the sample. Eligible households were those 
containing at least one Mariel entrant of either sex 
within the above age brackets. 

3 All retrieved cases were still living in Dade 
County at the time of the follow-up survey. There 
is no way to determine what proportion of the 
missing 20 percent left the area. 
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guarantee that all Cubans in these cities work 
in enclave firms or, conversely, that all those 
who live elsewhere work in nonethnic enter- 
prises. Sanders and Nee’s analysis is weak- 
ened to the extent that there is an imperfect 
overlap between places of residence and 
places of work in the Census sample. 

Census data lack information on the most 
appropriate variable for testing hypothesis I, 
namely ethnicity of firms where respondents 
are employed. In its absence, it is still 
possible to assume that Cuban immigrants in 
Florida who work in the cities of Miami and 
Hialeah are more likely to be enclave 
participants because these are the sites where 
most Cuban-owned firms concentrate. The 
PUMS contains information on places of 
work for 4720 Cuban-born individuals aged 
18 to 64 in the State of Florida. This large 
sample can be divided into four mutually 
exclusive categories: (1) those who live and 
work in the enclave area, defined as the 
municipalities of Miami and Hialeah and 
adjacent locations;^ (2) those who work in this 
area, but live elsewhere; (3) those who live 
there, but work outside; (4) those who live 
and work outside of the enclave. Although 
still an approximation, this typology allows 
us to document the extent to which places of 
residence and places of work overlap. 

Results in the first row of Table 1 begin to 
cast doubt on the appropriateness of defining 
enclaves on a purely residential basis. Respon- 
dents for whom places of residence and work 
do not overlap represent a sizable group: 40 
percent of the total, as well as of the 
employee and self-employed subsamples. In 
particular, Cuban immigrants who work in 
the enclave but live outside represent almost 
20 percent. Thus, the assumption of a 
one-to-one overlap between both definitions 
of ethnic enclave is not warranted. However, 
it may be argued that place of residence is still 
an appropriate proxy because enclave workers 
possess similar characteristics, regardless of 
where they live. 

Unfortunately, results in the following 
rows of Table 1 do not support that argument 
either. Cubans who work in the enclave but 
who live outside exhibit consistently superior 
characteristics to all enclave residents , and 
exceed educational and income levels even of 


* Areas known as Gladeview, Westview, Golden 
Glades, and others included in Dade County group 
048 adjacent to the two large cities. 
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Table 1. Socioeconomic Characteristics of Cuban Immigrants in Florida by Place of Residence and Place of Work, 











1979* 
Total 
I. 
Lives H- m. IV. 
In In/ Our’ 
Enclave Works Works Works Expert RENI ea siRNA 

Variable Ars? — Out In Out L Il. m. IV. I. IL. Ii. IV. 
N _ 1477 938 886 1369 1305 892 760 1195 172 96 126 174 

in years 10.8 10.5 12.5** 11.9 10.8 10.5 12.4** 11.8 11.3 10.9 13.1 12.5 
Percent college 1 

graduate 14 94 18.4** 13,103 8.7 16.5** 15.6 19.8 15.6 30.2" 29.3 
Percent in f 

professional be d i 

speciality S 

occupations 4.5 3.0 7.8** 7.2 3.6 2.5 6.2** 6.2 11.6 8.3 17.5* 14.4 
Percent in j 

executive- 

occupations 90 92 16.4** 124 79 80 15.8** 11.0 124 20.8 19.8 22.4 
Percent speaks : i 

English well 2.1 775 46.7** 45.8 '280 27.6 48.8** 469 20.4 271 34.1* 38.5 
Percent U.S. : i i 

citizen 41.2 39.7 62.2 55.2 140.0 59.2 61.5 53.7 50.6 43.8 66.7 65.5 
Percent immi- ; : 

grated before i 

1970 73.3 725 84.5** 844 '720 72.4 84.3% 83.7 79.6 72.9 85.7* 89.1 
Average annual ^ ` : 

earnings 10208 9683  14708** 12650 9430 9225 13424** 11412 16110 13942 22451**^ 21155 


* Cuban-bom men and women aged 18-64 in thé State of Florida who worked at leest 20 hours per week and earned 
$500 or more in 1979. Data source: U.S. Bureau of the Census, Five Percent Public Use Microdata Sample. 
> Cities of Miami, including West Little River, and Hialeah. 
* Two-tail probability of differences between cols. I and III: <.05. 
** Two-tail probability of differences between' cols. I and III: <.01. 


those who have moved entirely out of the 
area. Nonresident enclave workers, for exam- 
ple, have the highest educational attainment 
and earnings of all groups, exceeding the 
figures for resident enclave workers by 1.7 
years „of education and $4500 of annual 
income respectively. These and most other 
differences between the two groups are 
significant at the .05 level. The subsamples of 
employees and self-employed are quite dis- 
tinct—with differences consistently favoring 
the latter. Nonetheless, within each employ- 
ment category, those who live outside the 
enclave but work in it exhibit Pupinesuty 
superior cháracteristics. 

Results in Table 1 suggest that the principal 
cleavage in education, occupation, and earn- 
ings is by place of residence rather than by 
place of work. For example, respondents who 
live in Miami/Hialeah but who work outside 
are no better off economically than enclave 
resident workers, and both groups have 
considerably lower earnings than everyone 
who lives outside. This pattern is not 
surprising once one becomes familiar with the 
socioeconomic characteristics of this metro- 


politan area: Hialeah is an industrial city and 
Miami is increasingly a service center, both 
cities having high concentrations of low- 
income, dilapidated housing around their 
core. Cuban employers, professionals, and 
skilled employees who work in these cities do 
not generally live there. Instead, they have 
increasingly moved toward middle-class sub- 
urbs in Southwest Dade County, North Miami 
Beach, and Broward County (Ft. Lauder- 
dale). The low-income housing surrounding 
business areas in the first two cities have 
become increasingly places of settlement for 
older refugees, the unskilled, and the newly 
arrived Mariel entrants (Boswell and Curtis 
1984; Sherrill 1987; Rieff 1987). 

‘This residential cleavage is captured in 
Sanders and Nee’s analysis and is erroneously 
attributed to differences in enclave economy 
participation. Contrary to the first hypothesis, 
living in an ethnic neighborhood is not the 
same as working in an enclave economy, and 
confusing both can lead to major errors of 
interpretation. The assumed overlap between 
residential and sociological definitions of 
enclaves must therefore be rejected as the 

| 
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basis for serious analyses: of causes and 
effects of these phenomena. 


Ethnic Enterprise: Means of Survival or 
Means of Ascent? 


As seen above, ethnic business has been 
painted in very different colors, ranging from 
: instruments of self-exploitation by owners 
and their families to vehicles for rapid 
economic progress. Results in Table 1 begin 
to suggest that the latter interpretation is more 
appropriate since the self-employed, regard- 
less of their place of residence, are consis- 
tently better off than salaried employees. This 
preliminary evidence is insufficient to test the 
-second hypothesis because differences in 
earnings may be due entirely to differences in 
human capital endowments between self- 
‘employed and employees. To demonstrate 
that entrepreneurship has a positive effect on 
immigrants’ economic attainment, we must 
control for other relevant factors. 

The ‘following analysis is based on vari- 
ables available in both data sets described 
above.. Although Cuban-owned enterprises 
concentrate in the municipalities of Miami 
and Hialeah, they are also: found throughout 
. Dade County (Miami SMSA). Networks 
tying these enterprises together are certainly 
not limited to those in the two central cities 
(Wilson and Martin 1982; Botifoll 1984). For 
this reason, 
analysis on immigrant entrepreneurs in the 
' entire, Miami metropolitan area. Conclusions 
identical to those reported below are reached, 
however, on the basis of analyses (not shown) 
restricting the definition of enclave to the two 
core cities. 

"The following analysis is based on samples 
of Cuban-born males aged 18 to 64 who had 
positive earnings at the time of the respective 
surveys— 1979 for the Census and. 1986 for 
the Mariel study. The reason for restricting 
the analysis to males is the very low numbers 
. of female entrepreneurs —none in the Mariel 
survey. This finding bears directly on the 
fourth hypothesis and will be discussed in 
connection with the latter. 

- In addition, there are problems of sample 
selectivity in both data sets. In the PUMS 


data, inclusion is limited to workers for whom ` 


there exist valid place-of-work codes. In the 


Mariel sample, the problem is more compli- . 


' cated because there is selectivity determined 
by inclusion/exclusion into the follow-up 


it is appropriate to base. the - 
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sample and then into the sample of working 
persons. The potential for selectivity bias in 
both instances is handled in the following: 
analysis through the indirect estimation proce- 
dures proposed by Heckman (1979) and. 
elaborated subsequently by Lee (1983), Greene 
(1981), and others. The procedure requires 
specification of a selection equation which 
predicts exclusion. from the sample on which 
the substantive equations of interest are, 
based. Selection equation results aré then 
used to compute an instrumental variable or 
*hazard rate" (Allison 1984) inserted as a 
predictor in the final equation. 

In the present instance, procedures em- 
ployed to estimate selectivity require some . 
discussion since the latter turns out to have 
nonnegligible effects in the substantive results : 
below. Determinants of labor force participa- 
tion vary by sex and it was therefore 
necessary to compute the estimating equations 
separately for males and .females. Inclusion 
into the 1980 Census workplace sample falls 
into the category labeled. by Berk and Ray 
(1982) "incidental" rather than deliberate 
sample selection. Because it is not apparent a, 
priori what criteria distinguish respondents 
for whom workplace information is available, . 
we decided to model the process on a broad 
set of individual characteristics which include 
respondent's age, education, knowledge of 
English, occupational status, year of arrival, 
family characteristics, and place of residence. 
Table 2 presents descriptions. of variable 
measurement for those in this equation and all 
subsequent analysis. Notice, that occupational 
variables— which refer to customary occupa- 
tion—are available even for.those for whom 
workplace data are absent. The model peer ; 
fication is as follows: 


WS1 = f (AGE, ED1215, ED16+, 
ENGLISH, PROF, EXECMGR, 
IMM7074, IMM75 + , MARSTAT, 
CHILDS, ENCL) . (D 


` The Beckman (1979) estimator for selai 
ity is based on probit regression. In a. 
subsequent article, L.F. Lee (1983) describes . 
a reformulation of this. general model which 
allows more general specifications of the 
criterion equations. In particular, logistic 
regression is commonly used instead of probit 
when it can be assumed that the disturbances . 
are bivariate. logistic. We assume that this is. 
the case in our census sample and make use of 


936 AMERICAN SOCIOLOGICAL REVIEW 
Table 2. Names and Measurement of Variables Used in the Analysis 





Census Mariel Sample 
Label Description Sample 1979 1983-86 
WSI 1 if included in workplace sample; 0 otherwise. x 
AGE Years. X X 
ED1215 and Dummy variables representing years of education completed; X 
ED16+ less than 12 years is the reference category. 
ENGLISH Respondent's subjective estimate of her/his speaking ability x 
coded J if well or moderately well: 0 otherwise. 
PROF and Dummy variables representing professional specialty and x 
EXECMGR executive-managerial occupations, respectively; lower-status 
occupstions are the excluded category. F 
IMM7074 and Dummy variables representing year of immigration; X 
IMM75 + immigration prior to 1970 is the reference category. 
MARSTAT 1 if married; 0 otherwise. X X 
CHILD5 Number of children aged 0-5 present. X 
ENCL 1 ìf respondent lives in the core enclave area (cities of Miami X 
and Hialeah); 0 otherwise. 
WS2 1 if included in the 1986 working sample; 0 otherwise. X 
EDUC Years of education completed. X X 
LASTRES Size-place of last residence in Cüba coded on a 5-point scale; X 
rural residence coded 0. 
KNOWENG Objective test of English comprehension administered in X 
1983 and ranging from 0 to 8 (highest). Scale reliability is 
= ,045. 
USSEI U.S. occupation in 1983, coded in Duncan's SEI scores. X 
UNEMP J if unemployed in 1983; 0 otherwise. X 
CHILDR Number of children aged 0—18 present. X X 
EARNINGS Logarithm of prior year earnings. X X 
PROFMGR; Dummy variables indicating professional-managerial, X 
TECHSALE; technical and sales, and craft occupations, respectively; semi- 
and CRAFT , and unskilled operatives are the omitted category. 
IMM6569 and Dummy variables representing year of immigration; X 
IMM70 + immigration prior to 1965 is the omitted category. 
WORKEXP Work experience defined as AGE — (EDUC + 6). X X 
WORK2 Quadra-ic term of WORKEXP. X X 
HOURS Logarithm of average number of weekly hours of X X 
employment during the prior year. 
CITIZEN J if U.S. citizen; 0 otherwise. X 
SELFEMP 1 if self-employed; 0 otherwise. X x 
OCCUP Occupaticn in 1986, coded in Duncan’s SEI scores. x 
ENCLAVE 1 if employed in the Cuban enclave; 0 otherwise. x X 
IMM65 1 if arrived after 1964; 0 otherwise. X 
CUBSEI Last occupation in Cuba, coded in Duncan's SEI scores. X 
URBAN 1 if born :n Havana; 0 otherwise. x 
ADULTS Number of persons 18 and over living in the same dwelling x 
in 1983. 





Note: Variables are arranged in order of their presentation in the text. Subsequent mention of the same variable in 
either sample is omitted. X indicates inclusion in tbe data set. 


logistic regression to compute the hazard rate illustrated by coefficients presented in Table 
for sample exclusion. In practice, it makes 3. 

little difference in this case which model is In both models, the probability of inclusion 
selected because results of probit and logistic in the 1980 workplace male sample is 
regressions are practically identical. This is significantly greater for older and married 
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Table 3. Maximum Likelihood Estimates of Logistic and Probit Equations Predicting Probability of Inclusion into 
Workplace Samples, Cuban-born Males and Females, 1979 








Selection mto Census Workplace Sample 

















Males Females 
Logistic Standard Probit Logistic Probit 

Predictor* Coefficient Error Coeff. S.E Coeff. S.E.  . Coeff. S.E 
AGE .002* .001 .003 002 . —.003* .001 —.004*  .001 
ED1215 .051 .030 .064 .038 .171* .031 .206* 037 
ED16+ 065- 045 .079 056 .119* ,048 .143* 058 
ENGLISH —-.007 .034 — .009 042  .—.056 .034 — .069 .041 
PROF — 042 .058 —.051 .072 | —.145* .067 —.169*  .078 
EXECMGR .059 .040 .073 .050 237* .056 292*  .069 
IMM7074 .096* .036 .119* .045 .077* .035 092" :.042 
IMM75 + —.301* .060 — .364* 071  — 269* .070 —.313*  .079 
MARSTAT .083* .035 .102* .043 .034 .029 041 .035 
CHILDS 041 .030 .052 037  —.157* .033 —.189* . .039 
ENCL .017 .028 .021 034 .038 028 047  ..033 
INTERCEPT 387 i .480 425 .520 
n 6182 7165 

* See table 2 for definition of variables. 

* p<.05. 


respondents and significantly less for the 
more recent arrivals. Among females, re- 
cency of immigration also has a strong 
negative effect on selectivity, but otherwise 
the pattern is different. Younger, educated, 
and executive-managerial-level women are 
more likely to be included in the sample, 
while those in professional occupations or 
with children at home are significantly less 


` so. The computation of the final instrumental 


variable obviously varies with the type of 
model selected. In the case of logistic 
regression, it consists of a straightforward 
transformation of the logit into an actual 
probability, deflected to signify exclusion 
from the final sample.* The resulting variable 
À represents the instantaneous probability of 


exclusion, conditional on being in the pool at ' 


risk: the larger the value, the greater the 

probability that the observation would be 

absent (Reimers 1985; Berk 1983, p. 391). 
The Mariel data set presents a more 


5 The formula, in this case, is: 


k 
X xi 
=e , 
k 
Z xB; 
ite 


where xD, is a vector of logistic coefficients times 
the relevant predictors (Berk and Ray 1982, p. 
388). i 


difficult problem because inclusion into the 
1986 working sample is contingent on the 
case having been retrieved in the wave-II 
survey. In the past, this double selectivity 
problem has been handled in the sociological 
literature by computation of a logistic equa- 
tion to estimate inclusion/exclusion into the 
broader sample, followed by insertion of the 
corresponding instruinental variable into the 
final predictive model (cf. Massey 1987). 
Statisticians regard this procedure as inappro- 
priate, however, because the introduction of a 
logistic correction factor can play havoc with 
the disturbance terms. 

An alternative is the estimation of a 
bivariate probit model which makes use of the 
simple probit equations to provide starting 
values for the computation of parameters. 
This model can be estimated with the 
LIMDEP statistical package which provides 
maximum likelihood estimates of parameters 
in the joint selectivity equations as well as 
those in the final least squares model (Greene 
1988). As in the census data, sample selection 
equations were computed separately for males 
and females. Selection into.the final working 
sample is viewed as an outcome of a set of 
ascriptive and achieved status criteria, with an 
emphasis on those brought from, the country 
of origin and on occupational characteristics 
at the time of the first survey. The predictive 
model is as follows: ' 


938 
WS2 = f (AGE, EDUC, MARSTAT, 


LASTRES, KNOWENG, USSEI, 
UNEMP, CHILDR).. (2) 


: Table 4 presents final parameter estimates 
for the male and female subsamples.® Results 
are similar across both sexes, indicating that 
older persons, those' coming from less urban 
backgrounds, and more fluent in English, 
were significantly more likely to be included 
in the sample. A tendency in the same 
direction is also apparent for married respon- 


` dents, although the corresponding coefficients 
: do not reach significance. The computation of 


X on the basis of the above procedure is a 


. complicated matter. Fortunately, LIMDEP 


l hypothesis, 


calculates the instrumental variable internally 
and inserts it into the appropriate substantive 
equations (Greene 1988, p. 21.10). 
Returning to the .test of .the second 
Table 5 presents results of 
regressing the naturat log of yearly earnings 
on human capital predictors plus self- 


„employment and A. The first and third 


columns of the table estimate effects for 
Cuban males who worked in the Miami 
SMSA in 1979 and 1986, respectively, and 
are thus relevant to assess the influence of 
enclave entrepreneurship on earnings. The 
second column estimates the same effects for 
Cubans who worked in the State of Florida 
but out of the Miami area; they allow us to 
establish whether any effect of self- 
employment is due to entrepreneurship per 
se, regardless of location, or whether it is 


- restricted to business ownership within the 


enclave. This comparison is restricted to the 
1979 PUMS data because, as noted above, all 


` Mariel entrants in the sample worked within 


the. Miami SMSA. The model specification 
also- varies somewhat because of the different 
availability and measurement of variables. 
The 1979 OLS earnings equation includes 
education, reported ability to speak English, 
time since immigration, marital and citizen- 
ship status, number of children, and the linear 
and quadratic terms of work experience. 


-Occupation is measured by dummy variables 


indicating different types of skills with semi- 


'and unskilled occupations being the reference 


category. The 1986 earnings equation in- 





$ The estimation procedure in this case devolves 
in practice into the probit model above because of 
a correlation of .96 between both endogenous 
variables. 
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cludes the same predictors with: the exception 
of time since immigration and citizenship 
status, which are constant in the Mariel 
sample, and knowledge of English which is 
measured by an objective scale rather than a 
self-report. Occupational dummies are also 
substituted in this equation by Duncan’s SEI 
scores. Table 2 describes the measurement of 
all variables. 

Regression coefficients in Table 5 are taken 
to the fifth significant digit and multiplied by 
100 to facilitate interpretation; their signifi- 
cance levels are in parentheses. Human 
capital predictors do a good job in accounting - 
for earnings in the census samples (columns I 
and ID) where practically all relevant coeffi- 
cients are statistically significant. Marital 
status and number of children also increase 
earnings significantly, while recency of arriv- 
al—relative to the earlier exile cohort— 
decreases them. Human capital effects be- 
come attenuated in the smaller Mariel sample. 
In this instance, it is occupational status plus . 
number of hours worked which have the 
determining effects on earnings. Statistically 
significant lambda coefficients are negative in 
both instances, indicating positive selectivity 
of those included in the respective samples. 
For our purposes, the most important coeffi- 
cients are those associated with self- 
employment. After controlling for selectivity 
and human capital variables, self-employment 
increases earnings among Cubans in Dade 
County by 8 percent in 1979 and by 24. 
percent in 1986, both effects being statisti- 
cally significant. Among those employed 
outside of the enclave area, however, the net 
effect of self-employment is negative and 
insignificant. 

These results lead to the rejection of 
hypothesis II which predicts the absence of a 
positive effect of enclave entrepreneurship on 
earnings. The evidence from these two 
independent samples leads to the opposite 
conclusion. There is nothing in the theoretical 
literature on enclaves which predicts absence 
of self-employment effects on earnings among 
isolated entrepreneurs. The economic advan- 
tage for proprietors in the general economy, 
documented by past research, would actually . 
suggest positive effects for the self-employed 
wherever located (Form 1985). For Cubans in 
Florida, however, the beneficial influence of 
entrepreneurship on earnings appears associ- 
ated exclusively with participation in the. 


Miami enclave economy. This result may be: 
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Table 4. Maximum Likelihood Estimates of Probit Equations Predicting Probability of Inclusion into Final Working 
Samples, Mariel Entrants, 1986 








a : Selectian into Working Sample 
Males A Females 
Predictor* Coefficient Standard Error Coeff. S.E. 
AGE .018* .006 .024* O11 
EDUC ~ 011 .019 .015 .034 
MARSTAT 203 .129 .335 214 
LASTRES — .078* .035 — .128* .058 
KNOWENG .072* .253 .070* .032 
USSEI .009 .006 .012 011 
UNEMP — .252 .189 —.285 326° 
CHILDR ~ 021 .067 — .026 .097 
INTERCEPT — 3.345 1.388 —-4.351 2.415 
Log-Likelihood — 313.23 — 106.78 
Chi Square . 79.985* 31.75* 
n 331 183 
` a See Table 2. 
* p«.05. 


interpreted as reinforcing the conclusion of than that for self-employed Cuban males in 


the positive economic effect of concentrated 
ethnic enterprise (Wilson and Martin 1982). 
Having shown the presence of this effect, it 
is useful to illustrate its magnitude. In 1979, 
according to the Census, average individual 
earnings for the employed male population of 


the area. Individual earnings for the latter 
group exceeded by a sizable margin average 
household incomes in Dade in the same year 
(U.S. Bureau of the Census 1983a). Among 
Mariel respondents, firms created by the 
self-employed are predictably very small, 92 


Dade County were $15,709 or $5,250 less percent of them having as sole employees the 


D 


Table 5. OLS Regressions Predicting Natural Log of Yearly Earnings, Cuban-born Men in Florida, 1979 and 1986 


D 








I. I. ni. 
Employed Men in Employed Men out Employed Mariel 
Predictors* ! Miami, 1979^ of Miami, 1979” Entrants, 1986° 
EDUC 2.395 . (.00)4 2.393 (.03) 568 (72) 
WORKEXP 2.430 (.00) 4.288 (.00) 1.067 (.50) 
WORK2 — .034 (.00) —.058 (.00) .040 (.16) 
HOURS 54.684 (.00) 58.647 (.00) 64.136 (.00) 
PROFMGR 38.791 (.00) 43.900 (.00) =i 5 
TECHSALE 21.223 (.00) 13.350 (.16) ef = 
CRAFT 11,559 (.01) 18.448 (.01) - — 
OCCUP — — — _ 685 (.03) 
ENGLISH 21.888 (.00) 12.438 (.13) = = 
KNOWENG - Ex — - —.417 (.81) 
MARSTAT 16.639 (.00) 23.975 (.00) 11.554 (.37) 
"CHILDR 3.705 (.03) 6.071 (.09) —2.125 (.76) 
IMM6569 —7.374 (.06) —1.167 (.34) — € 
IMM70 + — 14.076 (.00) —11.425 (.18) - — 
CITIZEN 8.870 (.01) 9.164 (17) = — 
SELFEMP 8.538 (.04) —2.430 (77) 23.950 (.05) 
À — 86.487 (.00) 171.363 (.31) — 89.306 (.02) 
` Intercept 425.362 234.963 335.276 i 
R? .353 .493 .379 
n 1969 817 213 
* See Table 2. 


> Miami SMSA. See text for explanation. 

* All Mariel respondents employed in the Miami SMSA. 

4 Unstandardized regression coefficients taken to the fifth significant digit and multiplied by 100. Alpha levels in 
parentheses. ' 
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owner and his spouse. Despite their unimpres- 
sive size, the annual average gross income of 
firm-owners in 1985-86 was $12,967 or 
. $2,724 more than for the total employed 
Mariel sample (p < .01). This gross 
difference is very close to the net percentual 
gain associated with self-employment in 
Table 4 which translates into $2,365 per year. 
These results indicate that enclave entrepre- 
neurship does not function as an alternative to 
destitution or as an instrument of self- 
exploitation, as suggested by Bonacich and 
others. The positive effect of self-employ- 
ment even among such a disadvantaged group 
as Mariel refugees reinforces this alternative 
conclusion. 


Enclave Employment: A Dead-end Path? 


This section examines the second version of 
the proposition about negative consequences 
of enclave participation. According to this 
view, ethnic entrepreneurs may obtain signif- 
icant economic rewards, but this gain is at the 
expense of their workers. An oft-repeated 
remark is that ethnic enterprises manage to 
survive and prosper because of the vulnerabil- 
ity of their labor force and its semipermanent 
. confinement in dead-end occupations (Sand- 
-ers and Nee 1987; Bonacich 1987). 
_A difficulty in testing this hypothesis with 
census data is the absence of the key variable 


defining enclave participation: ethnicity of : 


employers. Having established that ethnic 
place. of residence is not equivalent to enclave 
participation, residential data must be dis- 
carded as a proxy. Also unlike the Cuban 


self-employed in the Miami SMSA, who can. 


be confidently labeled enclave entrepreneurs, 
employees cannot be so easily classified 
because there are many more non-Cuban than 
Cuban firms in this metropolitan area. 

A partial way of circumventing this diffi- 
culty relies on knowledge about places of 
concentration of ethnic firms. As noted 
above, Cuban enterprises are located through- 
out Dade County but they tend to cluster in 
the cities of Miami and Hialeah. The 
probability that Cuban employees are enclave 
participants is thus much greater among those 
whose places of work are in these cities. To 
the extent that ethnic employment has a 
negative effect on earnings, we would expect 
it to be reflected in significant differences 
between those who work in the core enclave 
area— Miami and Hialeah—and those whose 
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places of employment are elsewhere in the 
State. In the 1986 Mariel sample, this 
difficulty does nct exist because enclave 
employment is directly operationalized as 
work in Cuban-owned firms. 

Results presented in Table 1 indicate that 
there is no gross disadvantage for working in 
the enclave since Cuban immigrants who are 
employed there but live elsewhere display 
above-average socioeconomic characteristics 
and earn higher wages. Additional results (not 
shown) also indicate no gross differences over 
the same set of variables when all enclave 
area employees are compared with those 
employed outside. However, a strict test of 
hypothesis III requires examining effects. of 
enclave participation after taking into account 
other predictors. Hypothesis III may be 
interpreted as predicting one of two effects: 
(1) enclave employment decreases earnings, 
after controlling ror other variables; (2) 
returns on human capital such as education; 
knowledge of English, and work experience 
are significantly diminished by participation ` 
in the enclave. The first prediction refers to 
the main effect of enclave employment; th 
second, to its interaction effects. ' 

These alternative predictions are handled 
by adding a dummy variable representing 
place of work to the OLS earnings regression 
for all Cuban-born adults in the State of 
Florida within sample specifications. Next, -a ` 
separate regression for the subsample of 
enclave employees is computed and compared 
with that for emplcyees outside of the area. 
Unlike self-employment, wage work is quite 
common for women both in and out of the 
enclave. It is thus possible to test effects of 
enclave employment among workers of both 
sexes. Because of well-known differences in 
patterns of earnings attainment, results are 
presented separately for men and women. 
Corrections for selectivity in this instance are 
based on computation of instrumental vari- 
ables on the basis of the model specifications 
discussed above, but estimated separately for 
each sex. . 

Table 6 presents results of adding the main 
effect of ENCLAVE employment (coded 1; 
others 0) to earnings equations for men and 
women employees in both data sets. The 
census sample includes all Cuban-born em- 
ployees aged 18 to 64 with positive earnings 
and valid workplace codes in Florida; the 
Mariel sample includes employees with the 
same characteristics in Dade County. The 
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Table 6. OLS Regressions Predicting Natural Log of Yearly Eamings, Cuban-bor Employees, 1979 and 1986 


I. State of Florida, 1979 





Predictors* 1. Women , 2. Men 
EDUC 761 (.10)* 2.554 
WORKEXP : 2.985 (.00) 2.955 
WORK2 -—.057 (.00) — 043 
HOURS 58.644 ` (.00) 56.836 
PROFMGR 15.188 (.00) 36.730 
TECHSALE — 8.105 (.06) 21.538 
CRAFT 8.515 (.19) 15.900 
OCCUP — — ~ 
ENGLISH 18.904 (.00) 11,230 
KNOWENG — — — 
MARSTAT 6.700 (.03) 15.972 
CHILDR _ —3.553 (.04) 4.422 
IMM6569 —2.159 (.39) —6.552 
IMM70 + — 6.793 (.08) — 13.063 
CITIZEN 9.859 (.00) 8.819 
ENCLAVE 10.868 (.00) —3.835 
À — 21.263 (.21) — 61.963 
Intercept 403.635 435.537 
m .266 .331 
n 2006 2169 

* Sec Table 2. 








II. Mariel Entrants, 1986 





3. Women 4. Men 
(.00)' 1.134 (.69) .499 (.76) 
(.0C) 1.281 (.58) 1.026 (.51) 
(.00) — .001 (.97) —.004 (.16) 
(0C) 35.312 (.14) 61.799 — (.00) 
(00) - - = = 
(.00) - - — = 
(.00) — — — — 
— 1.399 (.02) .646 (.04) 
(.00) = == = = 
— — 4.019 (.18) —.218 (.90) 
(.00) — 6.518 (.70) 14.193 (.26) 
(.00) 3.941 (.59) —.919 (.89) 
(.05) — — — — 
(.00) = = = = 
(.00) — -— = — 
(.15) 24.259 (.12) .341 (.97) 
(.03) — 18.086 (.49) — 86.897 (.02) 
561.527 318.053 
.259 .363 
72 213 





b Unstandardized regression coefficients taken to the fifth significant digit and multiplied by 100. Alpha levels i in 


parentheses. 


- significant and negative coefficients associ- 
ated with lambda in both male equations 
indicate again positive sample selectivity with 
respect to earnings. After controlling for this 
bias, all human capital predictors - have 
reliable effects on 1979 male earnings, as do 


most other exogenous variables. Recency of ' 


arrival again decreases earnings relative to the 
original exile cohort; while marriage, number 
of children, and U.S. citizenship significantly 

“increase them. The single nonreliable effect 
in this regression is precisely that associated 
with enclave employment. Although the sign 
of the coefficient is negative, as predicted, it 
is not statistically significant. Results for 
Mariel entrants in 1986 confirm this finding. 
In this instance, male earnings are determined 
by the strong positive effects of occupational 
status and hours worked. The effect of 
enclave employment is insignificant. 

Among women, however, participation in 
the enclave does have a significant effect. In 
the 1979 Census sample, enclave employment 
increases female earnings by a net 11 percent 
or approximately $691 annually after control- 
ling for sample selectivity, human capital, 


and family variables. Results for Mariel. 


women must be interpreted with caution 
_ given the small size of this subsample. 
Because of this, most coefficients do not 


reach statistical significance. It is important to 


- note, however, that the coefficient associated 


with enclave employment is one of the 
strongest and is again positive. Added to 
results from the much larger census sample, 
this result suggests that employment in 
enclave firms does not reduce earnings among 
Cuban women, but actually increases them. 
The principal conclusion to be derived from 
results in Table 6 is that the additive interpre- 
tation of hypothesis III does not hold. It is pos- 
sible, however, that while enclave employment 
does not have a negative main effect on earn- 
ings, it still affects their determination through 
interactions with other variables. To test this 
second version of the hypothesis, we regressed 
logged earnings on the same set of predictors - 
separately for enclave and nonenclave employ- ` 
ees. Table 7 presents results for both pooled 
samples; separate regressions for men and 
women in and out of the enclave are omitted 
because they reproduce, in all its essentials, the 
findings reported below." Sex (FEMALE) is: 
included as a control in the pooled regressions, 


7 The Mariel female sample is much too small to 
allow separate estimations for enclave and nonen- 
clave employees. Pooled results in this case reflect 
most closely those for males. 
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Table 7. OLS Regressions Predicting Natural Log of Yearly Earnings, Cuban Enclave and Nonenclave Workers, 


1979 and 1986 





I. Cuban-born in Florida, 1979 II. Mariel Entrants, 1986 


Predictors* 





1. Enclave Workers 2. Nonenclave Workers 3. Enclave Workers 4. Nonenclave Workers 

EDUC 1.263 (01) 2.657 (.00)* .461 (74) :718 (.62) 
WORKEXP 2.923 (.00) 2.841 (.00) .631 (.64) .224 . (.88) 
WORK2 — 469 (.00) — 485 (.00) —.020 (.38) — .025 (37) . 
HOURS : 53.056 (.00) 64.898 (.00)* 93.134 (.00) 49.339 (.00)* 
PROFMGR 26.184 (.00) 25.997 (.00) — — — — 
TECHSALE 10.239 (.01) 4.725 (.29) — — — — 
CRAFT 10.886 (.01) 12.650 (.00) — — — — 
OCCUP — — — — 635 (.03) 984 (.01) 
ENGLISH 16.932 (:00) 12.114 (.00) — — — — 
KNOWENG — — — — — 1.602 (.35) — 324 (.85) 
MARSTAT 13.296 (.00) 14.312 (.00) 8.636 (.38) 10.730 (.32) 
CHILDR 2.436 (.12) .710 (.66) 5.027 (.27) 3.345 (.54) 
IMM6569 —5.873 (:08) — 3.686 (.27) — — = — 
IMM70 + — 13.148 (.00) — 5,357 (.17) — — 7 — 
CITIZEN 11.502 (.00) 7.969 (.01) — — — 
FEMALE = — 30.593 (.00) — 40.816 (.00)* —4.019 (.76) — 26.990 (.04)* 
A 24.843 (.16) — 29.908 (.34) — 23.593 (31) —55.909 (.02)# 
Intercept 431.840 372.12 211.822 322.991 
R .327 .396 .465 370 
n 2074 2101 88 197 

* See Table 2. 


> Unstandardized regression coefficients taken to the fifth significant digit and multiplied by 100. Alpha levels in 


` parentheses. 


* Difference between enclave and nonenclave employees significant at .05 level. 


* Difference significant at .10 level. 


which allows us to evaluate its effect in both 
employment sectors. 

The first two columns of the table yield 
several significant differences in the census 
sample.? The positive effects of education and 
hours worked on earnings are higher among 
those employed outside of the enclave area. 
Women in the enclave, on the other hand, 
tend to receive significantly higher net 
earnings than those working outside. The 
lower payoff for hours worked and, espe- 
cially, for education among enclave employ- 
ees is consistent with the interactive interpre- 
tation of hypothesis IIL Each additional year 
of education yields a 3 percent increase in 
earnings in outside employment, but only 1 


5 Differences between regression coefficients 
are evaluated by t-tests which assume homogeneity 
of the residual variances. The estimate of the 
common residual is: 


2 


|n - Gay)? + Iy? - xy? | 
i Xx; d 2 
Syx= ee ei ieee 


n +m — 4 


See Edwards (1976, p. 108). 


percent in the enclave. The latter effect is also 
significant. The effects of other human capital 
predictors—occupation, work experience, 
knowledge of English, and citizenship acqui- 
sition—do not differ significantly across 
subsamples. The higher R? among nonenclave 
employees is due mainly to the much higher 
earnings disadvantage for women workers in, 
this sector. 

The longitudinal survey of Mariel entrants 
is superior to census data in its unambiguous 
identification of type of employment. In this 
case, it is not necessary to approximate 
enclave work indirectly through spatial loca- 
tion of firms since specific information exists 
on ethnicity of employers. As in past runs 
with this sample, occupation and hours 
worked are the significant determinants of 
earnings. Effects of occupation are. similar 
across employment sectors while those of 
hours reverse the pattern found in the 1979 
sample: hours of employment yield almost 
twice the payoff among Mariel entrants 
employed in the enclave than among those 
working outside. Finally, there is a large . 
difference in the influence of sex on earnings: — 
there is no significant gender gap among 
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enclave employees, but Cuban women in 
outside employment receive net earnings 
which are about 27 percent lower than 
comparable males. This difference confirms 
that found among the pre-1980 Cuban popu- 
lation. The meaning of this effect will be 
. examined in the last section. 


In sum, despite the volatility of effects due 


to small sample sizes, results for Mariel 
refugees support the overall pattern encoun- 
tered in the census data. There is little 
evidence of a consistent earnings handicap 
against employees of enclave firms either 
direct or indirect. With the exception of a 
partial attenuation in the effect of education in 


1979, there is no indication of a pattern of 
overall disadvantage or lower human capital : 


returns among employees in enclave firms. 
On the basis of these results, bypotbenis Ill is 
rejected. 


Entrepreneurship: An Achieved Status? 


Despite the absence of wage discrimination 
against ethnic firm employees, it is clear that 
‘the main advantages for participation in this 
' sector accrue to entrepreneurs. Among both 
pre- and post-1980 Cubans, the self-employed 
receive significantly higher incomes than 
salaried workers and the difference persists 
after equalizing statistically for individual 
- human capital. The pattern is similar to that 
found for manual proprietors in the general 
economy (Form 1985) although, in the 
present case, net effects of entrepreneurship 
are limited to enclave firms. The obvious 
final question is: what determines differential 
access to self-employment in Miami. Since 


'thé question refers to within-group differ-. 


ences, contextual variables associated with 
the broader society are held constant for 
immigrants of the same nationality and 
< cohort. In this situation, the most common 
explanation advanced by economists who 
` have dealt with the matter is the differential 
skills of individual immigrants, as reflected in 
background variables such as education, work 


experience, and knowledge of English. To the : 


contrary, historical and ethnographic accounts 
' have emphasized the role of ascriptive 
variables — gender, kinship and family compo- 
sition—in the emergence of ethnic enterprise 


(Rischin 1962; Bonacich and Modell 1980; 


‘Botifoll 1984). 
.' Statistical results presented below are 
limited to Cuban-born men. As seen in the 
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analvsis of hypothesis I, there were no 
women entrepreneurs in the Mariel sample 
and too few in the 1979*Census sample to 
allow for a-test of self-employment effects. 
This situation, of course, bears directly on 
hypothesis IV since it indicates that thie most 
ascriptive of.factors—gender—is the most . 
powerful predictor of ethnic entrepreneurship. 
For example, when this variable is added as a 
predictor of self-employment for all Cuban- 


‘born workers in the census sample, its effect 


eclipses those of all other variables tripling, +. 
the size of the next-largest coefficient. For 
this reason—and after noting the decisive 
effect of sex differences—we focus on the 
variation in entrepreneurship among males. 
The first column of Table 8 presents ` 
maximum likelihood estimates of logistic 
regressions ‘in which the probability of 
self-employment among Cuban men working 
in Dade County in 1979: is modeled as a: 
function of human capital predictors and 
family variables. Occupation is not exoge- 
nous in this instance because, of high. 
collinearity between the executive-managerial: 


- category and self-employment. Immigration 


is dichotomized into: pre- and ‘post-1965 .: 
(IMM65 + ) arrivals to capture the distinction 
between the initial upper- and upper- 
middle-class exodus from Cuba and subse- 
quent arrivals. The literature on Cuban 
immigration. has consistently identified the 
edrlier exiles as the most entrepreneurial 
group (Pedraza-Bailey 1985b; Boswell and 
Curtis 1984). Other variables pertinent to 
hypothesis IV include marital status, coded as 
above, and the respondent's number of. 
children, ages 0 to 18.9 

Logistic coefficients represent the incremen- 
tal net effect of each predictor on the 
logarithm of the odds-ratio. of self- - 
employment. Significant coefficients are trans- . 
formed. into net probabilities following Pe- . 
tersen (1985). Figures in the second column, 
labelled AP, represent net changes in the 
probability of self-employment associated 
with a unit change of reliable predictors. As 
seen in the table, there are only four of these.. 
Education and. the linear and quadratic 

° Selectivity corrections are excluded in this’ 
instance because of the model (logistic) and the 
possible distortions that their introduction would 
create. On a purely empirical basis, the instrumen- 
tal variables employed aboye lack a significant- 
effect when introduced in either equation. ' 
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Table 8. Regressions Describing Effects on the Log Odds and Proportional Hazard Rates of Self-Employment, 
Cuban-born Men in the Miami SMSA, 1979 and 1986 


Employed Cuban Males, 1979 


LL II. AP* 
EDUC 048 (2.68)* .007 
WORKEXP .070 (3.05)* .010 
WORK2 — .001 (2.02)* — 000 
ENGLISH —.166 . (1.03) 
KNOWENG — — 
CITIZEN . 267 (1.89) 
IMM65 + —.164 (1.20) 
CUBSEI — — 
URBAN = — 
MARSTAT 572 (2.71)* .096 
CHILDR .156 (2.72)* .023 
ADULTS — = 
CEPT — 4,056 

% Self-employed 17.5 
n 1969 
Model chi-square 

* See Table 2. 








Employed Male Entrants 








Follow-up, 1986 Full Sample, 1983-86 
IU. L IV. APS v. gt 
—.027' (67) — .066 (1.42) 
006 (11) .073 (1.53) 
.000 — (.05) —.002  . (2.03) 
.033  (.66) .040 (.46) 
—.004 (50 —.008 (.72) 
231 (2.74)* .02 .046 (.35) 
.769 (2.10)* 14 642 (3.98)* 
312 (2.14)* .03 .359 (3.05)* 
—.352 (2.08* | —.08 —.355 (6.02)* 
—1.314 es E 
10.8 18.3 
213 331 
15.40, 9 d f.* 


> Logistic regression coefficients. Coefficient/standard error ratio in parentheses. 


* Net change in probability of self-employment per unit change of independent variable. 
d Partial likelihood coefficients. Chi-square values in parentheses. 


* p<.05. 


indicators of work experience do have 
significant effects on self-employment in a 
manner similar to their effects on earnings. 
However, neither knowledge of English nor 
citizenship status has much influence on 
‘enclave entrepreneurship. The presumed cen- 
tral role of year of immigration is also absent 
from, these results since Cubans coming 
during the “golden exile" period (1959-64) 
are not significantly more likely to be 
self-employed than subsequent arrivals. 

The remaining reliable coefficients are 
associated with the two family variables. 
Regardless of human capital endowment, 


married Cubans are significantly more likely | 


to be entrepreneurs, and the probability is 
further enhanced if they have children. Based 
on figures in column II, a Cuban husband and 
father of four is about 20 percent more likely 
to be self-employed than a single male of 
equal education and work experience. These 
findings support hypothesis IV insofar as the 
latter predicts a significant effect of family 
and kinship factors among determinants of 
entrepreneurship. 

Further support for this hypothesis is 
provided by the last columns of Table 8 
which present results for the Mariel entrant 
sample. The availability of more detailed 
information from this survey allows modeling 


the probability of self-employment on a fuller 
set of predictors. In addition to those 
described above, occupation and urban origin 
in Cuba and number of adult coresidents in 
the United States are included. The rationale 
for inclusion of Cuban occupation is that 
higher occupational skills in the country of 
origin may reflect additional human capital 
translatable into higher potential for entrepre- 
neurship. Place of origin is included on the 
assumption that refugees who come from 
Havana — Cuba's capital and only large city — 
may bring qualities distinct from those 
coming from smaller towns and rural areas. 
Finally, adult coresidents is included as a 
rough indicator of household size which may 
also affect individual economic strategies. !9 
The Mariel data contain sufficient informa- 
tion to estimate the time in months from 
arrival to self-employment in both sample 
waves (1983 and 1986). This available 


10 This possibility was suggested by results of 
the 1980 Census which indicated that average 
individual earnings of Cubans were not much 
higher than those of other Spanish-origin groups, 
but that household incomes were. The average 
Cuban household contained larger proportions of 
adult earners leading to sizable ponies incomes. 
See Perez (1986). 
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information enables us to make use of event 
history methods which utilize the full infor- 
mation for every case in the sample. The 
dependent variable in this analysis is the 
hazard rate ~ —, — — of self-employment over 
time. The proportional hazards model defines 
individual outcomes as: 


log à = a(t) + B, Xi. (3) 


Where a(t) can be any function of time and is 
not estimated, and B;X; is a vector of 
regression parameters and corresponding ex- 
ogenous variables. This model is estimated 
using Cox’s (1972) partial likelihood method 
(Allison 1984; Kalbfleisch and Prentice 1980) 
available in the SAS PHGLM routine. In 
addition, logistic regression coefficients mod- 
eling the probability of male self-employment 
in the follow-up sample were estimated for 
comparison. ` 

The third and fourth columns of Table 8 
present results of logistic regressions of 
self-employment for the male follow-up 
sample and associated probability levels. The 
last column presents partial likelihood esti- 
mates of equation 3 above. Results of the 

analysis are similar regardless of the method 
and sample specification. Among human 
capital predictors, only urban origin has a 
reliable effect on self-employment, increasing 
the probability of the latter by about 2 percent 
in the follow-up sample; its effect in the 
proportional hazards model is, however, 
insignificant. Neither education, work experi- 
ence, nor knowledge of English affects 
chances for entrepreneurship in either case. In 
both regressions, major effects are associated 
instead with the family variables. As in the 
1979 sample, marital status and number .of 
children have strong positive effects on male 
entrepreneurship. Based on column IV, a 
Mariel entrant husband and father of four is 
about 25 percent more likely to be self- 
employed than a single male with the same 
human capital endowment. 

Number of adult coresidents significantly 
decreases the probability of entrepreneurship. 
This result runs contrary to the notion that 
extended households facilitate ethnic enter- 
prise by providing a more abundant labor 
supply. In the Miami enclave, it is the nuclear 
family which seems better poised to take 
advantage of independent business opportuni- 
ties; households with large numbers of adults 
are actually inimical to them. In the follow-up 
sample, for instance, each additional adult 
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coresident "costs" 3 percent in the probability 
of self-employment, net of other factors. 

Results of this analysis lend strong support 
to hypothesis IV. Clearly, it is not human 
capital variables, but ascriptive and family- 
related factors — gender, marriage, and parent- 
hood— which represent the prime determi- 
nants of enclave entrepreneurship. The 
négative effect of adult coresidents supports 
this conclusion indirectly by indicating the 
importance of different types of household 
composition in the process. Enclave business 
is not just the business of males, but primarily 
of married males. However, it is not the sheer 
abundance of hands, but the dynamics 
associated with intact nuclear family units 
which appears to provide the main thrust for 
the emergence of ethnic firms. 


CONCLUSION: THE INTERNAL 
DYNAMICS OF ENCLAVES 


The preceding analysis has revealed some. 
important features of enclave economies as 
they manifest themselves among Cubans in 
South Florida. First, results indicate that there 
is no necessary overlap between where 
members of an entrepreneurial minority live 
and where they work and, hence, operational- 
izing the concept of enclave on the basis of 
residential location may lead to serious error. 
Second, they show that there is a significant 
payoff for ethnic enterprise; enclaves are no 
mere ghettos where foreigners huddle to- 
gether eking out a living from marginal 
activities. Instead, ethnic enterprise can be an 
effective avenue for economic mobility al- 
though, in the case of Cubans, this effect is 
limited to males. Females do not receive any 
direct gain from entrepreneurship because so 
few of them are self-employed. However, 
additional results indicate that there is a gain 
for women enclave workers in terms of higher 
net earnings. More generally, enclave enter- 
prises do-not treat their employees any worse . 
than do outside firms, in terms of either gross 
earnings or-net payoff on the workers' human 
capital. 

These conclusions apply to both pre-1980 
Cubans and the more recent Mariel entrants 
and are thus generalizable to the entire 
minority in its principal area of concentration. 
Concerning differences between both groups, 
results point to the more precarious economic 
condition of Mariel refugees and the relative 
lack of returns so far on their education and 
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other skills. In 1983, 26 percent of Mariel 
households were living in poverty and the 
unemployment rate was 27 percent or three 
. times the figure among the total Cuban 
- population. By 1986, these and other indica- 
tors had improved considerably but they still 
lagged significantly behind. Despite these 
handicaps, which. continue to reflect the 
difficult process of incorporation of Mariel 
refugees into their new environment, they 
have joined the ethnic economy in large 
numbers and with consequences similar to 
those experienced by earlier arrivals. In 1986, 
34 percent of gainfully employed Mariel 
respondents (excluding the self-employed) 
were working for Cuban-owned firms. Self- 
employment rates among the entrants were 
not very different from those achieved by the 
pre-1980 Cuban population and had a similar 
: economic payoff. 

: The last question posed to these data 
, concerns determinants of enclave entrepreneur- 
Ship. The results for both pre- and post-1980 
samples are again quite similar. They indicate 
that, while education and work experience 
may contribute to the process, ethnic enter- 
prise is, to a large extent, a family affair. Men 
claim ownership of the businesses, but it is 
frequently couples that start and consolidate 
them. The advantage that family men have for 
business start-ups reflects the collaboration of 
- their wives and, possibly, their children, 


which is clearly superior to the contribution of | 


other coresidents. - 

Other quantitative and ethnographic re- 
search suggests that women played a central 
. role in the development of the Miami enclave 
through the provision of labor and of savings 
with which to capitalize fledgling family 
enterprises. In a separate analysis of 1979 
Census data, Perez (1986) reports that relative 
to the U.S. and Spanish-origin female popu- 
lations, Cuban women were 7 percent more 
likely.to be employed full-time year-around. 
..- Cuban women, 16 years of age or older with 
spouse present, had a labor force participation 
rate of 59 percent, 10 points higher than the 
corresponding Spanish-origin and total U.S. 
populations. High labor force participation 
among female refugees created a ready pool 
. of labor linked to enclave employers by bonds 
of kinship and social solidarity. This pattern 
' helps explain why, in Perez’s (1986, p. 17) 
words, 


there are many firms within the enclave with 
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workforces that are predominantly female and 

ethnically homogenous . . . there is primacy in 
: Miami's economy, and especially within the 

enclave, of employment sectors that usually 
employ relatively large numbers of women: 
retail sales, services, and non-durable manufac- 
turing. 

Anthropological studies conducted in Mi- 
ami also point toward the contribution made 
by Cuban women to family enterprise. 
Among early exile cohorts, a common pattern 


' involved formerly middle-class and occupa- 


tionally inactive wives who entered the labor 
market to protect their families' living 
standards. In some instances, female earnings 
supported. their husbands while they learned 
English and took the first steps . toward 
entrepreneurship; in a second variant, those 
earnings capitalized family enterprises; in a 
third, wives and other kin constituted the 
first, unpaid labor force of these fledgling 
businesses (Fernández-Kelly and García 1989; ` 
Stepick 1989). Experiences of an informant in 
one of these studies offer a glimpse of the 
process: 
Something had to be done to keep the family 
together. So I looked around and finally founda _ 
job in a shirt factory in Hialeah. My husband 
joined a childhood friend and got a loan to start 
an export-import business. All the time they 
were building the firm, I was sewing. . . . We 
wouldn't have been able to pay the bills without 
the money I brought in. (Fernández-Kelly and 
García 1988, p. 24) 


The close association of women with the 
development of enclave enterprise and the 
kinship ties linking them to entrepreneurs : 
may help explain why net earnings of Cuban 
women in enclave employment tend to be 
higher than in outside firms. A second pattern 


. suggested by the Fernández-Kelly and García 


study is the withdrawal of Cuban women 
from the labor force once the family enter- 
prise has attained sufficient profitability. The 
economic advancement of the family on the 
basis of the businesses which husbands 
formally claim as "theirs" constitute, for 
many women, the payoff for earlier stints of - 


. wage-work and the basis for continued 


adherence to traditional values. In the opinion 
of another informant: 


There’s no reason for women not to earn a living 
when it's necessary. . . . But I also tell my 
daughters that the strength of a family rest on the 
intelligence and work of women. It is foolish to 
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give up your place as a mother and wife only to 

go take orders from men who aren't even a part 


of your family. (Fernández- Kelly and .García 
1989, p. 26) 


Although additional research is required, 
the basic contours of the éthnic economy in 
Miami seem by now clear. Cubans relied on 
the kinship bonds and on the traditional 
sexual division of labor to accumulate re- 
sources for launching independent businesses. 
As heads of households and of firms, men 
probably benefited more from the process; but 
women have also gained from relatively 
higher earnings in the enclave and from 
sharing in the collective economic advance- 
ment of their families. While future research 
may qualify these conclusions, enclave enter- 
prise is not the product of isolated individual 
achievement, but rather one deeply embedded 
in the social networks of the ethnic commu- 
nity and the family. 
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IMMIGRANT ENTREPRENEURSHIP: 
EFFECTS OF ETHNIC MARKET SIZE AND ISOLATED 
LABOR POOL* 


M.D.R. EVANS 
Australian National University 


Why is business ownership common in some immigrant groups, but rare in others? 
Building on organizational ecology, it, may be argued that the immigrant group 
acts as an economic niche for small business. Large groups provide a more 
favorable niche for ethnic entrepreneurs than small groups, because large groups 
provide large specialized ethnic markets. Groups whose members are not fluent in 
the dominant language provide more favorable niches for immigrant entrepre- 
neurs, because they provide a linguistically isolated labor poo: whose skills can be 
more efficiently tapped by coethnic rather than majority group entrepreneurs. 
Because of its large and: heterogeneous immigrant population, Australia is a 
particularly appropriate site for testing these hypotheses. With a one-percent 
Public Use Sample of the 1981 Australian Census, a multilevel logistic regression 
model predicts individuals" probabilities of business ownership as a function of the 
size of the group (ethnic market size), the percent of the group who speak English 
badly (the linguistically isolated labor pool), and the individual's own language ' 


skills and human capital characteristics. The results support both hypotheses. 


INTRODUCTION 


. Great diversity in the socioeconomic fates of 
immigrant groups poses a persistent puzzle: 
why do some gain good jobs and high 
incomes while others do not? Much of this 
diversity reflects differences in education and 
work experience, findings that hold in many 
countries including Australia, Canada, Israel; 
the United Kingdom, and the United States 

"(e.g., Boyd, Featherman, and Matras 1980; 
Evans and Kelley 1986; Neidert and Tienda 
1984). Yet human capital does not fully 
account for immigrants' socioeconomic diver- 
sity. For example, fluency in the host 
country's language matters much more to 
occupational status and income in some 
groups than in others (Evans 1987; Kossoudji 
1986; McManus, Gould, and Welch 1983). 


; * Earlier versions of portions of this paper were 
presented to the August 1987 meeting of the ISA's 
Research Committee on Social Stratification and 
Mobility, and to the August 1989 meeting of the 
American Sociological Association. This research 
was supported by internal funds of the Australian 
National University and by NICHD Grant no, 
T32-HD07064—08. For insightful comments and 
`- remorseless critiques, I thank Howard Aldrich, 
Glenn Carroll, Rober W. Hodge, Dennis P: 
Hogan, Jonathan Kelley, Donald Benin and 
Roger iom 
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. The development of partially self-contained 
markets for goods. services, and labor is 
another potentially important source of socio- 
economic diversity. For example, Cuban 
immigrants in Miami established a thriving 
Spanish-speaking enclave economy that of- 
fers entrepreneurs substantial profits . and 
offers employees a relatively benign "work 
environment with the security and prospects 
for advancement typical of primary labor 
markets (Wilson and Portes 1980). This 
suggests that a strong small business commu- 
nity can be a valuable resource for immi- 
grants. Yet some immigrant groups, such as 
West Indians in Britain, have few entrepre- 


.neurs and no signs of an emerging ethnic 


economy. In this paper, I develop a theory 
about some ecological factors that encourage 
(or hinder) immigrants' setting up their own 
businesses. . 
Small business ownership formed an impor- 
tant strategy in immigrants’ adaptation to 
industrializing societies (e.g., Glazer and 
Moynihan 1963), but for a time was thought 
to be outmoded in mature industrial and 
postindustrial societies, researchers viewing 
the petit bourgeoisie as a class on the road to 
extinction. As a result, research concentrated 
on the prewar period (e.g., Bonacich and 
Modell 1980; Light 1972). But more recently, 
it has become clear that small business is not 
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vanishing (Granovetter 1984), and research 
on immigrant small business has blossomed 
(e.g., Min 1988; Waldinger, Ward, and 
Aldrich 1985; Werbner 1984). 


Why, then, are entrepreneurs much more : 


common in some immigrant groups than 
others? Prior research points to potential 


business owners’ human capital (Reeves and ' 


Ward 1984, p. 134; Wilson and Portes 1980; 
Portes and Stepick 1985). This paper shows 
that human capital matters but is not a 
complete explanation. Instead, I show’ that 
characteristics of the immigrant group itself 
are also important: the size of the ethnic 
market and the size of the linguistically 
isolated labor pool. These are key resources 
defining an ecological niche in which the 
small businesses crucial to a partially indepen: 


` dent ethnic economy can flourish. 


The Organizational Ecology of 
Ethnic Markets 


The new population ecology of organizations 
can help explain differences among immi- 
grant groups in business ownership. An 
organizational environment consists of differ- 
entiated niches, each with a carrying capacity 
that limits how many organizations of a 
particular kind it can support (e.g., Freeman 
and Hannan 1983; Hawley 1984). A niche’s 
carrying capacity is set by the amount of 
resources available (e.g., Aldrich and Pfeffer 
1976; Nielsen and Hannan 1977). For exam- 
ple, the number of potential customers with 
preferences for distinctive ethnic goods de- 
fines the carrying capacity of the niche for 
sellers of such goods. Competition is crucial: 
organizations better able to exploit the 
resources in a given niche will proliferate 


there at the expense of other types of: 


organizations (e.g., Carroll 1985; Hannan and 
Freeman 1977). 

Linguistically bounded groups -of immi- 
grants can form niches for potential entrepre- 
neurs, since language is one of -the most 
common bases of ethnic distinctiveness (e.g., 
Edwards 1984; Robinson 1985; Stevens ánd 
Swicegood 1987). Two important’ resources 
in these niches are the size of the group (the 
potential market) and the linguistic isolation 


of the labor force. The key idea is that ethnic - 


entrepreneurs enjoy an advantage over poten- 
tial competitors outside the group both (1) in 


. tapping the preferences of consumers in the 


groups and (2) in tapping the -skills of 
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codiinis workers who are not fluent in the 
host country’s language. In other words, 


- ethnicity can carve out-economic niches that 
„foster immigrant entrepreneurship. 


Group :size and potential market.. The 
ethnic entrepreneur has an advantage in 
awareness of his or her coethnics’ consumer 
preferences, for example in food, foreign 
language newspapers, or approaches to medi- 
cal treatment (e.g., Aldrich, Cater, Jones, 
McEvoy, and Velleman 1985; Light 1972; 
Pescasolido 1986). Thus, the ethnic entrepre- . 
neur's information costs in ascertaining con-' 
sumer preferences are lower than the costs to 
potential competitors from . “outside.” This 


‘does not make it impossible for majority 


group entrepreneurs to compete in markets 
with distinctive ethnic tastes, but does make it ' 
more costly for them than for "insiders"^ (and 
hence, not usually worth their while unless 
the immigrant group is outstandingly afflu- 
ent). Further, immigrants are often at a. 


disadvantage in transactions with outsiders. 


because meanings are not fully shared (Lad- 
bury 1984; Werbner 1984), but at no extra 
cost, ethnic businesses can offer immigrants ` 


.the comfort and -security of conducting . 


transactions in their own language and 
culture. Indeed, ethnic shops often form the 
focus of. immigrant community life (e.g., 
Burnley 1982; Huber 1977; Tamis 1985). 
Thus, immigrants have am incentive to shop 
within the ethnic market, and coethnic 
entrepreneurs have a competitive advantage i in 


_ seeking the group's custom. 


For.these reasons, the larger the immigrant . 
group, the more favorable-the niche for ethnic 
entrepreneurs since their potential market is. 
larger. Indeed, immigrant business owners 


. perceive their ethnic markets as important to 


success (Nowikowski 1984) while, con- 


_versely, the small size of some immigrant. 


groups has been’ seen as inhibiting business 
ownership (Reeves and Ward 1984, p. 138). 
The effects of group size on socioeconomic 
outcomes are more controversial, but at least’. 
some prior research suggests that it can be, 
advantageous to belong to a group large 
enough to support a somewhat independent 
market (e.g., Borjas 1986; Lieberson 1390), 

In sum: i 


Group size hypothesis: The larger an TR 
group, the more likely are its’ members: to 
establish their own businesses. 


Isolated labor pool. The linguistically 
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isolated labor pool of workers whose skills 
can be tapped more profitably by a coethnic 
entrepreneur is another key resource of an 
ethnic niche. Linguistically isolated workers 
(those not fluent in the dominant language) 
get worse jobs and lower incomes than do 
other workers who are fluent but otherwise 
have similar human capital characteristics 
(e.g., Evans 1987; Kossoudji 1986; Mc- 
Manus et al. 1983). If that impairment comes 
about because workers' lack of fluency 
imposes a cost on majority group employers 
(e.g., because it takes longer to convey 


orders, or because more mistakes are made' 


because of faulty communication), then a 
coethnic entrepreneur who can readily com- 
municate with them bears no such cost. As a 
result, the ethnic capitalist can profit by hiring 
coethnic workers, paying them more than the 
broader market offers but — at least initially — 
less than majority group workers with equiv- 
alent skills earn in the broader market. 
Workers with language difficulties can thus 
earn more working for a coethnic; in some 
instances their pay may be a pittance, but it 
will be a bigger pittance than they could get in 
the broader market. Moreover, ethnic entre- 
preneurs profit by paying slightly less than the 
broader market price for their workers' skills. 
In addition, ethnic employers can rely on 
continuity in their labor force because work- 
ers who immigrate as adults acquire skills in 
the new language only slowly (Evans 1986) 
and so have a continuing incentive to work 
for a coethnic. Thus, workers' lack of fluency 
in the host country's language creates a 
favorable niche for ethnic entrepreneurs. In 
sum: 


Isolated labor pool hypothesis: The greater the 
percent of adults in an immigrant group who are 
not fluent in the host country's language, the 
more likely are group member to be entrepre- 
neurs. 


The group size hypothesis and isolated labor 
pool hypothesis focus on different aspects of 
immigrant groups. There are some large 
groups in which a very small fraction are not 
fluent in the host country’s language, and 
other large groups in which many are not 
fluent (e.g., respectively, the Dutch and 
Greeks in Australia). At the other end of the 
scale, there are tiny groups with few members 
not fluent in the host country’s language an 
other tiny groups with most members not 
fluent (e.g., respectively, Finns and Cambo- 
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dians in Australia). As will be seen, both 
group size and the percent of adults not fluent 
in the host country's language have indepen- 
dent effects on business ownership. 


Other Factors- 


The discussion thus far focuses on sources of 
group differences, but in assessing them it is 
important to control for individual character- 
istics as well. Individuals’ language skills 
might also influence who within the group 
becomes an entrepreneur. Immigrants fluent 
in the host country’s language can deal more 
readily with organizations such as suppliers 
and banks outside the ethnic market, so it is 
plausible that fluent people should be more 
likely to own businesses. More tentatively, it 
may also be that immigrants who continue to 
use their native tongue at home have closer 
ties to their ethnic community and so are 
better positioned to take advantage of ethnic 
markets and ethnic labor. So they too should 
have a higher probability of business owner- 
ship. I control for both of these in the 
analysis. 

In addition, immigrants’ human capital 
should affect their chances for business 
ownership (e.g., Reeves and Ward 1984), 
and so I also control for human capital in the 
analysis. The literature tying business owner- 
ship into the status attainment framework 
suggests that education should depress the 
likelihood of business ownership, whereas 
occupational status and labor force experience 
should raise it (e.g., Fligstein, Hicks, and 
Morgan 1983; Robinson and Kelley 1979; 
Wright and Perrone 1977). Occupational 
status reflects the content and skill demands 
of jobs, and people often train for and work in 
occupations as employees for some years 
before establishing their own businesses 
(e.g., automechanics, bricklayers, carpenters, 
lawyers). In analyzing immigrants, experi- 
ence in the "host" country (the country to 
which immigrants have come) should be 
distinguished from foreign labor force experi- 
ence. The former is likely to be more 
important both for career reasons— notably 
the opportunity to build up personal savings 
and local business contacts (Werbner 1984) — 
and because time in the host country matters 
to many adaptations (e.g., Kelley and McAI- 
lister 1982; Massey 1986; Pescasolido 1986). 
Immigrants who work longer in their native 
countries before migrating tend to have worse 
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socioeconomic outcomes than one would 
otherwise expect (e.g., Evans and Kelley 
1986), and so it seems probable that they will 
be somewhat less likely to own businesses. 


AUSTRALIA AS A RESEARCH SITE 


It is desirable to test these hypotheses in a 
country containing a wide variety of immi- 
grant groups, in part to reduce the risk of 
spurious correlation. For example, if a group 
with many entrepreneurs differ both in 
language skills and in size, then it is 
impossible to compare the impact of size and 
language fluency; unless a variety of immi- 
grants groups are present, one cannot sort out 
these effects. Australia has a large and 
heterogeneous immigrant population: suitable 
data for this analysis are available on more 
than 40 immigrant groups. 

Australia's immigrant population is now 
very diverse. Until World War I, the 
overwhelming majority of immigrants came 
from the British Isles. They shared language, 
culture, and many social and political institu- 
tions with the Australian-born, so they had no 
separate ethnic market or linguistically iso- 
lated labor pool. Australian immigration 
policy strongly favored them, in part because 
of ethnic hostility toward other groups and in 
part because of working class fears of wage 
competition, particularly from Asians (Yar- 


wood and Knowling 1982; Yong 1977). 


But then elite opinion began to see Australia's 
population as too small and slowly growing to 
sustain rapid economic growth, and saw 
immigration as fueling economic expansion. 
Moreover, the war frightened people badly, and 
Australia’s small population came to be seen as 
a grave military weakness. Politicians decided 
that Australia must "populate or perish" and so 
launched a vigorous campaign recruiting set- 
tlers, initially from Britain, but later from a 
broad range of countries (e.g., Martin 1978; 
Yarwood and Knowling 1982). 

This new non-Anglophone migration began 
with a small flow from Northwestern Europe 
and-Eastern Europe and much larger influx 
from the Mediterranean region (e.g., Jupp 
1966). A few Mediterranean immigrants 
arrived even before the war and many of them 
established small businesses (Price 1963). 
After the racist "White Australia" immigra- 
tion policy was rescinded in 1966, highly 
skilled Asians began a migration stream that 


grew steadily (Martin 1978), then expanded - 
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suddenly as many Indochinese refugees were 
taken in the late 1970s and early 1980s (Price 
1986; Viviani 1984). Thus, contemporary 
Australia rather: suddenly finds itself a 
polyethnic society encompassing a variety of 
linguistic minorities. 


DATA 


The one-percent public use sample from the 
1981 Australian census provides the individual- 
level data for this study (Australian Bureau of 
the Census 1983a). Population coverage in 
this census is nearly complete, with an 
underenumeration rate of 1.9 percent (Austra- 
lian Bureau of Statistics 1983b), and stan- 
dards of data preparation in the Australian 
Bureau of Statistics are high. Among other 
measures to protect confidentiality, the sam- 
ple file omits nearly all geographical informa- 
tion. The data set is a large (N= 146,088) 
random sample containing very accurately 
measured variables. 

My analysis focuses on foreign-born men, 
age 16 to 64, who have a job (N- 9,505). 
The low starting age makes sense for 
Australia because many people leave school 
by that age. Choosing a higher starting age 
would pose selectivity bias problems for the 
effects of labor force experience, because 
everyone with little labor force experience 
would also have more education than aver- 
age. This would not matter if all the effects 
were exactly linear with no interactions, but 
that is too strong an assumption to make in a 
relatively new research area. I have chosen 64 
as the upper age limit, because the old-age 
pension becomes available to men at 65 and 
many jobs have compulsory retirement at 65. 

I focus on men because labor force 
experience affects one's probability of start- 
ing a business, but these data (like most other 
census data) lack information on actual 
experience. Í estimate experience from cur- 
rent age, age at the end of education, and age 
at arrival in Australia. This is a reasonable 
approximation for men, but not for women. 
Australian women's labor force participation 
rates are lower than Americans', are more 
closely tied to the family life cycle, and vary 
widely among immigrant groups (Evans 
1984, 1988), thereby posing complex prob- 
lems of unmeasured heterogeneity. Nonethe- 
less, more in hopes of stimulating future 
research than of drawing firm conclusions, I 
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present parallel results for women (Appen- 
dix). 

The variables and details of their scoring 
are given in Table 1. 

Details of the full-time equivalent years of 
education for particular degrees and certifi- 
cates, of the country-specific estimates for 
. average age at start of education, and of the 
translation between the Australian Bureau of 
Statistics three-digit occupational code (with 
over 350 occupations) and the ANU-2 status 
code are in Evans and Kelley (1988, 
Appendices A and B). 


'METHODS AND 
MODEL SPECIFICATION 


I attached information about each individual's 
ethnic group (its size and its percentage of 
adults not fluent in English) to the individual 
records in the data set. I estimated a 
multilevel model of entrepreneurship predict- 
ing the individual's business ownership 
(0 — employee, or self-employed without em- 
ployees, 1 — business owner with employees) 
as a function of three sets of variables: (1) the 
group characteristics, (2) the individual's own 
language skills, and (3) the individual's own 
human capital. 

I used logistic regression, which is techni- 
..cally preferable to OLS for a dichotomous 
dependent variable, estimating it with maxi- 
mum likelihood procedures in the economet- 
rics package LIMDEP. 

The effects of size-related variables are 
often oddly shaped, so I also reestimated the 
core model with the group size variable 
transformed (1) by the natural log and (2) by a 
quadratic. Neither transformation improved 
the model (results not shown, available upon 
request from the author), suggesting that the 
logistic relationship adequately captures the 


curvatures in the link between group size and 


' probability of ownership. Because many 
researchers in the Marxist tradition object to 
occupational status on theoretical grounds, to 
facilitate comparisons I also estimated a 
model omitting occupational status. This 
leaves the effects of group size and of the 
group’s percent not fluent in English virtually 
indistinguishable from the core model that 
includes occupational status (see Appendix). 
The effects of group characteristics and of 
human capital are quite robust across some- 


what differently specified models, but the 
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effects of individual language skill are 
unstable, as shown in the Appendix. 

Based on the core model, I also computed 
predicted values of business ownership for 
various combinations of group size and 
group's percent not fluent in English. To 
compute these predicted values, I first set all 
other variables to their means, multiplying 
each variable's mean (from Table 1) by its 
logistic regression coefficient (from Table 2) 
and summing the results. I then chose levels 
of group size and group's percent not fluent in 
English, multiplied these by their logistic 
regression coefficients, and added these 
results to the previous sum. 


DESCRIPTION 


Group Characteristics 


The sizes of linguistically distinct ethnic 
groups vary widely with the men in my. 
sample belonging to groups that average 
63,158 people with a' standard deviation of 
88,935 (see Table 1). They range from zero 
for immigrants from Anglophone countries 
(who share language and many aspects of | 
culture with the native population) to tiny 
groups such as Nauruans (436) and the 
Bulgarians (1,678), on up to bigger groups 
such as the Maltese (57,001), the Dutch 
(96,044), the German-speaking ‘group— 
Austrians, Germans, .and Swiss—(141,289), 
with the largest group being the Italians at 
275,883. . 

The percent of adults not fluent in English . 
also varies greatly, with a mean of about 9 
percent and a standard deviation of 12 percent 
(Table 1). It is zero for the immigrants from 
Anglophone countries, a mere 2 percent for 
immigrants from the Netherlands and the 
Philippines; about 10 percent for the French, 
Hungarians, and Indonesians, about 15 per- 
cent for the Maltese, the Poles, and the 
Vanuatuans; about 30 percent for Brazilians, 
Greeks, and Koreans, about 45 percent for 
Turks and Cocos Keeling islanders, and 
ranges up to 64 percent for Cambodian. 
immigrants (who were then mostly recently 
arrived refugees). 


Individual Characteristics 


Most immigrant men speak only English at 
home (60 percent), and another 17 percent 
use a foreign language at home, but speak 
English very well. Immigrant men have, on 
average, 10.8 years of education. Only about 


IMMIGRANT ENTREPRENEURSHIP , l . 955 
Table 1. Variables and Scoring ` Y 








GROUP ATTRIBUTES (aggregate data from the full Census attached to individual records in the Public Use Sample): 


Ethnic group. Respondent's native country, unless it is part of a larger set of countries that share a language. If'the 
latter, then all countries in the set form the ethnic group. (Used only in further definitions.) à 
Ethnic group size. Number of persons of all ages in the respondent's "ethnic group." Set to zero for immigrants from 
English-speaking countries on the grounds that they ‘have no distinctive market. A few immigrants (0.7 percent of all 
immigrants) come from countries that have sent so few migrants that the Australian Bureau of Statistics does not 
identify the country of origin. I have assigned them a protected market size of 100 on the grounds that they may have 
tiny protected markets. The data for this variable are drawn from the complete enumeration of the 1981 Census 
(Australian Bureau of Statistics, 1983c, Table 8). Mean = 63,158. 


„ Percent not fluent in English. Percent (self-reports) who speak English "not well" or not at all among adults age 15 and 
over in the respondent's "ethnic group". A filter question asks whether one speaks a language other than English at 
home. Those answering “yes” are then asked how well they speak English: “very well," “well”, “not well,” or "not 
at all." The Census form is available in a broad array of languages and seems to be well understood. The data are from 
tbe full enumeration of the 1981 Census and were compiled by Charles Price (1984, Table 4). Mean = 9.01 percent. 


INDIVIDUAL ATTRIBUTES (unit record data in the Public Use Sample) 
English language proficiency. Self-reported individual-level language proficiency for the respondent. This is based on 
the same question as the group attribute "percent not fluent in English" and is scored as a set of dummy variables: 
*speaks only English at home" (omitted category in the logistic regression analysis, 60 percent of sample); those who 
speak a foreign language at home are divided among those who report themselves as speaking English “very well" (18 
percent), "well" (15 percent), 'and "not well or not at all" (8 percent). 
Education. Years of education. Estimated in two stages, as it is not asked directly. First, years of primary and, 
secondary school are estimated as the difference of "age left school" (which was asked directly) and usual age to start 
, school in respondent's country of origin. This is about 5.5 for those who started school in Australia; estimates for other 
countries are drawn from Evans and Kelley (1988, Appendix A). The only available information on tertiary education 
is the highest certificate or degree respondent has. Thus, there is no information on additional years of education for 
respondents who "dropped out" of tertiary courses; fortunately for my analysis such persons are relatively rare (Broom 
et al. 1980). For respondents who have completed tertiary courses, there is detailed information on the type of degree 
or certificate they received. I use Evans and Kelley's (1988, Appendix A) estimates of the usual number of years of 
full-time study required to obtain the certificate. The final estimate of years of education is the sum of the estimates of 
pretertiary education and of tertiary education. Mean = 10.82. 


Educated in Australia. A dummy variable scored one for immigrants who arrived in Australia before completing their 
education, and zero for those who completed their education abroad. Age at completion of education is available from 
a direct question for primary and secondary education, and age at completion of tertiary education is estimated from 
years of education (see above). Age at arrival is estimated. by the difference of current age (in single years) and years 
- since arrival in Australia (in single years). If age at completion of education is less than age at arrival, then the 
respondent is scored zero. Of our respondents, 22.4 percent were educated in Australia. 
Australian labor force experience. Years of Australian labor force experience. For immigrants educated in Australia, 
this is estimated as the difference of current age and age at the end of education (see “Educated in Australia"). For 
those educated abroad, this is the number of years since arrival in Australia (in single years). Mean — 14.82. 
Foreign labor force experience. Years of foreign labor force experience. Scored zero for immigrants educated in 
Australia. For immigrants educated abroad, estimated by the difference of age at arrival in Australia and age ended 
education (see “Educated in Australia"). Mean = 7.48. 


Occupational status. Socioeconomic status of respondent's occupation. For this variable, the Australian Bureau of 
Statistics' detailed occupational classification is recoded into the ANU-2 occupational status scale (Broom et al. 1980). 
In practice, it is similar to the widely used Duncan SEI scale for the United States and compares well with other 
occupational status scales in cross-national analyses (Jones and McDonnell 1977). 1 have used Evans and Kelley's 
(1986) linear transformation of the scale, which ranges approximately from zero to 100. Mean = 34.65. i 


Ethnic origin. Respondents come from five origins: the Anglophone countries (45 percent), Northwestern Eutope 
except the British Isles (10 percent), Eastern Europe (6 percent), the Mediterranean region including the Middle East 
(29 percent), and Asia plus the relatively few immigrants from Third-World countries not elsewhere classified (10 
percent). This grouping captures major differences pertinent to the labor market (e.g., Evans and Kelley 1986) but is 
obviously insufficiently detailed for fine-grained analysis of particular cultural groupings. Dummy variables based on 
it are used in some of models in the Appendix. 


DEPENDENT VARIABLE 


Entrepreneur. Based on a direct question asked of all people who had worked for wages, salary, payment, or profit in 
the week prior to Census date and also of people who usually do so but were temporarily absent from work because of 
holidays, illness, strike, or temporary layoff. The question asks whether in the main job the respondent was “a wage 
or salary earner," "conducting own business but not employing others," "conducting own business and employing 
` others,” or "a helper not receiving wages or salary." Wage and salary earners and those “conducting own business but 
not employing others" are coded zero (— not employer) on entrepreneurship. The 6 percent of respondents who were 
"conducting own business and employing others" are coded 1 ( employer). 
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one-fifth were educated in Australia (most of 
these came to Australia as children with their 
families), and the rest migrated after complet- 
ing their education abroad. On average, they 
have about 15 years of Australian labor force 
experience and about 7 years of foreign 
experience. Differences among the immigrant 
groups in "human capital" type character- 
istics have been treated in detail elsewhere 
(e.g., Broom, F. Jones, P. Jones, and 
Williams 1980; Kelley and McAllister 1984; 
Evans and Kelley 1986). Average occupa- 
tional status is 35, about the level of a 
‘plumber, a tailor, or an electrician. Six 
percent are employers. 


RESULTS 
Effects of group characteristics 


Group size hypothesis. Men are more likely to 
be entrepreneurs if they belong to big ethnic 
groups. This, I have argued, reflects the size 
of ethnic markets. It is noteworthy that group 
size raises the individual-level probability of 
business ownership, even controlling for a 
broad array of individual-level human capital 
characteristics: education, labor force experi- 
ence, occupational status, and respondent's 
own language use and skill (Table 2). 
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Evaluated at the mean of all other variables, 
men in very large groups (of 250,000 
people—somewhar smaller than the Italian 
group) are 1.5 times more likely to be 
entrepreneurs than are men from very small 
groups (of 100 people). 

Isolated labor pool hypothesis. Men are 
more likely to be entrepreneurs if they belong 
to ethnic groups vith a large percentage not 
fluent in English. I have argued that this 
effect reflects coethnic entrepreneurs' easier 
access to the skills of workers not fluent in 
English —the linguistically isolated labor pool. 
Note that this effect holds even controlling for 
the respondent's own language skills (as well 
as other human capital characteristics). Eval- 
uated at the mean of all other variables, men 
from groups with very large isolated labor 
pools (45 percent of adults speaking English 
poorly) are 2.5 times more likely to be 
entrepreneurs than men from groups with no 
isolated labor force. 

These results strongly support my popula- 
tion ecology modei of ethnic groups as niches 
that contain resources important to the growth 
of small business. 

Magnitude of the effects. The direction and 
significance of the effects shown in Table 2 
are clear, but their size is somewhat less 
obvious. To give a clearer view of the 


Table 2. Multilevel logistic regression analyses predicting individuals' probability of being an entrepreneur 
(conducting own business and employing others) as a function of both grcup and individual characteristics. 
Sample is foreign born men age 16 to 64 who are working (N = 9,505). 


. Causal Variable (units) 


Group characteristics 
Ethnic group size (10,000s) 
Group percent speaking English poorly (96) 
Individual characteristics 
Education 
Years of education 
. . Educated in Australia (0 or 1) 
Occupational status (0 to 100) 
Labor force experience 
In Australia (years) 
Abroad (years) 
English-language Fluency 
English monolingual 
English very good (0 or 1) 
English good (0 or 1) 
English poor (0 or 1) 
Constant 


Coefficient Standard error 
,0223** (.0059) 
.0188** (.0052) 

— .1089** (.0186) 
— 0428 (.1271) 
.0318** (.0020) 
.0259** (.0051) 
— .0218** (.0076) 
{omitted category) 
0891 (.1262) 
— .2168 (.1631) 
— .2850 (.2304) 
—3.4744 


Log-Likelihood = — 1963.7 and restricted (slopes =0) log-likelihood = — 2178.5, chi-squared = 429.5, significance 


level « .001. 
Note: ** indicates p <.01. 


Source: For individual characteristics: Australian Census of 1981, one percent public use sample. For group 


characteristics, see Table 1. 
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implications of the logistic regression coeffi- 
cients, Figure 1 shows the expected level of 
business ownership according to group size 
and the group percentage speaking English 
badly (with all other variables evaluated at 
their means). . 
Consider first a group that is not linguisti- 
cally distinct (i.e., has linguistic group size 
zero) and in which all speak English fluently. 
Anglophone immigrants would be an exam- 
ple. If they otherwise, had characteristics 
typical of immigrant men, then the regression 
model suggests that, on average, about 3 
percent would own businesses (bottom line, 
far left, in Figure 1). Consider next an 
intermediate group of 150,000 people in 
which 30 percent of the adults speak English 
poorly and so constitute an isolated labor 
pool—these are, for example, very close to 
the situation of Greek immigrants. Such a 
group would be expected to have a business 


_ ownership rate of about 7 percent, on average 


and all else equal (next to bottom line, center, 
in Figure 1). Finally, consider a linguistic 
group of 250,000 people in which 45 percent 
of the adults speak English poorly. Such a 
group has an expected level of business 
ownership of 12 percent (top line, far right, in 
Figure 1). These examples are all within the 
ranges of empirically observed sizes of ethnic 
groups and isolated labor pools in Australia, 
and give a sense of the scale of the effects: 
they are substantial, with entrepreneurs being 
four times more common at one extreme than 
at the other. 

Their implications for the involvement of 
the rest of the ethnic group in the ethnic 
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Fig. 1. The probability of being an employer: effects of 
group size and the group’s percent speaking 
English poorly. Predicted values from logistic 
regression, controlling for individual characteris- 
tics. 
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economy are also substantial. Recall that, by 
definition, each entrepreneur must have at 
least one employee. Even’ if each business 
owner employed only one coethnic, then a 
business ownership rate of 7 percent would 
implv an involvement of 14 percent of the 
group's male labor force in the ethnic 
economy (7 percent owners and 7 percent 
their employees), and a business ownership 
rate of 12 percent would imply 24 percent of 
this group's male labor force working in the 
ethnic economy. If businesses are even 
slightly larger, the ethnic economy is even 
more extensive. For example, if each entre- 
preneur employs three coethnics, then a 
business ownership rate of 7 percent implies 
28 percent of the groups' male labor force 
would be working in the ethnic economy, and 
an ownership rate of 12 percent implies 48 
percent of the groups' male labor force would 
be working in the ethnic economy. Unfortu- 
nately, the Census does not provide data on 
firm size, but these examples are well within 
the realms of possibility. 

The model does not fit terribly well, which 
is not surprising for a dependent variable with 
very high turnover (small businesses have 
very high failure rates and very high start-up 
rates). It is also true that these data do not 
allow me to capture many possible influences 
on small business ownersbip, for example, 
savings patterns and family traditions of 
entrepreneurship. 

Nonetheless, the significance of the effects 
of group size and of the group percentage not 
fluent in English is striking and the effects are 
quite robust across alternative formulations of 
the model. They change little if occupational 
status is omitted from the model (see 
Appendix). Even more strikingly, in a model 
omitting group size and group's percent not 
fluent in English, region of origin has 
significant effects on individuals' probabili- 
ties of business ownership. But when group 
size and group's percent not fluent in English 
are introduced into this model, they have 
significant effects (quite similar to those in 
the core model), and the effects of region of 
origin dwindle to insignificance (see Appen- 
dix). 

I also estimated a model of - business 


ownership for women; admittedly the measure- | 


ment difficulties are serious. This model 
supports the isolated labor pool hypothesis 
but does not support the group size hypothe- 
sis. Whether the lack of support for the group 
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size hypothesis is an artifact of measurement 
difficulties or represents a genuine difference 
between men’s and women’s business owner- 
ship opportunities is a matter that can be 
decided only by future research on data better 
suited to assessment of women's labor market 
situation, especially their fund of work 
experience. 

In sum, group size and isolated labor pools 
are among the forces that contribute to the 
development of immigrant small business. 


Individual characteristics 


Language use and skill. These effects are not 
significant (Table 2) and they are quite 
unstable across different model specifica- 
tions. This instability is unlikely to be an 
artifact of measurement error (although a 
multiple-item scale would be preferable), 
because effects of these variables on other 
Socioeconomic outcomes, such as occupa- 
tional status, are quite robust (Evans 1987). 


The results do not support the contention that - 


English-language skill and native language 
loyalty facilitate business ownership, but 
more research on other data sets will be 
needed before firmly dismissing these possi- 
bilities. 

Education. More highly educated men have 
lower levels of entrepreneurship,: all else 
equal (Table 2), a result consistent with prior 
research. Being educated in Australia rather 
‘than overseas is neither an advantage nor a 
disadvantage. 

Labor force experience. Men who have 
more Australian labor force experience have 
higher levels of entrepreneurship, a result 
consistent with prior research. By contrast, 
men with more foreign labor force experience 
have lower levels of entrepreneurship. It 
seems likely that this reflects a selection 
effect. 

Occupational status has a positive effect on 
entrepreneurship. These effects are robust 
across slightly different model specifications, 
but foreign experience less so than the others. 


DISCUSSION 


I have argued that linguistically distinct 
immigrant groups are economic niches that 
offer a competitive advantage to ethnic 
entrepreneurs. In particular, large ethnic 
markets and linguistically isolated labor pools 
are resources for ethnic entrepreneurs which 
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should stimulate business ownership. My 
results show that these group characteristics 
affect individuals’ probabilities of business 
ownership, even controlling for a wide range 
of individual characteristics including educa- 
tion, labor force experience, occupational 
status, and place af birth. 

Group size hypothesis. Men belonging to 
very large ethnic groups (say of 250,000. 
people) are about one and one-half times 
more likely to be business owners with 
employees than are men belonging to very 
small groups (of 100). I have argued that 
group size matters because the group provides 
an ethnic market, and larger markets stimu- 
late more entrepreneurship. 

Isolated labor pool hypothesis. My results 
also show that the greater the percent of 
adults in a group who are not fluent in 
English, the greater the probability that group 
members will be business owners. Men 
belonging to groups with a large portion of . 
adults not fluent in English (45 percent) are 
about two and a half times more likely to be 
business owners than are men in groups in 
which all adults are fluent in English. I have 
argued that group fluency matters because . 
adults not fluent in the host country’s 
language form a linguistically isolated labor 
pool whose skills can be tapped more 
efficiently by a coethnic entrepreneur than by 
an outsider. 

Thus, the evidence suggests that the partial 
separation of the ethnic group from the 
broader society stimulates the development of 
ethnic small business. But, on a cautionary 
note, in the tentative and exploratory model 
of women’s probabilities of business owner- 
ship, only the isolated labor pool hypothesis 
is supported; whether group size effects are 
genuinely absent for women or ‘whether their 
absence is an artifact of measurement difficul- 
ties remains to be seen. 

Individual-level human capital characteris- 
tics such as education, labor force experience, 
and occupational status affect men’s chances 
of business ownership, findings consistent 
with prior research linking business owner- 
ship into the status attainment model.’ 
Individual-level language skills have insignif- 
icant effects. 

It would be interesting to elaborate the 
model by exploring the effects of other group 
characteristics such as ethnic solidarity and 
social distance from the majority population. 
I have explored some “pull forces” that 
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increase the attractiveness of small business 
ownership, but future research should also 
explore the role of “push forces” such as 
disadvantage in the: broader labor market 
(Waldinger et al. 1985). | 


The tremendous diversity in socioeconomic 


fates of different immigrant groups was the 
Starting point for this paper. In particular, 
recent research has suggested that immigrant 
small business can buffer the effects of split 
labor markets and provide better opportunities 


for immigrants working within them. But if 


ethnic small business communities provide 
such clear benefits, why: haven't all groups 
developed them? My results suggest that the 
size of the ethnic market and of the labor pool 
which could benefit from enclave employ- 
ment affects the growth of immigrant small 
business. These results suggest the hypothesis 
that part of the reason that small business 
ownership is rare in some immigrant and 
immigrant-like groups— notably blacks in the 
industrial North of the United States, Jamai- 


'cans in Britain, and British immigrants in 


Australia—is that all these groups share a 
language with the host society and thus 
provide neither distinct markets nor linguisti- 
cally isolated labor pools to coethnic potential 
entrepreneurs, Obviously, there are some 
exceptions, such as West Indian immigrants 
in New York, but they do not invalidate the 
general picture. 

How do these findings relate to ethnic 
enclave development? A recent characteriza- 
tion of ethnic enclave economies stresses the 
importance of the extension of the market 


' beyond the ethnic group (Portes and Bach 


1985): that capital derived from selling to the 
broader society permits ethnic entrepreneurs 
to expand their firms and offer their workers 
very attractive opportunities (not merely 
opportunities not quite as bad as those 
available in the broader labor market). My 
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findings do not address this issue directly, but - 


it seems likely that an abundance of small 
businesses in a group increases the likelihood 
that some of them will discover effective 
strategies for selling to the broader market. 
On the other hand, the rarity of fully fledged 
enclave economies suggests that the likeli- 
hood of such discovery is extremely small. - 

Nonetheless, even a relatively small num- 
ber of entrepreneurs can offer employment 
opportunities to a substantial portion of their 
coethnics. For example, in a group with only 
average levels of entrepreneurship (6 per- 
cent), if each business owner employs just 
one coethnic, then 12 percent of the group's 
work force is involved in the ethnic economy 
and if each employer hire three coethnics, 
then 24 percent are in the ethnic economy. 
And the effects are even more striking at 
higher levels of entrepreneurship. For exam- ` 
ple, in a group with a high level of business 
ownership (12 percent), if each employer 
hires three coethnics, then 48 percent of the 
groups' work force are in the ethnic economy. 

It is possible that ethnic entrepreneurship 
and the employment opportunities it creates 
contribute to the rarity of clear-cut ethnic 
discrimination in occupational status and pay 
which we observe in the industrialized world. 
Knowing that jobs are available in the ethnic 
economy and that ethnic discriminatión would 
encourage immigrant employees to seek those 
jobs gives majority employers an incentive to 
offer equality of opportunity to their immi- 
grant and nonimmigrant employees. 
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ment of Sociology, Research School of Social 
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. Appendix. Alternative multilevel logistic regression analyses predicting individuals’ probability of being an 
entrepreneur (conducting own business and employing others) as a function of both group and individual 
characteristics. Sample is foreign born men age 16 to 64 who are working (N — 9,505). For comparison but 
subject to caveats noted in text, a parallel model is presented for foreign-born women age 16 to 64 working 
full-time (N = 3,140). 








MODELS WITH REGION OF ORIGIN INCLUDED 








Omitting Including 
‘ Core Omitting Group Group 
Model Occupation Variables Variables Women 
Group characteristics 
Ethnic group size (10,000s) .0223** .0214** — .0210** ~ .0086 
(.0059) (.0057) (.0076) (.0177) 
Percent bad English .0188** .0192** — .0251** .0238* 
] (.0052) (.0050) (.0066) (.0119) 
Individual characteristics 
Education (years) —.1089** .0719** —.1131** —.1085** — .0985* 
(.0186) (.0166) (.1876) (.0188) (.0412) 
Educated in — .0428 —.0952 —.0186 — .0501 .3675 
Australia (0 or 1) (1271) (.1245) (.1273) (.1285) (.2766) 
Labor force experience: 
in Australia (years) .0259** .0378** .0271** .0271** .0562** 
s : (.0051) (.0049) (.0052) (.0053) (.0113) 
Overseas (years) ~ .0218** — .0183* — .0230** — .0216** 0271 
(.0076) (.0074) (.0076) (.0076) (.0149) 
English-language fluency: 
English monolingual (omitted (omitted (omitted (omitted (omitted 
category) category) category) category) category) 
English very good .0891 222 .1662 .1138 .1948 
i (.1262) (.1236) (.1366) (.1377) (.3149) 
English good — .2168 —.3057 —.04543 —.1937 —.1558 
(.1631) (.1574) (.1657) (.1699) (.4007) 
English poor — .2850 —.4029 — .0569 —.2707 —.5445 
(.2304) (.2258) (.2290) (.2346) (.5089) 
Occupational status .0318** — .0319** .0320** .0298** 
(0 to 100) (.0020) (.0020) (.0020) (.0049) 
Region of origin: 
Anglophone lands — — (omitted (omitted (omitted 
category) category) category) 
Northwestern Europe — — .4105* .1693 .6864 
(.1597) (.1763) (.3712) 
Eastern Europe — — .0248 —.3178 —.6674 
(.2121) (.2259) (.5322) 
Mediterranean lands — — .6671** —.1524 ~ 1215 
(.1582) (:2433) (.5221) 
Asia — — .2374 —.1498 .0360 
(.1782) (.2094) (.3961) 
Constant —3.474 — 4.302 — 3.443 — 3.5062 —4.4623 
Log-likelihood — 1964 — 2096 — 1972 — 1961 — 484 
Restricted (slopes = 0) A 
Log-likelihood —2178 , —2178 -2178 —2178 —529 
Chi-squared 430 166 413 435 88 
Significance level «.001 <.00! <.001 <.001 <.001 
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CITY LIMITS ON RACIAL EQUALITY: 
. THE EFFECTS OF CITY-SUBURB BOUNDARIES 
ON PUBLIC-SCHOOL DESEGREGATION, 1968—1976* 


. DAVID R.: JAMES 
Indiana University 


: This research examines the determinants of public-school segregation in 65 


metropolitan areas in 1968 and segregation changes between 1968 and 1976. 
Models for central-city and suburban public schools permit estimation of the 
segregative effects that one system exerts on an adjacent system. In 1968, higher 
black proportions in cities enhanced segregation in suburbs, and higher 
proportions in suburbs promoted segregation in cities. Although city and suburban 
school segregation declined during the 1970s, segregation between cities and 
suburbs increased. Higher black proportions in cities put segregative pressure on 
their suburbs. The historic link between residential and public-school segregation 
was broken in southern but not in northern cities nor the suburbs of either region. 
Private-school utilization by whites had segregative: effects on southern city 
systems, but not elsewhere. The boundaries between city and suburban school 
systems appear to foster racial inequalities within those systems as class theories 


_| predict. 


The classic question prompting this study is, 
how do state structures create social inequali- 
ties? Recent debates concentrate on fragmen- 
tation of the lowest level of the state hierarchy 
(here termed the “local state"). Class theories 
argue that local-state fragmentation serves the 
interests of dominant classes by protecting 
class privileges, and by managing social 
conflicts and demobilizing insurgencies (e.g., 
Friedland, Piven, and Alford 1978; Cockburn 
1977). Institutionalists argue that fragmenta- 
tion hinders policy implementation and tends 
to make the metropolis ungovernable (e.g., 
Greer 1987; Yates 1977). Rational-choice or 
market models claim that local-state fragmen- 
tation serves democracy. If citizens are not 
satisfied with one local government, they may 
choose another in the metropolitan area that 
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provides them with the public services they 
prefer (e.g., Peterson 1981; Ostrom, Tiebout, 
and Warren 1961). Each theory has been used 
to explain the uneven results produced by 
public-school desegregative policies of the 
1960s and 1970s (e.g., Orfield 1969, 1978; 
Peterson 1981, 1985). This study evaluates 
the adequacy of these theories in explaining 
metropolitan ‘school segregation. The specific 
question examined is, how did the administra- 
tive division’ of metropolitan public schools 
between cities and suburbs affect racial 
segregation between and within those systems 
from 1968 to 1976? 

The matrix of metropolitan public-school 
systems confronts citizens with options and 
constraints. Metropolitan areas typically con- 
tain many school districts whose boundaries 
seldom conform to city or county boundaries. 
Parents who work in less affluent, multiracial 
neighborhoods may gain access to high- 
quality schooling in uniracial neighborhoods 
because they:can afford to live far from their 
work. Districts in prosperous neighborhoods 
usually spend more on teacher’ salaries, 
buildings, libraries, and other factors that 
affect school quality because local taxes 
produce much of the school revenues (46 
percent in 1976, the last year of this study 
[Aronson and Hilley 1986, p. 209]). Thus, 
dividing metropolitan schools into indepen- 
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dent school districts produces and maintains 
racial and economic inequalities in schooling. 

Recognizing these facts, plaintiffs in the 
1974 Detroit case (Milliken v. Bradley) 
argued that the Detroit School District could 
not be successfully desegregated unless all 
public schools in the metropolitan area were 
subject to a common desegregation order.! 
Many thought their arguments would prevail. 
The courts readily compelled changes in the 
boundaries of other local governments to 
remedy discrimination. For example, courts 
have often readjusted electoral district bound- 
aries in order to provide citizens in different 
local political districts equal protection of the 
law (e.g., Lawson 1976). In the Detroit 
school segregation case, however, the Court 
held that desegregative remedies must be 
confined to the districts guilty of discrimina- 
tion unless plaintiffs could prove that the 
boundaries separating . districts had been 
drawn with discriminatory intent (Orfield 
1978, pp. 12-36). Even showing that arbi- 


trarily drawn district boundaries had discrim- ` 


inatory effects was not enough to require 
cross-district remedies. 

Most comparative studies of public-school 
segregation ignore the possible segregative 
effects produced by the decentralized political 
structure of public schooling, although they 
do consider the effects of pressure from state 
agencies and the courts (e.g., Coleman, 
Kelly, and Moore 1975; Farley, Richards, 
and Wurdock 1980; Taeuber, Wilson, James, 
and Taeuber 1981; F. Wilson 1985). Ignoring 
metropolitan context implicitly assumes that 
school segregation is a function primarily of 
district internal characteristics such as racial 
composition. Yet, case studies suggest that 
School desegregative efforts are greatly af- 
fected by the quality of nearby public- and 
private-school systems (e.g., Orfield 1978; 
National Institute of Education 1977; Rossell 
1983). Boundaries between jurisdictions may 
not just impede policy implementation, as 


! The Detroit case involved 85 school districts 
ranging in size from 2000 to 285,000 pupils. The 
plaintiffs argued that "the State has regularly 
altered school district boundaries to suit its own 
purposes . . . [and] has not hesitated regularly to 
cross or alter these lines in countless instances for a 
variety of educational purposes" and should do so 
again to eliminate segregation that it helped 
generate in part (cited in Jones 1977, p. 67). 
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institutionalists argue, but promote greater 
inequalities within systems. 

The balkanization of metropolitan public . 
schooling may force adjacent school systems 
to compete for students as market models 
argue. Whites who prefer to avoid contact 
with blacks may leave or refuse to enter 
systems that have higher black concentrations 
if more attractive and affordable alternatives 
exist nearby. For example, higher black 
proportions in the city may encourage whites 
to migrate to the suburbs. If black proportions 
in the suburbs are also high, whites may 
redouble their segregative efforts in the city. 
Hence, citizens’ choices may circumvent or 
retard school desegregative policies if local- 
state institutions are fragmented over constit- 
uencies. But individual choices are con- 
strained by private resources as well as by 
local-state institutions. Political fragmentation 
in class-stratified societies produces inequali- 
ties by providing attractive choices to those 
with greater resources. 

Class theories argue that local-state frag- 
mentation creates and maintains inequalities 
within and between jurisdictions because 
local governments have little capacity to 
prevent powerful private economic actors 
from making decisions that affect local 
constituencies adversely. Institutionalist. and 
market theories complement class theories by 
identifying some of the causal mechanisms 
that create inequalities between systems. 
Neither focuses explicitly on how neutral 
political boundaries between systems create 
inequalities within systems. Class theories do. 

The most important split in metropolitan 
public-school administration is between central- 
city districts, with their large black propor- 
tions, and the surrounding network of smaller 
(and whiter) suburban districts. This study 
proposes a simple dual system of metropoli- 
tan schools separated by the political bound- 
aries of central-city districts. The model is a 
simplification because suburb systems are 
also politically fragmented, but it does 
provide a systematic account of how racially 
divisive splits in the administration of schools 
affect segregation within component systems. 
An important premise discussed below sug- 
gests that racial proportions, rather than 
segregation levels, are the most important 
school system characteristics that affect the 
segregation levels of adjacent systems. To the 
extent that the segregation processes are 
related, the political division of metropolitan 
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school systems causes racial segregation ` 


within as well as between systems. 
. School system internal factors and commu- 
. nity characteristics, such as residential segre- 
gation, income level, and the availability of 
private schools also affect public-school 
segregation. Previous. studies have, been 
biased because they did not analyze data on 
community characteristics that correspond to 
school system boundaries (e.g., Farley 1975, 
1977;. Rossell 1978). This study avoids such 
bias by analyzing school and census data for 


cities and suburbs of 65 SMSAs that enrolled 19 


60 percent of the black public-school students 
in 1968. ` 


A MODEL OF CHANGE 
IN METROPOLITAN 
' PUBLIC-SCHOOL SEGREGATION 


Milliken v. Bradley tends to confine state 
, imposed desegregative remedies to the bound- 
aries of school districts. Individual families 
can escape the consequences of desegregation 
by moving to other districts or sending their 


children to private schools as’ their resources . 


and personal preferences permit. To the 
extent that white parents avoid schools with 
higher black proportions, desegregated school 
districts with -higher black proportions may 
suffer disproportionate reductions in white 


students. Milliken v. Bradley therefore in-. 


creases the possibility that racial segregation 
will persist across districts even if all districts 
in a metropolitan area are completely deseg- 
regated within their respective boundaries.” 
Because suburban school systems are com- 


? This result holds even if no students relocate to 
adjacent systems. Consider a hypothetical metro- 
politan area with two independent school districts. 
Suppose that the proportion of students who are 
black is .8 in the first district, .3 in the second, and 

.5 in the metropolitan area. If both districts are 
. desegregated within their boundaries, 80 percent of 
the students in each school of the first will be 
black, whereas each school of the second will be 
30 percent black. Compared to the racial propor- 
tion of the metropolitan area, each school in the 
, first district will have more than its fair share of 
blacks (80 percent vs. 50 percent) and each school 
in the second will have too few (30 percent vs. 50 
percent). Thus, segregation between districts still 
exists although each is desegregated within its. 
boundaries. The one exception to this result is the 


improbable case in which all districts have the' 


same racial proportion. 
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posed of numerous áchioól districts, cross- 
district segregation within suburbs may in- 
crease even if, and perhaps because, 
desegregative remedies are imposed upon 
them or the central-city district. 

There are three important reasons for 
studying how city-suburb boundaries affect 
metropolitan school desegregation. First, fed- 
eral and state desegregative efforts were 
focused on the large central-city districts in 
metropolitan areas, which fueled the school- 
busing controversy of the 1970s (e.g., Orfield 
78). The Detroit school desegregation case 
that led to Milliken v. Bradley was an explicit 
attempt to fashion desegregative policy tools 
that would reach across city-suburb bound- 
aries. No other split in local public-school 
administration attracted as much attention. 
Southern suburban school systems were also 


' required to desegregate, but few (only four in 


this study) were desegregated as part of 
metropolitan plans that bused students across 
city-suburb district boundaries (Orfield 1978).3 
Northern suburban systems encountered less 
desegregative pressure because they con- 
tained few blacks or because white majorities 


‘were able to redraw school attendance zones 


or implement small-scale busing plans that 
reduced segregation enough to avoid further 
desegregative pressure.* No northern school 
system was subjected to a cross-district 
city-suburb desegregation plan during the 


'' period of this study. ` 


Second, the disparity between the central- 


3 The Supreme Court in Swann v. Charlotte 
(N.C.)-Mecklenburg Board of Education (1971) 
decided that busing could be used to transport 
students across central-city and suburban district 
boundaries if it was needed to eliminate the 
Southern dual-school systems that had -been 
imposed by law. Four school systems in this study 
were subjected to metropolitan desegregation plans 
between 1968 and 1976: Orlando, Louisville, 
Charlotte, and Nashville. The Milliken (1974) 
decision prevented the logic of Swann from being ' 
applied to northern systems guilty of de facto 
rather.than de jure segregation (Orfield 1978, pp. 
11-24): 

* Federal threats to enforce the 1964 Civil 
Rights Act by withholding funds available to local 
public-school districts under the 1965 Elementary 
and: Secondary Education Act (ESEA) was the 
primary tool used to desegregate most rural, 
systems in the South and some northern districts 
with small numbers of black students (Orfield 
1969, 1978). 
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city and suburban systems in the numbers and 
proportions of black students should produce 
detectable segregative effects on the two 
systems. If desegregating the central-city 
districts has no detectable .effect on the 
surrounding suburban systems, the effects of 
desegregating other smaller component sys- 
tems could hardly be detected. Finally, a very 
large proportion of black students in the 
United States have been crowded into the 
relatively small number of public-school 
districts in the central cities of large metropol- 
. itan areas. More than half of all black 
students were concentrated in less than 100 of 
the 22,000 school districts in the United 


States in 1968. The 65 central-city districts in ` 


this study accounted for more than 45 percent 
of all black public-school students in 1968 
and 1976. Adding the black students in the 
' suburban rings raises the proportion to over 
,60 percent. Thus, the segregation patterns 
identified in this study affect a significant 
proportion of black students.* 

A critical premise of this study is that racial 
concentrations and proportions provide the 
stimuli for racially. motivated enrollment 
, decisions, but segregation levels per se do 


*" not. Students and their parents can easily 
"ascertain whether or not there are larger 


numbers or proportions of blacks in a school 
or a school system. Whether or not a school 
has its fair share of blacks or whites ‘(or how 
equally students are assigned to schools by 
race) is harder to perceive. Measures of 
segregation are dispersion measures that 
indicate the degree of inequality (or uneven- 
ness) in the assignment of students to schools 
by race in a system (Massey 1988; James and 
Taeuber.1985). A measure of segregation is à 
convenient summary indicator for analysts 
interested in racial inequalities in pupil 
assignments. But racial inequalities in pupil 


5 The statistical analyses are representative of 
the 78 SMSAs that contained central-city school 
districts with black populations of 10,000 students 
or more in 1968. ‘See the discussion of sample 
selection bias below. The 78 SMSAs. contained 
about 65 percent of all black students in 1968. 

$ This theoretical claim appears justified in these 
data. Including the 1976 suburb segregation level 
in the 1976 city equation of the analyses reported 
., below had a small, statistically insignificant effect 
. on the results reported in Table 3. 
including the, 1976 city segregation level in the 
1976 suburb equation had negligible impact on the 
results reported in Table 4 (results not shown). . 


Similarly, © 


e 
' 
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assignments are not the proximate stimuli that 
encourage families to make racially motivated 
School enrollment choices. 

Higher black proportions and larger black 
student populations are the tangible factors 
that threaten whites who want to avoid 
contáct with blacks. A desegregated system 
with a high proportion of black students might 
trigger white avoidance because all schools in 
the system have significant black proportions 
even though some schools have less than their 
fair share of black students as compared to 
other schools in the system. Thus, white 
parents make decisions based on the actual or 
potential exposure of their children to blacks, 
not on how equally students are assigned to 
schools by race.? 

Figure 1 diagrams the determinants of 
metropolitan school segregation for 1968 to 
1976. No direct causal linkages are posited 
between the segregation levels of central-city 
districts and those of the surrounding subur- 
ban systems as a consequence of the premise 
that racial concentrations, rather than equal 
pupil assignments, motivate enrollment deci- 
sions. Only links indicated by solid arrows in 
Figure 1 appéar in the analyses that follow. 
Segregation levels are modeled as functions 
of current racial proportions and black 
population sizes, past segregation levels, and 
past levels of exogenous factors expected to 
affect changes in sepregation levels during the 


7 For example,:a white family that fears sending 
their children to. a black majority school may 
disenroll their child from a 60 percent black school 
even though the schoal may have less than its share 
of black students im a system that is 75 percent 
black. On the other hand, a 10 percent black 
desegregated system might cause whites no 
concern. Of course, rácial proportions, racial 
exposures, and segregation levels are linked. 
Desegregation will decrease racial inequality in 
pupil assignments and increase the exposure of 
whites to blacks and blacks to whites. The upper 
limit on racial exposure is set by a system's racial 
proportions. White students in a desegregated, 80 
percent black district will attend schools that are 80 
percent black whereas blacks would experience 20 
percent white environments (Lieberson 1981; 
Lieberson and. Carter 1982). As a consequence, 
whites opposed to black majority environments 
might withdraw. from desegregated systems with 
large black proportions but remain in systems with 
‘smaller proportions. 
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Note: Equation disturbances are not shown. The disturbances of the city equations (above the top bold line) and 
those of the suburb equations (below the second bold line) are assumed to be independent across sectors but correlated 
in time within sectors. The disturbances of the unanalyzed equations in the middle section (between the bold lines) are 
correlated, but are assumed to be independent of the disturbances of the city and suburb equations. 


A Model of the Determinants of 


Fig. 1. 
` Areas. 


intervening period. In the language of regres- 
sion analysis, these are reduced-form esti- 
mates.8 
City segregation levels (top section of 
Figure 1) and suburb segregation levels 
(bottom section) indicate racial inequalities in 
pupil assignments produced by variables in 
the middle section. Racial proportions and 
black population sizes are the factors that tie 
together the segregation levels of cities and 


5 The reduced-form equations express current 
endogenous variables in terms of lagged endoge- 
nous variables, current predetermined variables, 
exogenous variables, and disturbances. The reduced- 
form coefficient estimates obtained by two-stage 
least ‘squares are consistent even though the 
structural relations among intervening variables are 
not estimated (Theil 1971, pp. 429-41). If some of 
the unmeasured variables have direct effects on 
current segregation levels (contrary to the model of 
Figure 1), the estimates of the exogenous vari- 
ables' total effects net of the current predetermined 
variables will still be consistent. Similarly, the 

- estimates of the effects of the current predeter- 
mined variables and the lagged endogenous 
variables will be consistent. 


the Public-School Segregation within Cities and Suburbs of Metropolitan 


suburbs. Thus, current racial population sizes 
and proportions are predetermined with re- 
spect to current segregation levels. Other 
variables, including unmeasured ones, may 
intervene between the exogenous variables 
and the endogenous segregation levels as 
indicated in the middle section. Some of the 
middle-section relationships involve feed- 
back. I do not attempt. to model the complex 
relationships among the measured and unmea- 
sured variables in Figure 1, but this omission 
does not bias the estimates (see note 8). 
Segregation patterns tend to persist over 
time because they are linked to the available 
stock of school facilities and because admin- 
istrators seldom make radical adjustments to 
school attendance zones on their Ówn voli- 
tion. Parents" residential choices are based in 
part on ‘the stable relationship between 
residential location and school assignment. 
Thus, present segregation levels should be 
strongly related to past levels in the absence 
of external pressure to desegregate. This 
study seeks to isolate the most important 
determinants of changes in school segregation 
levels between 1968 and 1976, the period 


968 


when external desegregative pressure was 
strong. Figure 1 approaches this task by first 
modeling the level of segregation in 1968 and 
then the change in segregation levels between 
1968 and 1976. Including the 1968 value of 
segregation among the regressors in the 1976 
equation allows the coefficients of the other 
variables in that equation to be interpreted as 
effects on segregation change rather than 
effects on levels. 

Because Figure 1 does not include pre- 
1968 levels of segregation as a regressor, the 
coefficients in the 1968 equations indicate 
effects on 1968 levels. Modeling 1968 levels 
- rather than changes in levels seems appropri- 
_ ate for two reasons. First, identifying the 
determinants of 1968 levels, such as residen- 
tial segregation and school racial composi- 
tion, makes it possible to estimate how the 
effects of those factors were transmitted from 
the past to the 1976 segregation levels. 
Second, no metropolitan data on pre-1968 
segregation levels exist. Proxies for the 
missing data can be developed, but analyses 
based upon them may be unreliable.? Fortu- 
nately for this study, the greatest changes in 
metropolitan public-school desegregation oc- 
curred between 1968 and 1976.10 


Factors Affecting School Segregation Levels 
and Changes in Levels 


The suburb public-school system of Figure 1 


? A number of analyses were performed assum- 
ing that southern districts were completely segre- 
gated in 1960 and that no desegregation occurred 
in northern districts between that time and 1968. 
Higher levels of residential segregation and higher 
black proportions were associated with smaller 
. Segregation drops in cities and suburbs in 1968 as 
expected. 

© All southern districts were segregated by law 
on the eve of Brown (1954). Only a few showed 
significant changes between 1954 and 1968. Most 
of these were smaller suburban districts that were 
vulnerable to federal threats to withhold ESEA 
funds. Little or no school desegregation occurred 
outside the South before 1968 (Orfield 1969, 
1978). Table 1 shows that average 1968 segrega- 
tion levels were still 0.86 in central-city districts 
and 0.74 in the suburbs. After Green v. New Kent 
: County (1968), southern educators were required 
to use mandatory, rather than voluntary, pupil 
reassignments, including busing if necessary, to 
eliminate racially dual schools. See Orfield (1983, 
Tables 2 and 9) for evidence of the desegregation 
slowdown since 1976. 
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is actually composed of a number of politi- 
cally independent school districts. If a 
suburban system were a unified school 
district, as it is in some cities, it would be 
vulnerable to uniformly imposed desegrega- 
tive actions which could reduce its potential 
to provide sanctuary to those preferring 
segregated schools. None of the suburbs in 
the sample was politically unified. Whites 
who left city systems or in-migrants who 
wished to avoid contact with blacks could 
almost always find suburban enclaves with 
low black proportions, which would not 
threaten whites even if desegregated. 

Desegregative actions in the North and 
South differed in scope and intensity because 
the Supreme Court provided different reme- 
dies according to differences in the racial 
politics of the two regions. Because all 
southern districts had been segregated by Jaw, 
all came under pressure from the Courts or 
the Executive Branch to eliminate racially 
identifiable dual school systems. This pres- 
sure was weak before 1968, however, be- 
cause freedom of choice plans were deemed 
adequate until May of that year. By 
contrast, almost no desegregative pressure 
was focused on northern districts before 
1968. School segregation had not been legally 
imposed in the North and some integration 
had occurred. The more common result was a 
high level of de facto school segregation 
(Table 1), which was allowed to persist and 
sometimes encouraged by school administra- 
tors. 

Desegregative pressure increased dramati- 
cally after 1968 in both the North and the 
South. Over half of the central-city districts in 
this study, including 79 percent of the 
southern ones, implemented a public-school 
desegregation plan as a result of a federal 
court order during the period (see Table 1). 
Southern districts had to eliminate dual school 
systems in cities and suburbs, but suburban 
desegregative plans typically did not require 
transporting students across district bound- 
aries. Mandatory reassignment of students to 
eliminate segregation was much less likely in 
northern suburbs although a number of 


‘ The Supreme Court in Green v. New Kent 
County (1968) held that freedom of choice plans 
were inadequate if they did not eliminate racially 
identifiable school systems (Orfield 1969, pp. 
340-41). When they did not work, mandatory 
reassignment of pupils were required. 
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Table 1. Values of Selected Variables for 65 SMSAs* 

Proportion 

of Whites 

Central-City Public Schools Suburban Public Schools em 

SoU Pis uS Segregation Percent Black Segregation Percent Black Schools 1970 
NAME . ST 1968 1976 Diff. 1968 1976 Diff. Court” 1968 1976 Diff. 1968 1976 Diff. CITY SUBURB 
NORFOLK VA 86 1S -71 42 54 12 YES 26 48 -8B 28 26 -0 15 06 
COLUMBUS GA 92 B -.69 30 4 10 YES 95 47 -48 56 55 -00 03 05 
GREENVILLE SC 85 16 -—.69 22 24 02 YES 65 33 -31 12 .H —.01 05 02 
OKLAHOMA CITY OK 89 .26 -.63 22 M 12 YES 85 61 -23 01 .05 04 07 02 
WINSTON-SALEM NC ..86 .23 —.63 28 3 06 YES 69 34 -35 21 20 09 .06 02 
PORTSMOUTH VA 88 26 -—.6 52 59 07 YES P 46 -33 29 3l o2 OS 09 
GREENSBORO NC 82 21 -.61 31 43 i2 YBS 72 30 -43 21 28 o o 04 
NEWPORT NEWS VA 8 (29 -6 37 39 02 YES S 27 -B 24 B3 09. .06 05 
LOUISVILLE KY .8 22 -38 20 25 05 YES 57 47 -0 06 07 01 24 19 
CHARLOTTE NC B 15 -8 29 36 07 YES .62 33 -3 26 24 -02 05 01 
RICHMOND VA 87 30 -—.5] 68 81 12 YES B 51 -22 40 B 03 42 07 
AUGUSTA GA .93 38 -—.56 .36 48 2 YES 68 30 -38 30 3 03  .05 .06 
MONTGOMERY AL 94 45 —.50 .43  .50 07 YES 89 21 -69 35 32 -03 07 05 
JACKSON MS 95 47 -48 46 T 24 YES 92 54 -38 44 41 -02 118 A4 
FORT WORTH TX 89 47 -42 2 AL 4 YES .72 59 -B 02 B OL 05 03 
MEMPHIS TN 95 .56 -39 54 D 47. YES 93 45 —48 38 33 -05 a2 all 
NASHVILLE TN 82 43 -3 24 31 06 YES 62 43 -—19 12 09 -~.03 10 03 
COLUMBIA SC $2 42 -39 42 54 42. YES .9 37 -22 18 13 -065 06 03 
PENSACOLA FL 80) 45 -—.34 28 28 00 YES  .58 58 00 08 06 -.03 ais .05 
MOBILE AL ..89 .58 -3l 42 M 03 YES 98 23 -74 26 B -03 8 06 
ORLANDO FL 8 $4 -31 207 21 04 NO 2 56 -a47 24 45 -09 0 A2 
CHARLESTON SC .88 .58 -—.30 46 51 05 YES 84 .56 -28 39 33 -.07 .16 06 
DALLAS TX 94 65 -.29 .33 ' 55 22 YES 73 63 -—.10 08 10 62 0 04 
MACON GA 82 53 -.2 4| 53 2 YES 8&4 48 -36 2 26 04 09 03 
CHATTANOOGA TN 86 .62 —.24 49 AS ~.04 YES 62 53 —.10 05 03 -.02 08 7 
SAN ANTONIO TX 84 66 -48 35 $2 46 NO 32 52 -20 04 07 403 18 08 
ATLANTA GA .91 74 =-.17 .6 89 27 YES 4.74 m -V Q0 a3 06 2 04 
HOUSTON TX 92 7 -15 38 56 17 NO 2 .65 -07 alb IS 04 0 03 
BALTIMORE MD 82 68 -.14 65 77 12 NO 53 48 -05 .07 .10 03 30 AS. 
BIRMINGHAM AL 92 80 -—.12 .$1 6 7 YES 94 50 -4 29 22 -07 Q7 .06 
NEW ORLEAN LA .83 .76 -08 68 .83 15 YES 79 34 -44 D 2 42 43 31 
ST. LOUIS MO 88 85 -.03 64 .72 08 NO 80 74 -06 09 2 A2 38 21 
KANSAS CITY MO .80 .79 -.01 47 .68 21 NO 85 2 —43 06 10 OF 20 A0 
WASHINGTON DC .79 86 07 .94 96 02 NO 63 Sl -—.11 .09  .18 J0 47 A2 
(N34) SOUTH MEAN .86 48  -.38 A .5*  — 10 — 39 A 48 -26 149 20 Ol 44 -08 
NORTHERN SMSAs 
NAME 
DAYTON OH .87 22 ~.65 38 52 .4 YES .79 .66 -.13 03 14 Ji 48 10 
OMAHA NB 79 .30 -49 B 2B 04 YES 61 .57 -0 OL 01 -—.00 28 14 
DENVER CO 83 039 -.4 48 30 A2. YES 57 Si -06 © 0l Ol  .18 06 
BOSTON MA 73 32 —.4 28 49 -21 YES 59 45 -.14 OL 02 © 36 45 
SAN FRANCISCO CA .588 27 —34 40 51 Al YES 81 66 -.15 .14 35 A3 .09 
INDIANAPOLIS IN .77 47 —341 34 46 2. YES .7 .65 -06 .01 05 43 15 09 
MILWAUKEE Wi .85 .55 -—.30 25 40 45 YES 76 62 -44 (0 .O1 01 9 20 
OAKLAND CA 77 .59 —48 13  .18 05 NO 71.58 —43 01 (M 03  .08 06 
PITTSBURGH PA 71 .57 -.4 .39 46 07 NO 64 59 -05 .04 10 06 — 46 A7 
TOLEDO OH .78 65 —.13 27 3 03 NO 15 50 -25 01 02 01 8 14 
DETROIT M 7T 63) -438 60 Bi 21 YES St 87 -04 04 Il 06  .36 14 
KANSAS CITY KS 74 62 -.12 230 .39 09 NO .89 85 -04 15 21 06 a8 Al 
BUFFALO NY .70 59 -.a2 38 47 40 NO TI 55 -23 02 09 39 A8 
LOS ANGELES CA .90 Bi -09 30 4 40 | NO 87 77 -10 08 22 44 — 4 -10 
WILMINGTON DE .53 47 -%6 70 .9 20 NO 60 42 -.18 .06 08 02  .50 A3 
GARY IN 87 .82 —.05 68 .84 6 NO 86  .81 -.05 .07 I4 08  .20 AS 
FLINT MI 6&3 .59 —.04 238 54 .7 NO 4 68 -.15 02 01 -00 .17 07 
CINCINNATI OH 66 64 -~02 43 53 10 NO m .2 -0 2030 07 04 35 22 
ROCHESTER NY .53 .51 -02 .30 49 J9 NO 1 34 -27 03 (Q2 -0l 35 A4 
SAN DIEGO CA  .69 .68 -—.O01 .64 80 16 NO 46 .55 -21 12 28 16 2 JI 
CLEVELAND OH .89 90 OL 57 60 03 NO 833 B -a 04 28 24 32 22 
SEATTLE WA 64 65 OL i2 2 08 NO 48 .30 -—18 40 0l OL AS 05 
CHICAGO IL. 9 Nn- 02 58 n 12> NO 83O 74 ~09 05 12 07 43 19 
NEWARK N 79 83 04 .80 BB 08 NO 35. 66 —49 14 B J5 3 20 
NEW YORK NY 6 n 06 42 55 J4 NO J0 73 03 07 24 A7 — 40 AT 
NEW HAVEN CN 54 6 06 .53 .70 17 NO 56 41 -15 03 06 03 34 43 
PHILADELPHIA PA 75 8l 06 .60 66 .06 NO 6 63 -.03 09 17 07 .56 24 
PATERSON N .53 .6 07 54 73 19 NO S1 .59 -23 A 27 22 28 2 
HARTFORD CN 67 #74 0 52 69 8 — NO 48 (SI 03 02 05 03 M H 
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P : 
Central-City Public Schools Suburban Public Schools of 
in Private 

Segregation Percent Black Segregation Percent Black Schools 1970 
NORTHERN SMSAs Federal NEEDS Se. 
NAME 1968 1976 Diff. 1968 1976 Diff. Court? 1968 1976 Diff. 1968 1976 Diff. CITY SUBURB 
JERSEY CITY N 62 .6 08 49 65 16 NO 67 .56 -1 9» O 00 50 29 
YOUNGSTOWN OH .65 7 08 42 3 B NO 6 .59 -10 26 7 n 40 14 
(N*31) NORTH MEAN 72 -61 -d1 429 55 12° 26 TE OOP ane 5 12 07 31 AS 
N65) GRAND MEAN BO SH —29 A .S4 gd 544 33 S4 —.19 12 J6 04 xm ll 





Source: Special file that combines 1970 Census data, Office of Civil Rights (OHEW) annual school survey data for 
1968 and 1976, and surveys by DHHS (Rossell 1978; Cox 1979) and the U.S. Commission on Civil Rights (1977). 
* Districts are sorted in descending order of central-city segregation decline within regions. Segregation is measured 


by the Index of Dissimilarity (D). 


> A “YES” response indicates that a public-school desegregation plan was implemented as a result of a federal 


court order. Missing data were coded “NO.” 


* Rounding error prevents the difference in column means from equaling the mean difference. 


? Indicates the proportion of “YES” responses. 


districts implemented voluntary reassignment 
plans with varying degrees of success. 
Because northern school segregation was not 
statutorily imposed, many northern districts, 
especially the smaller ones, were never 
subjected to systematic desegregative pres- 
sure. Thus, the strength of desegregative 
interventions varied across regions and across 
city-suburb boundaries. 

A principal claim of this study is that black 
concentrations and proportions affect the 
segregation levels of adjacent school systems. 
Further, the effects of racial components on 
adjacent school systems should vary accord- 
ing to the desegregative pressure focused on 
them. Before 1968, higher black proportions 
in adjacent systems should have reduced their 
attractiveness as enclaves. Weak desegrega- 
tive pressures in the South and no pressure in 
the North left room for white avoidance 
within cities and suburbs if adjacent systems 
had high black proportions. Consequently, 
higher black proportions in the city should 
have produced higher segregation levels in 
the suburbs, and higher proportions in the 
suburbs should have produced more segrega- 
tion in the city before 1968. 

The strong federal desegregative actions 
focused on city districts after 1968 should 
have altered the effects of suburb factors on 
city segregation levels. Many city districts 
were forced to implement mandatory pupil 
reassignment plans to reduce segregation. If 
the interventions were strong enough, the 
effects of the suburbs’ racial characteristics 
on city segregation would have been reduced 
or broken completely. 


Suburban districts were also subjected to 
greater desegregative pressure after 1968, but 
not in a manner that eliminated city to suburb 
links. Suburban systems were still balkanized; 
only a few southern suburbs were subjected to 
cross-district desegregation programs. Faced 
with strong desegregative pressure and large 
black proportions in the city, whites could 
find suburban enclaves with small black 
proportions that provided little threat even if 
desegregated. Thus, high black proportions in 
the city should have retarded desegregation in 
the suburbs. 

Higher levels of segregation in 1968 shouid 
have been associated with higher levels in 
1976, but Federal Court pressure may have 
broken the link in central cities because that 
was the purpose of the desegregative plans. 
Because there was no statutory prohibition 
against school integration in the North, 
northern systems should have been less 
segregated than southern ones before 1968 in 
both cities and suburbs. Because the pressure 
to dismantle de fácto segregation was weaker 
than that directed against de jure, the 
segregation decline in the North between 
1968 and 1976 net of other factors should 
have been smaller than in the South. The 
effect of past segregation should have been 
stronger in suburban systems than in central- 
city districts between 1968 and 1976 because 
suburban systems were fragmented whereas 
city districts were not. Thus, desegregative 
actions in the cities should have been more 
effective in severing links with the past. 

Systems' racial characteristics also affect 
their own segregative levels and changes. 


CITY-SUBURB BOUNDARIES AND DESEGREGATION 


Numerous studies find that high black percent- 
age is one of the most important causes of 
white exit from and nonentrance to desegre- 
gated public-school systems (e.g., F. Wilson 
1985; Coleman et al. 1975; Rossell 1983). 
Hence, higher proportions of black students 
should have been associated with higher 
segregation levels for both cities and suburbs 
in 1968 and should have retarded desegrega- 
tive efforts between 1968 and 1976. : 

Whites may avoid large black populations 
for the same reasons that they avoid higher 
black proportions. Coleman, Kelly,: and 
Moore (1975), for example, found that white 
flight was greater from larger districts. Others 
have found strong relationships between 
population size and school segregation levels 
and changes in levels (Taeuber et al., 1981; 
Farley 1975). The effects of larger black 
population sizes on 1968 segregation levels, 
and subsequent changes from those levels, 
might be similar to those produced by higher 
black proportions for both cities and suburbs 
for the same reasons. Greater black popula- 
tion size also imposes logistical problems and 
transportation costs that reduce the effective- 
ness of desegregative efforts within systems 
(Orfield 1978; Rossell 1983; Rossell and 
Hawley 1981). Thus, larger black populations 
Should retard desegregative efforts in central- 
city districts. Black population size effects 
might be weaker or nonexistent in suburban 
Systems because there was no pressure to 
transport students across district boundaries 

before 1968 and little thereafter. 

The established practice of assigning stu- 
dents to neighborhood schools creates racially 
segregated school systems because residential 
segregation has been high as long as signifi- 
cant numbers of blacks have lived in U.S. 
cities (e.g., Taeuber and Taeuber 1965; 
Hershberg, Burstein, Ericksen, Greenberg, 
and Yancey 1979; Massey and Denton 1987). 
As a result, greater residential segregation 
should have produced greater public-school 
segregation in both cities and suburbs in 1968 
(e.g., Wilson and Taeuber 1978; Farley 1977; 
Logan and Schneider 1984). Desegregative 
pupil reassignment plans might have reduced 
the impact of residential segregation in 
central-city districts after 1968, but suburban 
school segregation is linked to between- and 
within-district residential segregation within 
the suburbs. Thus, residential segregation 
should have retarded desegregation in subur- 
ban systems. 
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Greater fiscal capacities may be associated 
with lower levels óf segregation in central- 
city districts because districts with more 
revenue per enrolled student can build newer 
Schools, maintain them in better condition, 


and hire better teachers (Rossell 1983). : 


Well-funded public systems tend.to attract 


and retain whites even if racial proportions ` 


are high (Monti 1985, pp. 138-47). Orfield 
(1978, p. 416) reports that there is an 
“increasing correlation between segregation 
and fiscal distress.” 
fiscal capacities in central cities should result 
in lower levels of segregation in 1968 and 
enhance desegregative efforts thereafter. 
Greater fiscal capacities may have just the 
opposite effect in the suburbs because the 
suburbs are politically fragmented. Because 
fiscal capacities are tied to property values 
through property taxes and because property 
values vary across suburban districts, higher 
average per-pupil expenditures were probably 
associated with the presence of relatively 
wealthy suburban enclaves which provide 
better schools and teachers for those who can 
afford to live in them. The higher average 
incomes of whites make it more likely that 
they would live in such areas in disproportion- 


ate numbers even if they did not intentionally - 


avoid contact with blacks. Thus, higher 
suburban fiscal capacities should be associ- 
ated with higher school segregation levels and 
smaller declines in segregation between 1968 
and 1976. 


Public-school desegregation may also in- 


duce whites to enroll their children in private 
schools if they are. available and affordable. 
Many northern and a smaller próportion of 
southern metropolitan areas had: extensive 
systems of private schools by 1968. Most of 


these private systems -were created originally _ 


for religious reasons or other purposes than to 
segregate blacks from whites. Because public- 
School systems were segregated by law in the 
South and by common practice in the North, 
there were no special racial motivations to 


create or attend private schools. Thus, there is | 


no necessary connection between the availabil- 
ity of private schools and. public-school 
segregation in 1968. . 

This was not the case, however, for the 
1968 to 1976 period. Federal pressure to 
desegregate first southern and then northern 
schools in the late 1960s and early 1970s 


For these reasons, higher - 


increased the likelihood that private schools - 
would be used as an avenue for white 
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avoidance (Rossell 1983, pp. 33-34). “Seg- 
Tegation academies” sprang up in the South 
.(Nevin and Bills 1976; Clotfelter 1976; 
. Munford 1973) and preexisting private sys- 
.tems were used as escape routes for whites 
"wherever they were available (e.g., Rossell 
1983; Hannon 1982). Thus, the availability of 
private schools should have retarded desegre- 
gative efforts in both cities and suburbs. 
Class differences between whites and 
blacks may also affect the school desegrega- 
tion process. Whites may be more willing to 
send their children to school with blacks from 
' higher than from lower class backgrounds. In 
addition, blacks with greater resources are 
better able to instigate litigation and pressure 
School boards for desegregative purposes 
(Farley 1975, 1977). Thus, higher class 
standings for blacks and class differences 
between whites and blacks should be associ- 
ated with lower levels of segregation in 1968 
and larger drops in segregation Verwee 1968 
and 1976. 


DATA, METHODS, AND 
VARIABLE DEFINITIONS 


This study uses a data set especially suited to 
an analysis of metropolitan public-school 
desegregation. Separate variables exist for 
central-city districts and the suburban rings of 
public and private schools that surround the 
central-city districts (James 1988). The use of 
non-school system measures that coincide 
with the boundaries of the school systems 
being analyzed is a unique strength of this 
study. For example, the central-city residen- 
tial segregation scores correspond to the 
boundaries of the central-city public-school 
districts in 1970. In the absence of variables 
tailored to school-district boundaries, previ- 
ous studies had to sutstitute scores computed 
for metropolitan or urbanized areas if city 
characteristics were considered at all (e.g., 
Farley 1977; Rossell 1978). Using metropoli- 
tan-level variables as proxies for school- 


` district level indicators introduces unknown 


" amounts: of measurement error that can bias 
_ analyses in unpredictable directions.!? Further- 


12? For example, Farley's (1975, 1977) analyses 
of the-relationship between residential and school 
segregation in central-city districts use 1970 
residential segregation scores for urbanized areas 
and 1967 and 1972 sublic-school segregation 


scores for school districts while noting that the: 
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more, it is not possible to investigate the 
relationship between residential and school 
segregation for cities and their suburbs unless 
separate measures are available for each area. 
This study improves on previous research 
because the data coincide with the boundaries 
that separate central-city districts from subur- 
ban schools. l 

The basic school data are from the annual 
surveys of public-school enrollments by race 
collected by the Office of Civil Rights (Office 
of Civil Rights 1968, 1979) (OCR) of the 
U.S. Department of Health and Human 
Services (DHHS). 1° In 1968, OCR collected 
data on 8491 school districts enrolling 
approximately 92 percent of all public-school 
students and about 98 percent of all black 
public-school students. The much smaller 
1976 survey collected data on only 3000 
districts. Both the 1968 and 1976 OCR 
suburb data are incomplete because of the 
sampling procedures employed by OCR. The 
problem is probably negligible in 1968, but 
more serious in 1976 because of the much 
smaller sample size for that year. Fortunately, 
the direction of bias in the segregation scores 
is known. The schcol districts most likely to 
be excluded from the suburban rings are those 
with small black enrollments (Office of Civil 
Rights 1979). Because schools with high 


boundaries of urbanized areas and school districts 
seldom coincide. The 1967 Farley data should 
compare closely to the 1968 data used here if 
measures based on dirferent areal units introduces 
little bias. Farley (1975, p. 16) reports a 
correlation between 1967 school segregation and 
1970 residential segregation of 0.61 for 61 
northern central cities and 0.68 for 33 southern 
ones. The correlations between 1968 school 
segregation and 1970 residential segregation from 
these data are 0.82 for 31 northern cities and 0.19 
for 33 southern ones. The southern correlation is 
lower because southern districts: were segregated 
by law, rather than by assignment to neighborhood 
schools. Because some desegregation took place 
between 1967 and 1968, the 1967 northern 
correlation should have been as high or higher than 
0.82, whereas the southern correlations should 
have been as low or lower than 0.19. Farley's data 
appear to exaggerate the southern correlation, 
underestimate the northern one, and rank order the 
correlations incorrectly by producing a southern 
correlation that is higher than the northern one. 

* DHHS was formerly the Department of 
Health, Education, and Welfare (DHEW). Refer- 
ences to DHHS funded research and data collection 
efforts includes those that were funded by DHEW. 
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white enrollments are underreported, the 1976 
suburb segregation scores are probably smaller 
than they would be if data for all schools in 
each SMSA were available. 

These data were consolidated into a file 
that links. 1970 U.S. Census (fourth count) 
data to public-school enrollment data within 
the boundaries of central-city public-school 
districts. Data for census tracts were aggre- 
gated to school district boundaries based upon 
a special census tape that identified the 1970 
census tracts contained within public-school 
districts (F. Wilson 1979; Wilson and Taeu- 
ber 1978). 

The scores for the system of schools 
surrounding each central-city district, here 
designated “suburb,” were estimated by 
aggregating the data remaining for each 
Standard Metropolitan Statistical Area 
(SMSA). The central-city district score was 
„calculated for all schools in that district using 
the Index of Dissimilarity (D); the suburb D 
score was calculated for all schools in the 
SMSA except those in the central-city dis- 
trict. Thus, a city segregation score corre- 
sponds to a single district whereas a suburb 
score reflects the segregation within and 
between all of the districts that surround a 
central-city district. Data for 1970 Census 
tracts outside of the central-city school district 
were aggregated and combined with the 
suburb school-segregation scores for 1968 
and 1976. i 

No 1980 census data for district boundaries 
circa 1976 and 1980 was available. Conse- 
quently, the 1970 census data were used in 
the analysis of both the 1968 and 1976 
panels, This data limitation prevented an 
investigation of how changes in the census 
variables during the period affected school 
desegregation. On the other hand, the 1970 
data should provide very good proxies for the 
levels on these variables in both 1968 and 
1976 because their relative magnitudes prob- 
ably changed little during the period (e.g., 
. Massey and Denton 1987). Farley (1975, 
1977) followed a similar strategy because 
census and public-school data are seldom 
collected contemporaneously (see note 12). 

Information on whether central-city dis- 
tricts were. under federal court orders to 
desegregate were consolidated from two prior 
studies (F. Wilson 1985; Rossell 1978). A 
(1,0) dummy variable indicates federal-court 
pressure. to desegregate (1 = Yes). An 
analogous variable was not available for the 
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suburban schools because suburban rings 
were composed of more than one public- 
school district. Some of the suburban districts 
were subjected to federal-court pressure, but 
many. were not, especially those ‘that con- 
tained small numbers: and proportions of 
blacks, Many southern city and suburban 
districts were also subjected to desegregative 
pressure from the Office of Education or OCR 
as well as from state and federal .courts 
(Orfield 1969, 1978; Farley 1975; Green 
1985). Data limitations preclude modeling 
those processes in this study. 

The 65 SMSAs (James 1988) analyzed 
below are a subset of the. 78 SMSAs ‘with 
ceniral-city districts. enrolling more: than 
10,000 black students in 1968. Thirteen were 
dropped because of missing suburb data.!4 
Deleting these cases did not appear to 
introduce sample selection bias.!5 Although ` 
this analysis seems representative ‘of the 
universe of 78 SMSAs with the largest black 
student populations, they may not capture the 
typical process in: smaller metropolitan ‘sys- 
tems or those with politically unified subur- 
ban school systems. i 

The Index of Dissimilarity (D) was used to 
measure ‘segregation levels among black and 
white students for the analysis of trends 
(James and Taeuber 1985).!6 Racial composi- . 
— * 4 

^^ There were 87 public-school districts with 
more than 10,000 black students in 1968 (Taeuber 
et al. 1981). Eight were not central-city districts, 
but were included in the suburb records of the 64, 
SMSAs in the sample. One was a rural distzict. 

15 T used sample selection models to evaluate the 
possibility of selection bias (Heckman 1979; Lee, 
Maddala, and Trost 1980). A first-stage probit 
analysis modeled the selection of the 65 SMSAs 
from the universe of 78 as functions of city black 
population sizes and proportions in 1968 and 1976. 
Second, hazard rates were calculated from the 
probit models and included among the regressors 
in the substantive equations. None was statistically 
significant, nor did they produce significant 
changes: in the other coefficients. The reported 
estimates exclude the hazard rate variables because 
no Sample selection bias was apparent. — 

$ The analyses did not appear to be sensitive to 
the segregation measure chosen. Parallel ‘analyses 
to the ones reported were performed using a 
measure of the exposure of whites to blacks, now 
known as a P* measure (e.g., Lieberson and Carter ` 
1982; Massey and Denton 1988), and the variance 
measure 5 preferred by sonde (e.g., Coleman et al. 
1975), transformed in the manner described for D . 
in equation 1 below. No significant differences in. 
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tion is usually the most important determinant 
of school segregation. The measure used here 
is the number of black students divided by the 
combined number of black and white stu- 
dents.!7 Size is indicated by the natural 
logarithm of the number of black students. 
Separate black proportion and size scores 
were computed for cities and suburbs in 1968 
and 1976. 

The availability of private schools is 
indicated by the number of white students 
enrolled in nonpublic schools expressed as a 
proportion of all white students attending 
public and nonpublic schools according to the 
1970 census. Separate scores were calculated 
for cities and suburbs.!8- Local-government 


the conclusions were required. D typically pro- 
duces empirical results more like those of other 
alternative measures than S does (James and 
Taeuber 1985), and is preferred because of ease of 
interpretation and the continuity provided with 
previous research (Massey and Denton 1988). It is 
unlikely that choosing a different measure would 
lead to substantially different conclusions. 

17 One reader suggested that the exposure of 
whites to blacks would have been superior to black 
proportions as a measure of the threat to whites, 
but the exposure measure acted more like an 
outcome of a desegregation process than a 
segregative force. The exposure of whites to blacks 
increased in those districts with the lowest 
segregation levels, but higher black proportions 
were associated with higher segregation levels. 
Both exposures and black proportions could not be 
included in the same equations because of 
multicollinearity problems. Black proportions were 
chosen to represent the threat to whites because it 
could not be manipulated by administrators, but 
exposures could be. 

8 Private schools might be attractive to whites 
for two reasons: the blacks who attend are from 
higher-class backgrounds and black proportions are 
smaller. The average black percentages for the 65 
- central cities in this study in 1976 were 12.7 
percent in nonpublic schools and 54.6 percent in 
public schools. The suburb black percentages were 
4.4 percent in nonpublic and 16.2 percent in public 
schools. All private-school systems had white 
majorities except the central-city system of Wash- 
ington, D.C., which was 54.3 percent black; 
Washington’s public schools were 96.4 percent 
black in 1976. Only three city private-school black 
percentages were greater than 30 percent: Washing- 
ton, Montgomery (31.8 percent), and San Diego 
(31.0 percent). Charlotte (31.0 percent) had the 
only suburb with private-school enrollments greater 
than 30 percent black. There were no black 
private-school students in the Greenville suburbs. 
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fiscal capacity is measured by per-capita local 
government revenue in 1970. Total revenue 
was used because neither school-district 
expenditures nor revenue was available. 

Black median individual income in 1970 
indicates average economic class standing of 
blacks. A number of. other inequality and 
economic class measures were evaluated, but: 
none had any effect net of black median 
income.!? Regional differences in the effects 
of a number of variables were expected 
because of the regional differences in racial 
politics before 1968. A dummy variable for 
region was multiplied by other independent 
variables to investigate regional interac- 
tions. Five were substantively different 
across regions and are reported below: city 
residential segregation in 1968, suburb me- 
dian income for blacks in 1968, the propor- 
tion of whites in city private schools in 1976, 
and the lagged dependent variable in both 
1976 city and suburb equations. 


TRENDS IN SEGREGATION LEVELS, 
1968 TO 1976 


Significant drops in public-school segregation 
occurred in central cities (25 points) and 
suburbs (19 points) between 1968 and 1976 
(Table 1). By 1976, city and suburb segrega- 
tion levels were equal (D — .54). 

Although average declines were substan- 
tial, they varied greatly across regions and 
across cities and suburbs within regions. 


19 The other class variables evaluated were 
white median income, differences in white and 
black median incomes, the ratio of black to white 
median income, white occupational status, black 
occupational status, racial differences and ratios of 
occupational status, and the proportion of black 
families that were low-income families. None had 
a substantively or statistically significant effect net 
of the other variables. They were excluded in order 
to reduce multicollinearity. 

? The District of Columbia and 17 states 
required segregated schools under penalty of law 
on the eve of Brown (U.S. Commission on Civil 
Rights 1959). All of these with the exception of 
Delaware are defined as "South." Wilmington was 
classified as a non-southern city because of its 
atypically low segregation score in 1968 (Table 1). 
Kansas, Arizona, and New Mexico allowed local 
enforcement of school segregation in 1954. All 
three were classified as non-South although a 
number: of school districts in those states were 
segregated by statute. 
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Southern city districts dropped an average of 
38 points, from .86 to .48. Southern suburbs 
dropped by 19 points to the same level of .48. 
Twenty-two of the 34 southern city districts 
and 14 of the suburban systems dropped by 
30 points or more. 

Seven of the 31 northern city districts 
dropped 30 points, but none of the northern 
suburban systems experienced drops this 
large. Declines were offset by increases in 
other districts so that, on average, northern 
systems dropped only 11 points to levels of 
.61 and .60 in cities and suburbs respectively 
(Table 1). In sum, southern city and suburban 
systems were less segregated than their 
northern counterparts in 1976 although they 
had been substantially more segregated in 
1968. 

Some systems experienced very little deseg- 
regation. In the South,’ Birmingham, St. 
Louis, Kansas City, Missouri, and Houston 
were comparatively successful in resisting 
change. Segregation in the central-city district 
of Washington, D.C., actually increased by 7 
points to D = .86. Los Angeles, Chicago, 
Gary, Newark, Cleveland, and Philadelphia 
were highly segregated with segregation 
scores greater than .80 throughout the period. 
Eleven northern city districts were more 
segregated in 1976 than in 1968 (Table 1) and 
the suburbs of two of these (Hartford and 
New York) were also more segregated. The 
greatest segregation declines in city systems 
occurred in districts subject to federal court 
orders (Table 1). The greatest segregation 
declines within suburban systems occurred in 
the South because federal pressure to deseg- 
regate was greater there. 

Racial segregation between central-city 
districts and their suburban rings actually 
increased between 1968 and 1976 (Table 2). 
The difference between city and suburb black 
proportions increased by 10 points (from .23 
to .33) in the South. The increase in the North 
was smaller (6 points), but the 1976 differ- 
ence of .43 was still greater than the southern 
difference. The average central-city district 
had a black majority by 1976, whereas the 
average suburban system had a white majority 
of more than 70 percent.?! The suburban 


?! The differences between city and suburban 
systems were actually larger than the values 
presented in Table 2 because white majority 
systems with small black enrollments were less 
likely to have been selected into either the 1968 or 
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public-school systems provided a potential 
escape route for whites who wanted to avoid 
contact, with blacks throughout the 1968 to 
1976 period.7? The boundaries between central- 
city and suburb school systems are buffers 
that permit, and perhaps encourage, increas- 
ing segregation between the two. 


PUBLIC-SCHOOL SEGREGATION 
DETERMINANTS IN CITIES 
AND SUBURBS 


Segregation measures vary between 0 (com- 
plete integration) and 1 (complete segrega- 
tion) and are therefore unsuitable for use as 
dependent variables in regression analyses. 
The dependent variable in the following 
analyses is the logit transformation of the 
index of dissimilarity, D, 


L= iIn[D/(1 — DJ, (1) 


where “In” implies natural logarithms. L 
indicates the degree to which blacks and 
whites are unequally distributed to the schools 
of a city or suburban system; positive values 
signify high levels of segregation, large 
negative values indicate low segregation 
levels.?? L incorporates the desirable feature 


1976 suburb samples. The differences of Table 2 
should be evaluated as lower bounds on the actual 
differences that existed in 1968 and 1976. 

2 Evidence that desegregation of central-city 
districts increased school segregation between 
cities and the suburbs was weak at best in these 
data. Two-stage least squares regression analyses 
of the differences between city and suburb. black 
proportions indicated that 1968 differences were 
the most important determinants of 1976 differ- 
ences. Federal court intervention in a central-city 
district may have increased the 1976 difference by 
about two points, but the increase was not 
Statistically significant in any of the models 
examined. A simple three-way ANOVA of region 
by period by federal-court action agreed with the 
regression analyses. The long-term difference in 
white and black suburbanization rates may not 
have been greatly affected by public-school 
desegregation of central-city districts. (Analysis 
results available upon request.) : 

? Because D can be interpreted as the propor- 
tion of black (or white) pupils that must be 
transferred to completely desegregate a system if 
only one race is to be reassigned (Duncan and 
Duncan 1955, p. 211), L is the log of the odds of 
selecting a particular black (or white) pupil under a 
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Table 2. Central-City and Suburb School Black Proportions in 1968 and 1976 for 65 SMSAs 


Southern SMSAs (N = 34) 








YEAR CENTRAL-CITY 

1968 42 

1976 53 

Northern SMSAs (N = 31) 

YEAR CENTRAL-CITY 

1968 42 

1976 55) 
Source: Table 1. 


of diminishing returns to segregative (or 
desegregative) pressures as the levels of 
segregation get large (or small) and avoids the 
ceiling and floor effect problem encountered 
by D. 


Determinants of 1968 Segregation Levels 


The model for 1968 central-city segregation 
levels based upon Figure 1 is 


Lci96s = fc, Xs), (2) 


where L,196g is the 1968 city dependent 
variable, and the X’s are sets of predeter- 
mined variables specific to central cities (C) 
and suburbs (S). The 1968 suburb model is 
identical to (2) except that the subscripts for 
city and suburbs are exchanged. Table 3 
contains ordinary least squares estimates of 
the central-city and suburb equations for 
1968. 

Three of the seven variables in central-city 
equation 1 produced increases in public- 


uniracial reassignment plan for complete desegre- 
gation. Transforming D in this manner avoids the 
problem for regression analysis that D is bounded 
by 0 and 1. See Massey and Denton (1987) for 
examples of the use of the logit of D in this 
manner. Some critics argue that D is an unsuitable 
dependent variable because it measures departures 
from the baseline of complete integration rather 
than from the level expected under random pupil 
assignments (Cortese, Falk, and Cohen 1976; 
Winship 1977, 1978; but see Massey and Denton 
1988). Choosing a different baseline to represent 
the condition of no segregation should have little 
effect on the results obtained here because none of 
the public school systems had small numbers of 
schools nor D values near the limits of 0 and 1, the 
situations where different baselines produce the 
greatest differences in measured levels of segrega- 
tion. 


SUBURB Difference 
.19 .23 
.20 33 
SUBURB Difference 
05 37 
12 43 


school segregation levels. Larger numbers of 
black students in the city, higher black 
proportions in the suburbs, and higher levels 
of city residential segregation were related to 
higher levels of segregation in the city as 
expected. Higher per-capita government rev- 
enue was associated with lower segregation 
levels as was northern location. Black median 
income had an insignificantly small effect in 
contrast to the large positive impact reported 
by Farley (1975, p. 17). Only one substantial 
North-South difference was discovered. Cen- 
tral-city equation 2 of Table 3 contains an 
interaction (line 4) between the region 
dummy and the residential segregation vari- 
able. The impact of residential segregation 
was over four times as large in the North as in 
the South. Residential segregation was the 
most important determinant of school segre- 
gation in the North, whereas southern districts 
tended to rely upon de jure school segrega- 
tion. 

Adding the region interaction with residen- 
tial segregation eliminated the effect of black 
population size, but the city black proportion 
coefficient became large and positive. The 
effect of northern location was large and 
negative. The effects of other variables 
changed little. Thus, the most important 
determinants of city segregation levels in 
1968 were the proportion of blacks in the city 
district, the proportion of blacks in the 
suburbs, and the level of residential segrega- 
tion. Cities with higher fiscal capacities had 
lower levels of segregation and northern 
districts were less segregated than southern 
ones net of the other factors. The size of black 
populations and black median incomes had 
little or no effect. 

The determinants of suburb segregation 
levels in 1968 were similar to the city 
determinants with two major exceptions. 
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Table 3. Determinants of 1968 Central-City and Suburb Public-School Segregation Levels for 65. SMSAs 
a) (2) 
Central-City Equations b t b t 
1. City black public-school population proportion in 1968 53 1.19 1.19* 3.90 
2. City black public-school population size in 1968 (log e) .17* 1.94 —.00 —.01 
3. City residential segregation in 1970 2.40* 4.37 1.42* 2.92 
4. Interaction: (variable 3) X (North=1) 4.70* 4.71 
5. City government revenues in 1970 per capita ($) —.14* —2.98 —.14* —4.15 
6. City black median income in 1969 ($1000) .02 .13 .12 .94 
7. Suburb black public-school population proportion in 1968 2.30* 3.47 2.08* 3.52 
8. Suburb black public-school population size in 1968 (log e) —.07 —1.29 —.04 —.99 
9. Region dummy variable (North = 1) —.59* —3.83 — 4.30* —5.54 
Constant — 1.36* ~2.01 45 60 
R 72 19 
(1) © Q) 
Suburb Equations b t b t 
1. Suburb black public-school population proportion in 1968 4.77* 4.54 2.96* 3.17 
2. Suburb black public-school population size in 1968 (log e) — .08 ~ 85 — .04 — .58 
3. Suburb residential segregation in 1970 1.54* 2.59 1.00 1.41 
4, Interaction: (variable 3) x (North= 1) 3.44* 3.66 
5. Suburb government revenues in 1970 per capita ($) .18* 2.38 .15* 2.50 
6. Suburb black median income in 1969 ($1000) —.15 — 1.04 09 1.15 
7. Interaction: (variable 6) X (South = 1) — .69* —2.58 
8. City black public-school population proportion in 1968 .85 1.50 .90* 2.75 
9. City black public-school population size in 1968 (log e) 14 1.26 .05 69 
10. Region dummy variable (North = 1) .10 47 —3.82* —4.32 
Constant -197* -204 — —.00 —.01 
R 52 61 


* p <.05, one-tailed test. 
Source: Same as Table 1. 


Note: All models were estimated by ordinary least squares. The model 2 city and suburb t-ratios were calculated 
after correcting standard errors for heteroskedasticity because of positive Breusch-Pagan tests (Judge et al. 1985, pp. 


446-47). : 


First, suburb government revenue was associ- 
ated with more, rather than less segregation. 
This was expected because suburban systems 
were fragmented politically. Higher govern- 
ment revenue probably reflects the presence 
of relatively wealthy enclaves that attract 
whites disproportionately because of their 
greater affluence. Second, black median 
income was associated with lower suburb 
segregation levels, but only in the South 
(suburb equation 2 of Table 3, line 7). Farley 
(1975), found strong negative effects of black 
income in the cities of both regions, but data 
problems may have confused city with suburb 
effects (see note 12). Farley did not analyze 
suburban segregation. Residential segregation 
had a stronger segregative effect in northern 
suburbs as it did in the city (line 4). Both city 
and suburb black proportions had segregative 
effects on suburban schools. 


These results are consistent with the claim 
that racial proportions (but not black popula- 
tion size) served as linkages between formally 
independent, but adjacent school systems. 
Higher black proportions in the suburbs were 
associated with greater school segregation in 
the city. Similarly, higher black proportions 
in the city produced higher segregation levels 
in the suburbs.?^ During the period of weak 
desegregative pressure before 1968, higher 


black proportions appeared to encourage 
segregative effects within school systems. 


24 The effects of an adjacent system's racial 
proportions tended to be larger in the North than in 
the South, but the regional differences were not 
statistically or substantially different (results not 
reported). 
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Determinants of Changes in City 
Segregation Levels, 1968 to 1976 


The model for changes in the segregation 
levels of central city districts between 1968 
and 1976 is : 


Lcis;6 = f(Lcises. Xo, Xs), 03) 


with variables defined as in (2). An analogous 
model applies to the suburbs. Table 4 
contains three versions of model 3 for the 
1976 city districts; all were estimated by 
two-stage least squares.25 

‘ Equation 1 of Table 4 is the basic additive 
model of segregation change for the central- 
city district. The signs of most coefficients 
are in the expected direction, but the equation 
appears deficient for two reasons. First, the 
dynamic component of the model appears 
unstable because the coefficient of the lagged 
dependent variable is greater than one (b = 
1.17).26 Second, the city residential segrega- 
tion coefficient, though statistically insignifi- 
cant at conventional levels, is very large and 
negative implying that residential segregation 
facilitated public-school desegregation, an 
unlikely result. Investigation of the correlates 
of 1970 city residential segregation indicated 
that it was very strongly and positively related 
to 1968 segregation levels in the North (r = 
.82), but not in the South (r = .19). This 
strong, regionally specific collinearity prob- 
lem produced the anomalous coefficients of 


75 OLS estimates produce inconsistent estimates 
when lagged values of the dependent variable are 
included among the regressors. The two-stage least 
squares procedure produces consistent, although 
not efficient, estimates of the structural parame- 
ters. Further, it has the advantage over more 
efficient, full-information estimation procedures 
that specification errors in any one equation of a 
system are not communicated to the point- 
estimates in the other equations (Theil 1971, pp. 
528-29). 

26 If the point estimate of the lagged dependent 
variable coefficient b = 1.17 is accurate, the 
average value of D in central cities would have 
been driven rapidly toward 0 under equation | of 
Table 4 assuming the X’s remained constant. The 
value of D should have dropped to less than .10 by 
1988, an improbable outcome that did not occur 
{calculations available upon request). The coeffi- 
cient is not statistically different from 1.0 at 
conventional levels, however. Thus, the apparent 
instability of the dynamic component of. the 
equation may be due to sampling variability. - 
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equation 1, Table 4. Because the effects of 
residential segregation were being carried by 
the lagged dependent variable, the statistically 
insignificant residential segregation variable 
was deleted. Two region interactions were 
added.” They were highly collinear with the 
North dummy so it was also deleted from 
equation 2. 

Black population size and proportions in 
city districts had strong segregative effects as 
expected. Federal court intervention had a 
strong desegregative effect (b = — 1.14) as 
did higher per-capita local government reve- 
nue. Higher black median incomes had a 
small, statistically insignificant desegregative 
effect. 

The availability of private schools had a 
strong segregative effect, but only in the 
South. The effect for the South —the omitted 
category in the "region" dummy —is given by 
the coefficient of the proportion of white 
students in private schools (b = 1.72) in line 
7. 'The northern effect is obtained by adding 
the coefficient on line 7 to that of the 
interaction term with the North Dummy on 
line 8 (b = —2.01, which produces a 
negligible sum. This result was unexpected 
because private school systems were more 
available in the North and a higher proportion 
of northern white students attended private 
schools in both cities and suburbs (see Table 
1). Perhaps large-scale white flight to private 
Schools occurred only when desegregative 
actions were strong and effective. Most 
southern city districts, but only a few northern 
ones such as Boston, came under strong 
federal court desegregative pressure (See 
Table 1 and Rossell 1983, pp. 23-25).?3 


27 The possibility of region interactions with the 
other predetermined variables was investigated. 
The two interactions discussed here were the only 
ones that were statistically or substantively signif- 
icant. 

78 Most non-southern private schools were 
Catholic whereas most southern private schools 
were Protestant or secular. Many of the southern 
schools were started in order to escape public- 
schoo] desegregation. One reader suggested that 
the regional differences in the effects of private 
schools might be a function of regional differences 
in the proportion of Catholic schools. I think that 
this is unlikely. Catholic schools were used as 
white escape routes when Boston was under a 
strong desegregation order (Hannon 1982). Data 
limitations prevented further examination of this 
issue. 
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Table 4. Determinants of Change in Central-City Public-School Segregation Levels in 65 SMSAs between 1968 and 
976 l 





Central-City Equations 











(1) (2) (3) 





b. t b t b t 
1. City black public-school population proportion in 1976 1.04" 1.80 1.80* 3.95 1.64* 3.91 
2. City black public-school population size in 1976 (log e) 13 | (Bl 28* 21 .23* 245 
3. City residential segregation in 1970 —2.44 -—1.59 
4. City government revenues.in 1970 per capita ($) —.03  —.35 —.11* —1.70 —.15* —3.06 
5. City black median income in 1969 ($1000) . 401 06 —.20 —.95 
6. Federal court action against city district (YES = 1) —1.37* —6.63 —1.14* —6.58 —1.08* —7.30 
7. City proportion of white students in private schools in 1970 1.87* 1.64 1.72* 1.66 2.22 2.26 
8. Interaction: (variable 7) x (North= 1) —2.01* —2.05 —2.39* -—2.87 
9. Suburb black public-school population proportion in 1976 —2.60* —2.32 —.75 | —.94 
10. Suburb black public-school population size in 1976 (log e) .04 48 -.03 —-.37 
11. Region dummy variable (North=1) : .40 1.34 d 
12. City public-school segregation level in 1968 1.17* 2,10 .03 12 
13. Interaction: (variable 12) x (North= W 40* 2.34 39* 2.51 
Constant — 1.48 —2.81 








* p <.05, one-tailed test. 
Source: Same as Table 1. 








—117 —2.30* —2.64 —2.44* 


Note: All models were estimated by two-stage least squares. The F-test comparing model 3 to model 2 is not 


statistically significant; F (4,53 df) = .759, p = .56.. 


On the other hand, the linkages to 1968 
segregation levels and their determinants were 
broken in the South (b = .03), but remained 
strong and significant in the North (b = .40). 
For example, northern residential segregation 
had a segregative effect on 1976 city districts 
via 1968 school segregation levels (see Table 
3). Multiplying the northern residential segre- 
gation coefficient of Table 3 times the 
coefficient of the 1968 segregation level (line 
13 of Table 4) suggests that the impact of 
residential segregation was still strong ([1.42 
+ 4.70] times: .40 = 2.45). By contrast, the 
weaker link between residential segregation 
and school segregation in southern central 
cities was severed. Perhaps residential segre- 
gation coupled with weak desegregative 
pressure in the North made the use of private 
schools to avoids blacks less necessary than it 
was in the South. 

The dynamic component of equation 2 is 
more reasonable than -that of equation, 1. 
Given the values of the other variables in the 
equation and ‘a coefficient for the lagged 
dependent variable of .40, the model implies 
an equilibrium D value of about .50 for 
northern central-cities.29 This is a moderate 


2 The equilibrium value of logit (D) = a'x/(1 
— b), where b is the value of the coefficient for the 
lagged dependent variable (the logit of D in 1968), 
0 < b < 1, a is.the vector of coefficients of 


predicted drop from the 1976 value of .61 for 
northern city schools and suggests that great 


declines in central-city school segregation are 


unlikely without further desegregative inter- 
ventions. 

Higher suburban black ` proportions no 
longer had direct segregative effects on the: 
city as they had on levels in 1968. The 
equation 2 coefficient of Suburb Black 
Proportion was negative, but of modest size 
and nonsignificant (line 9, b =  —.75). 
Equation 3 omits all variables in equation 2 
with r-ratios less than 1, which produces little 
changé in the remaining coefficients and an 
insignificant F-test for omitted variables (see 
Table 4 note). Suburb black population size 
was also insignificant. Of course, northern 
suburb black proportion in 1968 had an 
indirect segregative effect via 1968 city 
segregation levels (2.08 x .40 = .83) as was 
the case with residential segregation. 

The’ determinants of central-city school 
segregation were apparently, altered dramati- 
cally by federal intervention between 1968 
and 1976. The link with past segregation was 
broken in the South, but not in the North. The 
direct segregative effect of suburban black 
proportions was broken, but a modest indirect . 





equation 2 and x is a vector of independent variable 
means. Substituting a’x =: —.0077 and b = .436 
produces a solution for D of .497. 
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link survived in the North. Residential 
segregation also appeared to have strong 
segregative effects in the North, but not in the 
South. On the other hand, private schools had 
a strong $egregative effect in the South, but 
not in the North. The stronger desegregative 
actions focused on southern city districts 
probably reduced the effectiveness of residen- 
tial segregation in maintaining segregated 


schools. As residential location lost its power . 


to segregate public-school students, southern 


whites appeared to use private schools for. 


segregative purposes more often than northern 
whites did. Thus, private schools may be a 
functional alternative to residential segrega- 
tion. 


t 


Determinants of Changes in Suburb 
Segregation Levels, 1968 to 1976 


' Table 5 contains estimates of the determinants 


of school-segregation change for the suburbs 
and they differ from those in the city. 
Government revenue and black median in- 
come appeared to have no effect on suburb 
* School segregation (equation 1). The availabil- 
ity of private schools also had no effect (the 
coefficient was modest in size, but in the 
opposite direction expected: b = — .56). City 
black proportions had a strong segregative 
effect on 1968 suburb segregation levels 
(Table 3), but no effect (b = .09) on change 
between 1968 and 1976. 

Residential segregation had a strong segre- 
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gative effect (b = 1.78) as did past 
segregation levels (b = .37). Not only did 
residential segregation levels appear to retard 
the desegregative process between 1968 and 
1976, but they also slowed the desegregative 
process via their impact on 1968 school 
segregation levels. For example, the indirect 
effect of a ten-point increase in residential 
segregation in northern suburbs was .16, 
which nearly equaled the .18 direct effect. 
The indirect effect of residential segregation 
was smaller in the South, but still important. 
The direct school segregative effect of 
residential segregation appeared to be the 
same across regions. City black population 
size and northern region also produced 
segregative effects, but neither reached statis- 
tical significance in equation 1. 

More troubling was the large negative 
effect of Suburb Black Population Size (b = 
— 1.79). This result seemed so anomalous 
that I investigated it in more detail. It appears 
to be a result of a selection mechanism rather 
than an unusual causal mechanism operating 
only in the suburbs of metropolitan areas. 
Black proportions and segregation levels 
could have been negatively related if greater 
desegregative pressure was focused on subur- 
ban districts with higher black proportions 
than those with lower proportions. ` 

To investigate this possibility, I included 
an interaction term with a dummy variable 
that equaled 1 if black proportion was greater 
than .16, the mean level for the suburbs, and 


Table 5. Determinants of Change in Suburb Public-School Segregation Levels in 65 SMSAs between 1968 and 1976 











Suburb Equations 


1) (2) (3) 








“1. Suburb black public-school population proportion in 1976 
2. Interaction: (variable 1) X (dummy = 1 if variable 1 > .16) 


—1.79 —1.58 1.18 .64  2.24* 1.56 


—2.30* —1.99 —2.56* —2.36 


3. Suburb black public-school population size in 1976 (log e) .09 1.12  .06 .87 
4. Suburb residential segregation in 1970 1.78* 3.42 .1.58* 3.05 1.89* 4.64 
5. Suburb government revenues in 1970 per capita ($) —.01  -.13 .01 ll 
6. Suburb black median income in 1969 ($1000) .02 5 .04 38 
7. Suburb proportion of white students in private schools in 1970 —.56 -.52 —.20 —.21 
8. City black public-school population proportion in 1976 .09 23 .05 14 
9. City black public-school population size in 1976 (log e) 12 1.31 10 1.14 .15* 2.46 
10. Region dummy variable (North = 1) 25 1.44 i 
11. Suburb public-school segregation level in 1968 .37* 1.82 .28 1.54 .20* 1.78 
12. Interaction: (variable 11) x (North = 1) .25* 172  .23* 2.04 
Constant —3.29* —4.81 —2.89* —4.01 —3.02* —4.85 





* p <.05, one-tailed test. 
Source: Same as Table 1. 





Note: All models were estimated by two-stage least squares. The F-test comparing model 3 to model 2 is not 


statistically significant; F (5,53 df) = .315, p = .90. 
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0 otherwise (see lines 1 and 2 of equations 2 
and 3). Higher black proportions appear to be 
associated with higher segregation levels if 
proportions are low. When black proportion 
is greater than .16, however, the impact drops 
to zero or becomes modestly negative and 
insignificant.” These results are consistent 
with the presence of a selection mechanism. 
Higher black proportions still retarded deseg- 
regative efforts when proportions were low, 
but higher proportions might have attracted 
externally imposed desegregative | pressure 
which broke the link between higher black 
proportions and segregation level changes in 
the suburbs. 

An interaction term was added to equation 
2 to determine whether there was a North- 
South difference in the effects of past levels 
of segregation (see lines 12 and 13).?! The 
region dummy (line 11) was dropped because 
it was collinear with the region interaction. 
The effect of past segregation was stronger in 
northern suburbs than in southern ones, but 
the segregative impact of past segregation 
levels in the South did nor disappear as it did 
in southern cities. This was expected because 
between-district segregation in the suburbs 
would remain even if all suburban districts 
were desegregated within their boundaries. 

Dropping those variables that appeared to 
have no impact on suburban segregation 
levels (t < 1.0) produced the results in 
equation 3. The change in suburb segregation 
levels was still linked to the racial demogra- 
phy of the central-city district as it was to 
levels in 1968. Although higher black propor- 
tions had no discernible effect, larger black 
city populations were associated with higher 
segregation levels in the suburbs.. Federal 





?? The coefficient of Black Proportions (BP) in 
line 1 of equation 2 (b = 1.18) indicates a positive 
effect on segregation for BP « .16. The effect of 
BP when BP > .16, is negative for high 
proportions (1.18 [line 1] — 2.30 [line 2] = 


—1.12). The total effect of -—1.12 ‘is not. 


statistically different than zero (f = -—]1.02). 
When nonsignificant terms are omitted, both 
individual coefficients become strong and signifi- 
cant (equation 3, lines 1 and 2), but their effects 
almost cancel for high proportion (b — 2.24 — 
2.56 = —.32). The total effect for BP > .16 
is not statistically different than 0 at the .05 level. 
?! Only significant interactions with the region 
dummy variable are reported. None of the other 
predetermined variables of Table 5 appeared to 
have different effects in the North and South. 


were completely desegregated, 
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intervention in (ibis suburbs was not strong 
enough to break the ties with the past or with 
the black population characteristics of the 
city. City-suburb boundaries protected racial 
inequalities in the suburbs. As many suggest, 
the suburbs may have provided attractive 
enclaves to whites during a period of strong 
desegregative pressure 'in the city (e. 8- i 
Orfield 1978; Rossell 1983). 


CONCLUSION 


Local-state fragmentation is an important struc- 
tural determinant of racial inequalities in public’ 
schooling in metropolitan areas of the Untied 
States. Many have noted that the boundaries 
between central-city districts and suburban’ 
school systems permit increasing segregation 
between systems. This study is no exception. 
Average segregation between cities and sub- 
urbs increased for the 65 SMSAs analyzed be- 
tween 1968 and 1976. Less obvious is the pos- 
sibility that boundaries between school systems. 
exacerbate segregation within systems. In 1968, 
higher black proportions in the suburbs pro- 
duced more segregation in the city and higher 
black proportions in the city produced more 
segregation in the suburbs. These results are - 
consistent with the proposition that whites acted 
in ways that increased school segregation when 
they were confronted with modest desegrega- 
tive pressure and unattractive escape routes to 
adjacent systems with higher black propor- 
tions. Before 1968, "freedom of choice" de- 
segregation programs provided some maneu- 
vering room for whites who were opposed to 
desegregation. 

Federal desegregative actions between 1968 
and 1976 were much stronger. Mandatory pupil 
reassignments replaced freedom of choice plans 
as the principal desegregative tool, but the tool 
was applied unevenly across regions. The ef-: 
fort to dismantle southern de jure segregation 
was stronger and affected more districts than 
the pressure to reverse northern de facto segre- 
gation. As a result, segregation dropped by 
greater. amounts to lower levels in the South 
than in the North. Further, the pattern of seg- . 
regation determinants in the cities and suburbs 
of both regions was altered. 

Whites in suburban areas of the 65 SMSAs 
in this study retained some, capacity to avoid 
schools with high black proportions after 
1968 because the suburbs were politically 
fragmented. Even if each suburban district 
significant 
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segregation between districts of suburban 
systems would still exist. The link between 
city racial proportions and suburb segregation 
levels survived the post-1968 federal interven- 
tion, perhaps because of suburb fragmenta- 
tion. Larger black populations in the city were 
associated with smaller segregation declines 
in the suburbs. Thus, suburb desegregative 
pressure was not sufficient to uncouple the 
suburbs from city segregative effects. The 
link with past segregation levels also survived 
in the suburbs of both regions. The effect of 
past segregation was twice as strong in the 
North as the South, which is consistent with 
the stronger desegregative pressure focused 
on the South. Finally, segregated neighbor- 
hoods hindered school desegregation in the 
suburbs, but not in the cities. 

The patterns in the city-districts after 1968 
were radically different. Desegregative pres- 
sure on cities was strong enough to break 
their links to the suburbs. Suburban racial 
characteristics no longer affected central-city 
segregation trends. Further, the historic ties 
with past school segregation levels and 
current residential segregation patterns were 
broken in southern central-city districts, but 
not northern ones, consistent with the stronger 
interventions in the South as compared to the 
North. Whites' utilization of private school- 
ing retarded central-city desegregation in the 
South, but not in the North. Private schools 
may be the last alternative to residential 
segregation when desegregative actions are 
strong enough to disrupt the bonds between 
residential and school segregation. - 

The patterns observed are representative of 
the 78 SMSAs with central-city school 
districts containing more than 10,000 black 
students in 1968. No apparent sample selec- 
tion bias was introduced by analyzing the 
subset of 65 SMSAs for which data were 
available. Of course, the specific patterns 
identified may not apply to smaller, less 
fragmented metropolitan areas with simpler 
systems of schools. 

The desegregation patterns observed in this 
study are strongly conditioned by the political 
boundaries that divide metropolitan schools 
into formally independent districts. District 
boundaries buffer the suburbs from strong 
desegregative actions focused on cities, but 
city district racial characteristics still tie the 
segregation trends of the suburbs to the city. 
Desegregative actions in the city may have 
the unintended consequences of fostering 
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greater racial inequalities within the suburbs 
as well as between the city and the suburbs. 
These patterns are consistent with class 
theories, which predict that local-state frag- 
mentation protects the class privileges of 
dominant classes and perpetuates inequalities. 
Institutionalist and market theories, which 
identify some of the causes of urban inequal- 
ities, complement class theories. District bound- 
aries did prevent uniform policy implementa- 
tion across metropolitan areas, as institutionists 
argue, but not without the strong defense of 
those boundaries by suburban officials and in- 
terest groups who were reluctant to forfeit the 
privileges of urban locations (e.g., Orfield 
1978). Further, as market-model proponents 
argue, whites do appear to shop for other school 
opportunities if they find greater contact with 
blacks undesirable. The structure of public- 
and private-school systems in metropolitan ar- 
eas provides parents with a variety of options. 
Public systems vary greatly in economic re- 
sources, racial composition, and the extent to 
which school attendance zones are linked to 
neighborhood boundaries. If school integra- 
tion policies break the link between school and, 
residential segregation, whites may opt to en- 
ter a nearby public-school system. If all nearby 
public systems have large black proportions, 
private schools may provide an alternative for 
families with sufficient resources. f 
But class differences strongly condition the 
capacity of parents to act on their preferences. 
Many white parents do not have sufficient 
economic resources to place their children in 
private schools no matter how much they 
oppose contact with blacks. Few private 
schools were available in the South during the 
late 1960s and early 1970s and start-up costs 
were high. Most studies have found that 
affluent whites were more likely than poorer 
whites to withdraw from the public schools in 
response to desegregation (Rossell 1983). 
This study found that economic class 
standing had little or no direct effect on 
public-school segregation, but the indirect 
effects of class were strong. The fiscal 
capacities of school systems, which are 
directly related to the financial status of the 
communities they serve, had strong effects on 
school segregation in both cities and suburbs, 
Residential segregation is also an indirect 
indicator of class: because it too is highly 
related to income and wealth distributions. 
When blacks move to the suburbs, they tend 
to concentrate in areas close to the central-city 
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that have the highest tax rates, the weakest 
tax base, and the most difficulty in providing 
adequate public services (Logan and Schnei- 
der 1984).: To the extent that residential 
segregation is linked to between-district 
school segregation, the fragmentation of 
metropolitan public-school systems serves the 
interests of the more affluent whites who can 
take advantage of it as class theories suggest. 
The fact that the national school-busing 
controversy abated once it became clear that 
desegregation of central-city districts would 
be confined to the boundaries of those 
districts, and that large numbers of black 
students would not be transported to the 
suburbs except in rare instances, is also 
consistent with this view (Orfield 1978). 

The class model incorporates the institution- 
alist and market models to explain how 
system inequalities are generated. Multiple 
local state jurisdictions do impede policy 
implementation and provide citizens with 
choices. But unequal resources among citi- 
zens produce unequal choices. Recent re- 
search on the problems of the urban under- 
class suggests that metropolitan residential 
segregation, school segregation, and poverty 
are more closely related than ever before 
(e.g., W. Wilson 1987; Danziger and Wein- 
berg 1986). This study indicates that local 
state boundaries foster racial and economic 
inequalities in the consumption of public 
schooling when they split metropolitan areas 
into autonomous units. Ostensibly neutral 
State structures can facilitate unequal policy 
outcomes. Favored classes and racial groups 
can substitute private means for public goods. 
The less privileged have fewer options. 
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CITY SIZE, ECONOMIC DEVELOPMENT, AND QUALITY OF 
LIFE IN CHINA: NEW EMPIRICAL EVIDENCE* 


York W. BRADSHAW 
Indiana University 


ELvis FRASER 
The Ohio State University 


The dearth of case studies on Third World urbanization and city size is unfortunate 
because, in contrast to cross-national research, these studies can explain special 
patterns of development. This study utilizes the recently published Population Atlas 
of China to analyze the relationship among city size, economic development, and 


: physical quality of life. These data are unusually complete, covering most of 


China's counties, districts, cities, and other administrative units (N — 2,306). The 
data analysis produces three major results. First, all-sized cities enjoy higher 
levels of economic development and physical quality of life relative to nonurban 
areas. Second, externally oriented coastal cities possess higher levels of economic 
development but not a better quality of life. Third, education and industrial 
employment contribute to both economic development and physical quality of life. 
Overall, these findings support modernization theories and some aspects of the 


dependency/world-systems perspective. 


INTRODUCTION 


At the beginning of their book Urban Life in 
Contemporary China, Whyte and Parish 
(1984, pp. 2—3) speculate on the potentially 
exceptional character of Chinese urbanization: 


Cities [in China] had apparently not grown much 
in relative terms in spite of considerable 
economic development, slums and squatter 
settlements seemed absent, conspicuous consump- 
tion and foreign-oriented life styles were not 
visible, a high degree of economic equality and 
security seemed to prevail, unemployment 


seemed absent, close-knit neighborhoods and 


families seemed to persist, and crime, drug 
addiction, prostitution, and other forms of 
deviance seemed minor or nonexistent. Did 
China perhaps hold a lesson of how the benefits 
of urbanism could be enjoyed without the ills? 
Might there be features of the ‘Chinese model’ 
of urbanism that could be borrowed by the West 
or by the Third World to help them cope with 
their urban problems? 


Although China's urban areas are certainly 
not free of problems (see Whyte 1988), they 
do exhibit unusual characteristics that deviate 
from many underdeveloped countries. 

In contrast to countries plagued by "over- 


* We would like to thank Krishnan Namboodiri, 
Charles Ragin, Michael Wallace, and three ASR 
reviewers for very helpful comments .on earlier 
versions of this paper. À previous draft was 
"presented at the 1989 meetings of the Midwest 
Sociological Society in St. Louis. 
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urbanization" and other structural imbalances 
(see Kentor 1981; Timberlake and Kentor 
1983; Bradshaw 1985, 1987; London 1987; 
London and Smith 1988), China may possess 
an urban system that promotes economic 
development and physical quality of life. This 
assertion receives some support in Table 1, 
which reports findings for several variables 
related to these concerns. Economic develop- 
ment is measured as per capita income (in 
Chinese yuan), and quality of life is measured 
as infant mortality (number of infant deaths 
per 1,000 births), death rate (number of 
deaths per 1,000 population), and illiteracy 
(percentage of population aged 12 and over 
that is illiterate or semiliterate). These 
variables are reported for seven different 
categories, ranging from nonurban areas to 
extremely large (metropolis) cities.! The data 
exhibit a clear pattern showing that, com- 
pared to nonurban areas, cities have higher 
per capita incomes and lower rates of infant 
mortality, total death, and illiteracy. More- 
over, the largest cities— metropolises —have 
the highest per capita incomes and the lowest 


! As noted below, this data set is unusually 
complete. It contains data on 2,378 cases at the 
local level, that is, at the level of the administra- 
tive unit (including counties, cities, etc.). The only 
missing data include 72 cases for infant mortality. 
Because this variable is used throughout the 
analysis, we restrict our sample to the 2,306 cases 
that have complete data for each variable (see 
below for a detailed discussion of the data). 
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Table 1. Descriptive Data on Economic Development and Quality of Life in Different Regions of China, 1981-1982 


(1) 


Economic 

Sample Development 
Full sample (n = 2,306) 

mean 666.49 

st. dev. 965.05 
Nonurban (n = 2,076) 

mean 501.71 

st. dev. 722.00 + 
All urban (n = 230) . 

mean 2153.82 ` 

st. dev. 1477.44 
Small urban (n = 64) 

mean 1546.44 

st. dev. 1248.81 
Medium urban (n = 83) 

mean 2269.63 

st. dev. 1325.90 
Large urban (n — 47) 

mean 2339.06 

st. dev. 1867.62 
Metropolis (n = 36) 

mean 2724.15 

st. dev. 1297.69 





987. 
Variables 
Q) (3) 
Infant Death (4) 
Mortality Rate Illiteracy 
39.00 6.76 34.14 
28.98 2.16 13.88 
40.87 6.93 36.00 
29.45. 2.15 13.20 
22.14 5.15 17.33 
16.73 . 1.37 6.70 
26.44 5.08 17.33 
22.70 1.95 6.88 
21.95 5.17 17.70 
17.28 1.19 6.32 
20.06 5.02 18.03 
835 . 1.06 . 6.38 
17.64 548 15.52 
7.24 0.77 7.54 


Notes: Nonurban = all administrative units not listed as cities of at least 50,000 people; All urban = all urban 
areas that have a population of at least 50,000; Small urban = urban areas between 50,000 and 199,999; Medium 
urban = urban areas between 200,000 and 499,999; Large urban = urban areas between 500,000 and 999,999; 


Metropolis = urban areas of at least 1 million people. 
Source: People's Republic of China 1987. 


infant mortality and illiteracy rates. These 
descriptive data suggest that urbanization 
(especially on a large scale) is a positive 
feature in China, leading to higher levels of 


. personal wealth and a better quality of life.? 


This paper continues to investigate the 
special character of Chinese urban life by 
examining the relationship among cities, 


? China has a long history of large urban areas. 
In the eighth century it became the first country 
with a city of over 1 million people (Goldstein 
1988, p. 187). By 1982, China had 236 cities; 38 
of these were over 1 million and 8 exceeded 2.5 
million (Goldstein 1988, p. 208). The largest city, 
Shanghai, had an urban population of 6.3 million 
and a metropolitan population of 11.8 million 
(Chen 1988, p. 233), making it the fifth largest 
metropolis in the world behind Mexico City, 
Tokyo, São Paulo, and New York (Dogan and 
Kasarda 1988, p. 13). Despite many large cities, 
however, China remains a highly underdeveloped 
and predominantly rural society. The urban 
population of 211 million people represents only 
20.8 percent of the total population of over i 
billion people. This percentage has nearly doubled 
since 1949 (Goldstein 1988, p. 201). 


economic development, and physical quality 
of life. The first part discusses several general 
theories of Third World urbanization and then 
situates the Chinese case within these perspec- 
tives. While China partially supports several 
theoretical orientations, it also possesses 
some unique characteristics that defy any 
Western-style theory. To test various theo- 
ries, the second part of the paper utilizes 
recently published data to analyze the role of 
different-sized cities in the development 
process. These data are unusually complete, 
covering most of China’s counties, districts, 
cities, and other administrative units (V = 
2,306). In fact, this data set contains the most 
complete aggregate population data ever 
published on China. The analysis builds on 
several excellent studies of Chinese urban 
life, including (1) descriptive and analytical 
papers (e.g., Goldstein 1988; Chen 1988), 
and (2) quantitative analyses based on individ- 
ual-level data gathered from interviews and 
surveys (e.g., Whyte and Parish 1984; Lin 
and Xie 1988). Finally, the paper concludes 
by comparing urbanization in China to 
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urbanization in other underdeveloped and 
socialist states. 


CITY SIZE, ECONOMIC 
DEVELOPMENT, AND 
QUALITY OF LIFE: GENERAL 
THEORETICAL ARGUMENTS 


Although most theories of urbanization focus 
on international variation in city size and 
urban patterns, they:can also be useful guides 
‘in case study research. For example, recent 
research has used general theories to examine 
urban primacy and/or urban hierarchies in 
Guatemala (Smith 1985b), Australia (Clark 
1985), South Korea and the Philippines 
(Nemeth and Smith 1985), and Thailand 
(London 1985). Thus, to help explore the 
special character of Chinese policies regard- 
ing modernization. and city size, it is 
important to outline conventional theories 
concerning these issues. After examining 
modernization and dependency/world-system 
arguments, we study the Chinese case and 
assess its conformity to such perspectives. 
According to modernization theory, eco- 
nomic and social development is a unilinear 
and evolutionary process. Poor countries 
should follow the path established by devel- 
oped nations, moving as rapidly as possible 
from "traditional" agricultural societies to 
“modern” urban-based economies (see Kelley 


and Williamson 1984). In contrast to scholars. 


who emphasize the "parasitic" nature of 
Third World urbanization, traditional módern- 
ization theory argues that urban areas— 
particularly large urban areas—have a benefi- 
cial effect on national economic development 
(Mehta 1964; Mera 1973; see also Berliner 
1977; Walters 1985). Moreover, urban areas— 
particularly large urban areas— possess two 
other characteristics relevant to life in devel- 
oping countries. First, larger cities have 
higher average incomes than other areas of 
the country, especially nonurban regions. 
Second, all-sized cities have a superior 
quality of life relative to nonurban areas. 
Berry and Kasarda (1977, p. 373) make this 
argument for Third World urbanization: 
"Whereas the new industrial cities in the 
West were death traps, the cities of the Third 
World are usually more healthful than their 
rural hinterlands and are almost as healthful 
as cities ‘in the most advanced countries" 
(emphasis .added). The authors stress urban 
improvements in health (including "the mirac- 
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.ulous fall in mortality"), welfare, housing, 


and other features related to quality of life. 

In addition to the structural improvements 
created by burgeoning urbanization, this 
process will, according to modernization 
theory, facilitate the development of certain 
values and. attitudes deemed necessary for 
economic development (Inkeles and Smith 
1974; see also Delacroix and Ragin 1978). 
Inkeles and Smith argue that "modern" 
individuals are created by “modernizing 
institutions” like the school, the factory, and 
the mass media. These institutions are 
disproportionately located in ‘Third World: 
cities and therefore it is important for 
underdeveloped areas to urbanize as rapidly 
as possible. Following exposure to these 
institutions, Third World citizens supposedly 
will possess the requisite qualities for large- 
scale national development. Perhaps most: 
important, they will be more productive and 
better educated, characteristics associated 
with the modernization process. By exten- 
sion, this logic posits that urbanization may 
interact with phenomena like industrialization 
and education. Industrialization and education 
should have an especially strong positive 
effect on economic development and quality 
of life within urban areas. Traditional mod- 
ernization arguments assert that this effect 
should be particularly strong in large urban 
areas. 

However, the emergence of extremely 
large "primate" cities throughout the Third 
World has caused grave concern among some 
modernization scholars and most human 
ecology and dependency theorists (see Berry 
and Kasarda 1977, pp. 386—400; Smith 
1985a). These theorists argue that primacy 
stunts and distorts balanced development in 
Third World societies by channeling dispro- 
portionate resources to large urban areas. 
Although large cities often experience greater 
economic expansion, they must also confront 
a growing problem throughout poor regions. 
The disease-ridden slums and squatter settle- 
ments of Latin America, Africa, and Asia 
illustrate the potentially negative relationship 
between big cities and quality of life. This 
association underscores the severe inequality 
within large urban areas. On the one hand, 
average income usually is much higher in 
these areas relative to both smaller urban and 
nonurban regions. On the other hand, income 
inequality is severe and is manifested through 
extreme poverty among the mass population. 


QUALITY OF LIFE IN CHINA 


` The deperidency/world-system perspective has 


attributed this syndrome of high average 
incomes and stagnant or declining quality of 
life to foreign capital penetration. External 
capital increases uneven economic expansion 
but does not enhance quality of life among 
most citizens. 

More specifically, dependency and world- 
systems theories argue that Third World cities 
represent the primary target for multinational 
investment..Foreign companies take advan- 
tage of low Third World wages by establish- 
ing local factories that make products at a 
relatively low cost. These products are then 
exported back to developed nations or sold to 
local elites, thereby enhancing multinational 
profits’ while further underdeveloping. the 
Third World. The relatively low wages 
earned by local laborers do not appreciably 
enhance their income or quality of life (see 


Portes and Walton 1981; Portes 1985; Timber-- 


lake and Lunday 1985). Although average 
urban income may increase (because of a 
small and wealthy elite), it will not be 
. distributed Nidey among the poor popula- 
tion. 

This perspective also asserts that many 
Third World cities are expanding too rapidly, 
given their relative scarcity of employment, 

. housing, and other amenities. Some countries 
are "overurbanized" as indicated by their 
inability to provide basic employment and 
social services for urban residents (Timber- 
Jake and Kentor 1983; Bradshaw 1985, 1987; 
see also Gugler 1982). "Overurbanized" 
nations often exhibit the following character- 
istics which contribute to underdevelopment: 
(a) their urban employment structure is 
increasingly capital intensive because of 
foreign investment, a structure that fails to 
enhance local employment; (b) their overall 
rate of national productivity is inhibited by 
increasing urban unemployment and underem- 
ployment; (c) they must expend scarce 


resources on social welfare and social control 


in urban areas; and (d) their quality of life 
may decline due to the lack of basic needs 
provisions, especially in disease-ridden urban 


slums. Large cities are most problematic . 


‘because they receive substantial foreign 
investment and are least able to provide 
employment and amenities for urban resi- 
dents. 

` Several scholars sei about the poten- 
tially deleterious effects of extremely large 
(primate) cities have stressed the importance 


a 


- 989 


of urban “growth centers” (Berry 1971, pp. 


142-43; Berry and: Kasarda 1977, p. 399). 
Accordingly, Third World countries should 
decentralize urban planning by stimulating 
expansion of intermediate-sized cities (see 


Hansen 1971). This strategy would achieve 


the objective of “urban modernization” (the 
concern of traditional modernization theory) 
while avoiding the problems associated with 
giant cities (the concern of several other 


theories). Chinese urban planners apparently 


have been cognizant of these concerns. As 
noted by Goldstein (1988, p. 211), Chinese 
government policies in the 1960s and 1970s 
“attempted to develop policies designed to 
promote wider income distribution, reduce 
regional inequalities, and create a more 


_ balanced urban hierarchy, which would lead 


to a greater decentralization of economic 
activities. In doing so, the intention was to 
slow population growth in the largest cities, 
while allowing continued increases in medium- 
sized and smaller urban centers.”3 In the next 
section, we tum to an elaboration ‘of. the 
Chinese case, especially as it relates to 


. various theoretical orientations. 


MODERNIZATION, DEVELOPMENT, , 
AND. CITY SIZE: THE CHINESE CASE" 


"China has a long and varied history of 


urbanization. According to conventional West- 


. ern thought, China is a relatively “closed” 


society that has not been substantially influ- 
enced, by external factors. vIn fact, however, 


? The results of this policy have been fairly 
successful, with previous studies indicating that, 
on balance, urban primacy is not prevalent in many 


.areas of China (see Goldstein 1988, pp. 211-13 for 


a review of these studies). Using a four-city 
primacy index where 100 or more indicates a 
condition of primacy, earlier research shows that 
China has an overall score of only 43. When the 
six regions of China are measured separately, only 
the East Region (which contains Shanghai) exhibits 
a condition of primacy, although the score is still a 
rather modest 111.4. When the provinces and 
autonomous regions are examined individually, 
there is evidence of urban primacy in some areas. 
For.example, in the Central-South Region, the . 
Southwest Region, and the Northwest Region, 6 of 
the 11 provinces for which a primacy index was. 
calculated had scores over 100. The highest score 
was 309.1 in Hubei Province, mainly because of ' 
Wuhan which had a population of 3.3 million in 
1982. 
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China’s urbanization patterns have been 
affected by foreign powers, especially during 
the 19th century (see Buck 1978). In 1842, 
: the country was defeated by Britain in the 
Opium War and was forced to open its treaty 
ports for foreign extraterritorial control. 
China was compelled to sign more treaties in 
subsequent years, leading to additional port 
cities (Murphey 1974). These cities were 
divided, occupied, and controlled by the 
British, French, Germans, Russians, Ameri- 
cans, and Japanese. The treaty ports devel- 
oped into an urban network system consisting 
' of approximately 120 towns and cities, 
located primarily along the coasts and major 
rivers of China (Pannell 1984). The structure 
of these cities was basically commercial, 
differing from the administrative orientation 
of the city system during China’s imperial 
past. 

Although these .urban areas facilitated 
industrialization and economic expansion, 
they also symbolized the negative features of 
Westernization which damaged Chinese cul- 
ture and values. Whyte and Parish (1984, p. 
14) capture this sentiment: 


The treaty ports symbolized both the hope of a 
strong, industrial modern China and her shame 
at foreign domination and control. Foreign 
domination was apparent not only in the profit 
which flowed abroad, but also in public symbols 
of Western superiority, such as the signs in the 
Western-style park along the waterfront in 
Shanghai stating that no Chinese and no dogs 
were allowed entry. These new urban trends 
undermined the distinctive characteristics of 
traditional Chinese cities, such as cellular 
associational life and close ties with the 
countryside, and spawned many of the urban 
problems that have since come to plague other 
societies in the Third World. 

These problems include crime, prostitution, 

drug use, avarice, and severe income inequal- 

ity. 

Not surprisingly, the Chinese Communist 
Party (CCP)—founded in 1921—disparaged 
the Western influence of urban life. The Party 
was pushed into rural areas in 1927 by Chiang 
Kai-shek, where, with the help of urban 
intellectuals, it mobilized the peasantry to 
fight against the Chinese Nationalists (Whyte 
and Parish 1984). After defeating the Nation- 
alists in 1948 and establishing the People’s 
Republic of China in 1949, the CCP assumed 
control of urban areas across the country. The 
new government cfficially adopted an anti- 
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urban policy based on the belief that cities 
were a negative influence: “Western-style and 
‘modem’ cities—notably the former treaty 
ports but increasingly most growing cities— 
were seen as soul-destroying, antipeasant 
(hence anti-Chinese), and filled with corrup- 
tion, crime, hypocrisy, suffering, squalor, 
and pollution” (Murphey 1988, pp. 158-59). 

Despite the ostensibly antimetropolitan 
ideology of the CCP, its first five-year 
development plan (1953-1958) was biased in 
favor of “key point cities” and “new 
industrial cities” (Buck 1981, p. 122). This 
strategy reflected Soviet-style development 
policies which stressed heavy urban industri- 
alization during the Stalin era. In fact, the 
Soviet Union provided assistance to develop 
urban infrastructure throughout Chinese cit- 
ies, including roads, sewers, housing, trans- 
portation, electricity, and other factors that 
increase heavy industry (Buck 1981, p. 122). 
Growing urban industrialization— combined 
with rural land reform, collectivization, and 


.communization— prompted massive town- 


ward migration (Buck 1981, pp. 122-23; 
Whyte and Parish 1984, p. 17). Conse- 
quently, the urban population as a percentage 
of the total population, expanded from 10.6 
percent in 1949 to 19.7 percent in 1960 
(Goldstein 1988, pp. 199-201). The in- 
creased population: was particularly pro- 
nounced in key-point cities designated for 
industrial expansion. For example, the popu- 
lation of Zhengzhou expanded from 150,000 
in 1948 to 595,000 in 1953 and 785,000 in 
1958 (Buck 1981, pp. 122-23). 

Even though the early Chinese strategy was 
adopted from the Soviet Union, it still had 
similarities to classic modernization theory. 
China stressed heavy industrialization, urban 
expansion (especially in large cities), and 
massive rural-to-urban migration. During the 
first five-year development plan, however, 
the Chinese government became concerned 
about the concentration of industry and 
population in large cities. In 1956, the State 
Council] announced a policy change which 
reflected this concern over the emergence of 
extremely large cities: 


From henceforth the scale of new cities 
generally should fall within the range of several 
tens of thousands up to 100,000 or so. In areas 
where conditions are appropriate we can estab- 
lish cities of 200,000 to 300,000. For special 
needs, we can consider building cities of more 
than 300,000 and under conditions of geograph- 
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ical necessity or the special needs of mining, we 
will build independent workers’ towns. (quoted 
in Buck 1981, p. 123) 


This statement was an early sign of China’s 
intent ‘to pursue a different socialist policy 
concerning urbanization. 

At thé start of the Great Leap Forward in 
1958, China implemented a series of controls 
to limit townward migration. Individuals were 
permitted to migrate only if they had a 
specific reason to do so (e.g., an urban job or 
a position in a university). Moreover, urban 
families had to maintain household registra- 
tion books that listed each member of the 
family, thereby enabling the state to monitor 
city residents. Visitors had to register at the 
local police station to obtain “temporary 
household registrations” that would permit an 
urban visit (Whyte and Parish 1984, pp. 
17-19). These controls were part of a radical 
shift in government policy that began to 
emphasize rural instead of urban develop- 
ment. For instance, after Soviet withdrawal in 
1960, China virtually halted all urban construc- 
tion and stressed communization in both rural 
and urban areas. This change in government 
policy reduced the number of urban jobs in 
the 1960s which, when combined with 
migration controls, greatly decreased the 
amount of townward migration (Buck 1981). 
Consequently, the proportion of the urban 
population began to decline in the early 1960s 
and continued throughout the mid-1970s 
(Goldstein 1988, p. 201). 

This trend was especially evident during 
the Cultural Revolution (1966-1976), when 
urban high school graduates were required to 
migrate to rural and frontier areas for 
agricultural training (Bernstein 1977). This 
program, along with other resettlement efforts 
(e.g., transfer of workers to small factories 
outside urban areas), resulted in the greatest 
out-migration from large cities in China’s 
history. Shanghai may have lost as many as 
one million residents a year during the 
resettlement period (Buck 1981, p. 135). 
Altogether, estimates indicate that more than 
17 million youth were involved in this 

. relocation policy (Whyte and Parish 1984, p. 
19; see also Seybolt 1977), a practice that 
helped reduce the urban population to only 
17.2 percent of the total population by 1974 
(Goldstein 1988, p. 201). Importantly, how- 
ever, government policy did not denigrate the 
importance of all urban areas during the 
Cultural Revolution, as it stressed develop- 
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ment of medium-sized cities, small-sized 
cities, and towns throughout this period. The 
years 1969-1976 emphasized national "self- 
reliance," a policy based on rapid production 
especially in rural areas and smaller urban 
regions (Buck 1981, pp. 135-36). 

The Chinese government neglected urban 
infrastructure and services during the Cultural 
Revolution, sharply reducing the quality of 
life in cities—especially large cities. This 
neglect became painfully evident after the 
government lifted restrictions on urban migra- 
tion and allowed “rusticated youth” to return 
to urban areas following the Cultural Revolu- 
tion. The urban population increased after this 
period, to 19.4 percent of the total population 
in 1980 and 20.8 percent in 1982 (Goldstein 
1988, p. 201). Increased urbanization— 
especially big city growth—further strained 
inadequate urban resources and created addi- 
tional unemployment because state industries 
cannot provide a sufficient number of jobs for 
urban residents (Perkins 1986, pp. 51-57). 
Thus, in 1980 the Chinese government 
enunciated a policy that stressed limiting the 
growth of large cities and promoting the 
expansion of medium-sized and especially 
small cities. The intellectual rationale of this 
policy is clear (Goldstein .1988, p. 189; 
quoted from Li 1983, p. 9): 


In our circumstances, if only the big cities have 
good job opportunities, housing, schools, stores 
and services, and cultural and recreational 
facilities, then of course people will want to live 
there. But if all these things are developed in 
small and medium sized cities, people will be 
much more willing to live in these places and the 
population pressure on big cities will be eased. 


Big cities, however, also face a lack of 
adequate services. They have housing short- 
ages, water shortages, pollution, lack of 
transportation, and other problems connected 
to social services (Chen 1988; Murphey 
1988). Although these cities are more indus- 
trialized and possess higher incomes than 
other areas of the country, they still are 
plagued by problems .that threaten their 
quality of life. By contrast, smaller cities— 
though less industrialized and less wealthy— 
are potentially better able to provide housing 
and other social services that facilitate quality 
of life. China's urbanization policy is congru- 
ent with theories stressing the necessity of 
controlled urbanization, that is, urbanization 
that avoids the potentially parasitic drain - 
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caused by primate cities. To help slow urban 
growth, China has continued to enforce a 
rigid system of registration designed to inhibit 
rural-to-urban migration. A person must be 
registered to live in a specific city before he 
or she can obtain permanent residency there. 
Residents of rural areas, towns, or smaller 
cities cannot become residents of larger urban 
areas unless they are assigned to a university 
or a job in such areas (Whyte 1988, p. 265; 
see also Goldstein 1988, p. 193). 

In 1984, China further modified its urban- 
ization policy in several ways, two of which 
should be mentioned here. First, the govern- 
ment designated 14 coastal cities—including 
some large cities—-as centers for foreign 
investment and joint ventures (see China 
International Economic Consultants 1985; 
Goldstein 1988; Chen 1988).* These cities, 
some of which were once penetrated and 
controlled by foreign powers, are centers of 
manufacturing, shipping, and trade. Their 
ports constitute an important link to the 
outside world, a link that helps shape their 

_ internal structure and character. For example, 
“the city of Shanghai—the nation’s largest 
industrial and port city—had 756 projects 
involving foreign investment and financing by 
the end of 1984, including numerous joint 
ventures. The total amount of foreign invest- 
ment in the projects was the equivalent of 
1.51 billion U.S. dollars (China International 
Economic Consultants 1985, p. 103). This 
new investment policy is consistent with 
China’s slowly emerging strategy of emphasiz- 
ing increased private-sector involvement and 
foreign. capital. Second, the government 
released a document allowing peasants to live 
in towns-—on a nonpermanent basis— in order 
to work in industry, business, or some type of 
service labor (Goldstein 1988, pp. 217-18). 
This policy is consistent with China's effort to 
emphasize industrial expansion in smaller 
urban and even rural areas. If industry can be 
shifted outside larger urban areas, then 
migration to big cities may be discouraged 





-*The 14 coastal cities include Dalian 
(1,478,978), Qinhuangdao (403,701), Tianjin (4,607,- 
530), Yantai (384,336), Qingdao (1,173,872), 
Lianyungang (395,730), Nantong (389,988), Shang- 
hai (6,320,829), Ningbo (468,230), Wenzhou 
(508,611), Fuzhou (1,129,251), Guangzhou (3,148,- 
281), Zhanjiang (346,024), and Beihai (168,441) 
(China International Economic Consultants 1985). 
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and the country can proceed with a more 
decentralized and balanced development plan. 

The Chinese model of urbanization par- 
tially diverges from Western patterns of city 
growth. Perhaps most important, China has 
followed numerous socialist policies that give 
the centralized government power to enforce 
laws designed to control city size. Specifi- 
cally, registration laws, strict migration con- 
trols, and outright relocation are not typically 
endorsed by Western-style theories of modern- 
ization. Nonetheless, these actions make it 
possible to test an unusual strategy of limiting 
city size, a strategy that is possible precisely 
because the state exercises more power than 
in many "capitalist" developing countries. 
Although one study has quantitatively exam- 
ined the validity of traditional ecological or 
modernization arguments in China (Chen 
1988), no study has tested competing argu- 
ments relevànt to the role of different-sized 
cities in the country. This is an important 
omission because much of China's changing 


. urbanization policy has revolved around the 


effect of city size on economic development 
and physical quality of life. The next section 
proposes a method which will examine these 
issues. 


DATA AND 
VARIABLE CONSTRUCTION 


All data for this paper are derived from a 
recently published data source titled The 
Population Atlas of China (People's Republic 
of China 1987). This is a truly unique and 
comprehensive work that was compiled and 
edited by the (1) Population Census Office of 
the State Council of the People's Republic of 
China and (2) Institute of Geography of the 
Chinese Academy of Sciences. Data are 
based primarily (though not exclusively) on 
results from the 1932 population census, the 
nation's third census following those com- 
pleted in 1953 and 1964. The 1982 census 
was distinctive for several reasons. First, it 
was more comprehensive in scope, covering 
over 1 billion people; second, it provided data 
for more variables than before, 19 in all; and 
third, it utilized computers for the first time 
when processing census results, enhancing ^ 
accuracy and completeness (People's Repub- 
lic of China 1987, p. v). In addition, The 
Population Atlas of China contains 137 
detailed maps (80 percent of which were 
drawn by computers) on various demographic 
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concems. The census used 5,180,000 enumer- 
ators (working for one month), 1,090,000 
supervisors (working for six weeks), 130,000 
coders (working for three months), over 
4,000 key-punch operators (working for 15 
months), over 1,000 computer and technical 
specialists (working for three years), and over 
1,000 other census workers (working for six 
years) (People's Republic of China 1987, P. 


v). 

Data were collected at the level of the 
administrative unit, including counties, dis- 
tricts, cities, and other localities unique to 
China (e.g., autonomous regions) (People’s 
Republic of China 1987, p. xiii). The total 
number of cases is 2,378; the only missing 
data are for the infant mortality variable, 
which does not have information for the 72 
cases in Tibet. Because infant mortality is an 
important variable throughout this analysis, 
our results are based on 2,306 cases—an 
impressive, number for any development 
study. 

The analysis tests numerous mihe 
related to economic development, quality of 
life, education, industrial employment, city 
size, and other phenomena. All data are for 
the years 198f or 1982. 

Economic development. Studies often mea- 
sure economic development in terms of per 
capita gross domestic product (or gross 
national product). The Population Atlas of 
China does not include data on per capita 
total gross domestic product; however, it does 
contain information on per capita gross 
domestic values of industrial and agricultural 
output, a reasonably close substitute for per 
capita gross domestic product. In fact, the 
World Bank reports that the combined values 
of industrial and agricultural output accounted 
for 78 percent of total gross domestic product 
in China for 1982 (World Bank 1984, p. 
222). Thus, this paper measures economic 
development as the combined gross per capita 
values of industrial and agricultural output.5 

Physical quality of life. Previous quantita- 
tive, cross-national studies have used a 
physical quality of life index (PQLD based on 





* Unfortunately, this variable does not include 
data on the service sector. Because urban areas 
possess more ‘service activities than nontrban 
regions, this measure underestimates productivity 
especially in the former. Again, however, agricul- 
tural and industrial output accounts for the vast 
majority of total output in the.country. - : 
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infant mortality, literacy rates, and life 
expectancy (Morris 1979; Moon and Dixon 
1985; London and Williams 1988). Because 
our data set does not contain data on life 
expectancy, we computed an index based on 
infant mortality (number of infant deaths per 
1,000 births), illiteracy (percentage of the 
population aged 12 and over that is illiterate 
or semiliterate), and death rate (number of 
deaths per 1,000 population). Each indicator 
is logarithmically transformed (using the 
natural log) to reduce skewness and facilitate 
linearity between the variables. The variables 
are sufficiently correlated to measure a single 
underlying construct. Infant mortality and 
death rate have a correlation of .683; infant 
mortality and illiteracy have a correlation of 
.471; and death rate and illiteracy have a 
correlation of .526. Most important, these 
three variables have a Cronbach's Alpha 
score of .713, indicating that they exhibit a 
sufficient level of internal reliability to 
constitute a single index (Bohmstedt and 
Knoke 1988, pp. 384-86). 

To compute the index, infant mortality, 
illiteracy, and death rate were transformed to 
Z-scores (to standardize the measures) and 
then the three scores were summed. This 
composite term is called the misery index 
because it measures qualities considered 
opposite a good quality: of life.$ If a variable 
has a negative effect on the misery index, 
then it has a positive impact on quality of life. 
In addition to using this index, the analysis 
also disaggregates the composite term to 
examine each aspect of misery individually. 

Education. Modernization theorists argue 
that education will promote both economic 
development and quality of life. To examine 
this assertion, the analysis includes a variable 
that measures the number of people per | 
10,000 population with a middle school 
education. 

Industrial employment. Modernization the- 
orists also hypothesize a positive relationship 
between industrial (“modem-sector”) labor 
and both economic development and quality 
of life. Thus, the analysis includes a variable 





$' Thé misery index actually counts infant 
mortality twice, as death rate measures mortality at 
all ages. The data set does not contain a variable 
measuring only postinfancy deaths. Weighting 
infant mortality is. not problematic, however, since 
this .variable is probably the most Koponan 
indicator of.quality of life. 
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measuring industrial employment as a percent- 
age of total employment.. 

. Urbanization and city size. Recent research 
on China has examined the effect of different- 
sized cities on national development (see, 
e.g., Goldstein 1988; Chen 1988). These 
studies often divide cities into four broad 
categories: Small urban areas (50,000 to 
199,999 people), medium urban areas (200,000 
to 499,999 people), large urban areas (500,000 
to 999,999 people), and extremely large 
urban areas or metropolises (at least one 
million people). Although these categories 
may seem somewhat arbitrary, they represent 
a reasonable division of city size that permits 
an examination of the effects of different- 
sized cities (sée below). 

‘The unit of analysis in this data set (i.e., 
administrative unit) does not permit a contin- 
uous measure of urbanization because these 
units are classified as either predominantly 
urban or predominantly nonurban (e.g., 
' county). Thus, to test the effects of different- 
sized cities, 
created, These dummy variables can be 
divided into three different categories: 

1. Total urbanization. This variable codes all 
urban administrative units over 50,000 peo- 
ple as 1, and all other administrative units as 
0. 

2. Size of urban areas. This category includes 
four dummy variables: 

(a) small urban areas, with cities between 
50,000 and 199,999 coded as 1 (0 
otherwise); 

(b) medium urban areas, with cities be- 
tween 200,000 and 499,999 coded as 1 
(0 otherwise); 

(c) large urban areas, with cities between 
500,000 and 999,999 coded as 1 (0 
otherwise); i 

(d) metropolises, with cities of at least 1 
million residents coded as 1 (0 other- 
wise). f 

3. Externally oriented urban areas, with the 14 
coastal cities linked to outside countries 
coded as 1 (0 otherwise).? 


7 It is important to note that China's policy to 
encourage foreign investment was implemented in 
1984, two or three years after this data set was 


‘ collected. This does not mean, however, that 


foreign penetration and exploitation are new 
phenomena in China. In fact, some of these cities 
were profoundly affected during the years of 
colonial occupation and exploitation by foreign 
powers. In 1982, these cities were still externally 


six dummy variables were . 
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Control variables. It is also important to 


. control for the effects of three other indicators 


when examining economic development and 
quality of life. These variables include the (1) 
total population, measured simply as the 
number of people in each administrative unit; 
(2) elderly population, measured as the 
percentage of the total population aged 65 and 
over; (3) birthrate, measured as the number 
of births per 1,000 population.? 

Not surprisinglv, many of the variables in 
the analysis are highly skewed. Moreover, 
examination of scatterplots for the relation- 
ship between each dependent variable and 
each independent variable revealed that the 
relationships are not always linear. To correct 
for the problems of nonlinearity and skew- 
ness, all variables except for the misery index 
were logarithmically transformed (using the 
natural logarithm). (The components for the 
misery index were logged prior to the creation 
of the Z-scores.) These logarithmic transfor- 
mations eliminated the skewness for each 
indicator and also created approximately 
linear relationships between the independent 
and dependent variables, thus complying with 
the assumptions of OLS regression (see 
Hanushek and Jackson 1977). 


RESULTS OF THE ANALYSIS 


The analysis begins by using OLS regression 
to look at the effects of several variables on 
economic development.? Results reported in 
the first equation of Table 2 show that 


oriented coastal cities specializing in manufactur- 
ing, shipping, and foreign trade. We readily 
acknowledge, however, that this variable provides 
only: a crude test of dependency theories. This 
perspective should be reexamined in the future, 
after China has received foreign investment for 
several years. 

5 Some readers may raise questions about the 
use of ratio variables in this analysis (see 
Firebaugh and Gibbs 1985). However, ratio 
variables are needed to standardize variables across 
the different administrative units of China. Ratio 
variables are justified when theoretical and substan- 
tive considerations suggest that the denominator 
has an interactive effect with the specified 
independent variables (numerators). ` 

On balance, multicollinearity is not a severe 


l problem in any equation in the analysis. The -only 


variables that have a zero-order correlation of .70 
or higher are economic development and industrial 
employment (r = .773). 
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Table 2. OLS Estimates of the Effects of Independent Variables on Economic Development (N = 2,306) 














Dependent Variables 
q) Q) (3) (4) 
Independent Economic Economic . Economic ' Economic 
Variables Development Development Development Development 
Education 90*e - | d62se* .l66*** 183444 
: (.024)° (.023) ` (.023) (.024) 
Industria! employment .S20*** .420*** 415*** Spree 
(.013) (.013) (.013) (.013) 
Birthrate — .192"** —.134*** —.127*** —.163*** 
(.044) . C039). '(.039) (.041) 
Total population ` —.029** — — — 
(011) = B = 
Elderly population .180**+* .205*** .184*** ^ 796 
(.044) (.040) (.039) (.042) 
Urban dummy — .589*** — — 
= (034) > = = 
Small urban dummy — — .311*** — 
i — —- - » (053) — 
Medium urban dummy a _ .666*** = 
N = — (.048) : — 
Large urban dummy — _ .656*** — 
- -. 7 (.061) e 
Metropolis dummy — -— | 7 QBA6*9* es 
— am (.069) '— 
External urban dummy — — — .802*** 
— — — (413) 
Constant 4.280*** 4.061*** 4.055*** 3.994»** 
(.294) (.238) (.236) (.251) 
Rm i .620 .663 .671 .627 
F-test na na . 90.782*** ‘pa 


Note: a = unstandardized regression coefficient; b = standard error; *** = P <.01 (two-tailed test); ** = P< .025 
(two-tailed test); * = P <.05 (two-tailed test); na = not applicable. 


education, industrial employment, and elderly 
population have a strong positive impact on 


' economic development, whereas birthrate and 


total population demonstrate a negative ef- 
fect. The second equation indicates that the 
dummy term for urban population is signifi- 
cantly positive, while all other variables 
retain their original sign and level of 
significance.!? The third and fourth equations 
show that (1) all four urban subgroups and (2) 
externally linked cities also have a strong 
positive influence on economic development. 
Moreover, the F-test for equation 3 indicates 
that this set of dummy variables should be 
included in the model, as they explain 


10 One reviewer strongly objected to the inclu- 
sion of both total population and the urban dummy 
variables in the same equation. Because the urban 
dummy variables represent a discrete measure of 
population, the equation would include both a 
continuous and a discrete indicator of the same 
phenomenon. 


significantly more variation than the equation 
without such variables.!! 

These findings are interesting from both a 
theoretical and a substantive standpoint. 
Theoretically, they support the central logic 


11 The F-test examines whether equations with a 
set of intercept dummy variables and equations 
with interaction terms explain significantly. more 
variation than equations without dummy variables 
or interaction terms. Thus, the third equation was 
rerun without the dummy variables (results not 
shown here). The F-test is calculated as: . 

(R3 - Rive — ki) 
(1 — RON - k — 1), 
where k = the number of explanatory variables 
used to estimate R2 (the third equation); and kj = 
the number of explanatory variables used to 
estimate R? (the third equation without the dummy 
variables) (Bohrnstedt and Knoke 1988, pp. 
413-14). The other equations with dummy vari- 
ables are not tested because they simply include 
one intercept dummy variable. l 
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of most development theories, which assert 
that urban areas have a higher level of. 
average income than nonurban regions. Mod- 
ernization arguments receive added support 
from the strong positive impact of both 
education and industrial employment, charac- 
teristics of a "modem" country. Substan- 
tively, the- three control variables warrant 
discussion. On the one hand, the negative 
findings for population and birthrate are not 
surprising: these phenomena logically are 
associated with a decline in per capita 
income. Again, however, urban areas demon- 
strate a positive-effect, suggesting that cities 


. possess a higher level of economic develop- : 


ment. On the other hand, the positive impact 
of elderly population may seem curious. This 
result at least partially reflects the fact that (1) 
most older people have been employed (either 
formally or informally). for many years, and 
(2) China's pay scale is based heavily on 
seniority, meaning that old age often is 
associated with higher levels of per capita 
income.!? 


The next set of equations extends the 


analysis and examines the effect of each 
independent variable (and economic develop- 
- ment) on the misery index. If a variable has a 
negative effect on misery, then it has a 
positive impact on quality of life. Results 
. reported in the first three equations of Table 3 

support traditional modernization arguments, 


cis 


2 An earlier, version of this paper included a 
series of slope dummy variables to investigate 
_,whether the effects of education and industrial 
employment vary according to size and type of 
urban area (see Hanushek and Jackson 1977, pp. 
106—8). More specifically, we created 12 interac- 
tion terms by multiplying each urban dummy 
variable by education and by industrial employ- 
ment (ie., Urban Dummy * Education, Urban 
Dummy * Industrial Employment, Small Urban 
' Dummy * Education, Small Urban Dummy * 
' Industrial Employment, etc.). The results showed 
that all slope dummy variables demonstrate a 
significant positive effect on economic develop- 
ment, indicating that both urban education and 
urban industrial employment have a stronger effect 
than nonurban edücation and nonurban industrial 
employment. Nonurban education and nonurban 
industrial employment also have a significant 
positive effect, but the impact is significantly 
stronger in each urban area. Although space 
limitations prevented us from showing this part of 
the analysis, we would be happy to supply the 
`. results upon request. 
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as education, industrial employment, and 
all-sized urban areas have a strong negative 
impact on the misery index. Three control 
variables are also consistently significant, 
with total population and economic develop- 
ment demonstrating a negative effect and 
birthrate exhibiting a positive impact.!? (El- 
derly population demonstrates statistical sig- 
nificance in only the first equation.) Findings 
reported in the fourth equation, however, 
diverge from the established pattern. Specifi- 
cally, the dummy variable for externally 
linked cities does not demonstrate statistical 
significance, indicating that these 14 cities do 
not possess a superior physical quality of life 
relative to other regions. 

Results reported in Table 3 seem to provide 
a unique combination of support for both 
traditional modernization and dependency 
arguments, especially when examined in 
conjunction with earlier findings on economic 
development. Specifically, following tradi- 
tional modernization theory, all-sized urban 


. areas have a superior level of economic 


development and a better quality of life than 
nonurban areas. However, following depen- 
dency theories, externally linked cities do not 
enjoy a better physical quality of life (Table 
3) even though they do possess higher levels 
of economic development (Table 2). In 
accord with such theories, this finding 
suggests that economic development is not 
distributed in a way that enhances physical 
quality of life in these cities. Of course, this 
explanation must remain speculative until 
more direct measures of inequality are 
available for China.!4 - 


33 The significant negative coefficient for total 


' population indicates that areas with a larger 


population may provide better health care and 
other services relative to less populated areas. This 
possibility will be further examined throughout the 
analysis. ' 

'^ Again, an earlier version of this paper created 
12 interaction terms by multiplying each urban 
dummy variable by education and by industrial 
employment. The results showed that 'only two 
slope dummy ‘variables fail to demonstrate a 
significant negative effect on the misery index. 
The two exceptions are (External Urban Dummy * 
Education) and (External Urban Dummy * Indus- 
trial Employment), which have a nonsignificant 
effect on. the dependent variable. Overall, two 
conclusions can be drawn from these results. First, 
urban education and urban industrial employment 
have a stronger negative effect on misery than 
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Table 3. OLS Estimates of the Effects of Independent Variables on Misery Index (N = 2,306) 
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Dependent Variables 
0) Q) (3) (4) 
Independent Misery Misery Misery -Misery 
Variables Index Index Index Index 
Education —2.612*** — 2,721 8** -2.8*- —2.24*** 
(.094) (.097) (.098) (.098) 
Industrial employment .— 465*** — 407*** — .404*** — .457*** 
(.063) (.067) (.067) (.066) 
Birthrate .789*** 1.517*** 1.517*** 1.528*** 
(.166) (.165) (.165) (.166) 
Total population —.632*** — — — 
(.043) — — — 
Elderly population .905*** .015 011 118 
(.169) (.167) (.168) (.167) 
Economic development — .407*** — .206** — .220** (33798 
= (.079) (.088) (.089) (.084) 
Urban dummy _ — 103*** i ae 
— (151) — - 
Small urban dummy — — —.924*** — 
: — — (.226) _ 
Medium urban dummy — — —.5]5*** ~ 
— — (.213) -— 
Large urban dummy — — — .563* — 
~ — (.265) — 
Metropolis dummy — — —.716** — 
— — (.302) -— 
External urban dummy — — — —.335 
ned -— — (.459) 
Constant 26.730*** 17.473*** 17.539*** 18.135*** 
(1.166) (1.061) (1.063) (1.059) 
g .601 .568 .569 .564 
F-test na 6.660*** na 


na 
Note: See nóte to Table 2. l 


Although quality of life is a multidimen- 
sional concept best measured by a reliable 
composite term like the misery index, it is 
also useful to examine the components 
individually to see whether infant mortality, 
death rate, and illiteracy are influenced by the 
same factors. Thus, the next three tables use 
the natural logarithm of (1) infant mortality, 
(2) death rate, and (3) illiteracy as dependent 
variables. 

With a few exceptions, results reported in 
Table 4 resemble earlier findings. The 
exceptions are noteworthy. First, elderly 





nonurban education and nonurban industrial em- 
ployment. Although education and industrial em- 
ployment in nonurban areas also have a negative 
impact, the effect is significantly stronger in urban 
aras. Second, externally oriented cities may 
possess an internal structure that militates against 
the generally positive impact of education and 
industrial employment. We would be happy to 
supply these results upon request. 


population has a strong and consistently 
negative effect on infant mortality, a logical 
finding because an older population obviously 
would not experience infant mortality. Sec- 
ond, small urban areas and metropolises do 
not have significantly lower infant mortality 
rates relative to other regions. This may 
suggest that small cities and extremely large 
cities do not possess the capacity to provide 
adequate health care for infants. As before, 
externally linked cities exhibit a nonsignifi- 
cant impact on the dependent variable, 
indicating that they do not have lower levels 
of infant mortality. 

Prior to drawing any broad conclusions 
about the effects of different variables, it is 
helpful to examine indicators that influence 
the death rate in China. Results reported in 
the first two equations of Table 5 show that, 
in support of modernization arguments, edu- 
cation and industrial employment have a 
negative impact on the death rate. By 
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Table 4. OLS Estimates of the Effects of Independent Variables on Infant Mortality (N = 2,306) 





Q 
Independent Infant 
Variables Mortality 
Education —.433*** 
(.028) 
Industrial employment — .052*** 
(.019) 
Birthrate .107* 
(.049) 
Total population —.198*** 
(.013) 
Elderly population —.475*** 
(.050) 
Economic development — .084*** 
(.023) 
Urban dummy — 
Small urban dummy — 
Medium urban dummy — 
Large urban dummy — 
Metropolis dummy — 
External urban dummy — 
Constant 10.295 
(.342) 
R 414 
F-test na 


Note: See note to Table 2. 


contrast, birthrate and elderly population 
increase the national death rate, a logical 
finding because areas with a high birthrate 
and a large elderly population are expected to 
experience greater mortality. Although total 
urbanization does not have a statistically 
significant impact on death rate in equation 
two, it is possible that different-sized cities 
will demonstrate an effect. In support of most 
theoretical perspectives—except traditional 
modernization theory—the third and fourth 
equations indicate that only small cities have 
a strong negative effect on death rate. These 
findings underscore the primary concern 
expressed by critics of traditional moderniza- 
tion arguments, namely, that larger cities do 
not possess sufficient resources to improve 
quality of life. 

The final set of equations examines the 
impact of various indicators on illiteracy, 
another component of the misery index. The 
first three equations of Table 6 strongly 
support traditional modernization arguments, 





Dependent Variables 
(2) (3) (4) 
Infant Infant Infant 
Mortality Mortality Mortality 
= ,A67*** e .A68*** = .468*** 
(.029) (.029) (.029) 
— .039* — .039* — .049** 
(.020) (.020) (.019) 
.336*** .336*** .339*** 
(.049) (.049) (.049) 
- Jat - age — Tages 
(.050) (.050) (.049) 
— .036 — .034 — .061** 
(.026) (.026) (.025) 
pas, .l46*** a aaa 
(.045) — = 
_ —.123 — 
— (.067) — 
— —.159** — 
— (.063) _ 
~ ~ .154* -= 
— (.078) — 
— ~.155 _ 
— (.089) — 
— — —.152 
— — (.135) 
7.478*** 7,47])*** 7.597*** 
(314) (.315) (.313) 
.356 .356 .353 
na 2.675* na 


as education, industrial employment, and 
all-sized cities have a strong negative impact 
on illiteracy. The findings with respect to 
urban areas are not surprising because educa- 
tion is heavily concentrated in Chinese cities, 
especially large cities (Banister 1987, pp. 
138-43; see also Skinner 1977; Andors 
1978). Once again, however, equation 4 
shows that externally linked cities are not 
statistically related to improvements in an- 
other aspect of quality of life. Three other 
variables—birthrate, economic development, 
and total population—are also negatively 
associated with illiteracy. The effects of the 
last two variables are not difficult to explain, 
as economic development and population size 
often are linked to increased resources for 
educational attainment. The finding for birth- 
rate is also logical and consistent because the 
declining illiteracy rate in China is due largely 
to increasing education among the younger 
population. Thus, people born now likely will 
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Table 5. OLS Estimates of the Effects of Independent Variables on Death Rate (N = 2,306) 





(1) 


Independent Death 
Variables Rate 
Education —.193*** 
(.011) 
Industrial employment — .026*** 
(.007) - 
Birthrate .193*** 
(.019) 
Total population — .063*** 
(005) 
Elderly population .378*** 
(.019) 
Economic development .004 
(.009) 
Urban dummy — 
Small urban dummy — 
Medium urban dummy — 
Large urban dummy — 
Metropolis dummy = 
External urban dummy — 
Constant 3.012*** 
(.130) 
gm .458 
F-test na 


Note: Sce note to Table 2. - 


possess lower illiteracy rates than elderly 
people, many of whom are uneducated. 


DISCUSSION AND CONCLUSION 


This case study of China produces several 
interesting results. First, education, industrial 
employment, and all-sized cities are associ- 
ated with superior levels of economic devel- 
opment and physical quality of life. These 
findings provide general support for tradi- 
tional modernization arguments. Next, al- 
though externally linked coastal cities have a 
greater level of economic development than 
many areas of China, they do not possess a 
better physical quality of life. This is 
consistent with some dependency arguments, 
which assert that externally linked regions 
may experience uneven development. High 
levels of economic development will not be 
transformed into a superior quality of life for 
most people. Third, there is little evidence to 
support China's claim that small- and medium- 








Dependent Variables 
(2) (3) (4) 
Death Death Death 
Rate Rate Rate 
— 204*** — 203v — 204v» 
(011) (.011) (.011) 
— 024**«* — 023*** — 025*** 
(.007) (.007) (.007) 
.266*** 266*** 266*** 
(.018) (.018) (.018) 
296068 E 294909 2995+ 
(.019) (.019) (.019) 
.015 —.012 .010 
(.010) (.010) (.009) 
—.024 -— — 
(.017) — — 
— | —,050* — 
— (.025) — 
— — .016 — 
— (.024) — 
pe — 026 = 
— (.029) — 
— —.021 — 
— (.034) — 
— —-— i .012 
— _ (.051) 
2.141*** 2.150**» 2.168*** 
(.118) (.118) (.117) 
419 .420 .419 
na 1.979 na 


sized cities provide a better physical quality 
of life relative to larger cities. In fact, there is 
but one instance where only small (or 
medium-sized) cities have a statistically 
significant negative effect on a component of 
human misery, namely, the death rate (see 
Table 5).. Of course, this situation may 
change if pollution and other problems 
associated with large cities continue to 
worsen. 

A significant change in China involves the 
nation’s increased emphasis on foreign invest- 
ment, which may indicate that dependency 
and world-systems arguments are becoming 
increasingly relevant for the country. We wish 
to stress that our analysis provides extremely 
tentative support for the dependency perspec- 
tive, mainly because China’s new foreign 
investment policy was not implemented until 
1984, two years after our data were collected. 
Despite this fact, however, this analysis does 
show that several important externally linked 
coastal cities are associated with economic 
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Table 6. OLS Estimates of the Effects of Independent Variables on Illiteracy (N = 2,306) 














Dependent Variables 
Independent Q Q) (3) (4) 
Variables Illiteracy Illiteracy Hliteracy Illiteracy 
Education — 459*** — 462*** — .462*** — .463*** 
(.015) (.014) (.014) (.015) 
Industrial employment —.113*9* —.102*** — .10i*** —.H3*** 
(.010) (.010) (.010) (.010). 
Birthrate — .076*** — .054* — .056** — .052* 
(.026) (.024) (.024) (.025) 
Total population — .020*** _ — — 
(.007) om = = 
Elderly population .049 .004 .005 .025 
: (.026) (.025) (.025) (.025) 
Economic development —.119*** — .O87*** — .089*"* —.]15*** 
(.012) (.013) (.013) (.012) 
Urban dummy — —.152*** — — 
. — (.022) — — 
Small urban dummy — — -.215*** — 
— — (.033) — 
Medium urban dummy — — — .104*** — 
— — (.031) — 
Large urban dummy — — ~ .084* — 
= — (.039) — 
Metropolis dummy — ~ —.228*** — 
= = (.044) — 
External urban dummy — — — — .056 
— — — (.068) 
Constant 8.250*** 7.817*** 7.834*** 7.964*** 
(.180) (.157) (.156) (157) 
R .648 .654 .656 .647 
F-test na na 15.020*** na 
Note: See note to Table 2. 
development but not with physical quality of basic institutional framework was borrowed 


life. These cities have received an extensive 
amount of foreign investment since 1984, 
possibly exacerbating the gap between eco- 
nomic development and physical quality of 
life. 

While China represents a fascinating case 
study, it should also be examined in compar- 
ative perspective. An obvious comparison is 
with other socialist societies. On the one 
hand, the similarities to such societies have 
been discussed by Whyte and Parish (1984, p. 
359), who assert that many characteristics of 
Chinese urbanization 

are shared to one degree or another with other 

socialist societies—an emphasis on production 

rather than consumption, controls on expression 
and religion, high female labor force participa- 
tion, and so forth. Of all our comparative 
frameworks, the similarities with urbanism in 

Eastern Europe and the Soviet Union are most 

apparent. This is not too surprising, for the ideas 

that animated the Chinese urban reforms have 
their roots in Marxist-Leninist ideology, and the 


from the Soviet Union in the 1950s. 


On the other hand, Chinese urbanization 
retains some distinctive features compared to 
Eastern Europe and the Soviet Union. China 
has been more effective in restricting urban 
migration as it requires urban household 
registration and also tightly controls access to 
jobs and educational openings in cities. The 
country has maintained an amazing level of 
social control even compared to other social- 
ist countries. !5 

Chinese urban patterns also contrast sharply 


15 One example of China's social control 
pertains to its enforcement of population restric- 
tions in urban areas. The nation has implemented a 
variety of incentives (e.g., extra pay) to encourage 
one-child families, especially in urban areas. 
Partially because of these (sometimes coercive) 
policies, China's urban fertility rate dropped from 
3.3 to 1.4 between 1970 and 1982 (Shaoyu 1988, 
p. 26; see also Banister 1987). 


QUALITY OF LIFE IN CHINA 


with those found in the ideal-typic Third 
World country. Many underdeveloped coun- 
tries are considered “overurbanized” because 
they cannot control rapid urban migration that 
results in more unemployment (and underem- 
ployment), inadequate housing, and burgeon- 
ing squatter settlements. “Overurbanized” 
societies have not achieved the level of social 
control exhibited by China. These societies 
have weak states that are (1) unable to control 
their own populations (e.g., through migra- 
tion controls), and (2) unable (and sometimes 
unwilling) to circumvent “external” factors 
that contribute to excess urbanization. For 
example, many Third World residents have 
been “pushed” to urban areas by multina- 
tional corporate investment in rural areas. 
Moreover, they have been “pulled” to cities 
by foreign investment in urban enterprises 
(Bradshaw 1987). By contrast, post- 
revolutionary China has remained a relatively 
"closed" society that has not permitted 
extensive foreign investment. Although the 
country has allowed more foreign investment 
in recent years (especially in several coastal 
cities), it still tightly regulates foreign capital 
penetration. 

In conclusion, this study of China should 
be compared to other case studies and to 
cross-national research. Although cross- 
national studies are useful in establishing 
global trends, they are not especially belpful 
in explaining unique patterns of development 
in particular countries. Case studies also 
inform general theories of development by 
examining societies that exhibit special cir- 
cumstances and conditions. These character- 
istics are studied and then used to modify and 
adapt general theoretical perspectives. Case 
studies make a valuable contribution to 
development research and therefore should be 
combined with cross-national research to 
probe a variety of difficult issues. 
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Past research has treated self-esteem either as a social force or as a social 
product. However, this research has not given adequate attention to the reciprocal 
effects of the self-concept and various social and personal factors. A panel of 1886 
adolescent boys is used to explore the reciprocal relationships between self-esteem 
and three problems of youth: juvenile delinquency; poor school performance; and 
psychological depression. We find that low self-esteem fosters delinquency and that 
delinquency may enhance self-esteem. These reciprocal effects differ among 
Socioeconomic status groups. The relationship between self-esteem and school 
performance is primarily attributable to the effect of school performance on 
self-esteem. Finally, the causal relationship between self-esteem and depression is 
bidirectional. Substantive, methodological, and policy implications of these 


findings are discussed. 


The self-concept is a topic that bridges a 
variety of disciplines and has been the subject 
of thousands of studies (Ostrow 1982). The 
vast majority of them (McGuire and Padawer- 
Singer 1976) deal with self-esteem. Charac- 
teristically, these studies adopt one of two 
approaches: they focus on the self-concept 
either as a social product (a consequence of 
social influences) or as a social force (a cause 
of social behavior) (Rosenberg 1981; Kaplan 
1986). Some of these studies, however, 
prejudge the issue because they do not specify 
whether the self-concept is the cause or the 
effect of the correlated variable. In many 
cases, the causal direction is unequivocal. 


* The data and tabulations utilized in this article 

were made available (in part) by the ISR Social 
Science Archive. The data were originally col- 
lected by Dr. Jerald G. Bachman, Survey Research 
Center, Institute for Social Research, The Univer- 
sity of Michigan. Neither the original collector of 
the data nor the archive bears any responsibility for 
the analyses or interpretations presented here. 
. We are indebted to Ronald Schoenberg for 
statistical advice and to Maya Khallouf and Zita 
Givens for help in preparing the figures. The 
models in this paper were estimated by MILS, an 
advanced version of LISREL developed by Ronald 
Schoenberg. -The preparation of this article was 
facilitated by the award of a Guggenheim Fellow- 
ship to the first author. Request for reprints should 
be sent to Morris Rosenberg/Carmi Schooler, 
National Institute of Mental Health, Federal 
Building, Room B1-A14, 7550 Wisconsin Ave- 
nue, Bethesda, Maryland 20892. 
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The self-concept cannot be the cause of most 
sociodemographic variables. In several impor- 
tant instances, however, each variable can 
potentially affect the other. This paper 
examines three such instances. It deals with 
the causal connections between low self- 
esteem and juvenile delinquency, poor aca- 
demic performance, and psychological depres- 
sion. 

Although many investigators have explored 
the association between global self-esteem 
and various social and psychological prob- 
lems,! they have frequently been uncertain 
about whether low self-esteem is the cause or 
the effect of the problem. Some investigators 
have sidestepped the issue by suggesting that 
their data were descriptive and did not involve 
causal claims. Others (e.g., Gold and Mann 
1972; Purkey 1970) have acknowledged the 
possibility of reciprocal effects but have 
adopted the position that their theoretical 
focus was only on the effect of one variable 


! Kaplan (1975) has provided a literature review 
of the relationship between self-esteem and 
delinquency, crime, alcoholism, drug abuse, men- 
tal disorder, aggressive behavior, and suicidal 
behavior. Other investigators have examined the 
relationship between self-esteem and certain spe- 
cific problems such as delinquency (Wells and 
Rankin 1983; Gold 1978), school failure (Purkey 
1970; Wylie 1979), anxiety (R. Bums 1979), 
depression (Rosenberg 1985; Kaplan and Pokorny 
1969), alcoholism (Berg 1971), unemployment 
(Shamir 1986; Cohn 1978), and others. 
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on the other. Still others (e.g., Jensen 1972) 
have simply. acknowledged that each variable 
might affect the other. 

These approaches were adopted because no 
viable alternative seemed available. Today, 
however, it is possible, through the use of 
linear structural equation-based reciprocal 
effects analysis, to estimate the extent to 
which each variable affects the other (Joreskog 
and Sorbom 1976a, 1976b). 

One advantage of reciprocal effects analy- 
sis is that it enables us to uncover the complex 
dynamic forces at work. For example, if we 
consider any particular correlation (say, 
between self-esteem and delinquency), we do 
not know whether (1) each variable affects the 
other significantly and equally (bidirectional), 
(2) each variable affects the other signifi- 
cantly but the effect of one is greater than the 
effect of the other (predominant), (3) the 
bivariate association is entirely attributable to 
the effect of one of the variables upon the 
other (unidirectional), or (4) the effect of one 
variable on the other is positive while the 
effect of the second variable on the first is 
negative (countervailing).? 

These are not simply logical possibilities. 
As we shall see, each of the three zero-order 
correlations with which we start —self-esteem 
and delinquency, self-esteem and school 
marks, and self-esteem and depression— 
involves a different causal pattern. We shall 
attempt to show that such data can enrich our 
understanding of the zero-order relationships. 
Furthermore, as we shall see, the identifica- 
tion of the nature and direction of the causal 
influences can have both theoretical and 
policy implications. 


Self-Esteem. and Adolescent Problems 


Research has shown that global self-esteem is 
correlated with juvenile delinquency, aca- 
demic performance, and psychological depres- 
sion. In summarizing the research on the 
relationship between self-esteem and delin- 
quency, Wells and Rankin (1983, p. 13) 
concluded that these studies "demonstrate a 
consistent association between evaluative 
social experiences, self-evaluation, and a 
variety of delinquent behaviors.” Kaplan’s 





? We assume, of course, that a causal connec- 
tion exists between the variables, that i is, that the 
relationship is not spurious. 
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literature review (Kaplan 1975, pp. 79-83) 
also identified and described studies that 
showed .an inverse relationship between 
self-esteem and delinquency. Although these 
relationships are usually statistically. signifi- 
cant, they are rarely strong—usually between 
—.1 and —.2 (Wells and Rankin 1983). 

Research also shows that self-esteem and 
school marks are positively and significantly 
related. On the basis of a review of 22 
studies, Wylie (1979, p. 361) concludes: 
“The correlations between grade point aver- 
ages and the tests of over-all self-regard for 
which information was found are . mostly 
around .30." Most of these relationships are 
statistically significant and some are based on 
large well-selected samples (e.g., Bachman 
1970) and use well-established global self- 
esteem measures. In fact, according to Wylie 
(1979, p. 106), "significant associations : 
between achievement level and over-all self- 
regard indices have come more often from 
studies using well-known and relatively 
well-regarded instruments, as opposed to 
unknown ones with unknown psychometric 
characteristics." 

Finally, one of the most firmly established 
findings is the inverse association .between’ 
self-esteem and depression (Wylie 1979; 
Rosenberg 1985). Studies of children, adoles- 
cents, adults, and the aged all show. this 
pattern. For example, Bachman’s (1970) 
study of 2213 tenth-grade boys showed a 
correlation of —.51 between self-esteem and 
a.measure of "depressive affect" (Rosenberg 
1985). Pearlin and Lieberman's (1979) study 
of 2300 adults in the Chicago Metropolitan 
Area showed a relationship of — .49 between 
self-esteem and depression. Similar findings 
appear in Kaplan and Pokorny (1969) and 
Rosenberg and Simmons (1972). Clinical 
evidence (e.g., Beck, Rush, Shaw, and 
Emery 1979) supports these quantitative 
results. 

Although these data clearly show that 
self-esteem is significantly (though un- 
equally) related to each of these problems, 
they are not clear whether it is primarily the 
cause or the effect. There is, in fact, 
theoretical support for both views. Although a | 
number of theories are potentially relevant, in 
this paper we focus on two: self-esteem and 
self-consistency theory. 

Self-esteem theory. A core proposition of 
self-esteem theory is that self-esteem is a 
fundamental human motive. The self-esteem 
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Motive, also called the “self-maintenance 
motive” (Tesser and Campbell 1983), the 


“motive for self-worth” (Covington 1984), . 


and the. “self-enhancement” motive (Kaplan 
1975), has been identified by Maslow (1970) 
as one of the “prepotent” human needs. 
McDougall (1932) viewed the "sentiment of 
self-regard” as the “master sentiment." 
Hobbes expressed it both earlier and more 
vividly when he asserted that “the greatest joy 
of the human soul” is to have a high opinion 
of oneself (quoted in Allport 1968, p. 16). All 
these theories share the view that there exists 
a universal desire among human beings to 
protect and enhance their feeling of self- 
regard. 

Among the principles of self-esteem forma- 
tion (Rosenberg 1986) are reflected apprais- 
als, social comparison, and self-attribution. 
The principle of reflected appraisals, implicit 
in Cooley's (1912) looking-glass metaphor 
and Mead's (1934) conception of taking the 
role of the other, holds that people's feelings 
about themselves are strongly influenced by 
their judgments of what others think of them. 
Self-esteem is thus viewed as a product of 
social interaction. The principle of social 
comparison (Festinger 1954) holds that, in 
the absence of objective information about 
themselves, people judge themselves on the 
basis of comparisons with others (see also 
Pettigrew 1967). Self-attribution considers 
how naive observers characteristically at- 
tribute motives, intentions, causes, disposi- 
tions, etc: to themselves on the basis of their 
observation of their actions. When people 
draw conclusions about themselves by observ- 
ing the success or failure of their efforts, they 
are exemplifying the self-attribution process. 

Self-consistency theory. Self-consistency 
theory (also called “self-verification” theory) 
(Swann 1983) holds that people resist change 
in their self-concepts because of an "inborn 
preference for things that are predictable, 
familiar, stable, and uncertainty reducing" 
(Swann 1983, p. 34). This preference is 
exhibited in a number of ways. People make 
active efforts to obtain self-confirmatory 
feedback. They attempt to interact with others 
and to place themselves in situations that will 
support their opinions of what they are like. 
They also utilize modes of self-presentation 
(Goffman 1959) that will elicit from others 
responses that are consistent with their 
self-concepts. They attend to, remember, and 
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interpret information in ways that support 
their views of what they are like. 

Both self-esteem theory and self-consis- 
tency theory suggest certain causal connec- 
tions between self-esteem and these three 
problems of youth. Self-esteem theory sug- 
Bests that low self-esteem causes delin- 
quency. Kaplan (1980) contends that young- 
sters with low self-esteem have frequently 
undergone unsatisfactory experiences in the 
conventional society-experiences that have 
created painful feelings of doubt about their 
self-worth. Seeking to alleviate these feel- 
ings, many turn to the delinquent reference 
group to enhance their self-esteem. The 
delinquent group provides more favorable 
reflected appraisals, social comparisons, and 
self-attributions. First, it replaces the fre- 
quently negative reflected appraisals of the 
conventional societv with the positive re- 
flected appraisals of the delinquent group. In 
addition, as Gold (1978) notes, the brazen 
defiance of authority in the school is often 
played out before an appreciative audience, 
yielding positive reflected appraisals from 
classmates. Second. the delinquent group 
may provide more favorable social compari- 
sons. The youngster may compare more 
favorably with delinquent peers in terms of 
delinquent activities (for example, stealing, 
fighting, vandalizing) than he does with 
straight peers in conventional activities (for 
example, excelling at school work, getting on 
the honor roll). Third, the youngster may 
anticipate that he can make more positive 
self-attributions by observing the success of 
his efforts at delinquent activities than by 
judging himself in terms of the valued 
standards of the conventional society. Self- 
enhancement theory thus suggests that low 
self-esteem youngsters may turn to delin- 
quency in order to strengthen their feeling of 
self-worth. 

Kaplan’s (1980, p. 24) data are consistent 
with this view. In his large-scale study of 
junior high school pupils, he found that 
nondelinquents in the seventh grade with low 
self-esteem were significantly more likely 
than those with high self-esteem to become 
delinquent by the eighth grade. 

At the same time, if the delinquent 
experience improves reflected appraisals, 
social comparisons, and self-attributions, then 
it should enhance self-esteem. In this view, 
delinquency is expected to have a positive 
effect on self-esteem. Kaplan (1980, p. 207) 
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found that delinquent eighth graders were though it may seem strange, in some 
significantly more likely than nondelinquent instances youngsters may seek poor marks to 
eighth graders to experience enhanced self- protect their self-esteem. Covington and 
esteem when they reached the ninth grade. Omelich (1979) have suggested that poor 

Although self-enhancement theory suggests school performance may at times represent 
that self-esteem and delinquency affect one attempts by pupils to protect their self-esteem 
another, it also suggests that these influences by attributing their substandard academic 
may be countervailing. Self-esteem is ex- outcomes to lack of effort. Poor marks that 
pected to have a negative effect òn delin- can be attributed to lack of effort are much 
quency but delinquency is expected to Rave a less damaging to self-esteem than poor 
positive effect on self-esteem. The w performance that must be ascribed to lack of 
zero-order correlation between se estes, abiit Both self-consistency and self- 
and delinquency, then, may actually conceal "enhancement theory, then, agree that self- 
the operation of forces that partly counteract esteem can be a cause of academic perfor- 
one another. mance. : 

Self-esteem theory also leads to the predic- Finally, self-esteem theory suggests that 
tion that school marks are a cause of low self-esteem can cause depression. If the 
self-esteem. Pupils who do well in school desire for positive self-regard is a major 
should experience more favorable reflected motive of human beings, then the frustration 
appraisals, social comparisons, and self- of such a motive would almost inevitably be 
attributions. Schooling is generally a major experienced as depressing. Depression, how- 
arena of achievement among adolescents; ever, may also be a cause of self-esteem. The 
School marks are a particularly public, reason lies in the Pii of consistency. 
visible, overt indicator of an important aspect One of the most distinctive features of the 
of a pupil's worth; and academic success is depressed personality, according to cognitive 
generally valued in the society. Thus, those therapists (e.g., for Beck et al. 1979; D. 
who do well in school should receive more Burns 1980), is a pervasive system of 
favorable reflected appraisals from significant negative cognitions. The depressed person 
others. In addition, high marks in school thinks that the world is a rotten place, that 
should yield positive social comparisons. people are selfish, and that the future is 
Third, high grades would be expected to  hopeless. Consistent with this negative out- 
produce favorable self-attributions as young look on life is the expectation that the 
people, looking at their test marks and report depressed person would also develop negative 
cards, attribute these outcomes, whether good attitudes toward the self —for example, to feel 
or bad, to their own efforts and, on that basis, worthless, guilty, and unimportant. If this is 
draw corresponding conclusions about their so, then it would suggest that depression 
own worth. These three principles of self- causes low self-esteem. 
esteem formation, then, suggest that marks There are thus strong theoretical grounds 
would have an effect on self-esteem. for expecting self-esteem to cause certain 

But, according to self-consistency theory, social and personal problems and for expect- 
self-esteem should also affect school marks. ing these problems to cause self-esteem. The 
Lecky (1945), who originally formulated this aim of this analysis is to attempt to specify the 
theory, observed that many children with degree to which self-esteem and each of these 
adequate spelling ability nevertheless contin- problems affects the.other. Although some 
ued to spell poorly because they had defined writers. (e.g., Kaplan 1980) have provided 
themselves as poor spellers. This exemplifies evidence that self-esteem and delinquency do 
the principle that people may unconsciously affect one another, the aim of this article is to 
attempt to resist alteration of the pictures they go beyond such .findings.to ask: Which 
have developed of themselves. In this view, variable has the more powerful effect on the 
then, high and low self-esteem youngsters are other, what is the direction of this effect, and 
motivated to maintain the level of perfor- what is its strength? 
mance in school that is consistent with their i 
positive or negative judgments of their DATA: 
ability. 

This explanation does not necessarily The data for this secondary analysis are drawn 
conflict with self-enhancement theory. Al- from Youth in Transition, a panel study 
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directed by Jerold G. Bachman (1974) at the 
Institute for Social Research at the University 
of Michigan. Wave 1, conducted in the fall of 
1966, is based on a probability sample of 
2213 tenth-grade boys in 87 high schools 
throughout the contiguous 48 states; 1886 of 
these subjects participated in the second data 
collection phase in the spring of 1968. 

Although the study was based on a 
carefully selected sample of a defined popula- 
tion, it has certain limitations. First, it is 
restricted to boys. The nature and prevalence 
of juvenile delinquency is so different for 
boys and girls that it would be completely 
inappropriate to generalize from the former to 
the latter (Kaplan 1980). Second, we do not 
know whether the causal dynamics that 
obtained at the time of the initial study persist 
today. Third, it is necessary to consider the 
possibility of panel bias, especially repeated 
measurement effects and panel attrition. 
Bachman, O’Malley, and Johnston (1978) 
examined these problems and found that 
randomly selected young men who had not 
participated in earlier waves were very similar 
to those who had participated throughout the 
study. Panel attrition, however, did produce 
some biasing effects. Comparing the young 
men who participated in five waves of 
interviews with panel leavers, they found that 
the panel leavers differed from those who 
remained in the sample in a number of ways. 
However, those who remained in the panel 
were very similar to those who entered the 
original sample. Finally, except for the 
interviewer’s assessment of the respondent’s 
physical appearance, complexion, and physi- 
cal maturity, all measures were based on 
self-reports. 


MEASURES 


The self-esteem measure used in this study 
consists of 6 of the 10 items of the Rosenberg 
Self-Esteem Scale. (Methodological discus- 
sions of this measure appear in Rosenberg 
1986, 1989; Carmines and Zeller 1979; Wylie 
1974). Since this scale serves as a measure of 
global self-esteem, it is essentially content- 
free; that is, it excludes items dealing with 
specific attributes. Self-esteem, as reflected in 
this measure, does not imply feelings of 
superiority or perfection, but feelings of 
self-acceptance, self-respect, and generally 
positive self-evaluation. The items used in 
this study are the following: (1) I feel that I'm 
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a person of worth, at least on an equal plane 
with others. (2) I feel that I have a number of 
good qualities. (3) I am able to do things as 
well as most other people. (4) I feel I do not 
have much to be proud of. (5) I take a 
positive attitude toward myself. (6) At times I 
think I am no good at all. 

School performance is measured by the 
student's self-reported grade point average. 

Juvenile delinquency is measured by the 
following indices or indicators: (1) a 7-item 
index of delinquent behavior in school (e.g., 
“damaged school property on purpose," “hit 
a teacher"), (2) a 9-item index of frequency 
of delinquent behavior (e.g., how often "run 
away from home," "taken part in a fight 
where a bunch of your friends are against 
another bunch"), (3) a 10-item index of 
seriousness of delinquent behavior (e.g., 
"hurt someone badly enough to need ban- 
dages or a doctor," “used a knife or gun... 
to get something from a person"), (4) a 
9-item index of theft and vandalism (e.g., 
"taken something from a store without paying 
for it," "set fire to someone else's property 
on purpose"), (5) a single item dealing with 
whether the youth was ever expelled from 
school, (6) a single item dealing with trouble 
with police. With respect to the indices, it is 
the total index score, not its component items, 
that is entered into the confirmatory factor 
analysis. 

The depression measure is based on five 
items: the respondent's feeling that the future 
does not look bright; that things are hopeless; 
that he is bored; that he is down in the dumps; 
and that he is depressed. 

Finally, socioeconomic status is measured - 
by an index consisting of the following 
equally weighted indicators: father's .occupa- 
tional status; parent's education; possessions 
in the home; number of books in the home; 
and number of rooms per person. 


ANALYSES 


Self-esteem, delinquency, and depression 
were each estimated at two points in time as 
parts of full-information linear structural 
equation models. All of the correlations of 
residuals of the same variables at both points 
in time were estimated. In addition, signifi- 
cant correlations of residuals among different 
variables were included where suggested by 
an examination of the first-order partial 
derivatives. 


SELF-ESTEEM AND ADOLESCENT PROBLEMS 


Several features of the causal aspects of 
these models should be noted. First, panel 
data (Waves 1 and. 2, based on 1886 
respondents) are utilized, helping us to solve 
the structural equation models and to estimate 
the causal influences. In order to estimate 
linear structural equation models with recipro- 
cal effects, it is necessary to utilize instrumen- 
tal variables. An instrumental variable is one 
that, on theoretical grounds, would be 
expected to exercise a direct effect on one of a 
pair of reciprocally affected variables but not 
on the other. Cross-lagged effects are omitted 
from the model in order to provide a source of 
instrumentation for the reciprocal effects that 
are modeled at the second time period.? These 
Wave 2 effects are viewed as outcomes of 
long-term processes and thus represent the 
sum of the lagged and contemporaneous 
effects. (For a discussion of the rationale for 


' this procedure, see Heise 1975, pp. 184—85.) 


Wherever possible, we have utilized multi- 
ple instrumental variables to reduce our 
dependence on a single instrumental variable. 
In addition, in all of the models, we included 
a number of other exogenous variables that 
serve as controls. The analyses were carried 
out using covariance matrices, and standard- 
ized solutions are presented. 


RESULTS 
Zero-Order Correlations 


Table 1 shows that, with one exception 
(self-esteem and depression), the correlation 
between self-esteem and the other variables 
are similar at both time periods. 

These results correspond closely to those 
that generally appear in the literature. The 
weakest relationship is between self-esteem 
and delinquency; a stronger one exists 
between self-esteem and grades; and the 
strongest is between self-esteem and depres- 
sion. One advantage of the reciprocal effects 
analysis, we shall see, is that it helps us to 


? Analyses carried out using cross-lagged rather 
than total effects produced patterns of results 
similar to our reciprocal effects linear structural 
equation models. The sizes of the coefficients, 
however, are generally smaller because they do not 
include the contemporaneous effects. The advan- 
tages of linear structural equation models over 
cross-lagged panel correlations are discussed in 
Kessler and Greenberg (1981). 
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Table 1. Zero-Order Correlations of Self-Esteem and 
Social and Personal Problems at Two Points in 








Time 
Time 1 Time 2 
Self-esteem-delinquency —.09 -—.07 
Self-estecm-grades T.24 T.25 
Self-esteem-depression —.45 ~ 58 





understand why these relationships are so 
unequal in strength. 


Self-Esteem, and Delinquency 


Besides Time 1 self-esteem, six instruments 
were used for delinquency at Time 2 (that is, 
they were expected to affect self-esteem but 
not delinquency). The first three were mea- 
sures of anxiety: general anxiety (ANX-A) 
(7-iiem index); anxiety and tension (ANX-B) 
(5-item index); and psychophysiological indi- 
cators of anxiety (ANX-C) (18-item index). 
The other three instruments were the interview- 
er’s rating of three features of the respon- 
dent’s appearance: (1) general physical appear- 
ance, (2) complexion, and (3) physical 
maturity. Unattractive physical appearance 
would be expected to damage self-esteem but 
to have no direct effect on delinquency. The 
reasons for expecting anxiety to affect 
self-esteem are presented in Rosenberg (1989); 
anxiety was not expected to have a direct 
effect on delinquency. 

Instruments for self-esteem (that is, vari- 
ables expected to affect delinquency but not 
self-esteem) were (besides Time 1 delin- 
quency) the following values: academic 
achievement (4-item index), honesty (7-item 
index), social skills (6-item index), social re- 
sponsibility (4-item index), and religiousness 
(4-item index). For obvious reasons; we ex- 
pected that those less likely to embrace con- 
ventional values (that is, those for whom ac- 
ademic achievement, honesty, social skills, 
social responsibility, and religiosity were not 
highly valued) would be more likely to engage 
in delinquency. On the other hand, there were 
no compelling reasons to think that value con- 
formity would directly affect self-esteem. 

The following exogenous variables were 
also included: age, race, intact family struc- 
ture, father's education, mother's education, 
father’s and mother's Duncan occupational 
prestige scores, family socioeconomic status, 
and number.of siblings. Although multicollin- 
earity.may exist among some of the sociode- 
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mographic variables, that is not a problem 
here because our purpose is not to specify 
their independent effects but to control their 
combined effects. 


Results 
The model in Figure 1 fits the data reasonably 
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well, with a chi square/df ratio of 3.19. Using 
Joreskog’s goodness of fit measure, this index 
(and all other models included in this paper) 
is .999. 

Figure 1 shows that low self-esteem is 
significantly associated with an increase in 
delinquency (— .19). On the other hand, the 
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data suggest that delinquency may possibly be 
associated with an increase in self-esteem 
(+.08). This latter figure approaches, but 
does not quite attain, the conventional 
significance level (t— 1.82). (These figures 
are based on a model that includes the 
correlation of the residuals of self-esteem and 
delinquency. Had the model excluded this 
correlated residual, the effect of delinquency 
on self-esteem would be +.12, which is 
highly significant. More important, how- 
ever, is the fact that the positive path from 
delinquency to self-esteem is significant for 
certain subgroups but not for others. This 
becomes evident when we examine the 
reciprocal effects of self-esteem and delin- 
quency in different socioeconomic status 
groups. 

Socioeconomic status. Short and Strodt- 
beck (1965) and Hall (1966) have suggested 
that the relationship between self-esteem and 
delinquency may not be the same in different 
social classes. If, for example, deviance were 
more widely practiced and accepted in the 
lower-class culture, then deviance in this 
group might be an expression of conforming 
behavior (Jensen 1972). If this is so, then the 
reciprocal effects of self-esteem and delin- 
quency would be expected to differ in 
different socioeconomic status groups. One 
expectation would be that lower-class: delin- 
quency would be more a product of normative 
influences whereas higher-class delinquency 
would be more a product of personality 
influences. In other words, many lower-class 
youngsters may engage in delinquent behav- 
ior because they are conforming to group 
norms, not because they are searching for 
ways to bolster their shaky self-esteem. The 
reverse would be true for higher-class boys. If 
this were so, then low self-esteem should play 
a less important role in explaining lower- than 
higher-class self-esteem. 

Conversely, if higher-class youngsters are 
more apt to have been socialized to consider 
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delinquent behavior to be wrong, to be more 
severely condemned by most peers and 
parents for such behavior, and to recognize 
the potential damage to their future life 
prospects that a delinquent life style may lead 
to, then these influences might partly nullify 
the generally positive self-esteem effects of 
delinquent behavior. In this event, we would 
expect delinquency to have a less positive 
effect on self-esteem in the higher than in the 
lower class. 

In order to test these ideas, we examined 
the reciprocal effects of self-esteem and 
delinquency in different socioeconomic status 
groups. The sample was divided into thirds 
and respondents were assigned to low, 
medium, and high SES groups. The correla- 
tions between self-esteem and delinquency 


-were —.09, —.16, and —.07 in the low, 


medium, and high SES groups respectively at 
time 1, and —.06, —.09, and —.13 at Time 
2. 

Table 2 shows the reciprocal effects of 
self-esteem and delinquency in the three SES 
groups separately. As predicted, self-esteem 
plays a somewhat greater role in producing 
delinquency in the higher than in the lower 
SES group, but delinquency is a more 
effective device for boosting self-esteem in 
the lower than in the middle and higher SES 
groups. The reciprocal effects of self-esteem 
and delinquency thus appear to be conditional 
on the norms that prevail among adolescent 
males in different SES groups. 


Self-Esteem and School Marks 


Besides Time 1 grades, the following vari- 
ables are treated as instruments for self- 
esteem at Time 2; that is, they are assumed to 
have a direct effect on school marks but not 
on self-esteem: number of hours spent on 
homework; negative school attitudes index 
(8-item index); positive school attitudes index 
(15-item index); school motivation (4-item 


Table 2. Reciprocal Effects of Self-Esteem and Delinquency in Three Socioeconomic Status Groups 


Self-Esteem 
on Delinquency 
Total sample —.19* 
Low SES —.17* 
Medium SES —.22* 
High SES —.24* 





* Significant at .05 level. 


Effects of 
Delinquency 
on Self-Esteem x7/df ratio 
+.08 3.64 
+,14* 2.63 
+.05 1.69 
+.01 2.51 
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index); and valuation of academic achieve- 
ment (4-item index) We reasoned that" 
diligence in performing school assignments, 
positive orientation to school, and a high 
valuation of successful academic performance 
would tend to foster high grades but would 
not, in themselves, either raise or lower 
self-esteem. 

Variables treated as instruments for grades 
at Time 2 were self-esteem at Time 1, 
positive family relations (19-item index), 
number of best friends, physical attractive- 
ness, and stability of self-concept (5-item 
index). We reasoned that youngsters who get 
along well with their parents, have close 
friends, and are physically attractive would 
tend to have higher self-esteem. These 
qualities, in themselves, were not expected to 
affect school marks. The reasons for expect- 
ing stability of the self-concept (for example, 
whether or not attitudes toward the self 
seemed to change from moment to moment) 
to affect self-esteem are presented in Elliott 
(1986). 

In this model the following variables are 
entered as exogenous variables: race, age, 
intact family structure, father’s education, 
mother’s education, family socioeconomic 
status, father’s and mother’s Duncan occupa- 
tional prestige scores, and number of siblings. 
The effects of these variables are not 
discussed; they are treated as controls. 

In modeling the relationship between glo- 
bal self-esteem and grade point average, the 
data fit reasonably well (see Figure 2). They 
indicate that grades have a decidedly stronger 
positive effect on self-esteem than self-esteem 
has on grades. Whereas the effect of grades 
on self-esteem is +.15, the effect of 
self-esteem on grades is +.08. The latter 
figure approaches, but does not quite attain 
(t= 1.90), the conventional significance level. 
(The reciprocal effects are similar in all three 
SES groups.) 

The significant effect of school marks on 
self-esteem lends support to self-esteem 
theory. If superior performance within a 
valued arena of achievement — academic per- 
formance— produces favorable reflected ap- 
praisals, social comparisons, and self- 
attributions, then we can understand why 
grades exercise a positive effect on global 
self-esteem. On the other hand, the relatively 
weak effect of self-esteem on academic 
achievement lends little support to self- 
consistency theory. One reason may be that 
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we are dealing with global rather than specific 
self-esteem. Had we studied academic self- 
concept, for example, the effect might have 
been stronger. 


_ Self-Esteem and Depression 


Besides the exclusion of the cross-lagged 
effects, only two instrumental variables are 
employed in the analysis. Academic self- 
concept is an instrument for depression. We 
assume that it affects global self-esteem 
directly but does not have a direct effect on 
depression. Subjective physical health status 
is an instrument for self-esteem. Problems of 
physical health are commonly symptomatic of 
anxiety and depression and may have etiolog- 
ical significance. Unless physical health is a 
major source of pride to the individual (it 
does not usually seem to be), it should in 
itself have little effect on self-esteem. 

The following exogenous variables are 
included in the model for purposes of control: 
race, age, intact family structure, father’s 
education, mother’s education, socioeco- 
nomic status, father’s and mother’s Duncan 
occupational prestige scores, and number of 
siblings. 

Figure 3 presents the results. The goodness 
of fit test indicates a chi square/df ratio of 
4.80. Though fitting somewhat less well than 
the other models, the goodness of fit appears 
acceptable, especially in light of the large 
sample utilized. 

Both self-esteem and depression, the data 
show, significantly affect one another, but the 
negative relationship between the two vari- 
ables is due somewhat more to the effect of 
depression on self-esteem ( — .27) than to the 
effect of self-esteem on depression (— .21). 
(Both effects are stronger in the lower than 
the higher SES groups.) 

The finding that low self-esteem leads to 
depression is consistent with self-esteem 
theory, which holds that people are motivated 
to maximize their self-esteem. But it is also 
clear that depression can be responsible for 
low self-esteem. Many different causes of 
depression— biological, psychological, and 
sociological— have been identified (Hippius, 
Klerman, and Matussek 1986). Whatever the - 
cause, once the depression takes root, it 
comes to include the self in its orbit of 
pessimistic thoughts. This is to be expected in 
the light of self-consistency theory. 
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Fig. 2. Self-Esteem and Grades 


SUMMARY AND DISCUSSION 


This paper has explored the reciprocal effects 
of global self-esteem and three important 
social and personality problems of male 
adolescents: juvenile delinquency, academic 
performance, and psychological depression. 
These data, we suggest, have substantive, 
methodological, and policy relevance. 


Substantive Relevance 
Unlike other reciprocal effects investigations 
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of self-esteem and delinquency, which focus 
on lagged effects (Bynner, O'Malley, and 
Bachman 1981; Rosenberg and Rosenberg 
1978), the present study examines the total. 
effects (lagged and contemporaneous) of each 
variable on the other. These effects are 
estimated by means of a linear structural 
equation analysis that takes account of 
measurement error both in the sample as a 
whole and in separate SES subgroups. In 
general, our findings correspond most closely 
to those reported by Kaplan (1980). Bearing 
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Fig. 3. Self-Esteem and Depression 


in mind the marginal significance of the effect 
of delinquency on self-esteem in the total 
population, we find that low self-esteem 
fosters delinquency and that delinquency may 
enhance self-esteem. We also find, as Kaplan 
does, that the effect of self-esteem on 
delinquency is stronger in the higher tban in 
the lower SES group but is significant in 
both. On the other hand, the effect of 
delinquency on self-esteem is stronger in the 
lower SES group, where it is highly signifi- 
cant, than in the higher SES group, where it is 
. not. 
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Studies of the reciprocal effects of self- 
esteem and school marks are few and are also 
generally based on the examination of lagged 
effects (e.g., Calsyn and Kenny 1977). Our 
data indicate that the well-established relation- 
ship between self-esteem and academic per- 
formance is primarily attributable to the 
effects of school marks on self-esteem rather 
than the reverse. Global self-esteem appears 
to have littl or no effect in enhancing 
academic performance. 

Finally, we are not aware of any studies 
that have examined the reciprocal effects of 
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self-esteem and depression. Our data indicate 
that each has a substantial effect on the other. 
Further research on this question is obviously 
needed. f 

In general, our findings are in agreement 
with self-esteem theory. They are consonant 
with the view that human beings are moti- 
vated to protect and enhance their self-esteem 
and that their self-esteem levels depend 
heavily on reflected appraisals, social compar- 
isons, and self-attributions. Self-consistency 
theory is also supported by the- finding that 
depression causes self-esteem but not by the 
finding that global self-esteem has little or no 
effect on school marks. In this study, as in a 
number of others (Jones 1973), self-esteem 
theory receives greater support than self- 
consistency theory. 


Methodological Relevance 


One of the most striking findings of this study 
is its demonstration of how varied the 
“mutual effects” (Lazarsfeld 1972) of two 
variables may be. Each of the three bivariate 
relationships shows a distinctly different 
pattern of effects. One pattern is countervail- 
ing, the second chiefly unidirectional, and the 
third, bidirectional (mutually reinforcing). 

These different patterns of effect help us to 
understand why the relationships are so 
unequal in strength. We noted in Table 1 that 
the weakest relationship is between self- 
esteem and delinquency; intermediate in 
strength is the relationship between self- 
esteem and school marks; and the strongest is 
between self-esteem and depression. By 
“getting inside” these relationships to specify 
how each variable affects the other, it is 
possible to gain a better understanding of why 
the strength of these zero-order correlations 
differs so greatly. 

For example, the weak bivariate relation- 
ship between self-esteem and delinquency is 
not solely a reflection of the fact that the 
causal influences are weak (although that is 
certainly the case); it also reflects the fact that 
they are countervailing. Although high self- 
esteem lowers delinquency, high delinquency 
may raise self-esteem. Each variable does 
affect the other but, because each partially 
cancels the other’s influence, the total rela- 
tionship is small. 

Although the effects of self-esteem and 
school marks on one another are not counter- 
vailing, neither are they mutually reinforcing. 
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The data show that school marks have a 
significant effect on self-esteem. This effect, 
in the absence of an effect of self-esteem on 
School marks, would be a clear example of a 
unidirectional effect. However, since the 
effect of self-esteem on marks nearly reaches 
the conventional significance level, it may be 
more appropriate to say that the relationship is 
chiefly attributable to the effect of school 
marks on self-esteem. The possible effect of 
self-esteem on school marks contributes little 
to the strength of the relationship. 

Finally, the comparatively strong associa- 
tion between self-esteem and depression is 
partly attributable to the fact that each 
psychological disposition reinforces the other 
(bidirectional effect). Feelings of depression 
lead to low self-esteem and low self-esteem 
fosters depression. Although depression exer- 
cises a somewhat stronger effect, both 
variables function to strengthen the relation-. 
ship. 

In sum, the weakest relationship appears 
when the two causal effects function in 
opposition to one another; the intermediate 
relationship when one of the causal variables 
exercises a clear causal effect whereas the 
other exercises little or no effect; and the 
strongest relationship when both causal vari- 
ables reinforce one another. These dynamic 
processes, of course, are not solely responsi- 
ble for the unequal sizes of the zero-order 
correlations, but they do contribute to these 
outcomes. 


Policy Relevance 


In 1986 the California Legislature voted to 
establish a State Task Force to Promote 
Self-Esteem, and Personal and Social Respon- 
sibility (State of California 1986). Among the 
main charges to the Task Force was “to 
promote public and personal awareness of the 
role of developing healthy self-esteem as a 
way of preventing social problems" (empha- 
sis supplied). Among the social problems 
specified were “violence and crime, alcohol- 
ism and drug abuse, welfare dependency, 
teen-age pregnancy, academic failure, recidi- 
vism, child and spousal abuse," and, gener- 
ally, "failure of responsible citizenship" 
(State of California 1986, p. 10). 

The clear implication of this charge is that 
self-esteem is a cause of these problems. But 
if self-esteem is the effect rather than the 
cause of the social problems, then the effort 
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to develop healthy self-esteem “as a way of 
preventing social problems” would be largely 
pointless. 
As far as delinquency is concerned, our 
data, suggest that the California Commission 
is, in a general way, on the right track. 
` Whether low self-esteem is responsible for 
the full range of deviant behaviors identified 
by the Commission is unlikely, but our 

findings indicate that it does contribute to the 
occurrence of delinquency. Efforts to raise 
self-esteem would thus seem to be warranted. 
But our data also highlight two problems that 
the Commission must face. First, any general 
social policy will not necessarily be equally 
effective among all social groups. Enhancing 
self-esteem may be somewhat more success- 
ful in forestalling delinquency in the higher 
than in the lower social class. But if 
delinquency is less widespread in the higher 
class, then the general effectiveness of such a 
strategy would be weakened. In addition, the 
data call attention to one of the difficulties 
that the Commission may face in attempting 
to induce delinquents to abandon their 
antisocial behavior, namely, that by sacrific- 
ing their deviance, they may be sacrificing 
their self-esteem. 

Another social problem that the California 
Commission must address is "academic 
failure." Several writers (Glasser 1969; Gold 
1978) have süggested changes that should be 
introduced into the school system to enhance 
self-esteem. Some of these proposals are 
quite radical and clearly entail increased 
costs. But if the relationship between self- 
esteem and academic failure is attributable to 
the effect of academic performance on self- 
esteem more than the other way around (as 
our data suggest), then the costly and radical 
proposals for change in the school would be 
highly questionable. Research shows that 
efforts to improve academic performance by 
raising self-esteem have generally not met 
with success (see Scheirer and Kraut 1979). 
Whether improvements in academic self- 
concept (rather than global self-esteem) would 
help school performance is unknown. 

Finally, with respect to the relationship 
between self-esteem and depression, the main 
*policy" implication would appear to involve 
the choice of therapeutic strategy. From this 
perspective, the data suggest that one way to 
reduce depression is to raise self-esteem and 

one way to raise self-esteem is to reduce 
depression. At present, more is known about 
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how to reduce depression (including the use 
of physical, pharmacological, and cognitive 
devices) than how to raise self-esteem. At 
the same time the data suggest that the 
therapist who seeks to.overcome depression is 
well advised to attempt to enhance the 
individual's self-esteem. 

We believe that much more has to be 
known before more general and integrated 
theories of self-esteem and self-concept can 
be developed. Much of what has to be learned 
involves ascertaining the nature of the causal 
connections between self-esteem and the 
sociological and psychological phenomena to 
which it is empirically and conceptually 
related. Our reciprocal effects analyses and 
their findings are thus necessary steps toward 
the development of a unified, empirically 
based and sociologically informed theory of 
the self-concept. 
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This research joins sociological and demographic traditions in a study of villages 
as contexts for contraceptive behavior in rural Egypt. Using survey data collected 
in the early 1980s, we explore the effects of village, household, and individual 
characteristics on contraceptive use and expectations about future use. Primary 
interest centers on the effects of the village variables, including the structure of the 
village economy, modernization of agriculture, level of school participation, and 
family planning service environment. The analysis demonstrates clearly that 
contraceptive behavior in rural Egypt varies systematically with these features of 
the village setting. In addition, village effects appear to vary according to 
characteristics of individual respondents: women respond differently depending on 
the stage in their reproductive career and their motivation to regulate fertility. 


How does large-scale social and economic 
change transform the lives of individuals 
living in a society? How do individuals in 
turn transform the society in which they live? 
These questions lie at the heart of sociology 
and have guided a broad program of theory 
and research since the inception of the 
discipline (Alexander and Giesen 1987). 
Despite this interest, sociologists are gener- 
allp unaware of the intensity of recent 
demographic research addressing questions of 
micro-macro linkage: studies of communities 
as contexts for reproductive behavior, largely 
in rural areas of developing countries (see 
Bilsborrow and Guilkey 1987; Casterline 
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1985a; Tsui 1985 for reviews). The demo- 
graphic literature takes up many issues of 
interest to sociologists, especially of a 
methodological nature. These studies also 
constitute an important collection of empirical 
findings about the effects of local context in 
diverse settings. 

Here we join sociological and demographic 
traditions in a study of villages as contexts for 
contraceptive behavior in rural Egypt. From 
sociology comes a clear conceptual frame- 
work for examining contextual effects (Bla- 
lock 1985; Boyd and Iversen 1979). The 
demographic tradition supplies the specific 
substantive hypotheses to be tested. We 
reformulate and apply, at the local level, key 
ideas from demographic transition theory as it 
has evolved over the decades (Caldwell 1982; 
Coale 1973; Notestein 1953). We are also the 
beneficiaries of explicit attempts to theorize 
directly about community effects (Casterline 
1985a; McNicoll 1980, 1984; Potter 1983). A 
wide range of community factors are of 
potential importance to contraceptive behav- 
ior, including characteristics of labor and 
commodity markets, norms concerning roles 
of women and children, and the density of 
health and family planning services. These 
factors are hypothesized to influence contra- 
ceptive behavior by altering the value of 
children to parents and households, costs of 
childbearing, and barriers to adopting and 
continuing to use a contraceptive method. 
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We argue that social and economic change 
translates into a diversity of effects that may 
vary from one community to the next, and 
from one individual to the next. Rural Egypt 
provides an interesting setting in which to 
explore these ideas because of the extent and 
pace of the changes that have characterized 
the period since the 1952 Revolution, in the 
structure of local economies and extension of 
services.! Egyptian villages differ in the 
relative size of the agricultural sector, the 
mechanization and commercialization of agri- 
culture, educational opportunity, and family 
planning program characteristics. What con- 
sequences might these differences have for 
contraceptive behavior? Given a dependent 
variable measured for individuals and a 
mixture of individual- and village-level inde- 
pendent variables, there are a variety of 
potential effects: direct and indirect; additive 
and interactive (Blalock 1985; Mason, Wong, 
and Entwisle 1983; Pullum, Billy, and 
Brewster 1987). As described in further detail 
below, we expect all types of village effects 
to operate in rural Egypt. i 

To model these effects we draw on data at 
two levels of analysis. Data on characteristics 
of women and their households come from 
the 1980 Contraceptive Prevalence Survey 
(CPS) (Khalifa, Sayed, El-Khorazaty, and 
Way 1981), which interviewed 5,313 ever- 
married women aged 15-49 living in rural 
Egypt. Information about the 124 villages 
which composed the primary sampling units 
of this survey was gathered in a separate data 
collection effort two years later (Sayed 1982). 
The 1982 Community Survey obtained some 
500 separate items on these (and other) 
villages, drawing on 1976 census tabulations 
for local areas; administrative records for 
village agricultural cooperatives, schools, and 
clinics; and interviews with key informants in 
the village. Our. analysis is based on a 
merging of these two data sets. 


MODELING CONTRACEPTIVE 
BEHAVIOR 


In rural Egypt, contraceptive prevalence is 





l Qur discussion draws on the large social 
scientific literature on rural development in Egypt, 
especially Abdel-Fadil 1975; Abdel-Khalek and 
Tignor 1982; Adams 1986; Hopkins 1987; Hopkins 
and Mehanna 1981; Ikram 1980; Marsot 1985; 
Radwan 1977; Richards 1982. 
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relatively low. Women interviewed in the 
CPS were asked: “Do you or your husband 
currently use now, or have you used during 
the past month, any method to avoid getting 
pregnant?” (Q306). In response, 17.1 percent 
of currently married women aged 15—49 said 
yes (Khalifa, Sayed, El-Khorazaty, and Way 
1982).? Those not using a method were then 
asked: “Do you think you or your husband 
will do something in the future to avoid 
getting pregnant?" (Q402). Almost half (46 
percent) responded affirmatively, thereby 
indicating predisposition toward future use. 
Especial in a population such as Egypt's 
where use of modern contraceptives has just 
begun to diffuse, we expect that effects of 
community characteristics and other determi- 
nants of contraception are as likely to be 
expressed in receptivity toward future use as 
in use per se. 

Using this information, we construct a 
three-category dependent variable: currently 
using (USE), expecting to use in the future 
(EXPECTS), not currently using nor expect- 
ing to use (NEITHER). When stratified by 
age, the distribution of the sample across 
these three categories is as follows: 


AGE 
15-24 25-34 35-49 15-49 
Currently using 8.2 20.5 20.5 17.1 
Expects to use 50.0 46.8 19.6 38.1 
Neither 41.8 32.7 59.9 44.8 
Total 100.0 100.0 100.0 100.0 
Number of cases 971 1,253 1,210 3,434 


The sample, smaller than the total inter- 
viewed in the 1980 CPS, is restricted to 
currently married women for whom complete 
information is available at the individual and 
village levels.? 





? The oral contraceptive is the dominant method: 
11.3 percent of women (two-thirds of current 
users) report current use of the pill. 

3 Sample attrition occurs mainly because of 
missing data for villages. Comparing means for 
selected individual and household variables in the 
full and restricted samples does not point to any 
obvious selectivity. For village-level variables, 
there is a slight tendency for villages in the full 
sample to be less dominated by agriculture, more 
mechanized and commercialized, with higher 
levels of schooling, more pharmacies, and more 
outreach workers. The differences are minor, 
however. 
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There are several ways to model the 
dependent variable. Questionnaire design 
leads naturally to a sequential specification: a 
decision about current use followed, condi- 
tionally, by a decision about expected future 
use. We do not opt for this specification, 
however, because sequential models make 
assumptions about the decision-making pro- 
cess that go beyond our province. Do 
respondents first decide whether they want to 
practice contraception now, and then, if not, 
whether they expect to do so in the future? 
Or, do they first decide whether they plan to 
use a method at some point in their 
reproductive lives, and then, if so, whether to 
use one now or put it off into the future? 
Although either sequence is reasonable and 
may hold for some subset of respondents, 
each focuses attention on a different set of 
contrasts among categories of the dependent 
variable, and neither captures movement 
among all three states. Indeed, there is 
interest in each of the possible contrasts. A 
woman occupying any one of the three states 
could move to any of the other two states over 
some period of time (e.g., the next 12 
months). We therefore prefer a simultaneous 
to a sequential approach to modeling the 
dependent variable and treat the dependent 
variable as a simple polytomy, estimating 
multinomial logistic regressions to obtain the 
full set of effects. 

The statistical model can be expressed as 
follows: 


Pon Gea exp (B, Z) A 
Ors 2; exp (B, Z) 


Where j = 1,2,3 refers to categories of the 
dependent variable (USE, EXPECTS, NEI- 
THER), i = 1, .. . N indexes individuals, 
and Z is the vector of independent variables to 
be described below (Maddala 1983).4 The 


“The small number of respondents in a 
significant minority of villages, in combination 
with insufficient within-village variation on key 
individual and household variables, places con- 
straints on the statistical modeling that can be 
done. Villages were selected with probability 
- proportional to population size from six strata 
defined by region of residence (Upper and Lower 
Egypt) and participation in and exposure to the 
Population and Development Program. Within 
villages, segments were selected according to size 
of constructed area and households sampled 
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equations are estimated by maximum likeli- 
hood methods. An important assumption of 
the model is that the probability of choosing 
any one category of the dependent variable is 
unaffected by the existence of other catego- 
ries in the choice set, that is, the indepen- 
dence of irrelevant alternatives (Maddala 
1983). We tested this explicitly with tests 
described by Hausman and McFadden (1984) 
and found no consistent evidence to reject the 
IIA assumption.5 

To understand villages as a context for 
contraceptive behavior, the challenge is to 


identify characteristics of potential relevance 


which also vary in a meaningful way from 
one village to the next. Following the 
theoretical framework developed by Easterlin 
(1978) and adapted by Hermalin (1983), two 
general classes of village variables are of 
interest. The first set affects the costs and 
benefits of childbearing and consists mainly 
of social and economic characteristics reflect- 
ing opportunity structure (e.g., the structure 
of labor and commodity markets, the nature 
of agricultural activity, and schooling oppor- 
tunities for children) but also includes norms 
about the roles of women and children. The 
second class of variables influences the costs 

of contraceptive use and involves the accessi- 
bility of family planning services and also 
local norms about fertility regulation. To 
represent them, we have selected seven 
indicators: the extent to which agriculture 
dominates the village economy (NONAG); 
commercialization (MARKET); agricultural 
mechanization (TRACTOR); school participa- 
tion levels (EDPARTIC); and various aspects 
of the family planning environment (PHAR- 
MACY, MEDICAL, RAIYDA).5 According 


randomly within segments. The number of respon- 
dents per village thus varies, from 16 to 79. 
Methods that require separate estimation within 
each village (e.g., Wong and Mason 1985) cannot 
be applied to enough villages to make the statistical 
refinement associated with these approaches worth- 
while. "s 

5 Hausman-type test statistics were calculated 
for the equations reported in Tables 2 and 3 below. 
For the analysis of village effects only (Table 2), 
the IIA assumption is not rejected. After the 
addition of selected individual and household 
characteristics (Table 3), the assumption fails two 
of six times, for the USE/NEITHER contrast in the 
15—24 age-group and the EXPECTS/NEITHER 
contrast in the 35—49 age-group. 

© Despite the richness of the data set, we do not 
have a comprebensive set of indicators for all 
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to analyses reported elsewhere (Entwisle, 
Casterline, and Sayed 1989; Entwisle, Pic- 
cinino, and Sayed 1988), these measures 
capture key dimensions of variability among 
villages with a minimum of redundancy 
among measures. 

Table 1 provides definitions and descriptive 
statistics for the village variables, for rural Egypt 
as a whole and by major region (Lower and 
Upper Egypt). Lower Egypt consists of the Nile 
Delta (north of the Cairo metropolis) and areas 
bordering the Suez Canal. Upper Egypt incor- 
porates the narrow band of cultivated area and 
settlements stretching southward along the Nile 
from Cairo to the border with Sudan. Lower 
Egypt is more densely populated, is character- 
ized by more modernized agricultural practices 
and a longer history of technological innovation 
in agriculture (because of its more intensive 
integration in the world economy beginning in 
the early 19th century), and the population is 
culturally and ethnically more diverse (because 
of the presence of Cairo and Alexandria and the 
proximity of Mediterranean society and cul- 
ture). Also, women are less secluded in Lower 
Egypt, participating more in agricultural field 
work and circulating more freely in towns (Ad- 
ams 1986). In our analysis, variation in contra- 
ceptive behavior between the two regions is 
controlled through inclusion of an indicator vari- 
able for residence in Upper Egypt (UPPER). 

The individual variables are of interest 
mainly to the extent that they mediate or 
condition the effects of the village variables. 
Motivation to restrict fertility is represented 
by two measures: the first (LIMIT) is coded 1 
for respondents wanting no more children, 0 
otherwise; the second (SPACE) is coded 1 for 


village variables of potential interest. For example, 
we lack information on land distribution, kinship 
Structures (which span households and represent an 
important means of integration within and between 
villages), authority patterns, the views of the local 
' imam regarding matters of contraception, other 
aspects of ideological climate, the content of 
educational instruction, and the quality (as distinct 
from the quantity) of family planning services. 
Moreover, not every aspect of diversity among 
villages will be important in defining the context of 
contraceptive behavior, nor will every relevant 
characteristic vary among villages. An example of 
the latter concerns the position of women in the 
village and especially their educational and employ- 
ment opportunities, which despite their obvious 
importance do not vary much in our cross-section 
of villages. 
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respondents wanting more children but not for 
at least one year, 0 otherwise. Socioeconomic 
status, which for Hermalin (1983) represents 
an indirect measure of the costs of fertility 
regulation, is also represented by two indica- 
tor variables: ATTEND is coded 1 for 
respondents who have completed at least one 
year of schooling, 0 otherwise; NOTINAG is 
coded 1 for respondents whose husbands 
work outside of agriculture and who live in 
households owning no land, 0 otherwise. 
Finally, to allow for differences in the 
implications of village context depending on 
stage of the reproductive career, we stratify 
the analysis by age. All equations are 
estimated separately for women ages 15-24, 
25-34, and 35-49. Differences in the contra- 
ceptive behavior of these three age-groups 
also reflect norms concerning appropriate 
ages for childbearing and birth cohort effects 
(e.g., receptivity to innovative behavior). 
Within age strata, age (in years) is entered as 
a continuous predictor (AGE) to control for 


.perceived and actual fecundability and to 


some extent for achieved parity.” 


VILLAGE EFFECTS 


To understand villages as a context for 
contraceptive behavior in rural Egypt, a 
logical place to begin is with an examination : 
of village effects only. Do village-level 
differences in economic structure, school 
participation, and family planning services 
lead to differences in contraceptive behavior? 
This question logically precedes questions 
about the degree to which variables 
measured at the individual and household 
level might mediate the effects of village 
variables. Indeed, given the likelihood of 
indirect effects involving individual-level 
variables, a focus on direct effects net of 
individual-level variables will tend to under- 
state the importance of community context 
and ultimately lead away from an under- 


7 We explicitly do not include achieved parity 
(or number of living children). To the extent that 
parity affects contraceptive behavior, it does so 
largely through increased motivation to limit and 
space births, which are already in our model. 
Moreover, because parity is a consequence as well 
as a determinant of contraceptive use, it would 
introduce substantial causal complexity at the 
individual level (which we wish to avoid in this 
analysis). 
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Table 1. Descriptive Statistics* on Village Variables, by Region 
OVERALL LOWER EGYPT. UPPER EGYPT 
Stan- Stan- ‘ Stan- 

Variable Variable Devia- Devia- Devia- 

Name Description Mean tion Min. Max. Mean tion Min. Max. Mean tion Min. Max. 
NONAG Proportion of 0.29 0.16 0.04 0.77 0.30 0.18 0.04 0.77 0.27 0.14 0.06 0.65 
male labor force ` 

in nonagriculture 

MARKET Proportion of 0.16 0.23 0.00 0.95 0.19 0.22 0.00 0.82 0.11 0.23 0.00 0.95 
agricultural land 
in cash crops . . 

TRACTOR Tractors per 100 1.39 2.20 0.00 20.00 1.56 2.65 0.00 20.00 1.11 1.07 0.10 4.67 
feddans? in 
conventional 
crops : 

EDPARTIC , Proportion of 0.04 0.03 0.00 0.22 0.05 0.03 0.00 0.22 0.03 0.01 0.00 0.07 
population with 
preparatory 
Schooling^ or 
greater 

PHARMACY Number of 0.92 1.27 0.00 6.00 0.83 1.19 0.00 6.00 1.06 1.38 0.00 6.00 
pharmacies 
(public and 
private) 

MEDICAL Number of 1.23 1.31 0.00 10.00 1.31 1.57 0.00 10.00 1.09 0.65 0.00 3.00 
medical workers i 
providing family 
planning services 

RAIYDA Number of family 1.08 1.22 0.00 10.00 1.38 1.32 0.00 10.00 0.58 0.82 0.00 3.00 
planning : 


extension workers 


* Statistics calculated for 3,434 currently married women aged 15-49, 2,150 in Lower Egypt and 1,284 in Upper 


one = 1.03 acres. 


* Completion of preparatory schooling requires nine years of schooling. 


standing of how community characteristics 
affect current use and expectations about 
future use. This is why we reverse the 
customary approach in recent social- 
demographic analyses. Rather than beginning 
with the individual and household determi- 
nants of contraceptive behavior and then 
adding potential macro determinants, we 
begin with village characteristics. Only after 
we have examined the effects of the village 
characteristics will we assess the degree to 
which these effects are mediated by variables 
measured at the individual and household 
level. The one exception is respondent age. 
As will be apparent, age is a powerful 
predictor of contraceptive behavior in rural 
Egypt, and we choose to control for age 


'compositional differences among villages 


when examining effects of village variables. 
Multinomial logistic regressions containing 

the full set of village variables (and also includ- 

ing the continuous AGE term and controlling 


regional differences) are estimated for the three 
age strata; see Table 2. (We provide the means 
and standard deviations of the independent vari- 
ables for each age-group, and correlations 
among these variables for women 25-34, in the 
Appendix.) For purposes of presentation, the 
statistical model shown earlier is reparameter- 
ized so that the coefficients (Bj) refer to con- 
trasts of USE/NEITHER, EXPECTS/NEI- 
THER, and USE/EXPECTS. The coefficients 
represent effects on the log-odds of choosing 
the “numerator” category as opposed to the 
“denominator” caiegory. Statistical tests of ef- 
fects of specific terms are carried out assuming 
that the ratio of the logistic regression coeffi- 
cient to its estimated asymptotic standard error 
is a standard normal variate (Maddala 1983). 
We devote the remainder of this section to a 
discussion of Table 2, describing our theoreti- 
cal expectations about the effect of each village 
variable and comparing these expectations with 
the empirical results. 
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Table 2. Logistic Regressions for Contraceptive Use and Expectations, Village Variables Only: By Age 


Age 15-24 
Use/ | Expect! | Use/ Use/ 

Contrast neither neither expects neither 
INTERCEPT —8.25* —1.22 —7.04* —5.47* 
NONAG —1.74 2.30€ | —4.04* 1.62* 
LNMARKET —2.59* 0.58 —3.17* 0.80 
LNTRACTOR 007 -01l 0.18 0.23 
EDPARTIC 16.02* 1.39 14.63* 9.36* 
LNPHARMACY 0.58* -—0.30 0.88* 0.19 
LNMEDICAL —0.09 | —0.08 —0.01 0.08 
LNRAIDA 0.33 0.01 0.31 0.16 
UPPER —1.11* -0.47* —0.64  -—1.46* 
AGE 0.32* 0.06 0.27* 0.14* 
Log-likelihood — 831.74 
Model x? 116.96 

(df) (18) 

N 971 


* Denotes significant at .05 level (two-tailed). 
For definition of variables, see Appendix A. 


The Village Economy 


In rural Egypt, agriculture dominates the vil- 
lage economy (Table 1) and its labor-intensive 
nature (Ikram 1980; Richards 1982) probably 
bolsters desires for large families: indeed, some 
of the essential tasks in Egyptian agriculture 
(field work in cotton, for example) are partic- 
ularly well suited for children (Levy 1985). 
Thus, we expect current contraceptive use and 
plans for future use to be positively related to 
tbe proportion of the labor force outside of ag- 
riculture (NONAG). This expectation is fully 
realized at ages 25—34 and 35—49, but only 
partially at ages 15—24, as illustrated through 
predicted probabilities calculated from Table 2 
coefficients with all other independent vari- 
ables set at their means: 


USE  EXPECTS NEITHER 

Ages 15-24 
NONAG = 0.10 .09 Al .S0 
= 0.30 .05 54 41 
= 0.50 .03 65 .32 

Ages 25-34 
NONAG = 0.10 .16 .46 .38 
. = 0.30 .18 .50 32 
= 0.50 . .20 54 26 

Ages 35—49 
NONAG = 0.10 13 .13 75 
= 0.30 .18 .18 .64 
= 0.50 .23 25 52 


Among younger women, the likelihood of 
intending to use in the future is substantially 














Age 25-34 Age 35-49 
Expects/ Use/ Use/ —-Expects/ Use/ 
neither expects neither neither expects 
0.23 — 5.69" 5.51* 8.38%  —2.88* 
1.31* 0.31 2.37* 2:54* ~0.17 
0.39 0.41 1.34* 0.75 0.60 
—0.11 0.33* 0.00 —0.05 0.14 
4.66 4.69 5.95* —1.67 7.62* 
—0.05 0.24 0.03 0.04 ~0.01 
0.06 0.00 —0.01  -0.15 0.14 
0.10 0.06 0.53* 0.21 0.32 
—0.73* —0.73* -—1.17* -—0.47* | -—0.70* 
—0.01 0.15* ~0.20* -—045* 0.06* 
— 1,215.36 — 983.24 
190.31 335.11 
(18) (18) 
1,253 1,210 
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greater where the village economy is less 
dependent on agriculture, although the effect 


. on current use is contrary to that hypothe- 


sized. Among older women, the likelihood of 
current use as well as expectations about 
future use are greater where the nonagricul- 
tural sector is more important. 

Given the dominant role of agriculture, 
particular characteristics of agriculture may 
also influence contraceptive behavior. We 
consider two characteristics of theoretical 
interest for which the village survey provides 
information: commercialization and mechani- 
zation. Commercialization of agriculture is 
measured indirectly through data on cropping 
patterns. In rural Egypt, two general types of 
cropping patterns can be distinguished, “con- 
ventional” and “unconventional” (Adams 
1986; Ikram 1980). The conventional or 
traditional pattern consists of cotton, rice or 
wheat, pulses, and maize grown in rotation. 
These crops offer low returns because of 
government pricing and subsidy policies, but 
nevertheless represent the cropping pattern of 
most small and subsistence farmers. In 
contrast, the unconventional pattern — which 
includes some of the conventional crops but 
also sugar cane, fruits and vegetables, and 
berseem (Egyptian clover)— offers relatively 
high profit but also demands high levels of 
input and is typified by income lags and risks 
the small farmer often cannot tolerate. In the 
conventional rotation, the labor contributions 
of children are well defined and potentially 
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large; in particular, ample opportunities for 
wage labor during harvesting of field crops 
exist. The roles of children in the unconven- 
tional pattern are less clearly defined, and 
opportunities for wage earning outside the 
household appear to be distinctly less. Child 
labor would thus be less valuable in villages 
where unconventional crops are of more 
importance, and the likelihood of using or 
expecting to use contraception should be 
greater in those villages where a greater 
proportion of land is devoted to unconven- 
tional crops. However, these expected effects 
largely fail to emerge.. 

Data from the community survey indicate 
only the amount of cultivated land planted in 
vegetables and fruits (market crops), from 
which we calculate the proportion of village 
land in market crops, termed MARKET 
(Table 1).8 The natural logarithm transforma- 
tion is applied (yielding LNMARKET) to 
reduce the impact of extreme observations on 
the regression estimates. Results shown in 
Table 2 indicate a tendency for LNMARKET 
to increase the odds of contraceptive use 
among older women, but the major impact of 
this variable is to discourage use at young 
ages, an outcome contrary to hypothesis. It 
would be premature to conclude that commer- 
cialization of agriculture has little relevance 
for contraceptive behavior, though. Some 
intriguing patterns are observed later in the 
analysis where we find interaction between 
LNMARKET and LIMIT (the desire for no 
more children), especially evident for women 
aged 25-34. 

Mechanization of agriculture should also 
reduce the demand for child labor (Richards 
1982 claims this has occurred in rural Egypt), 
and by this means increase the likelihood that 
an individual uses contraception or expects to 
do so in the future. As an indicator of the 
intensity of mechanization, we consider the 
number of tractors per 100 feddans in 
conventional crops (TRACTOR), again tak- 
ing the logarithm to lessen the impact of 
extreme observations. Tractors and related 
motorized equipment are becoming more 





? Households receive monetary income from 
crops other than vegetables and fruits; indeed, 
crops in the conventional rotation also bring 
market income. Vegetables and fruits are distin- 
guished by the extent to which they are used for 
generation of cash income and by the level of 
profit they offer. 
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common in land ‘preparation and hauling of 
produce to market (Richards 1982), although 
use of this equipment is largely confined to 
land holdings of 20 feddans (feddan = 1.03 
acres) or more (rich peasants) (Abdel-Fadil 
1975). Significant effects emerge for LN- 
TRACTOR only at ages 25-34 (Table 2). For 
this age-group, the likelihood of contraceptive 
use is greater in more mechanized villages, 


but this comes about because EXPECTS . 


declines with increased mechanization. Mech- 
anization appears to motivate women to use 
sooner than might otherwise be the case, 
rather than stimulating use that might other- 
wise not occur. The estimated effects of 
mechanization are not substantial in size, 
though. It appears that mechanization is less 
salient to contraceptive behavior than the 
overall structure of the village economy 
(NONAG) or the extent of commercialization 
of agriculture (LNMARKET,, the latter being 
especially significant for certain subgroups 
(described below). i . 


School Participation 


Schooling may influence reproductive behav- 
ior through a variety of paths (Casterline 
1985b; Cochrane 1979), and effects of 
individual's educational attainment on repro- 
ductive behavior are routinely observed in 
data from contemporary developing societies. 
A more contentious issue: is whether village- 
level schooling effects also exist, net of 
educational background at the individual 
level. On theoretical grounds, there is ample 
reason to expect such effects. One source can 
be termed schooling opportunity effects: 
where school participation by children is less 
costly—in economic or social terms— parents 
are more likely to expect their children to 
attend school; this will reduce their involve- 
ment in economic roles; as a result, family 
Size desires will decrease (Caldwell 1982). A 
second source of village-level effects can be 
termed normative schooling effects: the con- 
tent of the curriculum in a modern educatjonal 
system often legitimizes small families and 
fertility limitation; therefore, the larger the 
proportion of a social group that has attended 
modern schools, the more acceptable fertility 
limitation will be for individual members of 
that group: 

In rural Egypt in the early 1980s, schooling 
beyond primary level can be expected to show 
the fertility-depressing effects just described. 
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We test this hypothesis by considering 
effects of EDPARTIC, the proportion of the 
village population with preparatory schooling 
(nine or more years completed), on contracep- 
tive behavior.? The estimated effects on 
contraceptive use (although not expectations) 
turn out to be statistically significant and 
impressively large in magnitude, as illustrated 
by probabilities predicted from the equations 
in Table 2: 


USE EXPECTS NEITHER 


Ages 15-24 
EDPARTIC = 0.00.03 52 45 
= 0.05 .07 52 Al 
r =0.10 14 .49 37 

Ages 25-34 
EDPARTIC = 0.00  .14 .49 37 
— 0.05  .19 51 31 
| 20140 24 51 25 

Ages 35-49 
EDPARTIC = 0.00 — .14 20 .66 
— 0.05  .18 .18 .64 
=0.10 .23 .16 61 


Source: Table 2 (with other variables set at mean 
_ values). 


At each age, the likelihood that a woman is 
currently using contraception is substantially 
higher in villages where a larger proportion of 
the population has progressed beyond primary 
school. The effects estimated in Table 2 do not 
take account of individual-level educational at- 
tainment, of course. Later on, we assess the 
extent to which the effects of school participa- 
tion at the village level are mediated by the 
School participation of the respondents. 


Family Planning Services 
In the early 1980s, rural women could obtain 


?' The Egypt Community Survey reports the 
number of village residents completing primary, 
preparatory, and secondary schooling. It does not 
provide information about school attendance per 
. 8e, nor does it distinguish between the participation 
of males and females. As a consequence, it is not 
possible to construct a direct analogue to AT- 
TEND, the individual-level measure of school 
participation. Nor is it our purpose to construct a 
classic contextual-effects model. Based on a 
thorough examination of different measures of 
educational participation at the village level, and 
also previous experience with other data sets 
(Casterline 1985b; Casterline and Eid 1988), we 
selected the proportion with preparatory schooling 

' as the measure that best distinguishes educational 
opportunity between villages. 
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family planning supplies from a variety of 
sources, including family planning clinics, 
health centers, .private clinics, hospitals, 
pharmacies, extension workers, and combina- 
tions of outlet types. It is difficult to measure 
the salient features of this multidimensional 
service environment through a few quantita- 
tive indicators. We choose three measures 
that appear, on the basis of related work 
(Entwisle et al. 1988) and knowledge about 
changes underway at the time (Sayed, El- 
Khorazaty, and Way 1985), to capture the 
essential features. The first is the number of 
pharmacies in the village (PHARMACY), 
including private ones and those connected to 
clinics or hospitals. The second measure is 
the number of medical workers (doctors and 


: nurses) providing family planning services in 


the village (MEDICAL), based in health or 
family planning clinics (the latter are usually 
integrated into the former) or in private 
practice. The third measure is the number of 
family planning outreach workers (RATYDA). 


. These workers are local female volunteers 


(known as raiyda riyfia, Arabic for rural 
pioneer), recruited and trained through sev- 
eral distinct projects in the 1970s. The natural 
logarithm transformation is applied to all 
three family planning measures for the 
regression analysis. 

It is clear from Table 2 that the impact of 
family planning variables varies among age- 
group. At the youngest ages, the number of 
pharmacies (LNPHARMACY) has a positive 
impact on current use, in particular influenc- 
ing whether women are users currently as 
opposed to planning future use. Since current 
use is low at these ages, the fertility 
implications of this differential are not of 
great consequence. Among older women, 
presence of raiydas in the village appears to 
stimulate higher levels of use and, as we 
show later on, does so especially for women 
motivated to restrict their fertility. No signif- 
icant effects are observed for LNMEDICAL: 
apparently the number of clinic-based person- 
nel (doctors and nurses) has no bearing on 
current use and expectations about future use. 


MEDIATION OF VILLAGE EFFECTS 


Thus far, we have analyzed the effects of 
village variables without regard for the extent 
to which these effects might operate through 
variables measured at the individual or 
household level. Given a full complement of 
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potential “bridge variables” (Wippler and 
Lindenberg 1987), not to mention perfect 
measurement, a well-specified model of 
contraceptive behavior would show all of the 
village effects to be indirect — mediated by the 
perceptions, attitudes, and decision making of 
users and potential users. Direct effects found 
in empirical analysis thus signify the inade- 
quacy of the micro data available to capture 
fully the processes intervening between the 
village context and behavior in response to 
the subjective experience of that context 
(Bilsborrow and Guilkey 1987; Hauser 1970). 
Following Hermalin (1983), we conceive 
of the individual-level determinants of contra- 
ceptive behavior as involving motivation to 
regulate fertility and costs (monetary and 
nonmonetary) of doing so. Measurement of 
motivation is direct: the respondent expresses 
a desire to delay her next birth (SPACE) or to 
stop childbearing altogether (LIMIT). Costs 
are assumed to be implicit in the effects of 
background characteristics such as education 
(ATTEND) and attachment to agriculture 
(NOTINAG), after controlling for motiva- 
tion. We make no claim that these four 
indicators exhaust the micro variables that 
might mediate the effects of village context 
on contraceptive behavior. But they do 
represent major microlevel determinants in 
social demographic theory, and as such 
Should suffice to illustrate the principle of 
mediation. . 
Estimates of the effects of village and 
individual variables on contraceptive behavior 
are presented in Table 3. Of particular interest 
are the effects of NONAG and EDPARTIC, 
because individual-level measures of eco- 
nomic sector and schooling also appear in the 
analysis (NOTINAG, ATTEND). A priori, 
we expect the effects of NONAG and 
EDPARTIC to be weakened with the addition 
of NOTINAG, ATTEND, and other individ- 
ual variables.!0 One of the surprises of our 





10 Note that the individual- and village-level 
measures are not exact analogues, nor were they 
intended to be. Rather, our goal is to specify and 
estimate a model that best captures the determi- 
nants of contraceptive behavior at each level of 
analysis, given the data available. We opt for a 
dichotomous representation of educational attain- 
ment at the individual level (ATTEND) because 
the small proportion of respondents at higher levels 
makes further refinement pointless. Our rationale 
for selecting the proportion with preparatory 
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analysis is how minor the attenuation turns 
out to be. i 

Indeed, comparison of coefficients reported 
in Tables 2 and 3 reveals very similar effects 
for NONAG, regardless of whether or not its 
individual-level counterpart, NOTINAG, ap- 
pears in the regression equation. Only a minor 
portion of the impact of the effect of NONAG 
on contraceptive behavior is mediated by 
NOTINAG, or any of the other individual or 
household variables, for that matter. This 
result also holds for other village-level 
measures of agriculture (including LNMAR- 
KET if partial derivatives of the log-odds of 
each contrast in the dependent variable are 
taken with respect to this measure and solved 
using values of LIMIT equal to the mean for 
the appropriate age-group [Stolzenberg 1980]). 
. Likewise, comparison of Tables 2 and 3 
indicates that the addition of the individual- 
level education measure (ATTEND) results in 
only slight attenuation of the estimated effects 
of EDPARTIC. Not only are the estimated 
effects of EDPARTIC relatively unaffected 
by the inclusion or exclusion of individual 
variables, among them ATTEND; the effects 
of EDPARTIC outweigh the effects of 
ATTEND in almost every instance. While 
EDPARTIC might seem to be simply captur- 
ing the upper tail of the educational distribu- 
tion for our respondents and its estimated 
effects to include some portion more appro- 
priately considered at the individual level, 
this is not the explanation. Substituting 
completed years of schooling for ATTEND 
does not substantially alter the results.!! 

In short, the effects of the village variables 
are quite resilient in the face of individual- 
level controls. How can we explain this 
surprising result? We believe that the answer 
lies, fundamentally, in the rapid pace of 


schooling as the village-level measure is explained 
in footnote 8. Similarly, the individual-level 
measure of economic sector (NOTINAG) takes 
account of land ownership as well as labor force 
participation, because preliminary analysis re- 
vealed both aspects to be important in the 
determination of contraceptive behavior. The 
village-level measure (NONAG) is based solely on 
labor force participation; information about house- 
hold ownership of land is not available in the 
village data. 

!! Nor does the addition of husbands' education 
lead to a substantial reduction in the estimated 
effects of EDPARTIC. 
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Table 3. Multilevel Logistic Regressions for Contraceptive Use and Expectations: By Age 
Age 15-24 Age 25-34 Age 35-49 

Use/ | Expects/  Use/ Use/ Expects/ Use/ Use/ Expects/ | Use/ 
Contrast neither neither expects neither neither expects neither neither expects 
INTERCEPT —8.45* —0.66 | —7.79* —5.41* 1.06 —6.47*  2,62*  8.55*  —5.93* 
AGE . 0.26* 0.01 0.25* 0.06 —0.06* 0.12* —0.22* —0.27* 0.05 
SPACE 1.98*  0.47* 1.50* 1.775  1.05* 0.7. — — — 
LIMIT 2.89*  0.73* 2.16* 2.69* 1.02* 1.67" 4.21*  0.77* 3.44* 
ATTEND 0.22 0.35*  —0.13 0.75* 0.45* 0.30 0.55* 0.02 0.53* 
NOTINAG 0.14 0.03 0.11 0.36 0.03 0.33* 0.13 0.20 | —0.06 
NONAG —2.46*  2.09* —4.55* 1.15 1.18 —0.03 2.33"  2.48* —0.16 
LNMARKET — 1.37 0.60 —2.06 3.14* 0.10 3.04* 2.45  —1.56 4.01 
LNTRACTOR 0.18 | —0.09 0.27 0.32  —0.04 0.37* 0.13 -—0.04 0.17 
EDPARTIC 15.53* 0.24 15.29* 9.31 4.23 5.08 4.38  —1.74 6.12 
LNPHARMACY 0.74* —0.28 1.02* 0.31 0.03 0.28 0.17 0.11 0.06 
LNMEDICAL —0.27 -0.13 -0.14 -0.01 0.03 —0.04 -0.02 -—0.15 0.13 
LNRAIYDA 0.46  —0.09 0.56 —0.62 -—0.29 —0.33 1.33 -0.02 1.35 
LNMARKET*LIMIT -2.66 —0.37  —2.29 -—3.94* —0.52 —3.4]* —1.58 2.35 —3.93 
LNRAIYDA*LIMIT —0.29 0.60 | —0.89 1.30*  0.88* 0.42 —0.81 0.28 —1.09 
UPPER —,90* —0.39* —0.5} —1.25* —0.62* —0.62* —1.09* —0.44*  —0.65* 
Log-likelihood — 790.98 —1,111.75 — 908.95 
Model x? 198.48 397.54 483.71 

(df) (30) (30) (28) 
N 971 1,253 1,210 


* Denotes significant at .05 level (two-tailed). 
For definition of variables, see Appendix A. 


social and economic’ change in rural Egypt 
over several decades. 

Our approach to the question of mediation 
has involved comparison of the total and 
direct effects of key village variables. We did 
not estimate or examine indirect effects 
explicitly, because of problems of temporal 
priority involving the village and individual 
variables. Consider women aged 35-49, for 
example. Their background characteristics are 
the consequence of opportunities and con- 
straints that prevailed some two to five 
decades before any of the data were collected. 
Even for the youngest women in the sample, 
those aged 15-24, village context probably 
exerted its influence on school attendance and 
attachment to agriculture well before attempts 
were made to measure key aspects of this 
context in the early 1980s. Only if villages 
have changed very little will characteristics in 
the early 1980s serve as good proxies for their 
characteristics in the 1950s, 1960s, and early 
1970s. This assumption cannot be defended 
in the case of rural Egypt. 

Indeed, since the 1952 Revolution, major 
changes have taken place in the structure of 
local economies in rural Egypt and in the 
provision of service to village residents. In 
the agricultural sector, Land Reform Acts of 


1952, 1951, and 1969 modified access to land 
(chiefly by eliminating the largest land 
holdings and by regulating the terms of 
tenancy agreements), with repercussions for 
the modes of production typifying Egyptian 
agriculture. Paralleling developments in agri- 
culture was the expansion of government- 
supported social services including the expan- 
sion of public (and secular) schools at all 
levels. Cn one hand, these changes generated 
much of the diversity among villages that 
makes contextual analysis possible and inter- 
esting. On the other hand, these same changes 
mean that the village characteristics that 
helped to shape the life course of our 
respondents at an early stage are not the 
village characteristics we observe today. (As 
a furthe> complication, we note that many of 
our respondents will have migrated away 
from their natal villages in any case.) Given 
these circumstances, it is not surprising that 
covariation among contextual and individual- 
level variables is relatively low in rural Egypt 
and tha: the latter do not appreciably mediate 
the effects of the former. 


VARIATION IN VILLAGE EFFECTS 


Finally a key hypothesis of our research is 
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that women may respond differently in their 
contraceptive behavior to key features of their 
village context depending on the stage in their 
reproductive career and their motivation to 
regulate fertility. To test this hypothesis, we 
posit two sets of interactions among the 
village and individual variables. Interactions 
involving stage of reproductive career are 
modeled by stratifying the analysis by age 
(15-24, 25-34, 35-49). Formal tests reveal 
the age interactions as a block to be highly 
significant (p < .001) for all models we 
estimate. Within age strata, interactions 
among village variables and motivation to 
restrict fertility (LIMIT) are represented 
explicitly as multiplicative terms. We ex- 
plored possible interactions with each village 
variable in the analysis, one at a time to 
reduce problems of multicollinearity. Those 
involving LNMARKET and LNRAIYDA 
proved important (Table 3). 


Motivation to Restrict Fertility 


At the time of interview, some women 
express a desire to limit their fertility, while 
others want more children or are unsure. 
Motivation to restrict fertility should increase 
the likelihood of contraceptive use and, to a 
lesser extent, expectations to use in the 
future. Estimates reported in Table 3 confirm 
that this is the case in rural Egypt, for all 
three age-groups.!? In addition, the contracep- 
tive behavior of women wanting no more 
children might be affected differently by the 
opportunities and constraints of their village 
context. There are two possibilities, neither to 
be preferred on a priori grounds. On one 
hand, highly motivated women may be 
relatively immune to the opportunities and 
constraints of their village context compared 
with less highly motivated women. In a 
village where the role of children in the local 
economy is- decreasing, and the costs associ- 
ated with children on the rise, the latter may 
be led to restrict their fertility even though 
they personally might like to have more 
children. Such women many also be prime 
targets for outreach activities of the family 


12 To give a specific example: setting MARKET 
and RAIYDA to their means for the entire sample, 
we find that a desire for no more children 
multiplies the odds of EXPECTS/NEITHER al- 
most threefold among women aged 15-24. This is 
its weakest effect! ' 
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planning program. On the other hand, it may 
be highly motivated women who are espe- 
cially affected by the opportunities and 
constraints of the local environment. A local. 
economy that accentuates the costs and 
diminishes the benefits of children may 
encourage women who wish to restrict their 
fertility anyway to act on the basis of their 
desires. Similarly, the availability of family 
planning services in the village may enhance 
the ability of motivated women to convert a 
desire for no more children into contraceptive 
use, immediately or possibly at some time in 
the future. ' 

We find evidence for both patterns of 
interaction in rural Egypt. Consider first the 
interaction between LNMARKET and LIMIT 
(Table 3). For women wanting more children 
or unsure of their desires (LIMIT — 0), 
LNMARKET positively affects the likelihood 
of contraceptive use; for women not wanting 
more children (LIMIT=1), LNMARKET 
negatively affects the likelihood of contracep- 
tive use. The latter effect is substantially 
smaller in magnitude than the former, which 
constitutes the heart of the interaction. Our 
interpretation is that among women who have 
already concluded that they wish to stop 
childbearing, the incentive to use contracep- 
tion provided by greater commercialization of 
agriculture is superfluous. Among those who 
express a desire for more children, or are 
unsure, commercialization of agriculture ap- - 
pears to encourage counterbehavior; personal 
desires run against the realities of the village 
economy. Although the interaction between 
LNMARKET and LIMIT is significant only 
at ages 25-34, the same pattern is evident at 
ages 15-24 and 35—49. 

We find the other pattern of interaction for 
family planning outreach. At ages 25—34, the 
effect of LNRAIYDA depends on motivation 
to restrict fertility such that the presence of 
raiydas in the village has essentially no 
impact on the contraceptive behavior of those 
who want more children (or who are unsure), 
but has a relatively large impact for women 
who want no more children. Entwisle and 
associates (1984) report similar results for 
rural Thailand. It is interesting that in both 
analyses the combination of personal desires 
and a context in which they can be realized is 
especially important for women midway 
through their reproductive years. The interac- 
tion between LNRAIYDA and LIMIT is not 
significant for younger. or older women, 
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either in rural Egypt (Table 3) or in rural 
Thailand (Entwisle, Hermalin, Kamnuan- 
silpa, and Chamratrithirong 1984). 


Stage in Reproductive Career 


We explore potential variation in the effects 
of village context further through comparison 
of results obtained for each of the age strata. 
In general, we expect the opportunities and 
constraints embodied in the local setting to 
have more of a bearing on the contraceptive 
intentions of younger women and the actual 
use of older women because of a mixture of 
age and cohort effects. Older women are 
further along in their life course and even if 
not ready to cease childbearing altogether are 
approaching that point. Younger women are 
much less likely to have reached this stage in 
their reproductive career, and yet they may be 
as receptive to contraception as older women, 
probably more so. 

The evidence is mixed. The effects of 
EDPARTIC, our measure of educational 
opportunity in villages, stand in contradiction 
to the hypothesis (Table 3): Effects on the 
contraceptive use of women aged 35—49 are 
less than half as strong as they are for women 
aged 15-24, with effects for women aged 
25—34 intermediate. These results may reflect 
the greater receptivity of younger women to 
new ideas about family size and fertility 
regulation that accompany increased participa- 
tion in a modern educational system (i.e., the 
normative schooling effect). We can think of 
no reason why increased costs and decreased 
benefits of children resulting from their 
school attendance (i.e., the schooling oppor- 
tunity effect) would affect younger more than 
older women, however, especially since the 
latter are more likely to have children old 
enough to enroll in the preparatory and 
secondary schools. 

In contrast, characteristics of the village 
economy tend to exert more influence over 
the contraceptive use of older than younger 
women, consistent with our original hypothe- 
Sis. The proportion of the male labor force not 
in agriculture (NONAG) and also the commer- 
cialization of agriculture (LNMARKET) have 
their strongest positive effects among women 
over the age of 25. For example, if we 
consider the effect of commercialization, the 
distinction between respondents over and 
under 25 is clear and substantial. 

Differences among age-groups in the ef- 
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fects of LNRAIYDA most closely conform to 
expectations. For women who wish to limit 
their fertility (LIMIT — 1), effects on current 
use are larger for older women, enough to 
create a three-to-one difference in the impact 
of LNRAIYDA between the oldest and 
youngest age-groups: 





USE EXPECTS/ USE 
AGE NEITHER NEITHER EXPECTS 
15-24 17 51 ~ 33 
25-34 .68 59 .09 
35-49 .52 26 26 
Source: Table 3. 


In contrast, expectations about future use are 
more strongly affected by family planning 
outreach at the younger than older ages, again 
as hypothesized. 


CONCLUSION 


To summarize, our analysis demonstrates that 
contraceptive behavior in rural Egypt varies 
systematically with the village setting. The 
level of nonagricultural activity in the village 
economy, the modernization of agriculture, 
the level of school participation, and (to a 
lesser extent) the family planning service 
environment influence contraceptive use and 
intentions. Estimated effects are significant in 
equations containing village variables only, 
and in equations combining village and 
individual variables. Most are substantial in 
size, for example modifying expected levels 
of contraceptive use by 10 percentage 
points or more (converting partial regression 
coefficients into predicted probabilities). Not 
all hypothesized effects emerge as statistically 
significant, but a large proportion of theoreti- 
cal expectations are confirmed. The results 
strongly reinforce the view that contraceptive 
behavior is determined in part by structural 
factors, as realized at the village level. 

Not only does contraceptive behavior vary 
systematically with village setting; village 
effects vary according to characteristics of 
individual respondents. Not all respondents 
are similarly sensitive to features of the 
village setting; some individual characteris- 
tics, such as stage in reproductive career and 
fertility preferences, would appear to heighten 
or dampen responsiveness to village condi- 
tions. Or, applying an alternative behavioral 
interpretation to the same set of multilevel 
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interactions, the results indicate that village 
circumstances not only influence contracep- 
tive behavior but also condition the influence 
of individual characteristics on the likelihood 
of contraceptive use or expecting to use 
contraception in the future. In the case of 
fertility preferences, for example, the results 
suggest that some village settings are more 
conducive than others to the implementation 
of expressed desires to restrict fertility. 

The regression estimates indicate that in 
tural Egypt, among women aged 25 and over, 
contraceptive use is more likely where the 
village economy depends less on agriculture, 
agriculture is more commercialized (espe- 
cially for women wanting more children), in 
the past a higher proportion of the population 
has advanced beyond primary schooling, and 
more family planning extension workers are 
active in the village (particularly for women 
aged 25-34 not wanting more children). This 
is an impressive array of village effects. They 
demonstrate the responsiveness of contracep- 
tive behavior in rural Egypt to the social and 
economic changes that have been under way 
during the past three decades. It is difficult to 
see how the fundamental relationship between 
these changes and reproductive behavior 
could be better documented than through 
multilevel analysis of the sort we present. 

If we step back for a moment for in-depth 
analysis of rural Egypt, what can be said about 
how these findings fit into the growing body of 
multilevel analyses? More to the point, is rural 
Egypt unique? Are village characteristics more 
powerful determinants of contraceptive behav- 
ior in Egypt than elsewhere? 

In responding to this question, the first 
point we want to underscore is the richness of 
the village data to which we have had access 
in this analysis. With respect to the range of 
characteristics measured and the amount of 
detail provided for each, these Egyptian data 
far exceed village data collected in most 
national fertility surveys during the past 15 
years (by comparison, see Casterline 1987). 
A related point, of considerable significance 
in our view, is that the multilevel analysis of 
contraception we present was preceded by 
extensive analysis of the village data, from 
which the village indicators used here were 
derived (see Entwisle et al. 1988; Entwislé et 
al. 1989). If the issues addressed here are to 
be pursued through similar research designs, 
we emphasize the importance of collecting an 
appropriate and reasonably comprehensive set 
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of macro variables and of investing in 
thorough analysis of the macro data. 

Having said this, we must also note that an- 
alysts have been especially successful in iden- 
tifying village effects on reproductive behavior 
in rural Egypt. Analyses of the Egyptian Fer- 
tility Survey (Casterline and Eid 1988; Easter- 
lin, Crimmins, and Khodair 1988) and of two 
Rural Fertility Surveys (Kelley, Khalifa, and 
El-Khorazaty 1982; Stycos, Sayed, Avery, and ` 
Fridman 1988) all identify direct effects of vil- 
lage variables net of determinants measured at 
the individual and household level. In this re- 
spect, the Egyptian analyses have been, as a 
set, more successful than studies in other set- 
tings. While not ruling out the possibility that 
data and analytic approaches have been supe- 
rior in Egypt, we tentatively conclude that the 
village determines the environment for decision 
making concerning fertility to an unusual extent 
in rural Egypt. Why rural Egypt? Or, to put the 
question in more general terms, under what cir- 
cumstances might we expect net effects of vil- 
lage characteristics to be relatively large? 

We suggest that one such circumstance arises 
in the presence of far-reaching social and eco- 
nomic changes over several decades, if these 
changes are expressed in terms of local contexts 
but not the characteristics of individual adults. 
Indeed, characteristics such as educational at- 
tainment tend to become fixed early in the life 
course and change little in response to subse- 
quent contextual changes. Reproductive deci- 
sions, however, are inherently forward looking 
and probably take into account more recent de- 
velopments. In rural Egypt, the modernization 
of agriculture, growth of employment in indus- 
try and services, expansion of educational op- 
portunity, and implementation of health and fam- 
ily planning initiatives have greatly altered 
village circumstances since the 1952 Revolu- 
tion. As a result, many rural dwellers now con- 
front a very different set of opportunities and 
constraints than those that shaped their lives at 
an earlier stage. Given the data typically avail- _ 
able in fertility and contraceptive prevalence 
surveys, this gap between the past and present. 
will tend to increase the measured impact of 
current or recent contextual characteristics. This 
is but one of several plausible explanations. 
Other factors of potential significance include 
degree of cultural homogeneity and the extent 
to which villages are integrated into a national 
society (through internal migration patterns, the 
development of extensive transportation and 
communication networks, and the like). 
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But these comments are speculative. We look 
forward to comparative analyses that explore 
more formally and more rigorously the condi- 
tions under which structural factors at the local 
level constrain reproductive behavior. 
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APPENDIX A 
Means (1) and Standard Deviations (o) of Independent Variables, By Age 

Variable Age 15-24 Age 25-34 Age 35-49 

Name Variable Description p cg I c LB c 

AGE Age of respondent 20.10 2.39 28.50 2.73 40.30 4.19 

SPACE Want more children but wish 201 .401 .091 .288 — —- 
to delay at least 1 year 
(1 — yes; 0: otherwise) 

LIMIT Want no more children 202 .402 .599 .490 .740 439 
(1=yes; 0 — otherwise) 

ATTEND Ever attended school .424 .494 375 484 347 .476 
(1=yes; 0=otherwise) 

NOTINAG Husband works outside of 424 .494 .428 .495 .348 — .477 
agriculture and household 
owns no land (1- yes; 
0 — otherwise) 

NONAG Proportion of male labor .276 .160 .293 .162 -295 .164 
force in nonagriculture 

LNMARKET Proportion of agricultural .123 .163 .138 .176 1287  .169 
land in market crops 
(logged) 

LNTRACTOR Tractors per 100 feddans in -706 .521 .685 .513 .687 .510 
conventional crops (logged) 

EDPARTIC Proportion of population 037 027 .041 .028 042 — .032 
with preparatory schooling 
Or more E 

LNPHARMACY Number of pharmacies .481 .524 .501 519 500 — .530 

(logged) 
LNMEDICAL Number of medical workers .659 472 .683 .497 .681 .488 
s providing family planning 

services (logged) ; 

LNRAIYDA Number of family planning 575 472 -630 494 609 .491 
extension workers (logged) 

UPPER Region of residence 395 489 358 479 374 484 
(1 = Upper Egypt; 0 = Lower 
Egypt) 

N 971 1,253 1,210 


CONTRACEPTIVE BEHAVIOR IN RURAL EGYPT 


APPENDIX B 


1033 


Correlations among Independent Variables (Including Interaction Terms), Women Aged 25-34 (N= 1,253) 














L NL 
L N B P N L L L 
N N T D H M N N N 
A o M R P À E R M* R’ 
S L T T N A A A R D A AL AL U 
P I T I o R C R M I I RI II P 
A A M E N N K T T A c Y KM YM P 
G C I N A A B o I C A D BI ODI B 
B E T D G G T R C Y L A TT AT R 
AGE 1.000 
SPACE —.120 1000 
LIMIT 277] -—.386 1.000 
ATTEND 015 —.010 056 1.000 
NOTINAG 006 —.027 053 036 1.000 
NONAG 085 —.020 — 095 158 235 1.000 
LNMARKET 052  —.069 .133 .136 — .096 -209 1.000 
LNTRACTOR -.025 ~.038 035 — .006  .026 — .088 414 1.000 
EDPARTIC 084 —,022 — .117 218  .102 588 170 —.144 1.000 

ACY —.011 —.016 .002  .020 120  .344  .200  .032 094 1.000 

ICAL 013. —.021 .047 .026 = .066 A81 .158 003 O51 .433 1.000 
LNRAIYDA .035  —.085 .135 .142  .026 .168 .430 -081 .245  .091 .197 1.000 
LNMARKET* 

LIMIT .20 —.185 — .478 — .124 071 — .169 — .769 — 294 163  .176  .13$  .336 1.000 
LNRAIYDA* : 

LIMIT 85  —.258 .668 106 .050 163 332 041 214 O75 .156 654 -586 — 1.000 
UPPER — 07)  .135 —262 -.100 .031 -.116 —.169 -.026 —.357  .108  .033 -—.394 —.186 -—.316 1.000 
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ON THE ESTIMATION OF UNION THREAT EFFECTS* 


KEVIN T. LEICHT 
The Pennsylvania State University 


Some researchers view unions primarily as functioning to increase earnings 
inequality among the working class while others see them as threatening employers 
so that they raise wages for unorganized workers and minimize working-class 
inequality. Threat effects are defined as the tendency for nonunionized employers 
to respond to the threat of unionization by increasing the earnings and benefits of 
their workers. This paper probes a contextual method for estimating the effects of 
unionization on the earnings of unorganized workers. This method controls for 
structural characteristics that affect workers? earnings and incorporates 
information about the organizational strength and vitality of specific unions. Using 
data from seven manufacturing industries in Indianapolis, the results suggest that 
(1) unionized workers’ earnings positively affect the earnings of unorganized 
workers in unionized plants, (2) unionized workers’ earnings negatively affect 
nonunionized workers’ earnings in nonunionized plants, and (3) when plant size is 
controlled, unionized workers’ earnings have no significant effect on the earnings 
of nonunionized workers in nonunionized plants. These results highlight the 
complexity and difficulty of estimating the effects of union threat and their 


importance for understanding earnings inequality in the workplace. 


THEORETICAL BACKGROUND 


Labor economists and sociologists have 
studied the consequences of union activities 
for many years. While union membership has 
declined from 34 percent of the labor force in 
the 1950s to under 20 percent in 1985, unions 
still represent 20 million workers in the U.S. 
economy (U.S. Bureau of Labor Statistics 
1985). An important issue still remains, how 
do unions affect the earnings of nonunionized 
workers? The debate continues between those 
who view unions as collectors of “monopoly 
rents” that increase working-class inequality 
and those who view unions as threats to 
employers who raise benefits and thus 
inadvertently economically homogenize the 
working class. 

Responses to union threat have both 
economic and sociological interpretations. 
Economically, nonunionized employers may 
have to raise wages to prevent their workers 


* Larry Griffin deserves special thanks for his 
ideas and contributions to earlier versions of this 
work. I also gratefully acknowledge the construc- 
tive comments made by Michael Wallace, Robert 
Kaufman, Arne Kalleberg, Robert Robinson, 
Gerald Wright, Elton Jackson, James N. Baron, 
two anonymous reviewers, and participants in the 
Ohio State University Sociology Department Lec- 
ture Series. 


from leaving for higher-wage firms. The 
standard sociological interpretation: is that 
employers raise wages to avoid unionization. 
Employers are willing to pay to keep unions 
out for a variety of reasons, including the 
avoidance of work rules and other limitations 
on managerial prerogatives (cf. Freeman and 
Medoff 1984, pp. 230-38). A third interpre- 
tation is that employers largely ignore mar- 
kets and unions. They simply react to 
situations as they occur, which is rather 
infrequently. Further, workers may not have 
adequate market information, and unions may 
not be particularly aggressive or effective at 
organizing workers (cf. Cornfield 1986). 
These alternatives cannot easily be weighted 
or tested analytically. The sociological litera- 
ture does indicate that some employers view 
unions as a threat (for whatever reasons) that 
they want to keep out of their firms. The 
growing use of labor consultants in union 
certification elections is circumstantial evi- 
dence for this position (cf. Cornfield 1986; 
Freeman and Medoff 1984, pp. 230-38). 
These questions can best be answered where 
individual, firm, and economic data are 
available within specific industries. Since 
these kinds of data are relatively rare, this 
study has the advantage of analyzing avail- 
able data on seven industries within a single 
metropolitan area (Indianapolis).. The data 
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refer to a functioning local labor market 
where employers are likely to be aware of the 
extent of union threat within their industry. 

This paper proves a contextual method for 
estimating the economic effects of unioniza- 
tion on unorganized workers, often referred to 
as “union threat” effects. Threat effects are 
the tendency for nonunionized employers to 
respond to unionization threat by increasing 
the earnings and benefits of their workers. 
This analysis uses a model of the workplace, 
where work structures are defined as sets of 
stable, regularized behaviors backed by insti- 
tutional sanctions and the force of law. The 
model is based on the premise that industrial 
relations and the structure of earnings in firms 
are affected by several overlapping work 
structures: (1) the industry, (2) the plant, (3) 
the cccupation, (4) the job, and (5) the 
worker’s personal characteristics. Further, the 
analysis assumes that unions differ in their 
potential to benefit organized workers, and 
that these differences are reflected in their 
ability to alter the earnings of nonunionized 
workers as well. Most importantly, the 
method I develop permits testing of different 
hypotheses on union threat effects at the 
individual level. 

Prior research has used a number of 
different strategies for assessing the effect of 
“union threat” on the earnings of unorganized 
workers. These strategies share several char- 
acteristics: (1) all control for human capital 
characteristics (such as education) that may 
affect wages; and (2) all (except Podgursky 
1986; see below) deal with aggregated, 
industry-level data and complex modeling 
techniques for separating out the effects of 
unions on unorganized workers. 

Some research findings support the pres- 
' ence of positive union threat effects. Rosen 
(1969) found positive union coverage effects 
within industries, suggesting that union threat 
effects increase as coverage increases. Martin 
and Rence (1984) found similar evidence at 
the industry level by examining the effects of 
union coverage on earnings. They found that 
threat effects appear to disseminate from 
buying industries to selling industries as 
increases in coverage in the former positively 
affect earnings in the latter. Podgursky (1986) 
found that the earnings of nonunionized 
workers increased as the proportion of 
workers organized in an industry increased. 
The effects were greatest in medium and large 
establishments. Allen (1986) found that 
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unionized construction workers are more 
productive than nonunionized workers, and 
that the productivity advantage is not offset 
by union wage gains. As unionized and 
nonunionized contractors begin to compete 
for the same jobs, the wage rates of the two 
groups may converge as nonunionized contrac- 
tors begin to share productivity gains with 
workers. Finally, Hirsch and Connolly (1987) 
challenge the notion that unions capture 
monopoly profits generated by industry con- 
centration, finding instead that the union/ 
nonunion wage differential is lowest in 
concentrated industries (defined using four- 
firm concentration ratios). This result, too, 
suggests the existence of union threat effects 
in concentrated industries. 

While previous analyses of union threat 
effects are suggestive, three problems have 
emerged. First, conducting aggregated analy- 
sis at the industry level requires making 
untested assumptions about the distribution of 
earnings between union members and non- 
unionized workers in an industry. Second, 
these analyses assume that all unions are 
uniform across all firms and industries in 
producing union threat. My analysis takes 
into account differences in the ability of 
specific unions to convey “threat” and affect 
the earnings of nonunionized workers. Third, 
each study fails to consider the existing 
structures surrounding industrial work such as 
plants, occupations, and jobs. The lack of 
controls for the host of contextual effects 
known to affect wages represents a serious 
flaw because unionization may interact with 
other work structures to affect workers’ 
earnings and job rewards (see also Kalleberg 
and Berg 1987, pp. 138-51; Kalleberg and 
Lincoln 1988). 

In this analysis, I present a method for 
estimating union threat effects at the individ- 
ual level that incorporates the effects of 
industries, plants, occupations, jobs, and 
individual characteristics. This method also 
begins to account for differences in the 
effectiveness and vitality of specific unions. 
The major empirical observation driving this 
analysis is that managers confront a complex 
world where a lot of information is available 
about unions and union activities. I explore 
various assumptions about how this informa- 
tion is used and combined to allow managers 
to determine the earnings of nonunionized 
workers. Again, I assume that one of the 
primary reasons that employers pay attention 
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to unionized earnings is to avoid the negative 
consequences of successful union organizing 
in their firms. A 

Models of competition markets assume that 
the wage rate is the major means of 
communication about the supply and demand 
for labor. If this wage is known in the context 
of a competitive market, the effects of unions 
on this wage should affect the earnings of 
unorganized workers in the same industry. If 
union earnings increases do not spill over to 
affect unorganized workers, then the in- 
creases attributed to unions may negatively 
affect the earnings of unorganized workers. If 
positive.union threat effects exist, increases in 
earnings attributed to unions should increase 
the earnings of nonunionized workers. 

Even if the wage rate itself is the only 
information that managers use, there are 
different ways of taking it into account. For 
example, managers may pay attention to 
earnings differences between union members 
and nonunionized workers. This difference 
would conform to the widely held belief that 
labor markets are markets for individuals. 
This view would account for within-firm 
variation in union membership and collective 
bargaining coverage. A variation on this 
assumption would be that managers compare 
earnings differences between organized plants 
with collective bargaining agreements and 
unorganized plants. This assumption would 
take into account differences between firms 
rather than workers. In both variations, other 
characteristics that structure labor markets 
and affect the behavior of. unions and 
managers would not affect managerial deci- 
sions regarding the earnings of nonunionized 
workers. 

Another complexity not incorporated into 
prior research is the possibility that managers 
respond to the earnings differentials, organi- 
zational coverage, and vitality of specific 
unions. Obviously, managers do not face a 
labor market where they are confronted by a 
“generic” union. Rather, managers are faced 
with specific unions, some that are organiza- 
tionally effective and others less so. It is 
possible that managers take into account 
membership ‘size, the number of firms 
organized, and the vitality of specific unions 
when setting the earnings for nonunionized 
workers. Unions with large earnings differen- 
tials, large and active memberships, and 
broad coverage across firms are probably 
noticed and taken into account more than 
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unions with the opposite characteristics. 
Further, unions may extract large earnings 
differentials and represent only a small 
number of workers in a few firms in the labor 
market. Such unions may be regarded as less 
threatening than unions with smaller earnings 
differentials which represent more workers 
across a greater number of firms. i 
Finally, - managers probably: incorporate 
subjective perceptions of a union’s vitality in 
assessing the relative strength of a union and 
(consequently) the probability that specific 
unions will target their firms for organizing. 
While the likely indicators of union vitality on 
a day-to-day basis are probably indirect, 
union participation can be viewed as an 
indicator of the ability of specific unions to 


‘involve their members in union affairs and 


activities. Managers may find unions with 
active members more threatening that unions 
with inactive or lethargic members. 

My analysis explores five methods for 
considering union strength in the context of 
àn analysis of union effects on the earnings of 
unorganized workers. The first two methods 
assume that managers account for only the 


‘earnings differential that unions produce in 


setting wages for unorganized workers. The 
other methods take into account the earnings 
differential, membership coverage, and orga- 
nizational vitality of specific unions. These 
measures are designed to provide preliminary 
insights into ways of accounting for manage- 
rial decision making in wage setting while 
assuming that employers are interested in 
preventing the organization of their firms. 


DATA AND METHODS 


The data in this analysis are taken from the 
U.S. portion of the Indianapolis/Tokyo Work 
Commitment Study (Kalleberg, Lincoln, and 
Near 1982a, 1982b). The primary sampling 
units are manufacturing establishments in 
seven industries: transportation. equipment; 
chemicals; electronics and electronic equip- 
ment; food processing; prefabricated metals; 
nonelectrical machinery; and printing and 
publishing. The researchers determined the 
population of manufacturing plants in these 
industries from lists obtained from Chambers 
of Commerce, government agencies, and 
other sources. À sampling frame of plants 
was stratified by employment size and + 
industry, and then plants were sampled 
randomly within each category. Complete 
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data from. organizational informants and 
workers were obtained from 52 plants in 
Indiana for an access rate of 35 percent. 
Given that relatively high demands were 
made of these plants for data, this access rate 
compares well with other surveys of manufac- 
turing plants (e.g., Blau, Falbe, McKinley, 
and Tracy 1976). 

After obtaining agreement from the plant’s 
» management and (for unionized plants) union 
leaders, structured interviews were conducted 
with plant managers and personnel managers. 
These interviews yielded information on the 
history, operations, market environment, per- 
sonnel policies, and structures of each plant. 
The sampling procedures differed slightly 
among plants. Random samples were some- 
times not possible, and in very small plants 
all employees were surveyed. In all cases, 
three groups of employees were administered 
Separate questionnaires: managers, supervi- 
sors, and workers. A total of 4,567 question- 
naires were completed. This analysis uses a 
subsample of 3,861 nonsupervisory workers, 
2,323 union members, and 1,538 nonunion- 
ized workers.! 

These data are relevant for estimating 
union threat effects for several reasons. First, 
all plants are located in a single metropolitan 
area, which makes the concept of "union 
threat" salient in local markets where employ- 
ers and workers interact with each other and 
can witness the operations and market activity 
of other actors. Second, unionized workers 
are well represented from a variety of unions 
within and across plants.? The fact that 


! The results presented here use all nonsupervi- 
sory workers in the sample as the relevant targets 
of union threat effects. However, the results can be 
replicated using samples of direct production 
workers only. ' 

? The 17 unions in this analysis include. the 
Bakery, Confectionery, and Tobacco Workers 
International Union, the International Brotherhood 
of Electrical Workers, the Teamsters, the Interna- 
tional Typographical Union, the International 
Union of Electrical, Radio, and Machine Workers, 
the Oil, Chemical, and Atomic Workers Interna- 
tional Union, the International Brotherhood of 
Painters and Allied Trades, the Printing and 
Graphic Communications Union, the Sheetmetal 
Workers International Association, the United 
Auto Workers Union, the United Electrical, Radio, 
and Machine Workers of America, the United 
Food and Commercial Workers Union, the United 
Rubber, Linoleum, and Plastic Workers of Amer- 


Business Patterns, 
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specific unions are included allows for 
incorporating information about the represen- 
tation and vitality of specific unions. Finally, 
the data include measures of plant, occupa- 
tion, and job characteristics that also affect 
workers' earnings. These characteristics act 
as controls in the analysis. 

This analysis assumes that unions operate 
in an environment where the activities of 
unions and.employers are constrained by 
employee characteristics and the structure of 
industries, plants, occupations, and jobs. 
Accordingly, I control for these environmen- 
tal characteristics so that the estimates of 


.union threat are not contaminated by them 


(see Appendix for characteristics). The analy- 
sis incorporates all these measures into a 
general contextual model of the workplace. 

Because union threat is measured at the 
industry level, I include two additional 
controls: the general prosperity of the industry 
(the average earnings of workers within 
industries in Marion County, Indiana),3 and 
union density in each industry (measured as 
the proportion of workers in each industry 
that are unionized).^ The control for industry 
prosperity is necessary because the wage 
increases that unionized and nonunionized 
workers received may be a function of 
industry prosperity, rather than the concrete 
actions on the part of unions to raise wages. 
Prior research has shown that union density 
increases the earnings of nonunionized work- 
ers (Podgursky 1986; Martin and Rence 
1984). This leaves open the possibility that 
the effects examined will be due to the effects 
of union density at the industry level. Hence, 
union density is also controlled in the analysis 
below. 


ica, the United Steelworkers of America, the 
Cummins Diesel Workers Union, the Graphic Arts 
International Union, and the International Associ- 
ation of Machinists and Aerospace Workers. 

3 The average ezrnings measure for workers in 
the seven industries in this analysis is a five-year 
moving average of values taken from County 
1977-1981. Using average 
earnings of workers for each industry for the state 
of Indiana (taken ss a three-year moving average 
from the National Income and product Accounts, 
1983) does not change the results reported here. 

4 The union density measure is taken from the 
sample itself. This is due to the lack of published 
data on unionization by industry at the local or 
national level for years after 1977. 


. UNION THREAT EFFECTS 
Variations on a Method for Estimating 
. Union Threat Effects 


To assess the impact of union threat effects on 
workers’ earnings in this analysis, I use a 
- general two-step process. First, I estimate an 
earnings model within the seven industries in 
the sample that predicts earnings as a function 
of union membership and a set of control 
variables: ` 


a, + (EByX) + Bx UNION 
+ ej (i=Industry,, Industry, 
. Industry, . . . Industry7. (1) 


i 
Sinuuyi = 


This equation is estimated within each 
industry (i). The B5; coefficients are used as 
industry-level data points to construct a 
variable representing the effect of union 
membership on earnings within industries. In 
: step two, this industry-level variable is used 
in an analysis of the earnings of nonunionized 
workers. This analysis examines nonunion- 
ized workers’ earnings as a function of the 
effects of.unions on earnings (Bj), control- 
ling for characteristics of industries (J), plants 
(P), occupations (O), jobs (J), and workers' 
personal characteristics (W): 


$a =a t Cil + CP + C430 + CJ 
T CSW + C5(B2;) e 
(i= Industry;, Industry», 
Industry3, . . . Industry;. (2) 


Because B; is in dollars, the coefficient Cg 
in equation 2 represents the amount that 
nonunionized workers’ earnings increase or 
decrease with changes in union/nonunion 
earnings differential.5 If unions are producing 
threat effects in the labor market, the 
coefficient Cg should be positive. In this case, 
tlie results would represent the increase in 
nonunionized workers' earnings between in- 
dustries as unionized workers’ earnings in- 


5 The method described here resembles Mason, 
Wong, and Entwisle's (1984) method of estimating 
contextual, multilevel models. However, single 
- equation estimates are complicated here by the fact 

that nonunionized workers are a subset of all 
- workers in the sample, and that I am interested in 
examining the effects of unions on the earnings of 
this specific subgroup. Further, because this 
analysis deals with different conceptions of union 
Vitality and strength, it is not possible in this 
analysis to examine reciprocal relationships be- 
tween unionization and earnings (cf. Rosen 1969). 
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crease within industries. If union activities 
depress the earnings of unorganized workers, 
the coefficient Cs should be negative. A 
negative coefficient would reflect the net 
contribution of unions to intra-industry earn- 
ings inequality. 

One complication of this analysis is that the 
earnings of nonunionized workers appears 
twice in the estimation of union threat effects: 
once in producing the industry-level variable 
(equation 1), and again in the estimation of 
the actual affect of union threat itself 
(equation 2). Stated differently, the estimate 
of the union threat effect (Ce in equation 2) is 
dependent on the level of nonunionized 
workers’ earnings used to create the within- 
industry unionized earnings effect (Bz) in 
equation 1. This sets up the possibility that 
error terms are correlated across the two 
equations. To counteract this potential bias in 
estimating of union threat effects, the non- 
unionized workers in the sample were ran- 
domly assigned to one of two groups. The 
first group was used in combination with the 
unionized workers to estimate union member- 
ship effects on earnings within each industry 
(equation 1). The second group was used to 
estimate the effects of unions on the earnings 
of nonunionized workers (equation 2). I used 
a stratified random sampling technique to 
assure that each plant would be proportionally 
represented in both groups of nonunionized 
workers.$ 

Table 1 outlines several modifications of 
the basic method of estimating union threat 
effects. All of the measures of union threat 
represent modifications to coefficients esti- 
mated using equation 1 above. With respect 
to the alternative perspectives, my perspec- 
tives contain different assumptions about how 
managers make decisions to set wages and 
what information they use. 


General Measures of Union Threat: 
Two Variations 


Research that emphasizes the workings of 


$ [n the random sampling technique, nonunion- 
ized workers were sorted by plant, and then 
even-numbered cases were assigned to a group 
used to estimate the union wage effect within 
industries and the odd-numbered cases were used 
to assess the effect of unionization on nonunion- 
ized workers. The results are similar when the 
groups are reversed. 
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Table 1. Five Estimates of Union Threat Effects 





Variables 
General union threat . 
1. Union membership 





Operationalizations 
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$u, nu = a + (X ByX) + Bu Union Membership + c 


(within seven industries) 
By, = Union Threat 


2. Collective bargaining 


$y, ne = a + È BUX) + 


. Bu Collective Bargaining + c 
(within seven industries) 
Bo; = Union Threat 


Specific union threat 
3. w/proportion of firms 
organized by union; 


$y, nu = a + (È B1X) + ZBj,Union + c 
where Union = UAW, IBT, IBEW, etc. 


(within seven industries) 


( (ny) B2y) (YN) X FuFt, = 


Union Threat where ny = number of 


members of union i in an industry, fy = (firms organized by union (VFt,, 


and N, = 
4. w/proportion of workers same as (3), 


organized by uniony 


total number of union members in an industry. 


CS (ny) (B2,) (UN) X FuFt, = Union Threat where U; = nT, and 


T; = total number of workers in an industry. 


5. w/union participation 
in uniony 


same as (4), except py is substituted for U, in the weighting equation, 
where py = (mean union participation score for unionyYQ;, and Q; = 


mean union participation score for all union members in an industry. 


competitive markets focuses on the wage rate 
as the sole means of communication about the 
supply and demand for labor. This informa- 
tion is viewed as all that is necessary for the 
rational manager to set wages for his or her 
workers. It leads to the first two operational- 
izations in Table 1. 

The first measure of the effects of unions 
on unorganized workers uses a measure of 
union membership (1 if the respondent is a 
union member, 0 otherwise). The estimation 
technique used: with this measure is identical 
to the procedure outlined in equations 1 and 2 
above. The second measure uses an organiza- 
tion-level variable that differentiates orga- 
nized plants with collective bargaining agree- 
ments from unorganized plants (1 if the plant 
has a collective bargaining agreement, 0 
otherwise). This measure is a more conven- 
tional one that economists often use (cf. 
Rosen 1969). 


Specific Measures of Union Threat: 
Three Variations 


The third, fourth, and fifth measures in Table 
1 incorporate the complexities discussed 
earlier. These measures consider differences 
in the ability of specific unions to extract 
t n . . 

wage gains. They also incorporate different 


conceptions of union growth and vitality in 
the form of weights that are applied to the 
union wage effect. By incorporating informa- 
tion concerning the wage effects, coverage, 
and vitality of specific unions, these measures 
begin to account for differences in the ability 
of unions to convey threats of possible 
organizing drives. ; 
Incorporating union coverage across plants. 
The third measure in Table 1 begins to take 
into account differences in the effectiveness 
and “threat potential” of specific unions. To 
capture this diversity, dummy variables for 
membership in each union in an industry are 
used in place of the union membership and 
collective bargaining variables in equation 1: 


Sinu = a; + (SBjiX) + Boy + en (3) 


where Uy = United Auto Workers, Team- 
sters, etc. 

Each coefficient is weighted by the number 
of members of union j within an industry (nj), 
and the proportion of plants in an industry 
that are organized by union ij (fj): 


nyfiBaij. (4) 


These respective weights increase the size 
of wage effects that apply to large numbers of 


UNION THREAT EFFECTS 


union members across several plants, and 
discount union wage effects that apply to only 
a few union members across a small number 
of plants. These weighted values for each 
union within an industry are summed, and the 
total is divided by the total number of union 
members in an industry: 


(X nyfyBoy)/Ni- (5) 


This method is similar to that used for 
calculating weighted means (see Hanushek 
and Jackson 1977). Finally, the total from 
equation 5 is weighted by the proportion of 
plants in an industry that are unionized 
(F,fF,).7 This weight incorporates the assump- 
tion that the threat potential of unions to 
nonunionized employers increases as the 
proportion of unionized plants increases in an 
industry. 

Incorporating membership coverage in 
specific plants. 'The fourth and fifth measures 
in Table 1 are variants of the weighting 
scheme just described. In the fourth measure, 
the proportion of unionized plants in an 
industry (fj) is replaced by the proportion of 
workers in uniony in the industry (Rj): 


(X njRyByj)/N, 

Ry =n (6) 
where nj; is the number of members of union y 
(as before) and T; is the total number of 
workers in an industry. This measure assumes 
that managers assess a union's coverage in an 
industry by the proportion of all workers in an 
industry that are organized by a specific 
union, rather than the proportion of plants 
organized by that union. This proportion, 
then, adds greater weight to a union's wage 
gain if the union represents a large proportion 
of all workers in an industry, and discounts 
the effects of unions that represent a small 
proportion of workers in an industry. Like the 
other measures of union threat effects (except 
measures 1 and 2), this sum is weighted by 
the proportion of plants in an industry that are 
unionized (F,/F;;). 


‘7 The proportion of firms organized in an 
industry is taken from the sample itself. Since the 
firms in the sample are representative of the firms 
in their respective industries, the ratios should be 
the same as those for the population of firms in 
each industry in the Indianapolis area (see Lincoln 
and Kalleberg forthcoming). 
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Incorporating Union, Vitality. The fifth 
measure in Table 1 attempts to account for the 
vitality of unions in terms of their ability to 
involve the membership in union affairs and 
activities. With this measure, I assume that 
union vitality will not be noticed by managers 
of nonunionized plants unless that vitality is 
unusually high or low. To take into account 
union vitality in this analysis, the mean union 
participation score for each union is divided 
by the mean union participation score for the 
industry, and this value is substituted for the 
proportion of plants organized by a spen 
union Vy) in equation 4:8 


(Z nyP yBoy)/N; (7) 
Pj (mean participation score for uniony)/ 
(mean participation score in industry,),. 


where nj; and Bo, are as they were in equation 
4, and Py is the mean participation score for 
union ij divided by the mean participation 
score for the entire industry. 

One weakness of the five methods is that 
the magnitude of the union wage effects in 
equation 1 can be affected by the other 
independent variables in the equation.? As an 
example, this analysis examines three differ- 
ent sets of controls for estimating the threat 
effects variables in equation 1: (1) an equation 
including only the personal characteristics 
listed in the Appendix, (2) an equation 
including personal characteristics, measures 
of occupation and job skill, and company 
tenure from the Appendix, and (3) an 
equation that adds plant size to the variables 
in (2). These are not the only variations 
possible, but they serve to illustrate the 
importance of specific control variables in the 
analysis. 

Each measure of union threat is placed in a 
regression equation controlling for character- 
istics of industries, plants, occupations, jobs, 
and workers’ personal characteristics. The 
industry measures are industrial prosperity 


5 The union participation measure used here was 
originally asked of individual union members in 
the sample. The question asks, “How often do you 
take part in union activities?” with response 
categories ranging from “seldom or never” (0) to 
“once a week or more” (5). Individual unions were 
then assigned the mean union participation score 
for all members within a specific industry. 

? [ am indebted to Jim Baron for suggesting this 


possibility. 
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Table 2. Weighted Least Squares Estimates of the Effects of Union Threat on Nonunionized Workers’ Earnings 
Controlling for the Effects of Industrial Prosperity, Union Density, Plants, Occupations, Jobs, and Worker's 


Personal Characteristics (N= 769) 








Generel union threat? 

]. Union membership 

2. Collective bargaining 

Specific union threat 

3. w/proportion of firms organized by union 

4. wiproportion of workers organized by union; 
5. w/union participation in union; 








Functional Form of Equation (1) 





ay Q» By 

— .7§5**° — .671** — ,666** 

.231* .302** .291** 
3.65** 3.89** 1.77** 
2.76** 2.52** 1.39** 

1.41** .874** ,433** 


* Cantrolling for age, race, sex, and education in equation 1 (see Appendix). ' 

b Controlling for age, race, sex, education, cognitive complexity, motor skills, physical demands, hazards, 
interpersonal relations, substantive complexity, autonomy control and company tenure in equation 1 (see Appendix). 

* Controlling for all variables in (b) plus plant size in equation 1 (see Appendix). i 

? Numbers correspond to the order of the threat effects measures in Table 1. 


* Urstandardized regression coefficients. 
*Ps 05. **Ps O01. 


and union density. The remaining controls 
and their measures are outlined in the 
Appendix. Because the analysis involves 


aggregating data from samples of varying | 


proportions from each plant, weighted least 
squares is used here to reduce the heteroske- 
dasticity of the estimates (see Hanushek and 
Jackson, 1977, pp. 150—53).10 

The analysis in Table 2 contains regression 
coefficients for the effect of union threat 
effects variables on a randomly assigned half 
of the nonunionized workers in the sample 
(see note 6). The column entries in the table 
represent the coefficients of different threat 
effects measures constructed from equations 
using a specific set of controls in equation 1. 
The row entries represent the effect of each 
measure of union threat calculated using 
different sets of controls in equation 1. With 
these guidelines in mind, the results are 
presented below. 


RESULTS 


The results in Table 2 are generally consis- 
tent. Regardless of the controls, the general 
measure of union threat using union member- 
ship uniformly and negatively affects the 
earnings of nonunionized workers (row 1, 
Table 2). With the assumptions regarding the 
communication of the supply and demand for 


10 The weighting scheme was adjusted so that 
the original number of cases would be preserved. 
This is necessary for the tests of significance to be 
accurate. Details are available from the author. 


X 


t 
‘ 


labor through wage rates in competitive 
markets, the results are consistent with 
economic predictions concerning union mo- 
nopoly power. The depressing effect of union 
membership on nonunionized workers’ earn- 
ings varies from 67 to 76 cents per dollar of 
earnings increase due to unionization. These 
results are not affected by the controls used in 
equation 1. | 
The other measure that accounts for general 
union earnings differentials is the collective 
bargaining measure in row 2 of Table 2. All 
of the coefficients are positive and significant 
and are not affected by the controls used in 
equation 1. Apparently, the exercise of union 
monopoly power is less relevant when 
comparing workers in plants with and without 
collective bargaining agreements than in 
comparisons of union members and nonunion- 
ized workers. When unionization is conceptu- 
alized as a structural characteristic of the 
workplace rather than a characteristic of 
individuals, the results begin to support the 
existence of positive union threat effects. ` 
Rows 3-5 of Table 2 display results for 
measures that take into account the effective- 
ness, membership coverage, and vitality of 
specific unions. These results also strongly 
support the notion that unionized earnings 
increase the wages of unorganized workers. 
The magnitude of these effects ranges from 
43 cents for every dollar increase in the union 
wage rate (row 5, column 3) to $3.89 for 
every dollar increase in the union wage rate 
(row 3, column 2). When specific unions are 
taken into account, the results strongly 


| 
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support the existence of positive union threat 
effects. f 

To summarize, when union threat is 
conceptualized to take into account directly 
the effectiveness and: coverage of union 
activities within industries, the results clearly 
support research on the positive effects of 
unionization on unorganized workers. As: the 
earnings of unionized workers increases 
relative to nonunionized workers in an 
industry, the earnings of those same nonunion- 
ized workers rise relative to workers in 
industries where the gap between the earnings 
of unionized and nonunionized workers. is 
smaller. In effect, the nonunionized workers’ 
earnings gap widens even as these workers 
increase their absolute earnings. 

While these results are interesting, most 
research on union threat effects assumes that 
threat effects are generated between unionized 
and nonunionized plants. Consequently, it is 
important to examine the effects of unions on 
workers in nonunionized plants. To examine 
this contingency, the effects of the threat 
effects measures are examined among non- 
unionized workers who do not work in 
unionized plants, and nonunionized workers 
who work in unionized plants.!! 

The results in panel A of Table 3 clearly 
show that the earnings of nonunionized 
workers in nonunionized plants decline with 
increases in the earnings of unionized work- 
ers. However, the results differ depending on 
which measure is used and which set of 
controls: are used in equation 1. For the first 
two functional forms, the earnings of non- 
unionized workers in nonunionized plants 
decline with increases in the earnings differ- 
ential between unionization and nonunionized 
workers. The effects range from 42 cents for 
every dollar difference (row 2, columns 1 and 
2) to $2.66 (row 3, column 1). However, 
when plant size is taken into account in 
equation 1, or when job skills and tenure are 
controlled when taking into account union 
vitality, specific unions have no significant 
effects. The results in column 3 suggest that 
the size of the plant plays a critical role in the 
assessment of union threat effects, supporting 
the results of Podgursky (1986). 


H Unionized plants were defined using two 
methods: (1) the presence of union members within 
firms in the sample or (2) responses to questions 
put to CEO's that unions were present in their 
firms. 
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The results in Panel B are similar to the 
results in Table 2 with the exception that the 
measure of general union threat using union 
membership fails to reach statistical signifi- 
cance. However, the other results show that 


€ 


managers respond to union wage effects when : 


any union members are present within plants. 
While it is possible that managers are 
responding to the immediate threat of further 
union organizing, these results may reflect 
concerns over earnings equity within plants. 
Union-nonunion wage disparity is potentially 


more disruptive within.plants than between : 
them. This is especially possible since unions | 


traditionally have organized plants where 
homogeneous work forces and task interde- 


pendence make internal inequity more divi- ' 


sive. When unionization is conceptualized in 
a manner that takes into account the effective- 
ness of individual unions, unions appear to 


exert positive effects on the earnings of . 


unorganized workers in unionized plants. 


DISCUSSION AND CONCLUSION 


This paper has attempted to explore a method 
for estimating union threat effects that takes 
into account structural characteristics ' that 
affect earnings as well as the earnings 
differentials, membership coverage, and orga- 
nizational vitality of specific unions. The 
results suggest that unions are, at best, 


selectively effective at raising the earnings of ` 
unorganized workers. Nonunionized workers’ ' 


earnings increase when unions are present 
within plants. This effect is probably due to 
concerns about equity and internal divisive- 
ness rather than threats of additional union 
organizing. In nonunionized plants, unionized 
workers’ ‘earnings negatively affect the earn- 


ings of nonunionized workers. However, : 


when plant size is controlled for in the 
contextual method, unionized workers’ earn- 
ings do not affect the earnings of unorganized 


workers in nonunionized plants. In sum, these - 
results suggest that managers respond to ' 


unions only when they are confronted by 


-them directly. Otherwise, the unresponsive- : 


ness of nonunionized workers' earnings to , 
union earnings differentials contributes to | 


working-class inequality (see also Form 1985, 
pp. 27-30). a 


The results for the five different variations , 
of the threat effects measure shed some light ` 
on the consequences of making different sets : 
of assumptions about the functioning of labor ' 
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Table 3. Weighted Least Squares Estimates of the Effects of Union Threat on Nonunionized Workers’ Earnings 
Controlling for the Effects of Industrial Prosperity, Union Density, Plants, Occupations, Jobs, and Worker’ 8 
Personal Characteristics, Unionized and Nonunionized Plants 





Panel A. Nonunionized Plants (V=485) 
General union threat? 

1. Union membership 

2. Collective bargaining 

Specific union threat 

3. w/proportion of firms organized by uniony 

4. w/proportion of workers organized by uniony 
5. w/union participation in union, 

Panel B. Unionized Plants (N — 284) 

General union threat? 

1. Union membership 

2. Collecztive bargaining 

Specific union threat 

3. w/proportion of firms organized by uniony 
4. w/proportion of workers organized by uniony 
5. w/union participation in uniony 


Functiona' Form of Equation 1, 


a) 2 ON 





—].01*** —]1.05** —1.20** 
— 419** — .423** -.351** 

—2.66** —2.51* — 500 

—2.25"* — 1.48* ~ 398 
—.990** — .325 ~ 092 
—.394 —.121 114 

.864** .825** 683** 

6.98** 6.62*» 23** 
523** 3.83** 1.76** 
2.32** L11** 


* Controlling for age, race, sex, and education in equation 1 (see Appendix). 

> Controlling for age, race, sex, education, cognitive complexity, motor skills, physical demands, hazards, 
interpersonal relations, substantive complexity, autonomy control, and company tenure in equation 1 (see Appendix). 

* Controlling for all variables in (b) plus plant size in equation 1 (see Appendix). 

d Numbers correspond to the order of the threat effects measures in Table 1. 


* Unstandardized regression coefficients. 
*Ps 05. **Ps 01. 


markets and the decision making of manag- 
ers. Analyses that take into account the 
strength and vitality of specific unions in a 
structural context, controlling for plant size, 
suggest that unions do not contribute to 
earnings inequality between ‘unionized and 
nonunionized plants. Analyses that ignore this 
structural context and that do not take into 
account the strength and vitality of specific 
unions suggest that unions contribute substan- 
tially to earnings inequality between union- 
ized and nonunionized plants. Future analyses 
will want to begin to take into account 
differences in the strength and vitality of 
specific unions since this is probably closer to 
the reality that managers face in setting wage 
rates for unorganized workers. 

While the results suggest that a broad 
contextual approach to analyzing earnings is 
fruitful, these results are meant to be an 
illustrative rather than a definitive analysis of 
union threat effects in local labor markets. Six 
problems still remain in this type of analysis. 
First, the results pertain to a specific regional 
sample and may not apply to other industries, 
particularly those outside of manufacturing. 


Second, analyses of union threat effects that 
use the actual earnings differentials between 
unionized and nonunionized workers cannot 
truly estimate the effects of unionization 
because those earnings are already “contami- 
nated” by threat effects and market practices. 
Third, a longitudinal analysis of union threat 
effects needs to be done to take into account 
actual cause-and-effect relationships between 
union activities and their implications for 
unorganized workers. Fourth, the conclusions 
about threat effects must be restrained, 
refined, and specified concerning the relation- 
ship between unions and earnings inequality. 
At most, the analysis has shown that, as the 
earnings of unionized workers increase rela- 
tive to the nonunionized, the earnings of the 
latter may increase or decrease under varying 
conditions. These results cannot speak to the 
overall effects of unions on earnings inequal- 
ity in manufacturing or elsewhere. Fifth, it is 
possible that these results may be accounted 
for by some unmeasured difference between 
unionized and nonunionized workers (e.g., 
hours or conditions of work). 

Finally, the results cannot speak to the 


E 
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relative merits of the economic and sociolog- 
ical interpretations raised in the introduction. 
Economically, employers may raise the earn- 
ings of nonunionized workers to prevent them 
from seeking employment elsewhere or they 
may raise wages to avoid the organization of 
their plants. Future research will want to 
examine the decision making of managers by 
asking them under what conditions they 
respond to union threats, whether they have 
done so in the past, and the extent of their 
knowledge about local labor markets. Future 
research should begin to take into account 
differences in the organizational vitality and 
effectiveness of specific unions as a means of 
specifying the mechanisms that produce 
social inequality within and between firms. 
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Further efforts in these two areas will bring 
advances and refinements in theories for 
explaining the role of unions and managers in 
the stratification system. 


KEVIN T. LEICHT is Assistant Professor of 
Sociology at Pennsylvania: State University. 
His research interests include the organiza- 
tional context surrounding U.S. labor rela- 
tions, organizational determinants of techno- 
logical change and innovation, and the 
political economy of deindustrialization. He 
is currently doing research on political and 
economic determinants of state-level direct 
investment in high-technology development 
with J. Craig Jenkins. 
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Variables Used as Controls in Estimating Union Threat Eifects* 


AMERICAN SOCIOLOGICAL REVIEW 
Appendix 





Variables 

Personal characteristics 
Age. 
Race 
Sex 
Education 

Plant characteristics 
Plant size 


Independent company 
Within-firm training 


Quality circles _ 
Promotion opportunities 
-Company tenure 
Formalization 


Units 


Occupational characteristics" 


Cognitive complexity 
Motor skills 

Physical demands 
Hazards 


Interpersonal 
Job characteristics 
Substantive complexity 


Autonomy 


Operationalizations 


In years. 

White = 1. 

Female = 1. - 
Measured ordinally from "None" (0) to "More than College" (7). 


Number of employees in plant. 

0 = Branch plant or subsidiary, 1 = Independent company. 

^How important à source skill was: 

a. informal learning, 

b. on-the-job training." (4= very important, 3— somewhat important, 2=a little 
important, 1 — not at all important, O= never had) ras = .30. 


1 = Employee poarticipates in quality circles, 0 — employee does not participate in 
quality circles. 

1 = Employee expects to be promoted, 0= not. 

Years worker has been employed in plant. 

Aston formalization scale. Proportion of a total of 28 possible documents present 
from job descriptions to congratulations and condolences. 

Number of organizational subunits (departments -- sections) in plant. 


Complexity of involvement with data and the intelligence and verbal aptitudes 
required for occupational performance (see Kalleberg and Leicht 1986). 
Complexity of involvement with things and the need for motor coordination and , 
manual dexterity (see Kalleberg and Leicht 1986) 

The degree to which behaviors such as "climbing and balancing" are required to 
perform occupational activities (Kalleberg and Leicht 1986) 

Exposure to extremes such as heat, noise, and vibration: (Kalleberg and Leicht 
1986). 

The degree and type of involvement with people (Kalleberg and Leicht 1986). 


The level, scope, and integration of mental, interpersonal, and manipulative tasks 
in a job (Miller, Treiman, Cain, and. Roos 1980; Spenner 1983; Kalleberg and 
d 1986). 

. “How long would it take to train someone else to do your work?" (0— A few 
hours, 1 =A few days to a week, 2= Several weeks, 3 = Two to five months, 
4-: Six months to a year, 5=a few years, 6= Five years or more) 

2. "My job requires a high level of skill.” (1- Strongly disagree, 2 = Disagree, 
3 = Neither agree nor disagree, 4= Agree, 5= Strongly agree) 

3, “My job makes me keep learnings new things.” (Same response categories as 
*2) 

Hc. EAE AKA Joke variety in the kinds OPES eae Lov my JODY i 
(Same response PER as #2) . 

reliability (alpha) = t 

The discretionary e of job tasks (Spenner 1983; Kalleberg and Leicht 1986) 

1. “My job lets me decide the speed that I work.” (Same response categories as 
#2 above) 

2. “My job gives me freedom.as to how I do my work.” (Same response cate- 
gories as #1 above) 

3. Ps of the following best describes your relationship with your supervi- 

" (0 My supervisor decides both what I do and how I do it, 1— My 

tied decides what I do, but I decide how to do it, 2 =I decide what to do 
and how I do it) 

reliability (alpha) = .58 


* All variables come from the Indianapolis/Tokyo Work Commitment Study (see Lincoln and Kalleberg 1985). 
b All occupational characteristics are factor scores generated from a factor analysis of characteristics derived from 
the Dictionary of Occupational Titles (1965) (see Kalleberg and Leicht 1986; Cain and Treiman 1981). 
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NEW EVIDEN CE ON RELIGIOUS PLURALISM, URBANISM, 
AND RELIGIOUS PARTICIPATION* 


KEVIN D. BREAULT 
Washington University 


In a recent issue of ASR, Finke and Stark (1988) reported results that contradict 
long-standing views on the relationship between religious pluralism, urbanism, 
and religious participation. They argue that religious pluralism and urbanism are 
positively related to religious participation. The present paper disputes the 
generality of Finke and Stark’s findings with both methodological criticisms and 
new data. With regard to pluralism, although the traditional views are supported, 
a new theoretical interpretation is introduced and empirically supported: religious 
pluralism appears to have differential effects on religious participation, depending 
on the religious makeup of the religious community along a liberal-to-conservative 
continuum. Finally, the best evidence suggests that there probably is not an 
important rural-urban difference in the rate of church adherents today. 


Robert Finke and Rodney Stark in a recent 
issue of ASR (February 1988) report striking 
results on the relationship between religious 
pluralism and religious participation. With 
1906 census data for 150 cities, Finke and 
Stark find that religious pluralism has a 
positive effect on religious participation. 
Analyzing these data, they also show that 
urban areas have higher rates of religious 
participation than nonurban ones. They con- 
clude that the “received wisdom about the 
effects of urbanism and pluralism on religion 
may be wrong.” 

Based on data compiled by the Glenmary 
Research Center (Quinn, Anderson, Bradley, 
Goetting, and Shriver 1982), the present 
paper reports results that contradict those of 
Finke and Stark. The reasons for the 
contradictory findings are explored, and a 
new theoretical interpretation is introduced 
and empirically supported. 


PLURALISM AND RELIGION 


. Quinn et al. (1982) report church membership 
statistics for religious bodies in the United 
States for 1980 by region, state, and county. 
Table 1 presents regression results for county 
groups and states based on these data. The 
first equation is for a 10 percent random 
sample (N — 310) of the 3,102 counties and 

‘county equivalents in the Quinn et al. (1982) 





* Direct all correspondence to K.D. Breault, 
Department of Sociology, Washington University, 
St. Louis, Missouri 63130. 
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compilation. The second equation is for 
highly urban counties, the 300 largest coun- 
ties in 1980. The third equation is for the 50 : 
states, 1980. Because the Quinn et al. study 
undercounts black church adherents, two 
additional samples are employed. The fourth 
equation is for the 10 percent random sample 
(above) minus the most urban counties, those. 
with a population of more than 150,000, and 
counties in the states Stark (1980, 1986) has 
identified as subject to the greatest enumera- 
tion errors, that is, Alabama, Georgia, 
Louisiana, Maryland, Mississippi, and South 
Carolina. After the deletion of these urban 
and southern counties, equation four is based 
on a sample of 242 counties. The fifth 
equation is for the 50 states using Stark's 
(1980, 1986) corrected data for church 
adherents, previously used in research by 
Stark and others (Stark, Doyle, and Kent 
1980; Stark, Kent, and Doyle 1982; Stark, 
Doyle, and Rushing 1983; Breault 1986, 
1988a, b; Breault and Kposowa 1987). The 
variables used here are equivalent to those in 
the Finke and Stark paper, religious diversity 
and percentage Catholics as independent 
variables and rate of church adherents as the 
dependent. ! 





! A control variable suggested by Finke and 
Stark, population growth, had negligible effects on 
the results. For example, in the sample of states 
with Stark’s corrected religious adherents data, the 
addition of population change to the equation 
slightly increased the betas for religious diversity 
(from B = —.36, b = —.30to B = —.45,b = 
— .38). 
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Table 1. Standardized and Unstandardized Regression Coefficients for Religious Diversity, Percent Catholic, with 
Rate of Church Adherents as the Dependent Variable for Samples of Counties and States, 1980** 


Rate of Church Adherents 


1. 310 Random 
Counties 


2. 300 Urban 
Counties 


8 b B b 
—21.24* —.20 —15.09* —.29 


Religious diversity —.21 
Percent catholic .35 40* > 42 .36 
R= .196 .346 


* p «001. 
** See text for description. of samples. 


Contrary to Finke and Stark's paper, the 
present results show a highly significant, 
consistently negative relationship between 
religious pluralism and religious participa- 
tion. Moreover, the results remain the same 
across samples that are subject to the 
measurement errors of Quinn et al. (1982) 
and samples that are altered to reduce these 
errors to a minimum. ' 


RECONCILING THE 
CONTRADICTORY FINDINGS 


Can the contradictory findings of this and the 
Finke and Stark paper be reconciled? The 
following possibilities suggest themselves. 
(1) There are unknown measurement errors in 
either the 1906 report or the 1980 compilation 
that are the source of systematic bias in the 
samples studied.? (2) The contradictory re- 
sults are real and reflect historical change. 
Accordingly, one must account for why the 
positive effects of pluralism have turned 
negative over the intervening 74 years. (3) 
Either the 1906 or the 1980 data reflect 
unique or unusual historical events. For 
example, the 1906 data may reflect unusually 


? The Quinn. et al. (1982) study covers 111 
religious bodies, and, according to the authors, 91 
percent of U.S. religious adherents. In addition to 
undercounting black church adherents, the study 
neglects other small religious bodies. Stark (1986) 
argues that the following religious groups or 


. denominations representing about 5 percent of 


religious adherents’ are among those omitted: 
Eastern Orthodox, Buddhists, Reorganized LDS 
(Mormons), Jehovah’s Witnesses, Old Catholic 
groups, Christian Science, Plymouth Brethren, 
The Wesleyan Church, Baptist groups, Pentecostal 
groups, other small Protestant groups, and other 
small Christian groups. These groups are small and 
they are distributed such that their omission is not a 
source of systematic bias in the present samples. 


4. 242 Random 
Counties 


5. States— 


3. States Corrected Data 





B b pB b B b 





—21.37* —.20 —23.61* —.36  —30.72* 
20 17*  .36 .45* 03 .04 
.179 .205 .143 





high rural-to-urban migration and immigra- 
tion, with consequences for the religious 
makeup of urban areas (see below). (4) 
Methodological reasons. I argue below that 
the reason for the contradictory findings is 
primarily methodological. 

In the Finke and Stark paper, the zero-order 
relationship between religious diversity and 
religious participation is in fact negative, 
—.40. However, the negative relationship 
becomes strongly positive in the multiple 
regression analysis. This change from a 
negative to a positive relationship is mainly 
due to the high degree of collinearity between 
religious diversity and percentage Catholic, 
—.88. The other factors are the compara- 
tively weak relationship between religious 
diversity and religious participation, and the 
strong relationship between percentage Cath- 
olic and religious participation (r = .66). 
That Finke and Stark's strong positive effect 
for religious diversity (B = .84, b = .76) is 
in large part contingent on the high degree of 
collinearity between the independent vari- 
ables can be seen with simple calculations of 
B's from the zero-order matrix in Finke and 
Stark’s paper.? In fact, if religious diversity 


? The formula for computing standardized betas 
from correlation coefficients for three variables is 
19.15 in Blalock 1979, p. 480). Note that Finke 
and Stark (1988, p. 45) assert that “Multicollinear- 
ity obscures relationships; it does not create false; 
strong relationships." This is precisely what 
happened in their study, sometimes referred to in. 
the specialized literature as the "bouncing beta" 
effect. Note that a good if not conclusive technique 
for detecting multicollinear inefficiencies is the use 
of variance inflation factors. VIF's measure the. 
amount the variance in parameter estimates are 
inflated by collinearity. Parameters with VIF's 
greater than four should be omitted or treated 
cautiously. Estimation procedures such as ridge 
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and percentage Catholic had been only a little 
less collinear, very different. results would 
have obtained. For example, if the magnitude 
of collinearity had been —.70 instead of 
— .88, religious diversity would have been 
insignificant instead of strongly positive. 

The critical question is whether Finke and 
Stark’s results can be replicated without high 
collinearity. If Finke and Stark’s thesis -is 
correct, then presumably the same results 
should obtain in the absence of high collinear- 
ity. In samples with low collinearity, the 
present results show that a positive effect does 
not obtain.^ 

A second source of concern is Finke and 
Stark’s sample. Urban samples are biased 
because they are disproportionately Catholic 
and because they neglect areas in which other 
religiously conservative or traditional reli- 
gions are strongly represented. Such places 
often have ‘very high religious participation 
and low religious diversity. For example, the 
state of Utah ranks second (fourth, with 
Stark’s correction) on religious participation, 
but 50th on religious diversity. In addition, 
this points to a key weakness in Finke and 
Stark’s theoretical discussion of their results: 
strong religious monopolies are not always or 
necessarily “lazy” as the authors suggest; 
they often are able to maintain their monopoly 
power—even in our highly secular world of 
today. Thus, Catholic Rhode Island, which 
ranks first on religious participation, ranks 
49th on religious diversity. 

Highly urban samples also dibprivortioi: 
ately neglect places where the rate of religious 
participation is low and where religious plural- 
ism may be quite high, notably the unchurched 
West. Alaska, for example, ranks 49th on re- 
ligious participation and third on pluralism. Sim- 
ilarly, Washington ranks 48th on religious par- 
ticipation and first on pluralism. 





regression may also be used if serious collinearity 


is suspected. 
* Note that in equation 2, religious diversity and 
percent Catholic are sizably collinear (r = —.74). 


In this case, the problem of collinearity is less 
severe compared to that of Finke and Stark's 
equation, not only because of the reduced degree 
of collinearity (—.74 compared to —.88), but 
_ because of the much smaller difference between 
the correlation for religious diversity and percent- 
age Catholic and that for religious diversity and 
church adherents (—.88 and —.40 in the ‘Finke 
and Stark paper, —.74 and — .52 for equation 2). 


AMERICAN SOCIOLOGICAL REVIEW 


Finally, one of the reviewers of this paper 
asked me to comment on Christiano's (1987) 
work on religious diversity as it is based on 
some of thé same census data used by Finke 
and Stark. Originally a doctoral dissertation, 
Christiano's investigation produced few consis- 
tent or strong findings. One exception to this is ` 
his finding that illiteracy is negatively related to 
religious diversity. This result may prove to be 
useful in future work. Of relevance here are 
Christiano's claims that there is, as expected, a 
negative relationship between religious diver- 
sity .and total church adherents and, interest- 
ingly, a positive relationship between religious 
diversity and Protestants. :Both these claims, 
however, are based on the direction of the re- 
gression coefficients as neither reached statis- 
tical significance.5 More doubt is cast on these 
weak findings by Christiano's own frank and 
detailed discussion of the limited and unrepre- 
sentative nature of his sample, 120 or so urban 
areas. I would point out that in many other 
methodological respects, such as the use of de- 


.nomination-specific diversity measures, the use 


of lagged time effects, and different estimation 
procedures, Christiano's work is noteworthy. 


URBANISM AND RELIGION 
Finke and Stark provide descriptive data to show 


that urban areas have higher rates of religious 


participation. The present results dispute the 
historical generality of this claim. The mean for 
religious adherents in the sample of urban coun- 
ties is 49.2 percent, while it is 54.6 percent and 
55.1 percent for the 310 and 242 county sam- 
ples respectively. After correction for the un- | 
dercounting of black church adherents in the 
sample of urban counties, the mean is 54.9 per- 
cent.$ Thus, the evidence suggests that today 
there probably is not an important rural-urban 
difference in the rate of church adherents — still 
a very important finding. This contradictory find- 
ing between the two studies may well indicate 
real historical differences in the religious par- 
ticipation of urban areas, and Finke and Stark's 
claim when restricted to 1906 is undoubtedly 
correct. Because of rural-to-urban migration and 


53 The t for total diversity was .73; 1.36 for 
Protestant diversity, N = 117. The diversity 
measures were taken from the 1890 census, the 
dependent variables from the 1906 census. 
The correction formula is given in Breault 
(1988a). It is an adaptation of Stark's (1986) 
correction formula for states. 
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immigration (notably of Jews and Catholics), 
urban areas in 1906 did enjoy greater religious 
participation than they do today. This effect has 
diminished over time perhaps for a variety of 
reasons, including demographic changes.’ 


HOW WISE RECEIVED WISDOM? 


With regard to pluralism, it is likely that 
Finke and Stark’s results cannot be replicated 
without high collinearity. This possibility and 
the empirical results presented here cast doubt 
on the generality of Finke and Stark’s new 
findings. In the case of pluralism, received 
wisdom may be correct. 

But how much confidence should we place 
in the received view? While I show across var- 
ious samples that a negative relationship be- 
tween religious diversity and religious partici- 
pation obtains, the negative effect is not as strong 
as it might be. If received opinion is correct, 
why is not the negative effect stronger? The 
answer that I propose is similar to that sug- 
gested by Kelley (1972) in the context of church 
growth rates: religious pluralism is differen- 
tially effective in undermining religion. In places 
where there are traditional, religiously conser- 
vative groups, religious pluralism will be less 
destructive. Where liberal or ecumenical de- 
nominations predominate, and where conserva- 
tive faiths are weak, pluralism will have a greater 
effect and religious participation will be lower— 
as received opinion would predict. Religiously 
conservative denominations are less vulnerable 
to the effects of pluralism and secular society. 
Reform Judaism, for example, is more suscep- 
tible because on many levels it is actively en- 
gaged with modem society. Conservative de- 
nominations are protected because they reject 
modern society. The strength of the sacred can- 
opy or the rate of adherents in a community 
may therefore depend not so much on the ex- 
ternal forces of religious pluralism or secular- 
ism but on the religious strength or traditional 
nature of the canopy itself. 

This may be a thesis worthy of further 
investigation. Table 2 presents regression 
results for measures of religious conservatism 


T One reviewer pointed out that the idea that the 
city is not necessarily ungodly is not new. Indeed, 
it is discussed by almost every classical writer, 
notably Weber (e.g., 1978, p. 471). It is the 
generation following the classical one that is most 
associated with the view that the city is a source of 
social pathology. 
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Table 2. Standardized and Unstandardized Regression 
Coefficients for Religious Conservatism, Reli- 
gious Liberalism and Religious Diversity, with 
Rate of Church Adherents as the Dependent 
Variable for 310 Randomly Selected Counties, 





1980 
Rate of 
Church Adherents 
g b 
Relisious conservatism .98 .98* 
Religious liberalism 43 1.06* 
Religious diversity .30 .30* 
= 646 
* p <.001. 


(the sum of Catholic, Mormon, and Southern 
Baptist adherents) and religious liberalism 
(the sum of Episcopal, United Church of 
Christ, United Methodist Church, and United 
Presbyterian Church adherents), controlling 
for religious diversity with the sample of 310 
randomly selected counties used previously ,® 
Because of high collinearity between religious 
conservatism and religious diversity, these 
results should be interpreted with caution.? 
However, in support of the thesis outlined 
above, they suggest that religious conserva- 


8 Similar regression results were obtained for 
states and the set of 242 counties used above. 
Kelley (1972) was the source for characterizing the 
denominations along a liberal-to-conservative con- 
tinuum. Specifically, he argues that the Episcopal 
Church, the United Church of Christ, the United 
Methodist Church, and the United Presbyterian 
Church are much more religiously liberal than the 
conservative denominations used. 

? It is. the. betas for religious diversity that are 
subject to the greatest inefficiency from the 
collinearity (r = —.72) because of the compara- 
tively weak bivariate relationship between reli- 
gious diversity and church adherents (r = ‘— .27), 
and the strong relationship between religious 
conservatism and church adherents (r = .64). The 
result is a change in the direction of the 
relationship between religious diversity and church 
adherents similar to that obtained by Finke and 
Stark. Rather than constituting support for Finke 
and Stark’s argument, this finding underlines one 
of the main points of this paper: researchers must 
be cautions in making claims for variables that are 
both weakly related to the dependent variable and 
highly collinear with an independent variable. Note 
also that a much stronger negative relationship 
between religious diversity and church adherents in 
the Finke and Stark paper (r = — .40 compared to 
— .27 in the present equation) resulted in a much 
higher positive Beta (.84 compared to .30). 
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tism is much more strongly related to church 
adherents than is religious liberalism. 


KEVIN D. BREAULT is Assistant Professor 
of Sociology and Psychiatry at Washington 
University. A 1986 graduate of the University 
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of Chicago, and a 1987 scholar at the Center 
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ences, Professor Breault is interested in 
research methodology and social problems. 
His work has appeared in a variety of 
journals. 





Appendix I 
Correlations, Means, and Standard Deviations for Variabies and Samples used in Table 1 
Rateof Religious Percent 
Adherents Diversity Catholic 
Equation 1 N = 310 Random counties 
Religious diversity -.27 
Percent Catholic .39 -.17 
Mean 55 .67 13 
Standard deviation 17 A7 16 
Equation 2 N = 300 Urban counties 
Religious diversity -.52 
Percent Catholic 57 -.7A 
Mean 49 .67 23 
Standard deviation 13 17 15 
Equation 3 N = 50 States 
Religious diversity —.40 
Percent Catholic .36 -.57 
Mean : 51 .70 19 
Standard deviation 12 .16 14 
Equation 4 N = 242 Random counties 
Religious diversity -.27 
Percent Catholic .40 —.19 
: Mean 55 69 13 
Standard deviation 18 AT» s 15 
Equation 5 N = 50 States— Corrected data 
for religious adherents 
Religious diversity —.38 
Percent Catholic 24 -.57 
Mean ; .55 .70 .19 
Standard deviation A3 i .16 .14 
Appendix 2 
Correlations, Means and Standard Deviations for Variables used in Table 2 
Rate of Religious Religious Religious 
Adherents Conservatism Liberalism Diversity 
Religious Conservatism 64 
Religious Liberalism 25 —.28 
Religious Diversity —.27 —.72 36 
Mean i 55 .28 B .67 
Standard deviation 7 AT. 07 47 


DR 


V 


RELIGIOUS PLURALISM AND URBANISM 


REFERENCES 


Blalock, Hubert M. 1979. Social Statistics. New 
York: McGraw-Hill. 

Breault, K.D. 1986. “Suicide in America: A Test 
of Durkheim’s Theory of Religious and Family- 
Integration, 1933-1980.” American Journal of 
Sociology 92:628—56. 

Breault, K.D. 1988a. "Beyond the Quick and 
Dirty: Problems Associated with Analyses Based 
on Small Samples of Ecological Aggregates: 
Reply to. Girard." American Journal of Sociol- 
ogy 93:1479-85. ` 

Breault, K.D. 1988b. “A Structural Analysis of 
U.S. Crime Rates.” Presented, Department of 
Sociology, Washington University. : 

Breault, K.D. and Augustine J. Kposowa. 1987. 
“Explaining Divorce in the United States: A 
Study of 3,111 Counties, 1980." Journal of 
Marriage and the Family 49:549—58. 

Christiano, Kevin J. 19878. Religious Diversity 
and Social Change. Cambridge: Cambridge 
University Press. 

Finke, Roger and Rodney Stark. 1988. “Religious 
Economies and Sacred Canopies: Religious 
Mobilization in American Cities, 1906.” Amer- 
ican Sociological Review 53:41—49. 

Kelley, Dean M. 1972. Why Conservative Churches 
are Growing. New York: Harper and Row. 

Quinn, Bernard, Herman Anderson, Martin Brad- 


1053 


ley, Paul Goetting, and Peggy Shriver. 1982. 
Churches and Church Membership in the U.S. 
Atlanta: Glenmary Research Center. 

Stark, Rodney. 1980. "Estimating Church Mem- 
bership Rates for Ecological Areas." Funded by 
the National Institute of Juvenile Justice and 
Delinquency Prevention, Law Enforcement As- 
sistance Administration, U.S. Department of 
Justice. Washington, DC: Government Printing 
Office. 

1986. “Correcting Church Membership 
Rates: 1971 and 1980." Manuscript obtained 
through personal communication with author. ' 

Stark, Rodney, Daniel P. Doyle, and Lori Kent. 
1980. "Rediscovering Moral Communities: 
Church Membership and Crime." Pp. 43-51 in 
Understanding Crime: Current Theory and 
Research, edited by Travis Hirschi and Michael 
Gottfredson. Beverly Hills: Sage. 

Stark, Rodney, Lori Kent, and Daniel P. Doyle. 
1982. “Religion and Delinquency: The Ecology 
of a ‘Lost’ Relationship.” Journal of Research 
in Crime and Delinquency 19:4—24. 

Stark, Rodney, Daniel P. Doyle, and Jesse Lynn 
Rushing. 1983. “Beyond Durkheim: Religion 
and Suicide.” Journal for the Scientific Study of 
Religion 22:120-31. 

Weber, Max. 1978. Economy and Society, edited 
by Guenther Roth and Claus Wittich. Berkeley: 
University of California Press. 





` COMMENTS 
————————————— 


EVALUATING THE EVIDENCE: 
RELIGIOUS ECONOMIES AND 
SACRED CANOPIES 


(Comment on Breault, ASR, this issue.) 


ROGER FINKE 
Purdue University 
RODNEY STARK 
University of Washington 


Breault claims that recent data do not support our 
proposition that religious diversity increases reli- 
gious mobilization. Moreover, he suggests that our 
finding is no more than an artifact of multicollin- 
earity. We find no merit in either claim. Long 
before seeing Breault’s note, we concluded that the 
data he has used were not suitable for testing our 
proposition. We remain convinced that the data are 
inadequate, despite the fact that an obviously 
necessary control for percent Mormon converts 
Breault’s strongly negative result to one that is 
solidly positive! In addition, there are no definitive 
tests for discovering when results are excessively 
influenced by multicollinearity, but our results pass 
muster against reasonable rules of thumb suggested 
by econometricians. 

Our primary objection to Breault’s note, how- 
ever, has to do with the puzzling fact that the 
results Breault reports, which were based on 
analysis of two randomly selected samples of 
counties, are utterly at variance with results 
obtained by a neutral investigator who based his 
analysis on all counties. Since Breault did not 
retain copies of his data files nor could he provide 
a list of the counties that made up either of these 
samples, we could not examine how his sampling 
went awry. 

We shall briefly spell out each of these points. 


UNREPRESENTATIVE SAMPLES 


Three of Breault's five equations are based upon 
subsets of counties. The first is a 10 percent 
random sample of all counties, the second is a 10 
percent random sample of all counties minus the 
urban counties, and the final subset consists of the 
300 most populous counties in 1980. Breault 
reports that for each of these samples the "results 
show a highly significant, consistently negative 
relationship between religious pluralism and reli- 
gious participation." Since we could not recon- 
struct Breault's samples, we could not check this 
for ourselves. Therefore, we asked Laurence R. 
Iannaccone of the Department of Economics, Santa 
Clara University to replicate Breault's equations on 
the full set of U.S. counties. Not only did he fail to 


find a negative result, he found a highly significant 
positive beta of .21 with a standard error of .04 
(t = 5.99).! 

However, using the subset based on the 300 
most populous counties, we were able to replicate 
Breault's findings. When entered into an equation 
with percent Catholic, diversity does display a 
negative effect on religious participation. How- 
ever, this equation omits a vital control. Examina- 
tion of the data showed that four counties with 
extremely low diversity scores were among the 
very highest on religious adherence. All four are in 
Utah and are more than 80 percent Mormon. Just 
as the Catholics dominated many urban areas at the 
turn of the century, and continue to do so today, 
Mormons dominate the urban areas of Utah. We 
have never suggested that it is impossible for a 
denomination to dominate a local market while | 
sustaining a high level of religious mobilization, 
especially if this is a faith that regards itself as 
somewhat of an embattled minority on the national 
scene. Indeed, this is why we controlled for 
Catholics when we used the 1906 date. 

When percent Mormon is entered as a control, 
as shown in Table 1, the negative effect of 
religious diversity obtained by Breault shifts to a 
positive effect. And the effect of diversity is even 
greater when the four Mormon counties simply are 
excluded from the analysis. Despite these agree- 
able results, however, we are unwilling to regard 
this as a worthwhile confirmation of our proposi- 
tion because the data are so flawed. 


LIMITATIONS OF THE DATA 


The 1980 statistics on church membership used by 
Breault were collected by Bernard Quinn and his 
associates at the Glenmary Research Center on the 
basis of information supplied to them by various 
American religious denominations. The Glenmary 
data are an extremely valuable resource which we 
have often utilized. However, they are not suitable 
for testing our theoretical propositions about the 
impact of diversity on religious participation. 
Why? Because hundreds? of denominations did not 
cooperate in the data collection and thus must be 
omitted from the diversity calculations. Nor is it 
simply that the Glenmary data are limited to the 


! Despite this strong coefficient, the explained 
variance was only a minuscule 1.3 percent — again 
not in agreement with Breault's findings. 

? The Glenmary data set is based on 111 
denominations while Melton (1987) provides 
profiles of 1,347 separate religious denominations 
functioning in the United States today. 
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Table 1. Standardized and Unstandardized Regression 
: Coefficients for Religious Diversity, Percent 
: Catholic, and Percent Mormon with Rate ‘of 
Adherents as the Dependent Variable d 


B 
1 


! Rate of 
Church Adherents ^ 
1 — 2* 
1 E ee Mtr CE t ——— ey 
l B b B b 
Diversity ^ ' 15 2 32 .18* 
% Catholics .' .70 62" 7 .67* 
% Mormon  . .35 .59* * 
R = 4 36 36 | 


s Equation 2 omits the four counties that are over 80% 
Mormon. : 
* At least twice its standard error. 


larger bodies and omit only the tiny ones. In fact, 
the data include six denominations, each of which 
has five or fewer congregations nationwide, and 
omit denominations -with thousands of congrega- 
tions and millions of members. These. omissions 
seriously truncate diversity calculations in nonran- 
dom ways. In particular, the immensely rich 
pluralism of black Protestantism is almost com- 
pletely ‘eliminated from ‘the .diversity scores. 
Breault is able to include only four black Protestant 
groups, two of them extremely small (one with 5 
congregations nationally, the other with 31), while 
the two largest black denominations (with 2.7 and 
5.5 million” members respectively) are missing 


along with scores of others, many of which have a . 


very significant presence in some cities. Breault 


cites Stark’s procedures for correcting the data to. 


reflect blacks, but this applies only to estimating 
the total rate of church membership in a county and 
offers no means for estimating the missing black 
component of diversity. The omission of the major 
.black denominations causes serious underestimates 
of diversity in all counties having large black 
populations. 

We used the 1906 data, not because they 
included so many more denominations than did the 
1980 Glenmary statistics, but because they in- 
cluded a much more complete enumeration of the 
groups existing in the religious economy ,of the 
time. Moreover, the 1906 data are for cities, not 
counties. Because of the much smaller size and 


more politically delimited structure of cities at that . 


time, cities seem a much more realistic unit than 
counties for assessing local religious economies. 
That is, people ‘experience diversity in a very local 
way. No one spends half of Sunday commuting to 
and from church. Diversity consists of the number 


of nearby religious alternatives. This also is the . 


reason why we did not: use states as units of 
analysis, and why Bréault’ s use of states ‘is 


irrelevant. i 
1 


E , 
Breault uses bold tones in the text to flatly assert 


_ diversity,” 


|: "competitive and work h 


^high rate of adherence. Indéegthro 
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that our findings are due to high collinearity. But, 
‘sotto voce, in a footnote he hastens to the 


test based on variance inflation factors as “not 
conclusive.” Since macroeconomists constantly 
:work with highly correlated variables (ofterr above 
:.90), econometrics textbooks give. the most 
complete: and sophisticated discussions ‘of the 
subject. And all leading texts point out that while 
there are many suggestive tests for multicollinear- 
‘ity, there is none that is authoritative. Kennedy 


' (1981) states flatly that “no formal tests exist” to- 


‘determine when multicollinearity is too: great. 
Judge et al. (1980) begin their review of diagnostic 


“techniques for assessing multicollinearity thus: 


"Contrary to conventional wisdom these measures 
are for the most part ineffective rules of thumb." 
Schroeder, Sjoquist, and Stephan (1986) agree: 
"there is no statistical test that can determine 
whether or not it really is a problem." Finally, Rao 


‘and Miller (1971) counsel: "One should not, a 
. priori, rule out estimation of any regression 


equation because of high simple correlations 
between any two independent variables." 

' Among, the best ways to assess potential 
problems of multicollinearity are the following 
three. First, are the standard errors low? Ours are. 
Second, do the betas remain stable when other 
variables are added to the equation? Ours do. 
Third, do. the equations remain essentially un- 


' changed for subsets of the sample? We obtained 


the same results for subsets made up of larger and 
smaller cities. 


CHRISTIANO'S RESEARCH 


We find Breault's review of Christiano’s research 
highly misleading. What Christiano (1987) re- 
ported was that the greater the extent of “Protestant 
the higher the level of Protestant 
participation, which precisely supports our results. 
What seems to have misled Breault is ‘Christiano’s 
report that greater “ethnic diversity” (Protestant- 
Catholic-Jew) results in lower rates of religious 
participation. That is quite another matter. ‘Chris- 
tiano's text might ‘have been clearer on this 
distinction, but we have checked our reading of 
these passages with the-author who assures üs that 
-we are correct about what he found and what he 


CONSERVATISM Ms 
Breault concludes by suggesting that "religious 
pluralism is differentially effective in undermining 
religion.” He argues that pluralism will be less 
destructive to religiously conservatiye- reg 

In part, we agree with thi 
conservative religious gro jo 





areas dominated by these 
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two centuries, it has always been the more 
conservative religious groups that prospered and 
their growth always has come at the expense of the 
“liberal” denominations (Finke and Stark 1989). 
But despite our agreement that some groups are 
more effective firms in their religious economies, 
we are not convinced that Breault has provided any 
evidence to demonstrate that pluralism undermines 
religion, differentially or otherwise. 
:  Breault seems to suggest that Table 2 in his note 
- supports his claim that it is religious conservatism 
rather than diversity that sustains high membership 
rates. Several things must be noted, however, 
First, the results rest on one of his random samples 
of counties. These have been found to be 
unrepresentative of the full set of counties. 
Secondly, we find it hard to imagine why Breault 
would think an acceptable measure of religious 
conservatism could be created by adding together 
the Catholics, Mormon, and Southern Baptists— 
surely one of the strangest denominational clusters 
ever assembled. In fact, American Catholics aren’t 
very conservative these days (nor are they gaining 
, members). And where are all of the other 
evangelical Protestants? Whatever Breault’s rea- 
sons for these choices, the result is to have selected 
the three denominations whose membership rates 
are most negatively related to diversity—since 
each enjoys huge majorities in some counties. 
Given how religious conservatism was measured, 
how could the results have been otherwise? 
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A REEXAMINATION OF 
THE RELATIONSHIP BETWEEN 
RELIGIOUS DIVERSITY AND 
RELIGIOUS ADHERENTS 


(Reply to Finke and Stark, ASR, this issue) 


KEviN D. BREAULT 
Washington University 


1. I have completed an exhaustive examination of 
the entire set of U.S. counties and a large number 
of subsamples. Hundreds of hours, were spent on 
the project; literally hundreds of equations were 
analyzed. More than ever I am confident of my 
earlier findings. 

Investigators who use the religious diversity 
formula (1 - X(X/X)*, where X is adherents) must 
do so with care. The finding reported secondhand 
by Finke and Stark can be obtained if the formula 
is used incorrectly. Remembering advice I had 
received early in my. methods education, I 
embarked on the herculean task of checking the 
validity of my conclusions by examining every 
observation, laboriously computing each of the 
(3,100 +) diversity scores. I then went back and 
systematically checked every 10th observation for 
errors. Equation 6 of Table 1 presents the results of 
this work, a highly significant negative finding for 
religious diversity.! Table 2 presents the zero-order 
matrix, means, and standard deviations for equa-: 
tion 6. Close examination of the data suggested 
that for small counties a positive relationship might 


! These results were confirmed by' her ` 
Gilbert of the Department of Political Science and 


. the Social Science Computing Facility, Washington . 


University, and Augustine J. Kposowa of the De- 
partment of Sociology, Ohio State University. In my . - 
analysis, I follow the practice of including Virginia's. 
independent cities, and I hold church rates to 99.9 
percent (Quinn et al. report 31 counties greater than 
100 percent). With these changes, the strength of the 
negative relationship is somewhat decreased. Equa- 
tion 22 is for a random sample-of 100 counties.-. 


.Here, every 31st county in order of population rank 


was selected. County ranks for 1980 may be found 
in the County and City Data Book. The ad in- 
cludes no Mormon counties. f. 
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Table 1. Standardized and Unstandardized Regression Coefficients for Religious Diversity, % Catholic, and % Black 
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Rate of Church Adherents 
6 7 8 9 10 11 
B b B b B b B b B b 8 b 
D —.16 -17.75*  .11 13.45" -—.03 -—3.63ms —.06 —7.84* 08 7.35ns —.04  —3.01ns 
c 32 .38* — 44 .58* — 38 53% 39 .54* — 8 67* — 53 43* 
B —06 —.07*  .00 ins —.08  —:13* -—.07 -—.09* 23 26* — .09 .10* 
f 12 13 14 15 16 17 
B b B b B b B ^b p b B b 
D —.18 —12.43* .07  6.25ns —.09 ,—8.75ns —.10 —8.52ns —.27 ~—23.43" —.26 —24.79* 
C Al 36% — 54 .62* — 46 .43* 39 34* 29 26* ' 6 .26* 
B 06 Ons —.14 12" 12 .16*  .06  .07*  .05 06ns —.01 —.Oins 
18 .— 19 20 21! 22 
B b B b B b B 'b B b 
D —.18 -16.14* —.09 —10.94* —.16 —18.53* —.25 —27.53* —.17 -21.56us 
C 36 32* — 35 42* 34 .42* .32 .41* 
B œ .09* —.03  —.04ns ~16 —.20* —.16  -.16ns 





Note: See text for description of equations; D = religious diversity, C = % Catholic, B = % black. 


* p< 05. 
ns = not significant. 


obtain. For the 286 counties with less than 5,000 
population (1.9 percent of U.S. pop.), the 
relationship between religious diversity and church 
adherents is positive (equation 7, p = .048). 
While this finding is deserving of more attention, it 
evaporates when counties that are slightly larger 
are added.? For counties under 10,000 population 
(4.1 percent of U.S. pop.), there is no relationship 
(equation 8, N = 736). For counties less than 
15,000 (7.2 percent of U.S. pop.), the relationship 
becomes negative (equation 9, N = 1,136). In no 
samples other than those composed of the smallest 
counties does a positive relationship result. 

Finke and Stark argue that when they control or 
eliminate Mormon counties in the set of the largest 
300, positive results obtain. I did replicate this find- 
ing, but what Finke and Stark neglect to mention is 
that the degree of collinearity between diversity and 
percent Catholic over the non-Mormon counties is 
— .84 (compared to —.74 for all 300). For Finke 
and Stark’s purposes this collinearity is too high (see 
below) and it again raises my original question: 7f 
Finke and Stark are right, why can they not produce 
results favorable to their theory without high collin- 
earity? In an effort to reduce the effects of collin- 
earity in this data set, I reestimated it (without the 
four Mormon counties) with the WLS procedures 
originally used in this context by Christiano. Equa- 
tion 10 presents these results. For the same reasons, 


? As these are the smallest counties, measure- 
ment error is a strong possibility. Note also that 
these counties are also religiously unrepresenta- 
tive. While Catholicism is by far the largest 
denomination in the United States, these counties 
have virtually no Catholic membership. 


ridge regression was used (eq. 11). Ridge regression 
is specifically called for in cases where untoward 
collinearity effects are suspected. Because collinear- 
ity problems basically boil down to a lack of infor- 
mation, one of the best ways to deal with high col- 
linearity is to increase sample size. Equation 12 (OLS) 
is based on the largest 500 counties. Equation 13 
(OLS) is based on these 500 minus Mormon coun- 
ties. Equation 14 presents WLS and equation 15 NRR 
estimates for this sample. Equations 16 and 17 present 
OLS estimates for the 800 (50,000+) and 1,600 
(20,000 +) largest counties respectively. In addi- 
tion, the significant negative finding for the largest 
800 does not change when Mormon counties are 
omitted (eq. 18, OLS, N = 795)—this is also true 


for the full set of counties (eq. 19, OLS). In sum, 


this research conclusively demonstrates that, for ur- 
ban counties, support for Finke and Stark's theory 


‘depends on high collinearity. When various methods 


are used to reduce collinearity (WLS, NRR, and 
larger samples), the theory fails. 

.2. Finke and Stark question the quality of the 
county-level data, and I agree that for many 
purposes the Glenmary data are undesirable. But 
Finke and Stark have lost sight of the key 
mathematical issue at the core of this debate. Finke 
and Stark’s data yield a. negative r between 
diversity and adherents. My problem is with their 
multiple regression results, the strongly positive 
beta for diversity (.76, B = .84), which is the 
result of the —.88 r between diversity and percent 
Catholic. We disagree over whether this collinear- 
ity is a cause for concern, but since both data give 
a similar r, the data are not at issue. Perhaps Finke 
and Stark are saying that, with better data, the r 
between diversity and adherents would be —.4 (as ' 
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Table 2, Correlations, Means, alid Standard Deviations for the Entire Set of US. Counties, 1980 w = 


3, BS is hd 


.Rate of 
; Adherents 
Religious diversity 1 ~.22 
Percent’ Catholic 37 S 
. Percent black s —.2 s 
Mean 2. 3:5 i 


Standard deviation .18 


^in their study) compared to. —.52 (for the largest 


300 counties) in mine. If this were true, it would 


‘neither damage my argument nor protect their | 


theory from criticism. Indeed, the similarity of the 
results supports the usefulness: of Quinn et al.’ s 
data in this research. The main way in which the 
results differ is in the degree of collinearity. This is 
because Finke and Stark look at the 150 largest 


units, and I, in an effort to reduce collinearity, 
analyze larger samples. The collinearity in the 150 ' 


largest counties (— .85) is similar to that i in Finke 
and Stark’ s 1906 cities.3 - 
Finke and Stark contend that analyses based on 


c, _ the Glenmary data may be biased by the omission 


eee — 


: of many black churches. To assess this possibility, 


I omitted. all counties with more than 20 percent 
black, leaving a sample size of 2,623. The findings 


(eq. 20) are virtually unchanged from those'for the ` 


entire set. In addition, most of the churches 
omitted by Quinn et al. are small. The few large 
ones are widely distributed over many. counties, 
and, with the exception of a handful of counties 
that have very large black populations, nonblack 


. churches by, virtue of their sizé mainly determine 
' the religious diversity score (see formula). For 


example, Cook County, Illinois, has a large black 
population, 26 percent. The Quinn et al. compila- 


*tion omits more than 1,000 black churches in Cook 


alone. This seems like a huge number, yet the 
cumulative impact of the difference changes the 
diversity score. from .39 to .41. Extrapolating to 
other black counties, the difference is negligible. 


"Thus, the omission of black churches has a 


minuscule effect in comparison to the other effects 
observed with this data set, notably the negative 


relationship between religious diversity and church 
.adherents. This is not to say the Quinn et'al. data 


are useful for every pubes ore they are 


"ASR editorial policy does not'permit introduction 
of new data, in exchanges, but, it seems likely that 
if ridge regression were applied to Finke and 
Stark’s 1906 data, their positive relationship, might 
not survive. The issue seems to be one of 
collinearity and not quality of data. 


? Substituting r = —.4 for .— .52, the resulting 
B would be .05, not significant for the largest 300 
counties: the positive’ relationship disappears when 
solnm is i rediced; E 


i 4 


Percent 


Religious Percent 
Diversity Catholic Black 
.69 
— 24 
—-45 — 27 
6 . . 35 .08 
` d6 .15 "14 


3. It is my view that Finke and Stark have 
misinterpreted the literature on multicollinearity. 
They also igre ee key points of my paper. 
They allege that I hold different positions ‘on, 
VIF's, while my criticism of their work had: 
nothing to do with VIF’s. They also say their 
findings are not due to collinearity. How are we to 
understand this. If not the — .88 r, what do they- 
think caused the —.4 r to turn into a .84 p? 
Theoretical controls have a mathematical existence 
that is all too real. " 

Researchers should address three questions 
about collinearity: (1) does high collinearity exist, ` 
(2) does it significantly affect results, and (3) if so, 
what if anything should be done about it? 
Beginning with the first question, difficulty arises 
with equations containing many variables. Groups 
of variables may be strongly collinear, even when 
pairs of variables do not show high collinearity. 
For pairs of variables, the convention is .8, but r's 
above .6 should be considered, especially if the 7's - 
between the X's and Y are significantly lower than 
those between the X's. One fairly reliable test for 
high collinearity is when r° = R?. For example, the 

in Finke and Stark's research is .62 while their 
r? = ,T], indicating high collinearity. Note that 


. this test is not useful when Ri is lower than about 


35.5 


There is a simple mathematical relationship 
between r's and B's in the case of three variables. - 
If there had been no collinearity in Finke and 
Stark's research, the Q for diversity would be, 
—.40. By increasing the collinearity, .the B's 
decline. For example, the B which results when the. 
collinearity is —.6 is —.01. Additional increases 
in the collinearity, however, cause the B to tum 
positive or "bounce." When rjik = —8, B = .36; . 
rjk = —.85, B = .58; rik = —.87, B — -.72;. 
rjk = —.89, B = .90. Note the accelerated rise in. 
B as a function of very high collinearity. " 

$ Every test has its limitations, and the quotes . 
Finke and Stark take from various econometric 
texts are in general unobjectionable. Certainly they 
are no justification for Finke and Stark's compla- 


' cency in regard to high collinearity. Not only is Finke 


and Stark's view not tbe standard in the economet- 
rics literature, but it is not the standard in sociology. 
Needless to say, econometricians are (and were) the 
first to point out that collinearity represents a, 
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Once high collinearity is suspected, the re- 
searcher should determine if it significantly affects 
the estimates. Here, a variety of tools come into 
play, including tests such as VIF’s now standard 
on statistical software. The controversy about these 
tests is that they do not detect all collinearity 
effects, and that, under some conditions, these 
effects are not unacceptable. They should not, a 
priori, lead researchers to do something about high 
collinearity. VIF’s are a good measure of the 
inflation of variance in parameter estimates—one 


of the most common effects of high collinearity — 


but they do not pick up other effects. Researchers 
should use several ways of investigating suspected 
collinearity problems. Besides the formal tests, 
there are other signs such as low :'s and high 
errors. However, high ?'s and low SEb's do not 
rule out problems, because it is quite likely that 
collinearity will produce high betas (and high £s). 
Another indication is whether specification changes 
significantly alter the betas. For example, when 
Finke and Stark add the variabe SUNDAY 
SCHOOL to their equation, the beta for diversity 
drops 66 percent from .76 to .26. Another strategy 
is to change the number of observations. As we 
saw, when Finke and Stark deleted a small group 
of Mormon countries, large changes to the beta 
indicative of severe collinearity resulted (from 
—.17 to .18). In general, any unexpected or 
unusual beta should be suspect. 

What should researchers do about significant 
collinearity effects? In some cases, there is no need 
to do anything. For example, in comparing 
different models of the economy, well-understood 
collinearity effects may be discounted. Moreover, 
we, and often economists, are not always inter- 
ested in making causal inferences. In such cases, 
collinearity may not be relevant. - 

When should we do something about a serious 
collinearity effect? There is little disagreement 
here: when a significant collinearity effect is 
suspected and the research objective is to establish 
causation, investigators are obligated to take 
appropriate steps. Because sociologists are mainly 


serious problem for causal interpretation (see 


below), that with sufficient care investigators can 
diagnose, with a high degree of probability, cases 
where collinearity may be a problem, and that 
above all, the scientifically responsible investigator 
must err on the side of caution. The fact that there 
is no simple cookbook recipe for the multicollin- 
earity problem means that researchers must be 
more cautious, not less. 

6 In addition to using representative samples, 
increasing sample size and the use of different 
estimation techniques, a more risky option is to 
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interested in making causal inferences, multicollin- 
earity is an issue that should be taken seriously. 
Note that Finke and Stark’s primary causal 
interpretation concerns one of their collinear 
variables, an especially risky leap of faith. 

4. Christiano did indeed report results on 
Protestant and “ethnic” diversity, yet, -as I said, 
neither variable was significant. In a discussion, 
Christiano indicated that neither proposition was 
statistically supported: his work plainly fails to 
support Finke and Stark's theory, and the t values I 
report for him are correct. Finke and Stark’s claims 
about Christiano’s work are based on the direction 
of his finding. 

5. It occurred to me that much of this enterprise 
was suspect, including my own test of the 
conservatism theory—and for far more serious 
reasons than Finke and Stark suggest. Finke and 
Stark conflate two meanings of causation with 
respect to religious adherents. To suggest that 
religious diversity is a cause of adherents is quite 
different from saying the percent Catholic is a 
cause. Adherents is the sum of Catholicism and the ` 
other denominations. In an equation in which 
Catholic is X and adherents is Y, not only is the 
disturbance term not independent, but we know 
both what the omitted variables are (the other 
churches), and the proper specification of the 
model (all churches and Y). To include diversity 
(or any other variable) in such an equation is à 
serious specification error: either diversity or the 
denominational variables should be excluded. 
Without percent Catholic in the equation the 
relationship between religious diversity and church . 
adherents is always negative. This applies equally 
to 1906 and 1980, and it is true for every sample 
and estimation procedure (for the full set of 
counties see eq. 21, OLS). 

In conclusion, the best available evidence 
suggests that pluralism does indeed undermine 
religion. Finke and Stark's new attempt to argue 
otherwise is flawed for serious methodological 
reasons. Their claim of a positive relationship 
between diversity and church adherents rests on the 
scientific shoals of high collinearity and misspeci- 
fication. 


drop one or more collinear variables—the risk 
being specification error. At the same time, 
researchers need to pay attention to whether their 
equations are properly specified in the first place. 
Investigators may also wish to change the 
functional form of their equations or combine 
variables when theoretically feasible. If none of 
these options is possible, researchers must caution 
readers that the causal interpretation of their results 
is problematical. 
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éé Provides new insights into the operations of 
the American workforce- Lozano accomplishes 
this in the old-fashioned way by examining the 
motivations and experiences of individuals and 
companies rather than generating numbers. Á 
thoroughly illuminating book. 9 
— Sar A. Levitan, Director, 
Center for Social Policy Studies 

1989 0-02-919442-3 $19.95 
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Mayers, Raymond Sanchez — FINANCIAL MAN- 
AGEMENT FOR NONPROFIT HUMAN SERVICE 
- AGENCIES: Text . Cases . Readings. '89, 358 pp. (7 
' x 10), 18 il., 52 tables, about $53.75. 


C Thorman, George—DAY CARE... 
CRISIS. '89, 148 pp. (7 x 10), About $28.78. 


[3 Thyer, Bruce A. and Marilyn A. Biggerstaff—PRO- 
FESSIONAL SOCIAL WORK, CREDENTIALING, 
AND LEGAL REGULATION. ‘89, 112 pp. (7 x 10), 
about $24.75. 


O Jaynes, Judith H. & Cheryl A. Rugg— ADOLES- 
_ CENTS, ALCOHOL AND DRUGS: A Practical Guide 


pp. (7 x 10), 4 tables, $31. 75. 


a pare Steven E.—A GUIDEBOOK TO REHABIL- 
ITATION AND HUMAN SERVICES MANAGEMENT: 
Fine Timing for Excellence. '89, 310 pp. (7 x 10), 6 
il., 13 tables, about $47.75. 


CI Rubin, Phyllis B. & Jeanine Tregay — PLAY WITH 
THEM —THERAPLAY GROUPS IN THE CLASS- 
ROOM: A Technique for Professionals Who Work 
With Children. '89, 184 pp. (7 x 10), 1 il., 2 tables, 
about $31. 75. . 


a Smis Géorge P., II—FINAL CHOICES: Autonomy 
in Health Care Decisions. ‘89, 234 pp. (7 x 10), 
about $37.75. 


Q Gregg Charles H., Judy L. Robertus & J Blair Stone 
—THE PSYCHOLOGICAL ASPECTS OF CHRONIC 
ILLNESS. ‘89, 344 pp. (7 x 10), 7 il; 15 tables, 
$49.75. 


O Johann, Sara Lee & Frank Osanka~ REPRESENTING 
... BATTERED WOMEN WHO KILL. ‘89, 416 pp. 
(7 x 10) $64.75. 


EJ Rosenthal, Uriel, Michael T. Charles, & Paul T. 
Hart —COPING WITH CRISES: The Management 
of Disasters, Riots and Terrorism. '89, Bis pp. (7 x 
10), 22 il. ‘about $84.75. 


C] MacNamara, Roger D. — FREEDOM FROM. ABUSE 
IN ORGANIZED CARE SETTINGS FOR THE El- 
DERIY AND HANDICAPPED: Lessons from Human 
Service Administration. '88, 104 pp. (7 x 10), 2 
tables, $25.25. 


C] Underwager, Ralph. and Hoilida Wakefield — THE 
REAL WORLD OF CHILD INTERROGATIONS. '89, 
300 pp. (7 x 10), About $49.75. 











CHARLES C THOMAS » PUBLISHER 


© Plumer, Erwin H.— DIRECTION AND MANAGE- 
AN EMERGING - 


_ for Those Who Work With Young People. ‘88, 210 . 












' Books sent on approval * Catalog erit on request « Write or call (217) 789-8980 
2600 South First.Street Springfield * Illinois e 62794-9265 





C) Thorman, George — HOMELESS FAMILIES. '88, 152 
-PP- (7 x 10), $29.75. 


` 


MENT OF CHILDREN’S INSTITUTIONS AND 
AGENCIES: Responsible Guidance and Procedures. 
‘89, 144 pp. (7 x 10), 4 il., $28.75. 


CI Blank, Thomas O.—OLDER PERSONS AND THEIR 
HOUSING —TODAY AND TOMORROW 88, 322 
pp. (7 x 10), $52.75. 


C Wakefield, Hollida & Ralph Undenwager — ACCUSA- 
TIONS OF CHILD SEXUAL ABUSE. ‘88, 530 pp. (7 
- x 10), 16 tables, $75.50. 


O Jewell, David L—CONFRONTING CHILD ABUSE 
THROUGH RECREATION. '89, 384 pp. g x 10), 
.1 il, 28 tables, about $58.75. : 


a Lakebrink, Joan M. — CHILDREN AT RISK, '89,.398 |. 
pp. (7 x 10), 21 ll., 6 tables, $63.75. 


O Bierker, Susan B. ABOUT SEXUAL ABUSE. '89, 
232 pp. (7 x 10), $38.75. 


O Steen, Charlene & Barbara Monnette— TREATING 
ADOLESCENT $EX OFFENDERS IN THE COM- 
MUNITY. '89, 212 pp. (7 x 10), 61 il., $39.75. 


O Thomas, R. Murray — THE PUZZLE OF LEARNING 
. DIFFICULTIES: Applying a Diagnosis and Treat- 
ment Model. '89, 330 pp. (7 x 10), 19 il., 7 tables, 
about $4975. . i 


O Conti, John V.—COUNSELING PERSONS WITH 
CANCER, '89, 148 pp. (7 x 10), about $23.75. 


O Forrest, Gary C. — GUIDELINES FOR RESPONSIBLE 
DRINKING. '89, 178 pp. (7 x 10), about $31.75. 


C Foxman, Joel—A -PRACTICAL GUIDE TO EMER- 
GENCY AND . PROTECTIVE CRISIS INTERVEN- 
TION. ’89, 322 pp. (7 x 10), About $49.75. 


C France, Kenneth — CRISIS INTERVENTION, Second 
Edition. “90, 316 6 pp. (7 x10), 3 il., About $42.75. 


a Henderson, A PRACTITIONER'S GUIDE 
TO UNDERSTANDING INDIGENOUS AND FOR- 
EIGN aa ndr An Analysis of Relationships 

Between Ethnicity, Social Class and Therapeutic 
Intervention Strategies With Third World Peoples 
From Other Countries. ^90, 306 pp. (7 x 10), About 
$40.75. 
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SOCIAL CURRENTS 





AMERICAN INDIANS 

The First of This Land 

C. MATTHEW SNIPP. A benchmark chroni- 
cle of the position of American Indians within 
the larger context of American society, this 
unprecedented demographic study deals 
with the size, distribution, and social and 
economic characteristics of American 
Indian and Alaskan Native populations. 
Census Monograph Series. October/ 

448 pages/Cloth $42.50 


RURAL AND 

SMALL TOWN AMERICA 

GLENN V. FUGUITT, DAVID L. BROWN, 
and CALVIN L. BEALE. This important new 
book offers in-depth analyses of trends in 
the size, growth, and composition of Ameri- 


ca's nonmetropolitan population, a popula- . 


tion whose special characteristics and 
needs constitute a significant dimension 
of our nation's social structure. Census 
Monograph Series. October/512 pages/ 
Cloth $45.00 


‘BECOMING A MIGHTY VOICE 


Status Conflict and Leadership 
Change In the United Furniture 
Workers of America, 1932-1987 
DANIEL B. CORNFIELD. A detailed socio- 
logical analysis of union leadership change, 


focusing on the past half-century of activity - 


within the United Furniture Workers of Amer- 
ica. This vivid case history explains why 
status conflict recurs in the labor move- 
ment and how ethnic minorities and 

women emerge into leadership positions. 
December/320 pages/Cloth see 


LEADING EDGES IN SOCIAL 


: AND BEHAVIORAL SCIENCE 


R. DUNCAN LUCE, NEIL J. SMELSER, and 
DEAN R. GERSTEIN, editors. Based on . 
consultations with hundreds of scholars and 
professionals throughout the country, these 
indispensable reports identify and outline 
the "leading edges" of research in 30 topical 
areas of the social and behavioral sciences. 
November/576 pages/Cloth $55.00 


MARGINALISM AND ` 
DISCONTINUITY 

Tools for the Crafts of 

Knowledge and Decision 

MARTIN H. KRIEGER. A provocative and 
inventive anthropology of the technology of 
models employed in the social and natural 
sciences, showing how such models are 


tools for getting hold of the world and asking 


how we might take our "toolishness" seri- 
ously. October/224 pages/Cloth $25.00 


New in paperback 


THE POLITICS OF NUMBERS 
WILLIAM ALONSO and PAUL STARR, 
editors. "Sure to become the standard in 
the field." — Choice. Available/480 pages/ 
Paper $18.50 


BIG STRUCTURES, 

LARGE PROCESSES, 

HUGE COMPARISONS 

CHARLES TILLY. "A ioy to read. He writes 
elegantly and amusingly, he is insightful and. 
tough-minded." — Contemporary Sociology. 
Available/ 167 pages/ Paper $8.95 





RUSSELL SAGE FOUNDATION 


c/o CUP Services, P.O.Box 6525, Ithaca, NY 14851, or call 1-800-666-2211 (for orders only) 
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DOWN AND OUT IN AMERICA 
The Origins of Homelessness - 

Peter H. Rossi 

“A new landmark in the study of homelessness and ex- 


` : 
An A ] y S 1 S , treme poverty in the United States. His careful empirical re-. 


"LE 


boy 


pie) 


UNIVERSITY OF 
CHICAGO PRESS 


5801 South Ellis Avenue 
Chicago, IL 60637 


search, skillful data analysis, and thoughtful policy 
recommendations will make this highly readable and 
interesting book a standard reference for those concerned 


.about the problem of severe economiic deprivation in our 


affluent society."— William Julius Wilson, author of 
The Truly Disadvantaged 
Cloth $15.95 264 pages 


COSMOPOLITANS. 
AND PAROCHIALS 


Modern Orthodox Jews in America 
Samuel C. Heilman and Steven M. Cohen 
A provocative and fascinating portrait of what Jewish Or- 
thodoxy has become and how it survives and flourishes in 
America today. 
“This book should be read and contemplated by all stu- 
dents of the sociology of religion as well as by those with a 
articular interest in the sociology of Jewry."—Chaim I. 
axman, Rutgers University 
Paper $12.95 (est.) 216 pages (est) 
Library cloth edition $32.00 (est.) 


THE ANALYSIS OF IDEOLOGY 


Raymond Boudon 
Translated by Malcolm Slater 


Boudon presents here a critical history of the concept of 
ideology, providing an exhaustive and subtle critique of 
sociological explanations of ideology from early concep- 
tions to its current usage in the works of Barthes, Foucault, 
Habermas, Sartre, and others. 

Cloth $39.95 250 pages 


Now in Paper 

THE TRANSFORMATION OF THE 
AVANT-GARDE 

The New York Art World, 1940-1985 

Diana Crane ` 

"As the first comprehensive sociological analysis of the 


postwar art world, [this book] breaks with the convention- 
al, purely stylistic art histories which have been written of 


the period. . . . Crane sketches out the social context in 
which artists moved and worked."— Brian Wallis, Art in 
America 


Paper $12.85 204 pages 32 halftones 
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A Critical Dictionary 


of SOCIOLOGY LV. 


Raymond Boudon and Francois Bourricaud 
Selected and Translated by Peter Hamilton 


This distinguished reference work is much more than a lexicon or ordinary dic- 
tionary. Each entry is a small essay, complete with bibliographical references, 
which taken together form a coherent and sophisticated ny ve of the sociologi- 
cal tradition that synthesizes American, British, French, and German contribu- 
tions. Covering a broad range of topics, the entries deal with key sociological 
concepts and paradigms, rmeuiodo one: issues, classes of social phenomena, 


and major classical figures in the fiel l 
CHICAGO 


Cloth $49.95 624 pages 
University of Chicago Press 
5801 South Ellis Avenue Chicago, IL 60637 








1989 Guide to Graduate 
Departments of Sociology 


This edition of the Guide to Graduate Departments of Sociology includes information 
on 244 graduate departments of sociology. The Guide is a valuable resource for 
students, faculty members, advisors, department chairs, and researchers. 
Information on each department includes: 


D Department address and telephone 

D Chair and Graduate Advisor/Director 

O Degrees offered 3 

U Special programs and specialties offered 

O Tuition, financial aid, and application deadlines 

D Statistics on new admissions and degrees granted 

O Teacher training 

D Roster of full-time, part-:ime, and jointly appointed faculty 
O PhDs awarded in 1987-83 


Send your prepaid order ($5 to ASA members and students, $10 to non-members 
and institutions) to: American Sociological Association, 1722 N Street NW, 
Washington, DC 20036; (202) 833-3410. 





| The Fourth 
National 
Conference on 
Undergraduate 
Research 


April 19-21, 1990 
Union College, Schenectady, NY 
Abstracts must be submitted by February 13, 1990. 
For information, write or call: NCUR-90, Wells 
House, 1 Union Avenue, Schenectady, NY 12308 


UNION 


Life Course Studies 
David L. Featherman and David l. Kertzer, General Editors 


Family, Political Economy, and Demographic Change 


The Transformation of Life in Casalecchio, Italy, 1861-1921 
. David |. Kertzer and Dennis P. Hogan 


As in so many other parts of Europe, the northern Italian community of Casalecchio 
experienced massive social and economic changes in the nineteenth and early twentieth 
centuries. Characterized by sharecropping agriculture and large, complex family 
households, the community faced the effects of industrialization, urbanization, and 
dramatic political changes. Family, Political Economy, and Demographic Change repre- 
sents an unprecedented interdisciplinary effort to discover how changes in family life and 
demographic behavior actually occurred in this crucial period. 


ISBN 0-299-12190-9/Cloth $42.00 ISBN 0-299-12194-1/Paper $19.50 


THE University oF WISCONSIN PRESS 
| Ju 4+ Neri Murray Street, Madison. E ota 7799 
at your beckstore, or call (G03) 262-8782 for ordering information 
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Literature, Politics, 
and Culture in 


Postwar Britain 


ALAN SINFIELD 
New in cloth and paperback—"A major work 
of cultural criticism in the British Marxist 
tradition. It makes a significant 'interven- 
tion' in the very thing Sinfield is talking 
about, the social construction of literary and 
political values." —Patrick Brantlinger, 
. author of Rule of Darkness: 
The New Historicism: Studies in Cultural 
Poetics, 12 $45.00 cloth, $14.95 paper 


Reproduction and 
Social Organization 
in Sub-Saharan Africa 


Edited by RON J. LESTHAEGHE 
“A unique and pathbreaking-work that will 
exért a lasting influence on the thinking of 


demographers, anthropologists, and develop-. 


ment specialists working in Africa." 
—Etienne van de Walle, 
Population Studies Center, 
University of Pennsylvania 
Studies in Demography, 4 $50.00 


Leningrad 
Shaping a Soviet City 
BLAIR A. RUBLE - 


NEW FROM CALIFORNIA 





: | 
Losing Face 
Status Politics in Japan: 
SUSAN J. PHARR 


"Social scientists specializing in Japan will 


'certainly welcome this book as one of the 


most significant publications of recent years. 
It is remarkably successful iri linking Japa- 
nese culture to real politics and micro- to 
macro-processes, all in a'comparative per- 
spective. ... The book will be much cited in 
years to come." —]ohn Campbell, 

EE University of Michigan 
Lilienthal Imprint, 5 $25.00 


Collective Action 
and Community 


Public Arenas and the Emergence: 
of Communalism in North India 
SANDRIA B. FREITAG 

Freitag examines one of the central problems 
of modern Indian history—the Hindu-Mus- 
lim conflict. She challenges long-standing 
interpretations by defining this conflict as a 
developing social process involving complex 
and contradictory participation by numer- 
ous social groups—not simply “Hindu” or 
"Muslim". 2n $32.00 


1 


Throughout much of this century, cities around the world have sought to gain control over their 
urban destinies through concerted government action. Nowhere has this process of state 
intervention gone further than in the Soviet Union. This volume explores the ways in which 
local and regional political, economic, and cultural leaders in Leningrad determine the physical 
and socioeconomic contours of their city and region. . ' 
Lane Studies in Regional Government, 8, A $45.00 . 





NEW FROM CALIFORNIA 


Thinking Fragments 


Psychoanalysis, Feminism, and Postmodernism in the Contemporary West 


JANE FLAX 


“Flax’s long-awaited book is even better than I thought it would be. There are few schol- 
ars—if any—who could bring such a comprehensive, rich, and both appreciative and critical 
perspective to psychoanalysis, feminist theory and postmodernist philosophy. Her even- 
handed attitude toward all three—and the balanzed scholarly and practical background she 
brings to her analysis—is just about unique."—8Sandra Harding, University of Delaware 


Social Change and 


Modernity 


HANS HAFERKAMP and 

NEIL J. SMELSER, Editors 

New in cloth and paperback—Here is a wide- 
ranging collection of original essays examin- 
ing new directions in the study of social 
dynamics, both in Europe (especially West 
Germany] and in the United States. The 
authors explore new models of social change, 
including the controversial idea of autopoei- 
sis [self-regulation of society], feminism, the 
anti-nuclear movement, and.environmental 
politics. $49.95 cloth, $17.95 paper 


Powers and Liberties 


The Causes and Consequences of 
the Rise of the West 


JOHN A. HALL 

New in paperback—"The major strength of 
this book is as a theoretical account that 
takes both history and comparison seriously. 
The goal of understanding our contemporary 
situation is kept clearly in view throughout." 
—American Journal of Sociology 
“An engaging and wide-ranging book .... 
Should spur discussion of the crucial problem 

of how Western liberties survived." 
—Contemporary Sociology 
$12.95 paper 


$25.00 


The Power of 
Collective Purse 
Strings 

The Effect of Bank Hegemony on 
Corporations and the State 
DAVITA SILFEN GLASBERG 
Through a series of case studies, including 
one of Chrysler, another of the 1982 debt 
crisis of Mexico, and one of W. T. Grant, the 
author explores how financial institutions 
play an important role in the way corpora- 


tions deal with a variety of crises. 
$30.00 


Theories of Civil 


Violence 
JAMES B. RULE 


New in paperback——"This is a rich and imagi- 

native work steeped in common sense and di- 

rectives for improving our understanding. 

Rule is a master a: reflexive discourse. Theo- 

ries of Civil Violence will surely endure as a 
chronicle and commentary." 

—Gary T. Marx, 

author of Undercover: 

Police Surveillence in America 

$11.95 paper 
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Fleeing the Iron 
Cage 


Culture, Politics, and Modernity : 


in the Thought of Max Weber 
LAWRENCE A. SCAFF 
“The quality of scholarship in this work is 
superior. Its originality comes precisely in 
the author's willingness to have spent ex- 
traordinary time and intelligence translat- 
ing Weber's manuscripts and published 
works, and then showing how his socio! 
logical and philosophical passions worked 
themselves out in close relations with 
Michels, Simmel, Lukács, and others in 
his circle." —Alan Sica, author of 
Weber, Irrationality, and Social Order 
$35.00 


Immigrant 


Entrepreneurs 


Koreans in Los Angeles, 
1965-1982 

IVAN LIGHT and 

EDNA BONACICH 

"This book is an excellent case study, and 
much more.-It deals in broad scope with 
factors that affected these emigrants, such 
as the role of the US and Korean govern- 


ments in facilitating emigration, and how ‘ 


US capitalism participated in both. the 

creation and destruction of entrepre- 

peuhip at the national and international 
level. . . Interesting and readable." 

~-Choice 

$47.50 cloth 


NEW FROM. CALIFORNIA 


Rationalism, 
Religion, and 


Domination 


A Weberian Perspective 
WOLFGANG SCHLUCHTER 


. Translated by NEIL SOLOMON 


“Publication of this material in English 
should be a major event in American Weber 
studies. Together with Economy and 


Society, Weber's comparative studies in 


the sociology of religion represent not only 
his own central contribution to theoreti- 
cal sociology, but also one of the most 


' ambitious and fruitful research programs 


in the history of modern social theory. 

Schluchter analyzes both of these projects 

and shows how they are related." 

—Guy Oakes, Monmouth University 
$75.00 


Of Rule and 
Revenue 


: MARGARET LEVI 


New in paperback—"A marvelous read, 
full of interesting lore and powerful ideas. 
. Levi has pushed the historical study of 
resources mobilization to a new height." 
: —Aaron Wildavsky, 
University of California, Berkeley 
"Levi's approach is a fascinating blend of 
rational choice theory.and neo-Marxism. 
[She] offers a powerful and provocative 


í model of rulers and ruling.” 


— Society Magazine 

California Series ọn Social Choice and 
Political Economy, 13. 
“is $12.95 paper 





A Quarterly Journal of Social Science Methodology 
and Quantitative Research 


OClal James D. Wright 


Tulane University, New Orleans, Louisiana 


CIence Senior Advisory Editor 


Peter H. Rossi 


Social Science Research publishes papers devoted to quantitative social science 
research and methodology. Featuring articles that illustrate the use of quantitative 
methods in the empirical solution of substartive problems, the journal emphasizes 
those concerned with issues or methods taat cut across traditional disciplinary 
lines. Special attention is given to methods that have been used by only one 
particular social science discipline, but that may have application to a broader 
range of areas. 


ACADEMIC PRESS, INC. 


Please enter my subscription for: 
Social Sclence Research O American Express 
O Volume 19 (1990), 4 issues O Diner's Club 
ISSN 0049-089X : 
In the U.S.A. and Canada: $111.00° Expiration Date (Mo/Yr): 
A other countries: $148.00 


Imm ee COT 
[J A sample copy. number; 


Privieged personal rate information. Personal subscription 
rates are avallable onty on orders placed directly with the Pub- Signature: 
lisher and paid for with personal funds. Checks and money D 
orders must be drawn against a U.S. bank. x 
Method of payment: ! 
Purchase order O Check is CJ Credit information 
is enclosed. enctosed. is given. 

O Enter as a Standing Order (Institutions only) hereby authorizing 
Academic Press to service and bill our account each calendar 
year untll cancelled. 

Please note: Payment or credit card Information must accompany 

order. Prices inctude postage, handling, and alr freight where ap- 

plicable, and are subject to change without notice. Payment must 
be In U.S. currency, U.S. Bank Draft, Intemational Money Order, 
or UNESCO coupons. S0005 - 





















































The 
— ASA — 
Presidential Series 


Volumes of outstanding scholarship 
edited by former Presidents of the . . 
American Sociological Association 


> Social Change and the Life Course : Matilda White Riley, Editor 
(Order from: Sage Publications, 2111 West Hillcrest Drive, Newbury 
Park, CA 91320) ` 


Volume I: Social Structures and Human Lives $45.00 cloth/$19.95 paper 
Volume II: Sociological Lives : $29.95 cloth/$14.95 paper 
Two-volume set $66.40 cloth/$32.20 paper 


> The Social Fabric James F. Short, Jr., Editor 
‘(Order from: Sage Publications, 2111 West Hillcrest Drive, Newbury 
„Park, CA 91320) $35.00 cloth/$16.95 paper 
_ > Gender and the Life Course Alice S. Rossi, Editor 
(Order from: Aldine Publishing Company, 200 Saw Mill River Road, 
Hawthorne, NY 10532) . $37.50 cloth/$18.95 paper 


> Sociological Theory and Research: l a 

- A Critical Approach Hubert M. Blalock, Editor 
` (Order from: The Free Press, 866 Third Avenue, New York, NY 10022) 
$34.95 cloth . 


> Societal Growth: Processes and Implications Amos Hawley, Editor 
(Order from: The Free Press, 866 Third Avenue, New York, NY 10022) 
' $22.95 cloth. 


Members of the American Sociological Association are eligible for a special discount. For information, 
contact: ASA, 1722 N Street NW, Washington, DC 20036; (202) 833-3410. 





Teaching Sociology 


Theodore C. Wagenaar, Editor 


. Puts the emphasis on teaching : 
. . Helps you grow as an effective teacher 
. . Uses sociology to understand teaching 
. . Presents current resources : 


While there are many journals in the social sciences dedicated to golar 
research, there is now a publication from the ASA especially focusing on the 
transmission of knowledgė, on teaching, with direct application to sociology. 

This journal is useful for sociologists beginning their careers, for teaching 
assistants, and for seasoned teachers who look for new ideas and resources. The 
journal reflects ASA's commitment to effective teaching of the discipline, and to 
the importance of teaching as part of the academic profession. 

‘Teaching Sociology publishes research articles, teaching tips, and apes on 
teaching sociology. It includes shorter reports on class projects and innovations 
that were formerly published in the ASA Teaching Newsletter. A new section 
includes textbook and film reviews. Special issues have been devoted to 
humanist sociology, research methods and statistics, numan sexuality, textbooks, 
and demography. 

Recent articles have included: 

D Teaching the Sociology of Complex Organizations: Issues and Strategies, byT. E. Drabek 
D A Capstone Course in Applied Sociology, by R.C. Wallace 

C Microcomputer Tast-Generation Systems: A Software Review, m V. Sturtevant and B. 
Johnson 

B Moving Our Minds: Studying Women oi Color and Reconstructing Sociology, by M. 
Andersen 

O By Its Right Name: The Relevance of Poetry for Sociology, by J. b. Miley 

D Teaching Sociological Theory Through Video: The Development of an Experimental 
' Strategy, by E. Fails 

: O Intimate Work: Teaching Sociologists to Write, by D. Vaughan ; ! 


D Some Observations on Sociology Textbooks: An Editorial Perspective, by F, Graham 


ASA members, $13; non-members, $30; institutions/libraries, $56 
(Add $5 for subscriptions to be sent outside the U.S.) 


American Sociological Association 
1722 N Street NW, Washington, DC 20036; (202) 833-3410 
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A Puzzle for Sociologists 


(Solution provided) 


Across 
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A THEORY OF MARKET TRANSITION: 
FROM REDISTRIBUTION TO MARKETS 
IN STATE SOCIALISM* 


VICTOR NEE 
Cornell University 


State socialist redistributive economies are characterized by the allocation and 
distribution of goods through central planning. This paper develops a theory of 
market transition which argues that, in reforming socialist economies, the 
_ transition from redistributive to market coordination shifts sources of power and 
privilege to favor direct producers relative to redistributors. The shift improves 
incentives for direct producers, stimulates the growth of private markets, and 
provides to entrepreneurs an alternative path for socioeconomic mobility. A set of 
hypotheses test the market transition theory with household- and village-level data. 


Economic reform provides an unusual oppor- : 


tunity to study the transition of a state 
socialist redistributive economy to a market- 
like economy. Thus far researchers have 
focused on issues relating to efficiency and 
institutional change in analyzing the effects of 
market reform in state socialism (Kornai 
1986; Perry and Wong 1985; Stark 1986). 
Fewer scholars have studied the impact of 
market reform on processes of socioeconomic 
attainment (Szelenyi 1988; Whyte 1986). If 
the transition from redistribution to the 
market mechanism involves changes in the 
mode of allocating and distributing resources, 
the transition will probably change the 
stratification order. I propose to show that the 
shift from hierarchies to markets in a socialist 
economy changes the determinants of socio- 
economic attainment and therefore the sources 
of power and privilege. 


:* Direct all correspondence to Victor Nee, 
Department of Sociology, Cornell University, 
Ithaca, NY 14853. 

Special thanks to Brett de Bary, Louis Putter- 
man, Jimy Sanders, David Stark, Su Sijin, Ivan 
Szelenyi, Frank Young, David Weakliem, and 
Martin Whyte: I also wish to acknowledge the 
stimulating comments of my colleagues in the 
faculty working group and of graduate students in 
the theory seminar of the Department of Sociology 
at Cornell. Earlier versions of the paper were 
presented at the conference on "Social -Conse- 
quences of the Chinese Economic Reforms" at the 
John King Fairbank Center for East Asian 
Research at Harvard University, May 13-15, 
1988, and at the first annual meeting of the Center 
for Society and Economy at the University of 
California at Santa Barbara, May 26-28, 1988. 
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Redistributive economies collect and distrib- 
ute goods through centralized decision mak- 
ing (Polanyi 1957a). Of course, some redis- 
tribution occurs in any society, whatever the 
economy. Past societies with redistributive 
mechanisms include the primitive hunting 
tribe, ancient Egypt, Babylonia, and Peru, 
and the later Roman Empire. In modern 
market economies, redistribution takes place 
through the institutions of the welfare state, 
but in state socialist societies redistribution 
constitutes the integrative principle of the 
economy. These mechanisms in state social- 
ism fundamentally involve a vertical relation- 
ship between redistributor and producer, in 
which a multilevel bureaucratic hierarchy 
allocates resources and redistributes income 
(Kornai 1989). In these societies, redistribu- 
tion occurs within subsocietal units such as ` 
agricultural collectives, where collective cad- 
res provide central direction in allocating 
land, labor, and farm implements, distribut- 
ing income to households. While at the 
societal level, redistribution integrates the 
economy through the institutions of central 
planning, in a market economy, coordination 
occurs through a horizontal relationship 
between legally equal buyers and sellers at 
prices based upon mutual agreement. 

The role of markets in China has steadily 
increased since 1978 and the agricultural 
sector has undergone the most dramatic shift 
in reliance on market coordination. In 1980, 
the state instituted the "household responsibil- 
ity system" which is in many ways similar to 
private farming in a market economy, with 
the household leasing its land and paying the 
delivery quota as rent. "Under this system the 
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farm household has control over-the land it 
uses and can choose what to produce and how 
to market its products as is fhe case in a 


market economy" (Chow 1987, p. 296). This: 
change in ownership structure returned to- 
peasants incentives in line with their tradi: ' 
tional preferences for household production - 


and eliminated the “free rider”: dilemma that 


had plagued collectivist agriculture (Nee 


1985; Nee and Su, forthcoming), This shift 


resulted in dramatic increases in per capita 
income in thé 1980s. From 1978 to 1988, per 
capita income rose from 134 to 520 yuan; 
adjusting for inflation, this represented a 192 
percent increase in. per capita income. (State 
Statistical Bureau 1986a, p..582; People's 
Daily, January "3'and 14, 1989). Whereas 
one-third of rural households had:per capita 
income of less than 100 yuan (4:yuan = 
$1:00) in 1978, only 1 percent remained at 
this subsistence level by 1985. On the other 
end of the income distribution, only 2.5 
percent of: rural households , Téported : per 
capita income above 300 yuan in 1978; by 
1985, 62. percent reported per capita income 
of over 300 yuan, with 22 percent reporting 
over 500 yuan. The.correlation between 
increases ‘in per capita income and the 
transition from redistributive/collective agri- 
culture to household commodity production is 
striking (see Table 1).! In 1978, 66 percent of 
household income was derived. from the 


redistributive sector (agricultural collective): 


and only 27 percent from household produc- 


tion., By .1985, 81 percent of household. 


income came from household. commodity 


duction,. and only 8 .pereent from the ' 


redistributive/collective sector." The -value of 
market transactions approached half of the 
total ‘purchase value by 1985 at .70,500 
milion yuan from 23,500 million yuan in 
1980 (Watson 1988). Thus within a span of 
seven years, China's "rural economy experi- 
enced a rapid transition from a redistributive/ 
collective economy to a marketlike economy. 





! Redistribution takes place both. within the 
collective (through the allocative decisions made 
by cadres) and through the institution of central 
planning (state procurement of agricultural prod- 
ucts at lower than market prices and the monopo- 
listic, 'sale of manufactured products to peasants at 
prices substantially higher than world prices). Both 
aspects of redistribution are still' operative after 
decollectivization (Lardy 1989), but influence a 
deri smaller portion of the total pout 
income.. : E a 


m 
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Indeed, the agricultural sector has: ieu. 
a vast "second" economy, far larger in scope ` 
than the second economies of Eastern Europe 
and the Soviet Union. The second economy 
includes all income-generating activity out- 
side the boundaries of the redistributively 
coordinated and managed economy including ' 
private construction, manufacturing, com- 
merce, handicrafts, repairs, - services, and . 
moonlighting for private gain (Staik, forth- ` 
coming; Gabor 1979). Due to :the rural. 


- character of the Chinese second economy, its ` ` 
. most common units are the peasant household 


farm and firm. Though the household enter- 
prises of peasant entrepreneurs are typically 
very small, in principle théy can expand into 
larger enterprises that hire labor from outside 
the immediate kin group. Chinese private 
sector firms have access to official soutces of - 
credit -and thus have the potential for - 


expansion. Already the rapid growth of’ : 


household’ and joint-stock firms in villages 


. and small towns reflects the enormous growth 


potential of the Chinese second economy. 
Recently sociologists have pointed to the 
unexpected expansion of the informal sector 
in capitalist economies (Portes and Sassen- 
Koob 1987); even more surprising has been 
the rapid growth of the marketlike’ second - 
economy in state socialist economies. Produc- 
ers in the second economy of socialist 
economies conduct their business in a.zone 
relatively autonomous from the state. Like the 


informal sector or “underground” economy in .' 


capitalist societies, the socialist second econ- 
omy operates: in the shadows of the. modern 
and -state-regulated economy. The Chinese 
second economy, however, is both legal and 
subject to staté regulation, though in practice 
it is largely unregulated and untaxed. Produc- | 
ers in the second economy of socialist’ 
economies conduct. their business in a zone 
relatively autonomous from the state. As Stark _ 
aptly puts it; “The boundaries of the second 
economy and the relative proportions of its le- 
gal, illegal, and alegal parts are products ‘of 
contestation between state and society—a con- . 
tinuously. changing outcome of a struggle in : 
which society attempts to create and'maintain a 
sphere of activity relatively autonomous from ,. 
the state” (Stark, forthcoming). A defining fea- 
ture of the second economy in state socialism is 
the lack of fully legitimated and well-defined 
private property rights (which is why the sec- 
ond économy remains a marketlike institution). 
Just as the welfare State did not eke to. Socialist í 
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. Table 1. Transition From Redistribution to Market for Peasant Households* 


: 1978' 
I. Average net income per capita (Rmb) 133.57 
Income from redistributive sector 88.53 
Income from private cooperatives 
Net income from household commodity 
production 35.79 
Other nonborrowing incomes 9.25 
II. Proportion (95) (net income as 100) 
Income from redistributive sector 66.3 
Income from private cooperatives 
Net income from household commodity 
production 26.8 . 
Other nonborrowing income ' 6.9 


1980 198] 1982 1983 1984 1985 
191.33 223.44 270.11 309.77 355.77 397.60 
108.37 116.20 58.09 36.06 3533 33.37 

0.88 2.85 3.69 

62.55 — 84.52 187.55 244.66 285.44 322.53 

2041 22.72 24.47 2817 31.71 3801 

566. 520  2L5 116 100 84 

03 08 09 

3271: 378 (694 | 79.0 803 8l 

1.7. 102 91° 91 89 96 


* Source: Statistical Yearbook of China (State Statistical Bureau 1986", p. 582). Based upon a national sample - 


survey compiled by the State Statistical Bureau. 


transition in capitalist economies, so in state 
socialism, market reform is not likely to pro- 
duce full-blown market economies. Instead a 
mixed economy with a dominant state redistrib- 
utive sector and expanding second economy are 
the more likely outcome of market reform in 
state socialism, at least over the next. decade. 


This paper models the effect of the rapid ex- - 


pansion of the second economy on socioeco- 
nomic outcomes in state socialism. 


SOCIAL INEQUALITIES IN 

STATE SOCIALISM 
Official Marxist views of stratification in state 
socialism either deny the existence of classes, 
arguing that in the absence of private 
ownership of productive property there can be 
no classes, or conceive of society as compris- 
ing only two classes, peasants and workers 
(Parkin 1971). An early theoretical departure 
from the official Marxist position was Milo- 
van Djilas’s (1957) "new' class theory,” 
which built on Trotsky’s view of class 
antagonism between the working class and 
the new communist bureaucracy. According 
to Djilas, the bureaucracy constituted a new 
class because control over the means of 
production must be interpreted as a form of de 
facto ownership of productive property. 
Subsequent empirical studies of stratification 
in Eastern Europe tended to avoid. the 
controversial new class theory by focusing 
instead on analysis of occupational mobility 
and the division of labor characteristic of state 
socialist societies (Hegedus 1977; Ferge 
1979). In China, debate over social inequality 
has revolved around the issues. of the 
persistence of prerevolutionary class labels 


and. Maoist views on.new class formation. 
Western scholarship on stratification in China 
has generally sought to establish an empirical 
baseline (Whyte 1975) and to clarify issues 
raised by the Chinese debate over stratifica- 
tion (Watson 1984; White 1976; Kraus 1977). 
lvan Szelenyi (1978) formulated a theory 
of social inequality in state ‘socialism in re- 
sponse to perceived shortcomings of Djilas's 
thesis. By applying the concept of nonmarket 
trade (Polanyi 1957b) in a substantive analysis . 
of the redistributive mechanism in state social- 
ism; Szelenyi's theory specifies the underlying 
processes through which surplus is appropri- 
ated in state socialist economies. A feature of 
state socialism, argues Szelenyi, is that the price 
of labor is set administratively by the state. Just 
as labor markets are the central institution of 
capitalist economies, so the core institution of 
state socialist economies is the nonmarket trade 
of labor. When salaries and wages are set ad- 
ministratively and not through transactive mar- 
ket relationships, surplus is directly centralized 
in the state budget and redistributed according 
to centrally defined goals and values. Hence the 
State socialist redistributive mechanism appro- 
priates surplus directly from the immediate pro- 
ducer and creates and structures social inequal- 
ities through the processes of its reallocation. 
Szelenyi innovatively argues that the redis- 
tributive mechanism in state socialism differs 
fundamentally from that of the capitalist 
welfare state. In the welfare state, redistribu- 
tion reduces the inequalities produced in the 
marketplace (Wilensky 1975). Though the 
welfare state has a decidedly procapital bias, 
the pattern of state expenditures has resulted 
in modest gains for labor (Devine 1983) and 
in the reduction of poverty, though without 


changing the underlying pattern of wealth 
-concentration (Danziger and Weinberg 1986; 
Gough 1979). Scholars commonly assume 
that redistribution in the socialist state would 
be more progressive than in the welfare state 
‘(Stephens 1986). Contrary to expectations, 
` the redistributive mechanism in state socialist 
economies does’ not give rise to more 
equality, but to greater social inequality 
(Konrad and'Szelenyi 1979; Szelenyi 1983). 
In state socialism, : argues: Szelenyi, the 
expansion of the dominant redistributive 
'sector of the economy adds to the advantages 
of the already privileged and powerful. The 
effect of redistribution’ is more evident in 
higher nonwage compensations for the *'redis- 
tributive class," such as housing, access to 
higher education, subsidies for certain com- 
modities, the health and pension plans, and is 
only partially reflected in salaried income 
(Szelenyi 1983). Redistributors are "selfish" 
in that they "favor 'their own kind' (or more 
.sociologically speaking: the class which is 
organized around the monopoly of redistribu- 
tive power) when they allocate scarce re- 
'sources" (Szelenyi 1978, p. 77). 


THEORY OF MARKET TRANSITION _ 

On.the basis of his substantive analysis of 
social inequalities in state socialism, Szelenyi 
speculated that “the interests of the powerless 
and disprivileged can be best served with 
increasingly transactive (and consequently 
market-like) relationships in the economic 
system" (Szelenyi 1978, p. 63). The dramatic 


increases in peasant household income follow- 


ing market reform in China and the conse- 
quent narrowing of the rural-urban income 
gap are consistent with Szelenyi’s inference. 
Under the redistributive economy, urban 
bureaucrats, professionals, staff, and salaried 
working class benefited from the low purchas- 
ing price of grain and other staples. Following 
market reform,. prices for most agricultural 
products have risen sharply. This has dramat- 
' ically increased the income of rural producers 
and has inflated the cost of food commodities 
to urban dwellers. Thus, whereas rural-urban 
inequality widened under the redistributive 
economy, it narrowed substantially after 
market reform (Ignatius 1988). Such descrip- 
tive analysis, however, fails to distinguish 
‘between ‘redistributors and producers in both 
‘the rural and urban sectors of the economy, 
though it does demonstrate that the transition 
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‘from redistributive ‘allocation to marketlike 


transactive exchange benefits direct produc- 
ers, in this case peasant households, which . 
make up abou: 80 percent-of the population: 
Though Szelenyi identifies the underlying 
processes of the redistributive mechanism in 
state socialism, he does not explain why the 
shift from redistributive to market coordina- 
tior. would benefit immediate producers; nor 
do Manchin and Szelenyi (1987) identify the 
processes underlying the expected transfer of 
power and privilege. By extending the logic 


'of Polanyi's and Szelenyi's analyses of the 


redistributive economy to a reforming social- 


‘ist economy, I propose the theory of market 


D 


"through marketlike exchanges, two things are © 


tranaition in the following interrelated theses: 

(1) The market power thesis: If surplus is 
no longer monopolized by the redistributive 
sector, and more is allocated and distributed 


likely to happen. First, less power— control 


„over resources—is.located in the redistribu- 


‘tive economy and more in marketlike transac- 


tive exchanges. Second, when the price of 
labor and goods is: based upon mutual 
agreement between buyer and seller, and.not 
set' by. administrative. fiat, direct. producers 
have more power over the terms of exchange 
` for their goods and services. Therefore, the 
transition from redistribution to markets 
‘involves a transfer of power favoring direct 
prodtcers relative to redistributors: 

(2) The market incentive thesis: Markets 
provide powerful incentives for immediate 
producers whereas redistributive economies ` 
depress -incentives. In the state’ socialist 
redistributive economy, administratively set 


‘prices for labor (in industry and services) tend 


to lack sensitivity to differential performance 
and (for agricultural products) are set' lower 
than market-determined prices. In market 


transactions, however, producers have the 


right to withhold their product or labor power 
until a mutually agreed upon price is set; as a 
result, a greater share of the ‘surplus is 
retained by direct prodücers.? There are also 
greater incentives for individual effort be- 
cause rewards are more closely related to 
‘individual productivity. This is likely to be 
reflected in higher returns of education, 
which i$ among the best indicators of human 
producivity (Mincer 1958; Schultz 1963). 


1 





? [nequality produced by market action is the 
nee of another paper in progress. ; 
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(3) The market opportunity thesis: In state 
socialism the transition from redistribution to 
matkets results in new opportunity structures 
centered on the marketplace. Rather than 
focusing exclusively on access to and mobil- 
ity within the redistributive sector, market 
opportunities open up alternative avenues of 
socioeconomic mobility. Thus changes in the 


structure of opportunities give rise to entrepre- ` 


. neurship as an alternative to bureaucratic 


advancement in state socialism.. -. 
Together the three theses: constitute a 


theory of market transition in state socialist - 


societies: They specify the central processes 
in the shift from hierarchies to markets that 
involve fundamental changes in the sources of 


. power, and in the structures of incentives and 
‘of opportunities. Changes in distribution ‘will 


flow from changes in power, incentives,.and 
opportunities. The processes are interdepen- 


dent and occur simultaneously. Overall, the- 


theory of market transition predicts that direct 
producers gain in power relative to redistrib- 
utors in the sectors of the socialist economy 


that experience a shift from redistributive. to.. 
` market allocation. 

The market transition theory turns on the, 
decisiveness of the shift to reliance on the: 
, Market mechanism in the allocation and 


distribution. of goods. A corollary of the 


theory is that less market coordination and. 


greater reliance on bureaucratic coordination 
will result in greater power of the class 


,' organized around redistribution. Therefore, in 


sectors and regions of the socialist economy 
where allocation and distribution continue’ to 
be based upon central decision, there will be 


, little or no change in the processes determin- 
ing stratification. For this reason we expect 


more continuity in the distribution of power 


‘and privilege . in less commercialized -hinter- 


land regions, collective enterprises managed 


` by township, county, and municipal adminis- 
trations, and especially. the socialized state 
sector: Another corollary of the market 
transition theory is that following;a shift to` 


market allocation sectors and regions that 
were “contributors” in societal redistributive 
processes prior to market reform will experi- 
ence net gains over those that were beneficia- 
ries of redistribution. For example, prior to 
market reform, rédistribution diverted. surplus 
from coastal industrial cities like Shanghai to 
support development in hinterland provinces. 
Economic reforms now. allow factories to 


retain a substantially greater portion of 


s 
© 


` 
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profits. Thus, in the’ balance of e chases 
between rural and urban-sectors and coastal 
and inland provinces, the rural sector and 
coastal provinces in China have ‘realized 
greater benefits from economic reform. 


THE POLITICAL ECONOMY OF C 


MARKET REFORM IN CHINA 


Kornai (1986) contends that the dilemma E - 


partial reform in the state sector results from’ 
"dual dependence" on bureaucratic and market 
mechanisms of economic coordination. At- 


‘tempts to impose the discipline of markets on ` 


firms in the-state sector have failed because _ 


market forces are constrained and dominated 
by continued bureaucratic micro-interventions 


` (Kornai 1984). As a result, the urban economy 
has remained redistributive despite efforts to` 


introduce more market coordination i in the state. 
sector (Lin 1989). 


By contrast, the rural sector, as area ' 


shown, has experienced a decisive shift from 


redistributive to market coordination. More- - 


over, the structure of opportunities has 
changed rapidly as the rural economy. has 


diversified from a predominant reliance on - 


agriculture to a wide range of other sources of 


income (Watson 1988). This change is : 


evident in tbe growing. proportion of the 
peasant household income derived from 


nonagricultural sources (Table 2): that figuie E 


tripled between 1978 and 1985. 
: Because market reform in urban areas is 


still at an early stage, I focus on the rural ' 


économy to test the market transition theory.? 
Though some agricultural collectives.in some 
regions of China persist as potent organiza- 
tions, that manage and allocate economic 


‘resources, the broader national trend away 


from redistributive to market coordination i in 


the Chinese Countryside is clearly docu- : 


mented in Table 1. 


At the village level "decoljecB visito: 


sharply curtailed the redistributive power of 


“ grass-roots” cadres (village officials). Nev- 


3 Using an ‘urban. sample, Walder (forthcoming) 


shows that in Tianjin return on party membership - 
declined from .1976 to 1985 and income inequality 
narrowed between cadres and workers. This finding 


is consistent with the market transition theory: how- : 


ever, it is difficult to assess from the Walder and. 


‘Ruan study what aspects of structural change resulted 
. in the relative decline in returns on human capital for 


party cadres and ‘the reduction of inequality. 
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Net Income Per Capita by Nature 
Item . Unit 1978 1980 1981 1982 1983 1984 1985 
‘I. Net income per capita Rmb 133.57 191.33 223.44 270.11 309.77 355.33 397.60 

Net income, productive Rmb 122.86 166.39 194.51 237.15 272.91 315.06 350.07 
Agricultural production* Rmb 143.47 149.62 170.58 203.65 221.77 250.36 263.81 
Nonagricultural production" Rmb 9.39 16.77 23.93 33.50 51.14 64.70 86.26 

Net income, nonproductive® Rmb 10.71 24.94 28.93 32.96 36.86 40.27 47.53 

IL. Proportion (Total net income = 100) 

Net income, productive % 92.0 37.0 87.0 87.8 88.1 88.7 88.0 
Agricultural production* % 850  Á 782 76,3 75.4 71.6 70.5 66.3 
Nonagricultural production? * 7.0 8.8 10.7 12.4 16.5 18.2 21.7 

Net Income, nonproductive* % 8.0 13.0 13.0 12.2 11.9 11.3 12.0 


* Referring to net income of peasants engaging in farming, forestry, animal husbandry, sideline occupation, and 


fishing. 


> Referring to net income of peasants engaging in rural industry, construction, transport, commerce, and catering 


trade. 


* Referring to income, both cash and in kind, sent or brought back by those working elsewhere, income received 


from collective accumulation fund and public welfare fund, 


income received from state budget, etc. 


* Source: Statistical Yearbook China (State Statistical Bureau 1986a, p. 583). 


ertheless, cadres still control the allocation of 
remaining collective assets (i.e., collective 
enterprises, orchards, ponds, and other pro- 
ductive properties owned by the collective), 
negotiate household quota production for the 
state, and collect taxes and other fees (Oi, 
forthcoming). In the transition from redistrib- 
utive to market allocation, economic transac- 
tions are often embedded in social networks 
in which officials hold pivotal positions, 
certainly more so than in a market economy 
(Nee 1989). Local cadres frequently act as 
brokers or middlemen in exchanges involving 
economic transactions between peasant entre- 
preneurs and state. agencies. Entrepreneurs 
must mobilize social networks to gain access 
to capital, labor, raw materials, technology, 
and markets, sometimes seeking access through 
bribes or gifts to local cadres.* 

To the extent that hierarchical forms of eco- 
nomic coordination remain dominant, personal 
connections between state agencies, firms, and 


* Local cadres may negotiate with entrepreneurs 
to have relatives included in the ownership of firms 
in exchange for preferential access to collective/ 
state resources (Zhang unpublished). This is a form 
of payoff to political capital that would not be 
picked up in my analysis unless the relative is a 
member of the cadre's household. My analysis of 
the declining significance of political capital 
(reflected in returns to former cadre status) 
suggests, however, that where redistributive power 
of cadres has declined, the payoff in personal 
connections to them is likewise likely to decline in 


significance. 


households continue to provide critical infor- 
mal linkages. But where market reform has re- 
sulted in bypassing hierarchies, the extent of 
vertical segmentation characteristic of socialist 
economies is reduced as horizontal market re- 
lationships establish new social networks be- 
tween private buyers and sellers. The market 
trans:tion theory maintains that the more com- 
plete the shift to market coordination, the less 
likely that economic transactions will be em- 
bedded in networks dominated by cadres, and 
the more likely power— control over resources — 
will be located in market institutions and in 
social networks (guanxi) of private buyers and 
sellers. My study seeks to document this shift 
in the sources of power. 


DATA AND MEASURES 


I derive 10 hypotheses from the three market 
transiton theses in order to provide multiple 
tests (Stinchcombe 1968) of the theory with 
data from peasant households and villages. 
As no:ed earlier, available national data is 
useful only for descriptive analysis. My test 
instead employs survey research data from the 
Fujian Rural Survey Project which was 
conducted during the summer of 1985 under 
the joint sponsorship of Xiamen University 
and the University of California at Santa 
Barbare, directed by the author who orga- 
nized and trained the Chinese research team. 
Thirty-four graduate students and lecturers, 
recruited on the basis of ability to speak local 
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. dialects, participated in the study.5 A cluster 


"sampling procedure selected 30 villages from 
"two periurban counties ‘chosen for their 


diversity in geographical.and socioeconomic 


conditions. For those villages I then randomly ` 


selected a sample of 725 households frem the 
household registers in the township govern- 
ments, resulting i in 624 completed interviews. 

The dependent variable, household in- 
come, is based upon retrospective reporting 
of all sources of household cash income— 
agricultural, sideline production, nonagricul- 
tural, remittance of family members working 
outside thé village. Interviewers carefully 
explained to informants (household heads) the 
importance of accurate estimates of household 
income for the success of the study. Interview- 
ers assured peasants of confidentiality and the 
independence of the survey from the Chinese 
government. The presence of a cadre was 


` ascertained in a separate form prior -to the 


main household questionnaire. The word, 


"cadre" did not appear in the course of a 76-" 


to 90-minute interview. Households with 
cadres should not have been any more 
apprehensive about disclosing a full estimate 


"'of household income than, say, entrepre- 


neurs. But to the extent that households of 
current cadres systematically understated their 
household income, the estimations reported in 
the test of Hypothesis 1 of the market power 


^^ thesis may be misleading.” 


5 I am grateful to these participants and to my 
.Sponsors,at Xiamen University for their support 
and contribution to the Fujian Rural Survey 
Project. I am appreciative of the contribution of 
Lin Bogiang, an economist trained at UCSB, who 
supervised the work of the survey research team. 

ŠI made separate estimations of all the equa- 


‘tions using a measure of household income that 


included the estimated value of, food consumed by 
the household. The results were virtually identical 
to'the estimations using household cash income. 

Because ‘the study focuses on the effect of 
‘cominercialization on stratification, I chose cash 
income for the. dependent variable. When adjusted 
for food produced and consumed by the household, 

the mean household income was 3,417 yuan and 
mean per capita income, 646 yuan for 1984 in the 


„Fujian sample. The mean housthold per capita. 


income for’ China was 467 yuan in 1984. 
7 In general, bias in self-reported income in the 


Chinese interview setting is toward understating ` 


‘income. I note, however, that"the reported mean 
income for all households in the sample is 
substantially higher than the national mean re- 


as s the unit of analysis. 
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` The exogenous variables include estimates |. 
of household income for 1975 and 1980 
reported by.the household head.® The ‘human 
capital variable: includes the pooled educa- 
tional attainment of the household head and 
spoüse,? measured by. whether someone 
attended or graduated from primary, junior 
middle, senior middle, or technical school 
and college. I control for the effects of the age 
of.the hoüsehold head andthe number of 
adult laborers and children in the household. 
The age of the household head was based 
upon the household register and confirmed ' 
during the interview. It is modeled as a` 
quadratic function to account for the interac- 
tion, between weaker: physical capacity that 
comes with aging and enhanced human 
capital acquired through work experience. 
The number of adult laborers'and children 
under 14 years of age was based upon 
self-report and confirmed in reviewing the 
household census. In household commodity 


production, as in collectivized agriculture, the 


" t. 


ported by the State Statistical Bureau. For- groups 
that are more likely to underreport their income — 
cadres and ‘éntrepreneurs—the reported. méan 
income ‘was higher than that of the mean of. the 
sample. The: higher mean income reported by 
cadres both before and after the reform is at least 
consistent with my view that cadre ‘households 
probably gave reliable estimates of their household . , 
income. See Footnote 15. ` 

FA Household. incomes were not adjusted for . 
inflation. Peasants, however, benefit from price o 
inflation in agricultural products, the primary cause . 
of inflation since 1978. According to the official - 
statistics released by the Chinese government, the 
rate of inflation from 1978 to 1985 was 8 percent 
(State Statistical Bureau 1986b, p. 623). I am 
skeptical about the accuracy of these figures and 
assume that the real inflation rate was higher. ' 
Double-digit inflation probably did not occur until ` 
the mid-1980s. Overall, peasants seemed to have a 
ready recall of, their household income for these 
years. Because 1980 was the year of decollectiv- 
ization, the 1980 household income. was .a 
benchmark that peasants seemed to remember 


well. The 1975 income may be more subject to 


estimation error; yet because household income in - 
this period was relatively Stable and was calculated 
according to workpoints, this estimate is plausibly ` 
accurate. Interviewers were instructed to assist 
informants making adjustments for family -divi-. 
sions for estimations of 1975 and 1980 income. 

? Pooling’ the educational attainment of husband 
and wife is appropriate when the household is used 
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number of laborers in a household is expected 
to have a very strong net effect on household 
income.!? Children are the primary source of 
new labor.power for the peasant household. 
Though children under 14 years of age do not 
work at their full adult capacity, peasant 
children typical start to contribute to the 
household economy at an early age, helping 
out in household tasks such as feeding and 
tending livestock, weeding, gathering kin- 
dling, carrying water, taking care of smaller 
siblings, and other lighter tasks.!! 

The use of media by households is an index 
constructed from measures off how frequently 
households listen to the radio, watch televi- 
sion, and read the newspaper, magazines, and 
‘journals. This variable can be interpreted also 
as an appropriate measure for a peasant 
society of the household's cultural capital 
(Bourdieu 1977). 

Interviewers asked household heads whether 
a member of the household had ever served as 
a team, brigade, commune, or county cadre 
or was presently a cadre (village as well as 
higher levels). Household heads were asked 
whether they started up a household enter- 
prise in recent years and how much they 
invested in the business. In order to control 
for the size of investment, and therefore the 
relative scale of the enterprises, I used the 
amount invested in starting up a private 
business as the variable for entrepreneur. The 
mean start-up capital for entrepreneurs was 
1,603 yuan. 


10 Fei (1946) suggested that in household 
production the Chinese peasant family achieved its 
optimal efficiency at a size of about 5 members. In 
1985, the mean household size in the Fujian 
sample was 5.56, and the State Statistical Bureau 
national sample of peasant households reported a 
mean household size of 5.12 (State Statistical 
Bureau 1986, p. 583). The household structure in 
the Fujian sample ranged from nuclear (62 percent) 
to stem (22 percent) and joint (16 percent) family 
composition. The effect of household structure on 
changes in income, however, is not significant. 

.! Because children contribute to the household 
economy at an early age, the control for the 
number of children should show a positive effect 
on household income (see Table 3). Notwithstand- 
ing the net benefits of children's labor contribution 
to the household economy there is an inverse 
relationship between the number of dependents 
supported by a household and its per capita 
income. Thus, though children contribute at an 
early age to the household economy, they do so at 
8 level lower than experienced adult laborers. 
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Institutional or contextual variables were 
drawn from village-level data. Urban proxim- 
ity is measured by kilometers from the riearest 
city. Market access is a factor score that 
combined three locational measures of market 
access: distances to the nearest city, county 
town, and standard marketing town. Per 
capita farmland is based upon the arable land 
assigned to a household divided by the 
household size. The measure of the village's 
educational level is a factor score that 
combined the percentage of school-age chil- 
deen who graduated from primary and junior 
middle school in 1974 and 1984. Factor 
Scores are based on confirmatory factor 
analysis. Lastly, the density of households 
with overseas relatives is based upon esti- 
mates by village cadres. 

First, I examine the independent effects of 
occupational status on household income for 
current cadres, former team and brigade 
cadres, and entrepreneurs. Second, I compare 
the returns on human capital investments for 
the pooled husband's and wife's education on 
household income before and after market 
reform. Third, I estimate the effects on 
household income of the human capital 
variables, age of household head, the number 
of adult laborers and children in the house- 
hold, and media use (or cultural capital). 
Fourth, I examine the independent effect of 
urban proximity on household income before 
and after structural reforms. Lastly, I estimate 
the changing effects on household income of 
the institutional context: urban proximity, 
market access, village educational attainment, 
per capital farmland, and density of village 
overseas connections. 

A lagged income variable is used to absorb 
the determinants of household income not 
specified by the models. When lagged income 
variables are used, the regression analysis 
points to net. returns of the exogenous 
variables on change in the dependent variable 
between Y; and Y2. The structural equation 
model is: Yn m bY + bX; T b2X2 F b3X3 
+ b4X4 + U, where Yp is 1984 household 
income and bY, is the «Jagged income 
variable. 


THE MARKET POWER THESIS 
Derived Hypotheses 


Hypothesis 1 directly extends the market 
power thesis. If power is located more in 
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marketlike transactions and less in the redis- 
tributive sector, a differential growth of 
household income will occur. after market 
reform. In other words, "The distribution of 
rewards in a society is a function of the 
distribution of power, not of system needs" 
(Lenski 1966, p. 63). If Hypothesis 1 is 
supported, market reform will result in no 
higher returns for redistributors relative to 
direct producers and entrepreneurs. (See 
Weakliem [unpublished] and Giddens [1974] 
on the relationship between income and 
power in a market economy.) 

Hypothesis 1: The transition from redistribu- 
tive to market coordination results in changes 
in the underlying processes of socioeconomic 
attainment that favor direct producers relative 
to redistributors. 


I expect that changes in the underlying 
processes of socioeconomic attainment will 
reduce the value of political capital in a more 
generic sense. Not only are the direct 
controllers of the redistributive mechanism 
likely to experience a relative loss, but the 
value of their political capital accumulated 
through prior experience as cadre is likely to 
diminish as well. At the most general level, 
this hypothesis predicts a decline in the value 
of political connections. I can indirectly test 
this hypothesis by examining the returns on 
former cadre status. Former cadres have 
typically accumulated political capital through 
prior service and are likely to have strong 
guanxi ties (personal connections) with cur- 
rent cadres, since as former cadres they are 
likely to be members of the same status 
group. 

The decline in the value of political capital 
must be viewed relative to other groups. Thus 
household income of. cadres should have 
increased in recent years along the general 
trend. But relative to others, and especially 
entrepreneurs, cadres are not expected to gain 
special advantages, net of human capital and 
household characteristics, for their cadre 
status after the. shift to a marketlike economy. 
Social groups closely linked to the market 
rather than the redistributive mechanism will 
experience higher gain in household income. 
If the hypothesis is supported, entrepreneurs 
should have significantly higher returns than 


' cadres. 


Hypothesis 2: The more market exchange 
replaces the redistributive mechanism in state 
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socialism, the less the value of political 
capital relative to market capital. 


A widely held view among China scholars 

is that local cadres have been surprisingly adept 
in their response to market reform by becoming 
entrepreneurial themselves (Zweig 1986; Oi 
1986). According to this view, local cadres 
have used their political capital to establish a 
dominant role in the commercialization of ag- 
riculture to such an extent that entrepreneurs 
come mainly from cadre backgrounds. This . 
interpretation fails to take into account the 
changes in the underlying processes that de- 
termine socioeconomic attainment accompa- 
nying a shift from redistributive to market co- 
ordination. In a market economy, specialists 
in redistribution are not necessarily more likely 
than other households to have the experience 
and orientation required for private sector en- 
trepreneurship. 
Hypothesis 3: In sectors of the socialist econ- 
omy that experience a shift from bureaucratic 
to market coordination, redistributors have lit- 
tle or no net advantage in entering into private 
entrepreneurship. 


Results 


The results reported in Table 3 provide strong 
support for Hypotheses 1 and 2. In equation I, : 
I regressed household income on a dummy 
variable of current village cadres. The regres- 
sion coefficient, though nonsignificant, is neg- 
ative and is consistent with the hypothesis that 
net of human capital and household composi- 
tion, current cadres (CADRE) might not be | 
keeping up with other households. Likewise, 
in equation If, when I regress household 
income on former cadre status, former team 
and brigade cadres do not appear to be getting 
returns on their cadre status. Thus cadre status 
and social network ties (guanxi) built up 
during the tenure as cadres do not seem to 
confer economic advantages in a marketlike 
economy. 

When current and former cadres are 
compared to entrepreneurs in equation III, the 
results are more sharply contrasted. Current 
and former cadres seem to have gained no 
significant returns on their cadre status. On 
the other hand, cadre entrepreneurs are doing 
well and seem to have advantages over other 
entrepreneurs. This finding indicates that 
political power has utility for entrepreneurs. 
However, it may be that the cadre entrepre- 
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neur group includes individuals recruited to a 
cadre position because of their prior success 
as entrepreneurs. In controlling for entrepre- 


neurship, current cadres now show a statisti- . 


cally significant negative return for their 
cadre status. Team cadres were the lowest- 


Table 3. Effects of Cadre Status and Entrepreneurship 
on Household Income 


I rT in 
Education 89.131*** 86.572** 69.107** 
(29.825)? (29.982) (29.470) ' 
131° 27 -101 
Age 97.186* 91.747* 70.743* 
Á (46.758) (47.117) ` (45.980) 
5 .605 : 571 .441 
Age? —.996* —.939* -237* 
(.508) C512) . (.499) 
—.565 —-.533 . —.418 
Labor . 283,599" 279.928*** 266.952*** 
(49.059) (49.244) | (47.953) 
215 272 .259 
Children 126.630* 129.485* 120.440* 
à (63.460) (63.791) (62.451) 
.084 .086 .080 
Income '80 .993*»* .996*»* .990*** 
(.095) (.096) (.094) 
.431 .432 .430 
Media 91.950*** 92.230*"* 90.257*** 
! (23.265) (23.263) (22.747) 
ATi AT2 168 
Cadre — 187.885 — 168.992 —467.683* 
(273.837) (276.985) (290.258) 
— .028 — 025 =] 
Former brigade — 386.493 449.114 
cadre bn: 033) (368.913) 

. 044 051 
Former team — — 8.182 — 80.352 
cadre Cnt. 173) (205.983) 

—.002 — .016 
Peasant 
entrepreneur — _ .156** 
(.052) 
.122 
Cadre 
entrepreneur - - 584** 
(.209) 
.120 
Former 
brigade = = — 062 
entrepreneur (.344) 
— .007 
Former 
team — — 1.040** 
entrepreneur i (.558) 
.076 
Intercept —3314.211*** —3197,.29]*** —2601.119%"* 
dí | x 386 384 380 
P 403 .406 445 


Note: Regression of 1984 household income on 
‘husband and wife's education (Education), age’ of 
household head (Age), age squared (Age*), number of 
adult laborers (Labor) number of children under 14 
(Children), household income in 1980 (Income '80), 
media use (cultural capital) (Media), current cadre 


(Cadre), former production brigade cadre, former team. 


cadre, peasdnt entrepreneur, and interactions. between 
cadre status and entrepreneur. 
* p .05. ** p< .01. *** p < 001. 
* unstandardized coefficients. > standard error. 
* standardized coefficients. 
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level cadre in collectivized agriculture and 
were routinely engaged in agricultural work 
themselves. Thus they are advantaged, not as 


.cadres but as direct producers in a marketlike 


economy. Cverall, these findings provide 
strong evidence that the sources of power 


- have shifted decidedly from the redistributive 


economy to the marketplace. 

The cross--abulation of cadre with entrepre- 
neurs shows that current and former brigade 
cedres have a somewhat higher probability of 
being entrepreneurs than noncadre house- 


.holds, which is consistent with the view that 


cadres have been. adept in their accommoda- 
tion to a marketlike economy. This notwith- 
standing, . the overwhelming majority of 
entrepreneurs come from direct producers, ‘as 
predicted by the theory of market transition. 
Moreover, despite the descriptive evidence of 
a higher. probability, the Goodman and 
Kruscal’s Tau test indicates that knowledge of 
cadre background reduces the error in predict- 

ing entreprereur status by one-half of one 
percent. 2 7 

A growing consensus among political 
sc:entists concurs with the findings reported 
in this analysis: that there has been a 
"redistribution of power to the peasantry" 
(White 1987, p. 418) following market 
reform (Shirk 1989; Latham .1985). I note, 
however, that the results of our analysis 
appear to be at odds with the impressions of 
some scholars who have recently visited 
China. On the basis of case studies and 
anecdotal evidence, they report that rural 
cadres have been extraordinarily adept at the 


-12 Tables 3 and 4 refer to a reduced sample size 
because the cadre status item was added to the 
survey instrument after the survey was begun. For 
this reason, households in 6 of the 30 villages were 
dropped from tke analysis reported in Tables 3 and 
4. But since.the villages were selected randomly, 
the-e should be no sample selection bias caused by 
the 1 reduced sample size. 

© personal communication from Ezra Vogel; 
Louis Putterman, Huang Shu-min, and Dorothy 
Solinger. Voge. reports on the basis of work in 
Guangdong that many cadres have been bought off 


. by entrepreneurz. Huang conducted a village study 


in which the party secretary used his political 
capital to become a prosperous entrepreneur. 
Soknger reports that cadres exert influence through 
their control of mformation and official sources of 
credit. And Putierman suggests that cadres appear 
to control marxet guanxi in the areas he has 
visited. 
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instrumental use of political capital—guanxi 
ties and official position—for private gain. 
These seemingly contradictory findings, how- 
ever, need not be interpreted as such. I 
assume that rural cadres have been vigorous 
in pursuing their material interests with all the 
means available to them, and that guile and 
opportunism have been common among rural 
cadres. After all, if the state exhorts peasants 
to enrich themselves, why should -peasant 
cadres be excluded from the pursuit of private 
gain? Current cadres and former cadres have 
become entrepreneurs (see Table 4). More- 
over, rural cadres have traded official favors 
for gifts or bribes, gained unfair advantage in 
the private use of collective resources, and in 
general attempted to exercise control over the 
commercialization of the agricultural econ- 
omy (Nee 1989). 

The market transition theory maintains that 
redistributors in sectors of the socialist 
economy that have experienced market re- 
form are unable to keep up by drawing on 
their political capital alone, because fundamen- 
tally, the shift from redistribution to markets 
involves a change in the sources of power and 
privilege. Reflecting this change, I cite an 
example of a common attitude conveyed by 
rural cadres to the field research team in the 
course of open-ended interviews. “Nowa- 
days, if you don’t work, you have nothing to 
eat. I have tons of work to do for my family. 
How can I have energy to take care of the 
village's affairs? Besides, if I do a good job 
for my family I'll earn more, but a good job 
for the village earns me nothing. Why 
bother?" Prior to market reform, rural cadres 
maintained near monopoly control over vil- 
lage economic resources. But following the 
reform, village-level cadres no longer have 
that much power, and therefore there is less 
need for peasants to seek favors of them. 
While a cadre may benefit materially from 
taking an occasional bribe or accepting gifts, 
in a market economy the greatest increases of 
wealth come from successful entrepreneur- 
ship, and not from petty bribes and gifts.!4 
That many cadres accept gifts or bribes in 
exchange for official favor in itself reflects 
the shift in the sources of power which our 


1^ The assessment of cadre gains after market 
reform reported by field researchers very often are 
based upon observations made in the redistributive 
sectors where cadres continue to benefit form the 
"gift economy." 
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Table 4. Cross-Tabulation of Cadre with Entrepreneurs* 
Former Former 
Never Team Brigade Current 
Cadre Cadre Cadre Cadre 
Non- 80.7% 83.6% 72.2% 73.5% 


Entrepre 19.3% 16.4% 27.8% 26.5% 
neurs (n=72) (n=12) (n-5 (n=9) 
Total 100% 100% 100% 100% 

(n=373) (n=73) (n=18) (n=34) 


* Reduced sample size (n= 498). 
cal’s Tau = .005. See Footnote 12. 


findings document. It is also important to 
keep in mind that our analysis specifies the 
effect of structural transformation on change 
in income, not the level of household income. 


.In our sample, cadres have a higher mean 


household income before and after market 
reform, but the increase in household income 
is no higher than that of other households, 
and substantially lower than that of peasant 
entrepreneurs. 15 


THE MARKET INCENTIVE THESIS - 
Derived Hypotheses 


During the period of collectivized farming, 
schooling in rural China did not appear to 
serve as an avenue for family prosperity or 
individual social mobility. Though schooling 
was seen as providing useful knowledge, 
parents believed that schooling was not 
indispensable as preparation for farm work 
(Parish and Whyte 1978). Moreover, there 
were no clear benefits associated with more 
schooling. Schooling did not appear to 
provide ambitious young peasants an avenue 
for upward social mobility, such as assign- 
ments to state sector jobs or cadre positions in 
the village and local government. Middle 
school graduates typically had no alternative 
but to return to the village after graduation, 
where they discovered that they were often at 
a disadvantage to village youth who had 
entered farming earlier and had already 
become experienced farmers (Nee 1983). 
More schooling did not result in higher 


15 The mean household income in yuan for all 
households was 1,089 in 1980 and 2,344 in 1984 
(exclusive of cadres and entrepreneurs); for cadres 
it was 1,222 in 1980 and 2,839 in 1984; and for 
entrepreneurs it was 1,381 in 1980 and 3,242 in 
1984. 
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workpoint assignments from the team since 
workpoints were assigned almost exclusively 
on quantity of labor inputs. Nor did more 
schooling entitle a village youth to a cadre 
position, which depended more on the 
accumulation of political capital ‘from village- 
based activism than on educational credentials 
(Madsen 1984).16 

The transition to a marketlike ` economy 
should result in higher returns to human 
capital characteristics. First, markets provide 
powerful incentives for direct producers 
through both positive and. negative sanctions. 


In household production the economic well- ' 


being of households depends entirely on the 
performance of its productive members in a 


more complex and demanding economic 


environment. Rather than simply accumulat- 
ing workpoints based upon labor inputs in 
agricultural production, the household in a 


marketlike economy pursues a strategy of | 


maximizing profits from existing income 
‘streams and developing new, more Profitable 
lines of activity. Rather than’ earning work- 
points on a steady, predictable basis, peasants 
in a marketlike economy are transformed into 
petty entrepreneurs whose relative success is 
based upon the ability to make short- and 
long-term investment decisions based upon 
informed cost-benefit calculations. ‘Educated 


peasants’ who are better able to draw on and 


use available information have a clear advan- 
tage ‘over illiterate peasants (Schultz 1964). 
Second, in a marketlike economy, profit 
making. is associated with the production of 
cash crops, lucrative sidelines, and, nonagri- 
cultural enterprise. The ability to learn and 
acquire the .techniques required for the 
introduction of more profitable undertakings 
typically depends on having some degree of 
formal education, at last.enough to read and 
digest simple technical literature. In the 
post-Mao era of sweeping institutional re- 
forms, the educated peasant has an advantage 


16 Despite negligible returns on education in 
collectivized agriculture, school attendance in- 
creased during the Maoist era from prerevolution- 
ary China. The expansion of mass education in 
state socialism can. be explained by the change i in 
political power and associated ideology in a 
successful social revolution (Easterlin 1981). 
During the Maoist era, sustained ideological and 
mobilizational efforts were directed at increasing 
the: educational attainment of peasant children. 
These have déclined d following decollectivization. 
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in being more able to discern subtle policy 
transitions and political trends transmitted in 
the news media and to understand the 
implications of these changes for the local 
political economy. 

' In collectivized agriculture entire villages 
got along with a few subscriptions to the party 
newspaper and radio broadcasts. Today there 


is a surprisingly wide range of technical and 


market-oriented newspapers and magazines 
available to households, as well as a wider ' 
selection óf radio and television programs. In 
part this reflects the liberalization of the 
cultural sphere in post-Mao China. It also 
reflects the greater demand for. information 
among rural families. Rather than local cadres 
being the principal readers of printed news, 
after market reform peasant households con- 
sume an increasingly diverse range of media 
and printed information. The household's use 
of media (cultural capital) is expected to have 
a- significant net effect on the household's 
earning power in a marketlike economy. This 
is consistent with the revised attainment 
model proposed by DiMaggio and Mohr 
(1985) in which they incorporated cultural 
capital as an independent dererminant of 
socioeconomic outcome. 


Hypothesis 4: For both men and women the - 
transition from state socialist redistribution to 


‘ markets increases the value of education to 


the household. . 
Hypothesis 5: Media use ‘(or cultural capital) 
has a positive effect on household i income in a 
matkeuike economy; 


Results . E 


In Table 5, I do not attempt to estimate a 
complete model of the processes of household 


‘earning power (which is reported in subse- 


queat tables): Rather I assess the change in 
returns on human capital measured by school- 
ing following market reform. The purpose is 
to: determine the effects and significance of 
market reform on household returns to human 
capital investments rather than the parameters 
of the model as a whole. Lagged income 
variables for:1975 and 1980 capture: the ` 
unspecified determinants of household in- 
come. Though the effects of the combined 
husband and wife's education (EDUCA- 
TION) were not significant, its sign is 


- negative. Equation V reveals a dramatic 
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Table 5. Effects of Human Capital on | Household 
; Income 1980 and 1984 


Before After 
Market Market 
Reform Reform 
IV V 
Education — 13.5998 62.325** 
* (9.095) (23.741) 
— .045*. .094 
Income '75 1.011*** QM. 
(.040) ] 
` .759 
Income '80 — 1.208*** 
(.077) 
; : : .561 
Intercept 377.362***. 810.78*** 
df 468 . 524 
Pr 577 0322.7 





. Note; Regression of household income on husband and 
wife's education (Education). 


* p «.05. * unstandardized coefficient. 
** p « 1. > standard error... : 
© standardized coefficient. 


et p < (001. 
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E t 
(Table 6) provides strong support for Hypoth- 
esis 5 showing that media use or the cultural 
capital of a peasant household is very strongly 
associated (p « .001) with the rate of growth 
of household income. A separate analysis 
indicates that husband and wife's education 
(p < .001; standardized regression coefficient . 


' = ,31) and 1975 household income (p < 


.001; standardized regression coefficient = 
.17) are strong and significant predictors of 
media use (cultural capital). This- finding . 
supports the view that media use (cultural 
capital) is a function of educational and 


. household socioeconomic attainment prior to 
. market reform. . 


THE MARKET OPPORTUNITY THESIS 
Derived Hypotheses 


The following hypotheses examine institu- 


-. tional or contextual effects on household 


turnaround in the effect of husband and wife's: 
education (p < .01) on household income. !7 
In Table 6, equation VI is a more complete 
model of household earnings. As predicted by 
Hypothesis 4, the effect of education on 


significant and positive for husband and 
wife's education (p < .001).'® Equation VII 


17 When the husband's and wife's.educational 
attainments are disaggregated, the effect of wife's 
educational attainment is significant (p <:.001), 
whereas the husband's remains nonsignificant. But 
in the complete model used in Table 6, both 
husband's (p « .05) and wife's (p « .001) 
educational attainments are significant when disag- 
gregated. Adult female illiteracy is still very high, 
at 68 percent. Peasant women play a central role in 


household commodity production, whereas grain. 


production to satisfy subsistence needs and meet 
the state grain quota continues to be largely a 
male-dominated activity. 


'* A paradoxical outcome of market reform was. ' 


the sharp decline in school enrollment of peasant 
children. Whereas in the Maoist era, when there: 


was no apparent significant return to education, | 


'school enrollment steadily increased, after market 
reform, peasants withdrew their children from 
school in record numbers. Although my regression 
analysis indicates significant returns to education. 
after market reform, it may take awhile for people 


to recognize the changes in returns on education. . 


' Education is a long-term investment, the benefits 
of which may not accrue directly to the parents, 


+ 


especially. in the case of daughters who marry out 


l 


earning power. These hypotheses suggest that 
the transition from state socialist. redistribu- 
tion to markets results in‘changing the manner 
in which structural variables affect earning 


' power at the community level. If the. market 


: rtunity th t L t that 
household income following market reform is : OPDOIBIRID Diels is comes AREE 


market reform will strengthen the effects of 
market structures on household income. 
: During the Maoist era of collectivist 


` agriculture, rural-urban trade was curtailed 


under the state procurement plan in which 
agricultural surplus was sold directly to the 
state. Rigid constraints barring: free trade 
between peasant producers and urban markets 


‘were enforced (Lardy 1983). All forms of . 


long distance trade involving peasant produc- 
ers came to a virtual standstill (Lyons 1987). 
Villages became increasingly self-sufficient, 
resulting in the, constriction of marketing 
activity within the local marketing system. 
Following market reform, however, rural- ' 


urban trade in China expanded rapidly, -as 


peasant households once again were permitted 
to market their products directly in urban 


of the household and are not expected to support 


. the parents in old age. Thus short-term calculations 


of costs and benefits have led peasants to withdraw 
children from school to meet a perceived labor 
shortage in the household economy after market 
reform. I anticipate, however, that school enroll- 


'ment. and educational attainment for peasant 
` children will eventually increase beyond the level 


‘achieved in the Maoist era. 
A 
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Table 6. Model of Peasant Household Economy 


Media 
Human Cultural 
Capital Capital 
VI VII 
Education 106.002**** 74.377** 
(25.339° (26.291) 
AST H 
Age 82.315* 89.589* 
(40.662) (40.098) 
524 531 
Age? — .868* — .845* 
(.451) (.445) 
— 497 — .483 
Labor 260.988*** 262.086*** 
(39.861) (39.702) 
.275 275 
Children 101.851* 120.638* 
(54.033) (53.576) 
O71 .084 
Income '80 .988*** .928*** 
(.080) (.080) 
.466 .437 
Media — 82,539*** 
(19.386) 
.162 
Intercept — 2228.568*** — 2851.880 
df 482 474 
r .389 413 


Note: Regression of 1984 household income on 
husband and wife's education (Education), age of 
household head (Age), age squared (Age?), number of 
adult laborers in household (Labor), number of children 
under 14 years of age (Children), household income in 
1980 (Income '80), and index of media use (cultural 


capital) (Media). 
* p «.05. * unstandardized coefficient. 
** p « 01. > standard error. 
*** p< .001. * standardized coefficient. 


markets. Not only did rural-urban trade 
expand rapidly, but long distance trade 
resumed as well The rapid increase in 
transactive market exchange opened up vil- 
lage economies to market integration. As a 
result the encysted character of Chinese 
villages (Parish and Whyte 1978) may be 
breaking down as horizontal market ties 
restore rural-urban and inter- and intra- 
regional trade. Hypothesis 6 predicts that the 
restoration of rural-urban trade will be 
reflected in the increased significance of 
urban proximity as a determinant of peasant 
household income. 

Hypothesis 6: In a reforming socialist econ- 
omy, urban proximity has a more positive 
effect on peasant household income than in a 
state socialist redistributive economy. 


In a market economy, locational access to 
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markets is expected to have an increasing 
effect on household income for several 
reasons. First, the more distant markets are, 
the more costly the transport of products to 
the marketplace. Second, market proximity 
provides households with better access to 
market information. Lastly, market proximity 
provides advantages in network terms, in 
developing and maintaining guanxi con- 
nections with brokers and entrepreneurs 
whose activities converge on marketing 
centers. The market access variable takes 
into account the effect of access to the local 
and intermediary markets in which peasants 
participate, and is therefore a more compre- 
hensive measure of market access than 
proximity to the urban market. Though 
rural-urban trade was curtailed during the 
Maoist era, peasants continued to market their 
produce in the standard marketing towns and 
county towns. For this reason, market access 
is expected to have a significant effect on 
household income prior to market reform, but 
a stronger effect is expected after market 
reform. 

Hypothesis 7: The transition from a state 
socialist redistributive economy to a market 
economy increases the effect of market access 
on household income. 


Sustained growth in peasant agriculture 
according to Schultz (1963) depends on the 
acquisition of modern techniques of produc- 
tion by farming households. Thus an impor- 
tant determinant of economic growth is a 
community's investment in schooling. The 
presence in a village of larger numbers of 
literate and educated peasants benefits all 
households in creating a greater receptivity to 
new methods and innovations; likewise, high 
levels of illiteracy in a village are a source of 
constraint on the diffusion of modern produc- 
tion techniques among households in the 
village. Education thus has not only an 
individual-level, but also a community-level 
effect on changes in household income. 
Hypothesis 8: The transition from a state 
socialist redistributive economy increases the 
value of community investments in education. 


` In collectivized agriculture the emphasis on 
self-sufficiency and the constraints on market 
transactions led to a greater reliance on grain 
production as the primary source of income. 
The Maoist policy of self-sufficiency, state 
procurement, and restriction of rural market- 
ing brought about a sharp reduction of 
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sideline commodity production as a source of 
income for peasants, though it continued to 
constitute about 25 percent of household 
income (Fei 1979). Following market reform, 
sideline commodity production has increased 
- sharply. In the Fujian sample, income from 
: sideline production in:1984 made up 55.2 
percent of total household income. Thus after 
reform a larger proportion of household 


income derives from factors of production . 


other than the arable land assigned to the 
household. 

Hypothesis 9: The per capita land holding is 
likely to have a decreasing effect on house- 


hold income in a diversified commodity : 


economy. 


- Fei (1953) pointed to the “social erosion” 


of Chinese villages as a result of out- l 


migration to urban centers in the , prerevolu- 
. tionary period. Out-migration of young men 
and the consequent dependence on remit- 
tances sent back from abroad may cause 
underdevelopment of the village economy. 
Not only is there an outflow of the most 
talented and energetic men, but also the 
long-term effect of dependence on remit- 
tances may be a welfare mentality. Thus 
“social erosion" is likely’ to become more 
pronounced in a market economy where 
restrictions on labor migration are relaxed and 
out-migration of young laborers resumes, 
especially in villages with a tradition of high 
levels of labor migration. 

Hypothesis 10: In reforming socialist econo- 
mies, a history of high out-migration has a 
negative effect on individual household in- 
come in a village. - 


Resulis ` 


The data in Tables 7 and 8 support the institu- 
tional effect hypotheses. Table 7 provides clear 
evidence that in the period prior to decollectiv- 
ization (equation VII), urban proximity did not 
matter to households in terms of growth in their 
earning power.!9 Urban proxirnity, among the 
strongest determinants of rural income through- 
out the world (Schultz 1953), may have dropped 


"P? Skinner (1978) shows that suburban com- 
: munes located within metropolitan boundaries 
benefited from urban proximity prior to economic 
reforms and achieved economic parity. In his view 
the urban/rural divide should be redrawn as “urban 
cum peri-urban islands" in a sea that is more 
smelly tural. 
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Table 7. Effect of Urban Proximity on Household 
1: Income 1980 and 1984 





Before . After 
Market Market - 
Reform, Reform 
VIII IX 
Distance to city —.731" _ —11.089** 
: ? + (.698)h (1.704) 
j _ .032° — .229 
Income '75 1.012*** ; —. 
(.041) ' 
: .758 i . 
Income '80 ` — 1.200"** 
(.076) - 

. : .557 
Intercept 393.288*** 1758.081*** : 
df f 449 | , 504 
r .577 (o 0379 

** p « 0l. * unstandardized coefficient. 
> standard error. 


*** p < .001. 
. * standardized coefficient. 


out as a predictor of household income in col-. 
lectivist agriculture. After structural reform 
(equation TX), urban proximity gained very con- : 
siderably in its effect on the rate of growth of 
household income within a community. For ev-' 
ery kilometer further from an urban center, péas- 
ant households in a village lose about 11 yuan 
in growth in household income after structural 
reform. The results in Table 7 provide strong 
evidence to support the view (Hypothesis 8) 
that decollectivization and market reform have : 


. resulted in the renewal of the traditional rela- 


tionship between urban proximity and the level 
of household participation in rural-urban trade. 
Peasant households in communities located . 
closer to cities are better able.tó market their. : 
products in the more lucrative and -differenti- 
ated urban markets. 

Data in Table 8 show that the independent - 
effect of market access increases as markets 
become more important in economic coordina- 
tion. Growth of household earnings is signifi- 
cantly lower in villages that have a history of 


. overseas emigration. Especially striking is the- 


changing effect of village investment in educa; 
tion, from a very significant negative to an 
equally significant (p « .001) positive effect on 
growth in household earnings. This finding -is 
consistent with the results reported on the effect 
of household educational attainment on income 
before and after market reform. Lastly, the 
change in the size of the coefficient and the 
higher t-ratio (t =, — 1.27) weakly support the 
hypothesis of a decreasing effect of land hold- 
ing after market reform. 
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Table 8. Institutional Effects on Household Income 


Before After 
Market Market 
Reform Reform 
x XI 
Per capital land —-.545* —89.201 
(26.674? (70.456) 
— .001* — 045 
Distance to markets —42.727* — 192.841** 
(25.282) (63.834) 
. ~ 051 —.107 
Village education —81.342*** 303.590*** 
(26.457) (65.208) 
— .096 .170 
Oversea migration — 28.699 — 166.152* 
(32.289) (79.232) 
— .028 —.077 
Income '75 1,031*** ~ 
(040) 
.T]O 
Inzome '80 — 1.235*** 
(.075) 
577 
Intercept 364.428*** 1296.149*** 
df 458 511 
pr .587 .372 


Note: Regression of household income on per capita 
land assignment, distance to markets, village educational 
level, density of overseas Chinese connections, 1975 
household income, and 1980 household income. Distance 
to markets and village education are factor scores based 


upon confirmatory factor analysis. 
* p «.05. * unstandardized coefficient. 
** p « O01. > standard error. 
»** p « .001. * standardized coefficient. 
CONCLUSION 


I began with Polanyi's concept of redistribu- 
tion and nonmarket trade, pointing to the way 
in which Szelenyi applied these concepts in a 
substantive analysis of social inequality in 
state socialism. In state socialist redistributive 
economies, surplus is appropriated from 
producers through the nonmarket trade of 
labor and commodities and redistributed by 
administrative processes. Under the redistrib- 
utive mechanism, the allocation and distribu- 
tion of surplus benefit Szelenyi's new class 
since redistributors tend to favor their own. 
By building on and extending the concep- 
tual framework developed by Polanyi and 
' Szelenyi, I proposed a theory of market 
transition. Three theses explain the effect of 
the transition to transactive markets on the 
distribution of rewards in state socialism. The 
market power thesis argues that as markets 
replace redistributive mechanisms in the 
allocation and distribution of goods, there is a 
shift in the sources of power from the 
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redistributive sector to the marketplace. The 
market incentive thesis argues that markets 
provide more incentives than do redistributive 
economies. The market opportunity thesis 
states that the shift to market coordination 
gives rise to new opportunity structures 
centered on the marketplace, changing the 
mamner in which structural constraints affect 
socioeconomic outcome. A .fundamental 
change in the processes of socioeconomic 
attainment occurs in the transition from 
redistribution to markets, involving not only a 
reduction in the relative transfers of surplus 
from producers to redistributors but changes 
in opportunity structures and incentives result- 
ing irom market reforms. 

Tne fate of market reform depends not only 
on economic efficiency, but ultimately also on 
struggle over the distribution of power and priv- 
ilege. Efficiency and power are intertwined in 
the sense that if market reform results in more 
rapid economic growth, this is likely to make 
the struggle over rewards less contentious. High 
inflation in food prices already has led to wide- 
spread urban discontent and resentment. The 
high rate of economic growth, however, has 
allowed the state to respond by increasing ur- 
ban income and bonuses to salaried employees. 
In an expanding economy, despite the transfers 
of power this paper documents, the greater af- 
fluence produced by markets reinforces the sense 
that che benefits of economic reform are widely 
distr:buted. Though some have gained more than 
others from market reform, even urban dwell- 
ers have experienced net improvements in stan- 
dard of living. If, however, economic growth 
falte-s and declines, there will be intense urban 
pressure to constrain markets and to reimpose 
redistributive power over peasants while cur- 
tailing entrepreneurship in the second econ- 
omy. Who gains and loses in the shift to market 
coordination structures the bureaucratic politics 
of reform (cf. Shirk 1989). The politics of re- 
form cycles is shaped by the interest of redis- 
tributors to limit reform 10 partial measures and 
by the need to deepen reform to achieve effi- 
ciency and growth (cf. Stark and Nee 1989, pp. 
25-29). i 

If in the future there is a decisive shift to 
market coordination in the collective and 
state-owned industrial sectors, the market 
transition theory advanced in this paper can 
be tested on urban data as well. I suspect, 
however, that this is not likely to occur in the 
absence of corresponding political reforms of 
the Leninist party state. As Brus points out, 
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bureaucratic coordination of the state sector is 
virtually inseparable from the way power is 
exercised in state socialism, which depends 
on the “ability to direct and redirect resources 
by commands in order to achieve chosen 
political priorities” (1989, p. 261). 

Furthermore, a transition to markets in the 
urban sector would bring the issue of 
ownership’ to center stage. The quest for 
greater efficiency and competitiveness can be 
expected to give rise to pressures to privatize 
state-owned enterprises, as it has in capitalist 
economies (Ikenberry unpublished). If urban 
reform achieves a breakthrough, high-ranking 
state officials can be expected to use their 
power to attempt to procure a controlling 
share of the equity of industrial enterprises 
through favorable access to credit from 
banks. The prospect of equity ownership by 
high officials opens the distinct possibility of 
high-level redistributors making successful 
accommodations to a market economy. De- 
spite changes in the sources of power. and 
privilege, this may result in a greater initial 
continuity in the stratification order than 
observed in the rural sector. Although any 
trajectory to a market economy will retain a 
strong bureaucratic component, the market 
transition theory predicts that such advantages 
of political power and connections will 
decline following the immediate transition 
period. Selection processes in a market 
economy are not likely to discriminate 
between enterprises managed by former 
redistributors and by nonbureaucratic entrepre- 
neurs. Moreover, as economic action by- 
passes hierarchies to center on transactions 
between private buyers and sellers, the power 
of redistributors becomes less salient. 

Kornai (1986) specifies the consequences 
of partial economic reform in the state sector 
for economic efficiency. In his view, partial 
reform brings out the worst aspects of central 
planning. and markets, thereby exacerbating 
the problems of a shortage economy (Kornai 

- 1989). Neither Kornai nor this paper ad- 
dresses the distributive consequences of 
partial reform. I instead focus only on the 
sector where a transformative shift from 
hierarchies to markets has taken place. While 
redistributors in the marketized sector have 
lost power, in sectors and regions where 
reform has been partial or incomplete, 
positional power in the state bureaucracy 
provides continuing and often lucrative pri- 
vate advantages. Redistributors profit from 
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partial reform to the extent they double dip by 
leveraging power in the redistributive econ- 
omy to gain unfair advantage in the market- 
place. The rapid growth of official corruption 
(quandao) reported in the Chinese press 
reflects the efforts of redistributors to benefit 
directly from market opportunities. í 

The market transition theory has shown that 
the shift from hierarchies to markets in a so- 
cialist economy involve changes in the deter- 
minants of socioeconomic attainment and there- 
fore the sources of power and privilege. This 
knowledge provides new directions of research 
on state socialism: examining the distributive 
consequences of partial reform when redistrib- 
utors double dip in redistributive and market 
opportunities, specifying the dynamics of so- 
cial inequalities created and structured by the 
marketplace; analyzing the effects of market 
reforms on regional and sectoral growth and 
distribution; modeling entrepreneurship and la- 
bor market processes that transfer surplus labor 
into the second economy; analyzing the role of 
the state in establishing the institutional frame- 
work of a mixed economy; and studying how 
changes in the balance of power in reforming 
socialist societies shape class conflict and the 
politics of markets. 
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Both new-left and neo-conservative iheorists argue that poor state performance - 
, can lead to a legitimation crisis, but they neglect distinctions among different forms 
` of political support and objective conditions that citizens evaluate. I propose a 
theoretical model that distinguishes between (a) confidence in institutions and 
legitimation of democracy and (b) state performance and the structure of 
opposition (party systems, governing coalitions). I test the model with data located. 
in an extensive new search of survey archives in six Western countries. The results 
show that poor state performance leads to a decline of confidence, but not a 
softening of support for democracy. Problems in the structure of opposition lead to 
a decline of both confidence and support for democracy. In the conclusion, I 
speculate whether state performance might affect democratic legitimacy in a less 


straightforward fashion. 


Among the voluminous writing on political 
support in Western democracies since World 
War II, the most dramatic were the warnings 
of actual or imminent “legitimation crisis,” 
“crisis of confidence,” “government over- 
load," or breakdown of “governability.” As 
the protest wave of the 1960s crested and the 
economic difficulties of the mid-1970s set in, 
the sharpest cries were raised symmetrically 
on the new left and among neo-conservatives. 
Many empirical indicators also showed an 
alarming decline in citizen support for 
political institutions. But in the quieter 
atmosphere of the 1980s, as empirical evi- 
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dence showed a turnaround on some indica- 
tors, warnings of crisis became fewer. 

In the méantime, critics argued that new- 
left and neo-conservative crisis theories im- 
properly equated declining confidence in 
institutions with a delegitimation of democ- 
racy. And others argued that crisis theories 
focused too narrowly on state “performance” 
as a source of political support and neglected 
the structure of opposition (party systems, ' 


'coalition forms). But most observers agreed 


that the available opinion survey data do not 
permit an adequate analysis of trends and that 
other types of data cannot completely fill this 
gap (see Kaase 1988, for the most comprehen- 
sive survey of data sources). 

The aim of the present study is to develop 
anc test a unified model of the structure, 
sources, and trends of political support in 
Western countries since World War II. I 
corsolidate critiques of crisis theories by 
incorporating in the model distinctions . be- 
tween (a) confidence: in institutions and 
legitimation of democracy and (b) state - 
performance and the structure of opposition. I 
test the model with survey data (much of it - 
previously unknown or neglected) located in _ 
an extensive new search of the archives. 


THEORIES OF POLITICAL SUPPORT 


Most of the crisis theories of the mid-1970s 
build on Schumpeter's ([1950] 1975) classical 
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proposition that political support in a democ- . - 
: picture, but place the blame not so much on’ 


racy depends on the delivery of services by 
the state.!.Schumpeter characterized democ- 
racy as the electoral competition of party 
organizations that offer themselves as rulers 
to the ruled on the basis of electoral promises. 
Governing elites must promise only what they 
can deliver, and they must succeed in deliv- 
. ering that. Too high promises (especially) or 
failure to deliver result in problems in 
political support.. The image is one of a 
balance between citizen demands and state 
capacity to satisfy the demands. A crisis 
consists of an overload on the state, and it can 
only be resolved’ by creating greater state 
capacity or persuading citizens to reduce their 
demands. Since, Schumpeter argues, creating 
greater state capacity tends to undermine civil 
liberties and to encourage a:spiral of greater 
demands, his preferred solution is to hold 
down both demands and state capacity.: 

The image of political support as a balance 
between citizen demands and state capacity 
was generally accepted by crisis theorists on 
‘both the left and'the right. Much of the debate 
centered around the extent to which a 
traditional sector (church, family, commu- 
nity, etc.) has been displaced by a state sector 
in providing services. 

Theorists’ on the left, argue that "late" 


capitalism generates increasing social and 


economic dislocations which must be reme- 
died by the state (e.g:, O'Connor 1973; Offe 
1974, 1975; Habermas 1975; Woife 1977). 


The state must shore up a lagging private. 


economy with investments in infrastructure; 
. and a growing welfare .state must plug the 
holes left by a private economy by providing 
social services it formerly provided. Those 
groups best able to protect their own interests 
(e.g., big business, big labor) do so at the 
expense of the weak,. but all groups clamor 
for more state services. Since the state cannot 
keep up .with. the inflationary spiral of 
demands, it loses legitimacy, especially in the 
eyes of the weakest who suffer most (Thurow 
1980 makes a related argument).? 


! [mportant earlier variants of this proposition 
include Weber (1968) and the elite theorists; later 
variants include Lipset ([1960] 1981). I will 
restrict my discussion to Schumpeter for reasons of 


space. - 
? Linz (19885) inverted this gimeni by 


proposing that, with the growth of international ` 


capitalism, citizens no longer attribute economic 
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Theorists on the right S a similar | 


capitalism as on the left itself for generating ' 
demands on the state which, they claim, 


‘should more properly be left to the private 


realm (e.g., Huntington 1975, 1981; Crozier | 
1975; Hennis, Kielmansegg, and Matz 1977; 
some not on the right make the same point: 
Rose and Peters 1979; Bell 1976; Janowitz 
1978). This lack of traditional restraint and, 
especially, the accompanying protest creates 
conflicts which cannot be regulated by the ` 
state and demands which the state cannot 
satisfy. And the “revolution of rising entitle- ` 
ments” - (Bell), in a ratchetlike fashion, 


‘prevents demands against the state from being 


cut back or displaced elsewhere. 
Still other writers responded to the left and 


‘right primarily by denying that things have 


gotten so bad (e.g., Lane 1979; Dahrendorf 
1980). They argue that traditional orientations 
have not eroded, and expectations regarding 
State services have not escalated, to the extent 
claimed. There is a much gréater resérvoir of 
support (and. traditional orientations) than the 
left and right believe. But even these centrist 
formulations tend to accept the image of the 
balancé formulated by Schumpeter.and perpet- 
uated by the.left and right: of a certain 
quantity of demands against i a certain quantity . 
of state capacity, with crises arising from an 


. overloaded scale. 


Two problems with crisis theory have been 
pointed out in the literature. First, a greater ` 
distinction must be made among various 
levels of political support. In particular, 


' dissatisfaction with political institutions -or 


incumbents cannot automatically be. taken as 
a rejection of existing democracy. Second, a 


'demand overload is not the only source of 
: political dissatisfaction, and possibly not even 
the most important one. Citizens may also be 


dissatisfied by an unresponsive structure of 
opposition— — for instance, a party system that 
generates extremism, or a governing coalition 
that does not reflect voters’ preferences. 
Indeed, 
structure can also block the delivery of state 
Services. Citizens' dissatisfaction may result: 
not from an overloaded state but from an 
impasse or gridlock among actors. 

.Easton's Von 1975; also Parsons 1967) ' 


crises to state incapacity. Thus, the poliea realm 
is more shielded from a legitimation crisis today 
than in the classical age of national capitalism. 


a poorly ,functioning opposition. - 
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distinction between diffuse system support 
and specific evaluation of state outputs or 
performance is probably the most influential 
typology of political support in Western 
democracies.? A large empirical literature 
developed that revolves around this distinc- 
tion. Miller (1974) analyzed the decline in the 
University of Michigan’s “trust in govern- 
ment” survey indicators from the mid-1960s 
to the mid-1970s and argued that the 
legitimacy of the American political system 
was eroding. Citrin (1974) expressed skepti- 
cism on the basis of Easton’s distinction, and 
claimed that the indicators were not adequate 
for measuring diffuse system support. Re- 
search on 1960s-1970s protest supported 
Citrin’s argument by showing that many of 
those with least trust in state institutions 
expressed strongest support for democratic 
norms (e.g., Kaase 1971; Barnes and Kaase, 
et al. 1979). The refinement of indicators 
and, above ali, the rebounding of trust in 
government in the 1980s reinforced the 
plausibility of the distinction (see Abramson 
1983, and Dalton 1988, for literature reviews; 
also the later views of Miller 1983, and Citrin 
and Green 1986). By the time Lipset and 
Schneider ([1983] 1987) published their 
exhaustive study of the "confidence gap" 
which caps the literature, they could take the 
distinction between diffuse system support 
and confidence in specific institutions as 
essentially proven. The theoretical implica- 
tion for a legitimation crisis was clear: low 
confidence in institutions does not necessarily 
entail rejection of existing democracy (see 
Lehman 1987). 

Critics also argued that new-left and 
neo-conservative crisis theories focused on a 
state performance deficit (overload) as a 
source of popular disapproval, but neglected a 
poorly functioning structure of political oppo- 
sition (impasse). The aspects of opposition 
structure that have most impact on political 
support are party-system polarization and 
fractionalization, the configuration of govern- 
ment and opposition, and cabinet stability.+ 


? Most other typologies in the literature tap the 
same dimension. For instance, Held (1987, pp. 
181-82, 238) identifies seven levels of acceptance 
of political institutions, on a continuum from 
coercion to a belief that the institutions satisfy 
one's normative ideals. 

* The literature on opposition structures is 
enormous and disparate, and for the sake of 
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Writers argue that the party system should 
present voters with clear alternatives, each 
capable of rotating into office, and none 
likely to destroy democracy. And governing 
coalitions should be stable and reflect voters’ 
choices as directly as possible. These propo- 


. Sitions follow from basic notions of represen- 


tative democracy. Party systems and coalition 
forms comprise the subconstitutional under- 
pinnings of democracy—the informal "rules 
of the game" —and their functioning affects 
not only confidence in the government but 
also the legitimation of the regime. 
Extremely polarized or fractionalized party- 
systems are said to be “poorly-functioning” 
structures of opposition. Sartori’s (1966, 
1976; also di Palma 1977; Powell 1987) 
model of “polarized pluralism” is the most 
influential. In these systems, a large (but not 
majority) party governs more or less perma- 
nently in unstable coalitions with various 
other parties. And at least one extremist or 
antisystem party is in quasi-permanent oppo- 
sition. Extremist parties are sufficiently unac- 
ceptable to others that they cannot form 
alternative coalitions, but they are strong 
enough to block alternative coalitions that do 
not include themselves. Sartori argues that 
this leads to stagnation and corruption at the 
center, frustration and radicalization at the 
periphery, and instability of governing coali- 
tions. He cites Weimar Germany, Fourth- 
Republic France, and contemporary Italy as 
examples.5 Other writers argue that party- 
system fractionalization also threatens politi- 
cal support (e.g., Rae 1971; Powell 1982; 
Lane and Ersson 1986). It presents voters 
with unclear choices, governing coalitions 
tend to be unstable, and voters’ preferences 
are seldom well represented. However, Sani 
and Sartori (1983) argue that once polariza- 


brevity, I have been selective. For instance, I do 
not discuss neo-corporatism here, although it might 
be relevant theoretically. Schmitter (1981) claims 
that neo-corporatism permits the state to handle 
greater loads, and Scholten (1987) argues that 
many instances of consociationalism (discussed 
below) are really epiphenomena of neo- 
corporatism (see also Schmitter and Lehmbruch 
1979; Lehmbruch and Schmitter 1982). However, 
I was unable to find indicators in the literature that 
I could use in the empirical analyses below. 

* The depolarization of the Italian party system 
in the 1970s and 1980s led some to question 
Sartori’s theory (e.g., Daalder 1983). 


fi 


SOURCES AND STRUCTURE OF. LEGITIMATION pT 


tion is controlled for, fractionalization has 
little independent effect. 

Oversized governing ‘coalitions are aibo 
said to reflect “poorly functioning" structures 
of opposition. they tend to generate antisys- 
tem extremism and- polarization because few 
major prosystem parties are in opposition, 
available to rotate into office or to: criticize 

the government. This undermines political 
support. Minority cabinets create similar 


` problems because they ‘involve informal. 


cooperation of the major opposition parties, 
but with only one side holding cabinet posts. 
Governments usually survive with "floating" 

parliamentary majorities'or Italian-style tras- 
formismo. Lijphart (1977, 1984; also McRae 
1974; Nordlinger 1972) points out an impor- 


tant exception to this pattern, where oversized ` 


coalitions are meant to promote political 
support. In “consociational democracies,” 
. plural societies with high levels of intercom- 
munal’ conflict, parties are closely tied: to 
ascriptive communities and experience little 
change in voting strength. Permanent opposi- 
tion status in such :cases could lead to 
radicalization and heightened intergroup con- 
flict. Grand coalition, governments of. all 
major -parties (combined with elements of 
federalism and proportional allocation of state 
services) attempt to reduce extreme underly- 
ing intercommunal conflict by building trust 


among opposing elites. Successful historical 


examples include the Netherlands, Belgium, 
and Austria; the most spectacular failure. was 
the Lebanese republic. 


Finally, cabinet instability is also said to ` 


undermine political support: constant turnover 
interferes with the. representative function of 


elections and encourages antisystem: senti- . 
: a unified model, along with several additional 


ment. Yet most investigators do not measure 
political support directly (see the reviews in 
Strom 1988; Browne, Frendreis, and Gleiber 
1988). Instead, cabinet stability ‘itself is 
treated as an indicator of. democracy (e.g., 
Powell 1982; Zimmermann 1987, 1988). 


Exceptions to this rule. include: Schmitt (1983) ` 


and Harmel and Robertson (1986), who find 
that cabinet stability has a positive effect-on 
legitimation, as measured - Pindependenity wiih 
survey data. 

“Thus, 
political support as much or more than state 
performance. Yet for a long time, scholars 
did not argue that the overload model was 
wrong, only that the model of impasse works 
: in parallel. Since the mid-1970s, .interpreta- 


opposition structure may affect l 
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tionis have shifted. Researchers noted that fhe 
new ‘wave of transitions to democracy in, 
southern Europe, „Latin -America, and else-. 
wheré, occurred in the midst of a performance 
crisis (e.g., O'Donnell, Schmitter, and White- 
head 1986; Diamond, Linz, and Lipset, 

forthcoming). Others applied these observa- 
tions to a reanalysis of. the causes of 
democratic collapse or survival in the Great 
Depression and other times (e.g., Linz 1978; 


‘Linz’ and Stepan 1978; Zimmermann and 


Saalfeld 1988). Scholars now began to insist . 


: that the structure of opposition is the key: 


variable affecting political support, and that 
‘State ` performance is virtually irrelevant. 
Democracies can survive even severe perfor- 


: mance crises if their structures of opposition 


do not block attempts to address the prob- 
lems. But if there is an impasse, problems in 
political support will escalate in a deadly 
ae until the regime itself is threatened. 


A CONSOLIDATED MODEL OF 
POLITICAL SUPPORT ` 


One can summarize these criticisms of 
new-left/neo-conservative crisis theories as a 
series of testable propositions. Poor state 
performance may lead to a crisis of confi- 
dence, but it is unlikely to lead tova 
legitimation crisis (a rejection of democracy). 
Citizens' evaluation of democracy is likely to 
decline. only if democracy itself is perceived 
not to function as it should. When the 
structure of opposition is unresponsive, de- 
mocracy is perceived to function poorly. 
Thus, a poorly functioning structure of , 
opposition can cause legitimation problems. ` 

Figure 1, consolidates these propositions in: 


propositions. The. model incorporates the 
distinctions among types and sources of 
political support developed above. ; 

The Types of Political Support (subjective 
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evaluations) are adapted from Easton’s distinc- 
tions. (1) “Political Confidence” is a positive 
evaluation of representative institutions: citi- 
zens trust them to represent their interests 
properly. (2) “Legitimation,” or Support for 
Democracy, is a positive evaluation of the way 
the political realm is structured and political 
actors interact, even if one is not happy about 
outcomes. That is, citizens feel that the rules 
of the game and the conduct of the players are 
fair, even if their side does not win. 

Different Types of Objective Conditions 
that affect political support can also be 
distinguished. (1) “Performance” is the 
state's output of services to the population 
that assure economic benefits and maintain 
civil order.5 (2) The “Structure of Opposi- 
tion" concerns the functioning of the political 
realm itself. I focus here on subconstitutional 
factors like the party system, coalition-forms, 
and cabinet stability; but other factors could 
also be included. 

In Figure 1, I classify these elements into 
deeper and more superficial levels. The more 
superficial elements (performance and politi- 
cal confidence) concern particular outputs of 
particular state institutions. The more funda- 
mental elements concern the functioning of 
the whole system: the rules of the game and 
the interactions among the players. 

In Figure 1, I also treat political support as 
a dependent variable, and the objective 
conditions as an independent variable. Other 
types of causality are possible, and I speculate 
about several in the conclusion. The proposi- 
tions summarized above are included, along 
with several others, as causal paths or 
hypotheses, labeled a-d, x and y. Heavy path 
lines indicate that I predict the hypothesis will 
be supported, light path lines that they will 
not be supported.’ 


* Path "a" corresponds to Lipset and Schnei- 
der's theory of a "confidence gap." It says 
that poor state performance will lead to a 
decline of confidence in state institutions 
(or incumbents). It also accepts Schumpet- 
er's image of an overload. I predict that this 
will be supported. 


Economic and political outputs can be distin- 
guished, but I do not examine relations between 
them for reasons of space. 

7 The predictions follow from the theoretical 
discussion above, but in several cases, I cannot 
give full supporting arguments due to lack of 


space 
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* Path "b" is the new-left/neo-conservative 
theory of legitimation crisis, also based on 
the image of a demand overload. It says that 
if the state does not deliver the expected 
services, the population will turn against the 
existing democratic regime. I predict that it 
will not be supported. 

Path "y", together with path "a", is an 
indirect form of the new-left/neo-conser- 
vative legitimation-crisis theory (path “b”). 
It says that low confidence in institutions 
will escalate into a rejection of democracy. 
I also predict that this will not be supported. 
The lack of a causal connection here 
underscores the distinction between deeper 
and more superficial levels of political 
support. It means that the regime has 
"reservoirs" of legitimacy that are not 
easily drained by poor performance. Whether 
a reservoir of legitimacy could be drained in 
the “long run” by persistently poor perfor- 
mance—as with all long runs—is virtually 
imponderable in theory and quite difficult to 
test empirically .® 

Path “c” says that a poorly functioning 
structure of opposition will drive down 
confidence in state institutions. The litera- 
tu-e is relatively silent on this point. There 
are two possibilities. Deeper-level problems 
might lead to dissatisfaction at both the 
deep and superficial levels—or alterna- 
tively, this might not happen because 
causality would have to move across 
“levels.” I lean toward the first proposition, 
but the second also seems plausible. 

Path "d" is a theory of legitimation crisis 
based on the image of a structural impasse. 
It says that if the opposition structure is not 
responsive—especially if the party system 
or coalition structure does not provide 
prosystem alternatives — support for democ- 
racy will decline. The population will 
abandon a game whose rules do not work. I 
predict that this will be supported. 

* Path “x”, together with path "d", is 


x; 


STf Reagan promoted “trust in government" 
only by means of deficit spending, one could argue 
that a future crisis would drive down confidence 
and—in the “long run” —support for democracy as 
well. This implies that citizens will think another 
regime-form could do better. Yet at this writing, 
populations of many debt-ridden Latin American 
democracies do not favor a reintroduction of 
authoritarianism, largely because earlier authoritar- 
ian regimes performed no better. 
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another indirect version of the new- 
left/neo-conservative legitimation-crisis the- 
ory (path “b”). It says that a good 
opposition structure will not hold up under 
poor state performance (e.g., if protest 
voting leads to polarization), and that as the 
structure of opposition collapses, so will 
support for democracy. But recent work on 
the Great Depression suggests that precisely 
this did not happen (Zimmermann and 
Saalfeld 1988): I predict that it will not be 
supported. On the other hand, a bad 
opposition structure might also block state 
outputs (path “x” in reverse), and blocked 
outputs might lead to reduced confidence 
(path “a”). This is an indirect form of path 
“c” and might be more plausible. 


The model in Figure 1 has several clear 
restrictions: it does not exhaust all possible 
types of objective conditions or political 
support; it makes no predictions below the 
aggregate level; it excludes interaction terms; 
and it examines only one causal order without 
feedback loops. But it is parsimonious and 
clear. I mention other factors that I have 
proposed elsewhere, like the age of the 
regime or a “honeymoon” period for new 
democracies (Weil 1985, unpublished), only 
when they seem unavoidable in explaining the 
results. Otherwise, I limit the analysis to this 
model. If the model holds up, it will be 
possible to add complexity in future research. 
In the conclusion, I discuss one variation 
created by relaxing some of these restrictions. 
Other variations are possible. 


INDICATORS OF POLITICAL SUPPORT, 
PERFORMANCE, AND OPPOSITION 
STRUCTURE 


Data were collected for six countries: the United 
States, Britain, France, West Germany, Italy, 
and Spain.? Economic and political perfor- 
mance and the structure of opposition are mea- 
sured at the aggregate, country level. I tried, 
with modifications, to employ indicators widely 
used in the literature. The indicators of polit- 
ical support are derived from opinion surveys, 
and are also aggregated to the country level. 
These data go considerably beyond those pre- 
viously collected. !9 


? Data were also collected for Austria, but the 
time series were not sufficiently long or densely 
populated to be included in the analyses. 

? Data for the dependent variables are reported 
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Three indicators of economic performance 
were calculated. (a) Economic hardship is 
measured by a "misery index," the sum of 
the average annual rate of unemployment plus 
the annual rate of inflation. (b) Opinions 
about the national economy are measured 
with survey data. Some variation occurs in the 
question wording (see discussion below), but 
questions always refer to the national econ- 
omy and not the respondent's personal or fa- 
milial well-being. (c) The rate of economic 
growth is a three-year moving average of an- 
nual real change in gross domestic product, 
measured with constant currency (i.e., con- 
trolling for inflation) (OECD 1988, Table 3.1; 
OECD 1983; Flora et al. 1987, pp. 370—400). 

Political performance is measured by an 
index of civil disorder, a factor score 
computed from annual rates of demonstra- 
tions, riots, political deaths, and governmen- 
tal sanctions (Taylor and Jodice 1983). These 
four elements are highly intercorrelated. The 
first factor captures 64 percent of the variance 
(eigenvalue = 2.6), and the mean loading for 
the variables is .80. The factor analysis was 
conducted over all six countries in order to 
preserve cross-national variation.!! Civil dis- 
order indexes are often used in the literature 
as indicators of the extent of democracy (as 
dependent variables; e.g., Hibbs 1973; Pow- 
ell 1982). I use the index mainly as an 
indicator of state performance (an indepen- 
dent variable), but for comparability to other 
studies, I will also examine it briefly as a 
dependent variable. 

Two features of party systems are em- 
ployed,: system fractionalization and system 
polarization. Fractionalization is Rae's (1971) 
index, 


Fractionalization = 1 — Xj. pf, ` 


where p is the percent vote for party i, with n 
parties. Although its properties are debated in 
the literature, the index is the most widely 


in the analysis section below; data for the 
independent variables are available from the author 
on request. 

H Hibbs (1973) and Powell (1982) use similar 
indicators. They deal with extreme outliers either 
by truncating the data back to the ninetieth 
percentile (Hibbs) or by calculating a log transfor- 
mation (Powell). Much the same effect is achieved 
here by using factor scores, which are measured in 
siandard deviations. 
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used indicator (Pedersen 1980). In this study, 
fractionalization values are carried over from 
each election year, through off-years, to the 
following election year. 

‘Scholars. disagree more on how best to 
measure polarization of a party system. But 
Lane and Ersson (1987, p. 159) show that 
most measures are strongly correlated, and 
recent studies have converged on a weighted 
standard deviation as a suitable indicator. I 
use a slight modification of Powell’s (1987, 
p. 176) index, 


Polarization = Xa PAX: = zÊ 


where: 


x; = the mean left-right self-placement of 
supporters of party i, 


=. the mean left-right self-placement of . 


all party supporters, 
= the percent vote for party i, 
= the number of parties. 


, ud self-placement of party supporters 
and all voters is taken from published 
analyses of opinion surveys conducted from 
the mid-1970s to the early 1980s (Inglehart 
' and Klingemann 1976; Sani and Sartori 1983; 
and Maravall and Santamaria 1985). Since 


these party self-placements probably did not. 


remain absolutely constant over time, the 
following assumptions were incorporated (the 


surveys were used to anchor self-placement at- 


the date of data collection). (a) Parties 
converged somewhat in the post-war period, 
1945-1960, as right parties came to accept 
increased welfare-state measures and nation- 
alization, and left parties softened demands 
' for more radical change. (b) In the following 
period, 1960—1975, left parties moved some- 
what left to capture the emerging new-left 


constituencies. And (c) since the oil crisis, . 


1975 to the present, all parties have generally 
moved moderately to the right, led by 
conservatives like Reagan and Thatcher.!? 


. ?? The following table summarizes these assump- 
tions: . 
Ideological Movements (in standard deviations) . 





1960- 1975- 
; 1960 1975 1988 
" Right parties .5 s.d. 5 sd. 
: Left 0 Right 
-Genter parties ' 0 0 0 
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An index of government coalitions was 
calculated on the premise that no country in 
the sample is a consociational democracy (but 
see Gunther, Sani, and Shabad 1986, regard- 
ing consociational elements in Spain's transi- 
tion to democracy). Oversized or minority 
coalitions are predicted to be unconducive to 
political support. A government-opposition 
pattern is coded 1, whether the goyernment is 
a single majority party or a minimum-winning 
coalition of parties. Oversized and minority 
cabinets, where the major opposition parties 
govern together (formally or informally) are 
coded 0.!3 To apply a concept of coalition 
government to the Presidential systems of the 
United States and France, the index was 
adapted as follows. When the President's 
party is the same as the Congressional/ 
Parliamentary majority, government is consid- ` 
ered majoritarian and is coded 1. When the 
President's party differs from the Congression- | 
al/Parliamentary majority (one or both cham- 
bers in Ámerica), government is considered 


oversized and is coded 0. In both parliamen- 


tary and presidential systems, the coalition 


Ideological Movements (in standard deviations) 


1945— 1960- 1975- 
1960 1975 1988 
Left parties 5 s.d. „5 s.d. .5 s.d. 
Right Left Right 
Exceptions (see explanation below): 
West German 
FDP 1 s.d. 5 s.d. .5 s.d. 
Left Left Right 
Spanish AP | .5sd. 
-— -— Left 


The exceptions are made for the following reasons. 
The West German FDP changed coalition partners 
several times, and the literature suggests correspond- 
ing voter shifts. Empirical survey data shows 
substantial ideological change for the Spanish AP. 
No data were available for several parties. Thus, 
the West German Communists and Greens were 
coded the same as the French Communists. The 
West German far right was coded the same as the 
Italian MSI, but one standard deviation further ` 
Left No further attempt was made to fine-tune 
these party movements, but the results are similar 
to those with an index that assumes no ideological 
change. 

13 The present index is a dummy variable. 
Coding oversized, minority, and minimum- 
winning coalitions ordinaly produces virtually 
identical results. 
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index is calculated as a moving average of 
coalition forms over the previous 10 years. 
Data on governing coalitions are drawn from 
Flora et al. (1983, pp. 153-90), von Beyme 
(1984, pp. 484—509), and recent issues of the 
Economist). 

Finally, an index of government stability 
was calculated as a moving average of cabinet 
duration (in months) over the previous 10 
years. The data come from the same sources 
as those for coalitions. 

Political support (the dependent variables) 
and subjective opinion on the economy (an 
independent variable) are measured with 
opinion survey data, which come from a 
larger compilation covering various aspects of 
political culture. Data collection proceeded in 
several steps intended to assure maximal 
comprehensiveness. First, I searched major 
published sources of survey results for 
relevant items: publications of the survey 
organizations themselves (e.g., the regular 
publications of the International Gallup Poll), 
codebooks from survey archives (mainly the 
ICPSR), compilations of survey results, 
opinion survey journals, and previous studies. 
Second, I commissioned several archives to 
search for relevant items; and I visited about 
20 commercial and scholarly survey institutes 
and archives in six countries between March 
and June 1985 (listed in Appendix A). 
Roughly 30,000 to 50,000 pages of survey 
results (marginals and breakdowns) were 
photocopied, covering the period from 1945 
to the present (earlier in some cases). The 
most relevant data were then selected, 
translated into English if necessary, coded 
and assigned keywords, and entered into a 
computer data base. The resulting political 
culture data base contains about 15,000 data 
points (survey questions asked at given times 
and in given countries) and is the most 
comprehensive one known to the author (see 
also Shapiro and Page 1988). 

Survey questions were sought to measure 
the two theoretical dimensions of political 
support, confidence and support for democ- 
racy. Data were abundant, but they were 
sufficiently irregular that conventional proce- 
dures of index construction had to be 
modified. Few proven scales or indexes could 
be located as time series in different coun- 
tries. Since the data were mostly available on 
an aggregate basis, item-to-item validity 
checks generally could not be conducted at 
the individual level. And standard dimen- 
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sional analysis at the aggregate level had to be 
modified (see below) because many questions 
were asked at irregularly-spaced intervals, 
and the time series often did not overlap. 
But if the data were too irregular for 
conventional methods of index construction, 
graphical evidence strongly suggested that 
summary scales would be appropriate in 
many cases. Items that seemed similar at face 
value, or that approximated scale components 
tested elsewhere, tended to move in parallel. 
The Italian Political Trust Scale in Figure 2 
provides perhaps the best example. When 
visual inspection of the graphed data seemed 
to warrant the creation of a summary index, 
scales were created in one of two principle 
ways. If the index items were asked in the 
same years and were of similar magnitudes, 
the index was simply a mean score of the 
component time series. If the data were 
non-overlapping or of widely-varying magni- 
tudes, some time-points were interpolated, 
and indexes were computed by adding the 
mean year-to-year change among items to the 
previous year's index' value. Indexes were: 
begun at a level that approximated the 
beginning levels of their component items. 
Interpolations in time series were made only 
when the gaps were two years or less, and 
interpolations were avoided over periods of 
extreme stress like the oil crisis of 1974. 
Finally, standard validation procedures were 
used to test the composition of the indexes. 
The scales were correlated and visually 
checked against their component parts; and 
their robustness was tested by correlating and 
visually checking them against indexes calcu- 
lated by different methods. For example, 
alternative indexes were constructed by inter- 
polating all missing data and computing 
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factor scores.!^ These methods of' scale 
construction would not be ‘appropriate for 
-very “clean” data, but they seemed -adapted 
to irregular data when graphs suggest that 
. Summary indicators are appropriate. Appen- 
dix B contains the texts of the scale 
components and' other items. 

Political confidence is measured by “Polit- 
ical Trust Indexes" in the U:S., Italy, Spain, 
and West Germany, and by additional individ- 
ual questions repeated in various countries. 
The Trust Indexes are designed to approxi- 
mate closely the Michigan Election Studies’ 
“Trust in Government” 
Abramson 1983). The indexes include items 
on trust in the government or parliament, on 
the honesty and competence of people in 
government, on the waste of tax money, and 
on satisfaction with the overall performance 


of the state. Additional indicators of political 


confidence are questions whether one cares 
who wins elections (U.S.), 
budget is fair (Britain), whether there are 
important differences between the parties or 
whether they are all the same (Britain), 
whether the Parliament is doing a good job 
‘as representative of the people (Germany), 
and whether Members of Parliament (MPs) 
really represent the interests of the people as 
against special interests or party interests 
(Germany).!5 Support for democracy is mea- 
sured first by a series of identical questions 
taken from the Eurobarometer surveys about 
how well democracy is functioning in the 
respondent's country. Additional indicators 
include indexes in West Germany of support 
for democracy and opposition to Nazism, and 
questions in Spain on whether one person 
should rule (as against elected representa- 
tives) and whether political parties should be 
legally permitted. 

: Finally, the nature of the scales i imposes an 
. important limitation on the sort of analyses 
that can be performed here. No absolute 


levels of political support are reported or . 


- V The factor-based indexes gave stronger results 
than did the year-to-year change indexes. I did not 
use the factor-based indexes because I was 
,uncomfortable with so much interpolation and 
because their trend curve did not visually approxi- 
mate those of the component items as well. 
!5 The additional indicators are less validated in 
the literature-as measures of confidence. They may 
` also tap elements of anomie or efficacy, but they 
differ from validated indexes of the latter. 


scales (see, e.g., . 


whether the - 


the answer seem to be “yes and no” 
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analyzed, but only relative levels. This is 
because scales are sometimes set to arbitrary 
“starting points,”- as described above, and 
because scale compositions vary from country 
to country. For this reason, this study should 
not be considered to be fully historical and 


‘comparative, but rather a study of parallel 


time series in- different countries. Cross- 
national comparisons aze limited to the timing 
of peaks and valleys and the influence of 
several exogenous variables. 


ANALYSES 
Is/Was There a Crisis? 


The decline and subsequent rise of confidence 
in government from the 1950s to the 1980s is 
well documented for the United States (Lipset 
and Schneider [1983] 1987; Citrin and Green 
1986). But was there a crisis of confidence in 
other countries, as well? And was a decline in 
confidence accompanied by a softening of 
support for democracy? 

A glance at the time series of political 
support indicators in Table 1 suggests several 
straightforward answers (also see Figures 
3-T;. Most political confidence indicators dip 
sometime in the late 1960s and 1970s in most 
countries. Yet most indicators of support for 
democracy rise or show no clear trend. Thus, 
: there 
was a decline of confidence, but it has risen 
again. And the decline of confidence did not 
lead to a rejection of democracy. 

The data are fairly consistent on these. 
points. The American Political Trust Index 
declined steadily from the mid-1960s to 1980 
and then rose again (Table 1, Column A). 
The trend in the Spanish Political Trust Index 
(C) was almost exactly parallel, and its course 
in Italy (B) was similar. Political Trust in 
Germany (D) did not decline much, but an 
otherwise steady rise flattened out in this 
period. Most of the single-question indicators 
of political confidence followed roughly 
similar courses: caring who wins elections in 
the U.S. (E), believing that the parties are not 
all the same in Britain (G), and saying that 
politicians act in the interest of the country in 
Germany (I). Two additional questions did 
not follow the same pattern. Belief that the : 
budget is fair in Britain (F) showed.no clear 
trend, and approval of Parliament in Germany 
(H) rose. 

In contrast, support for democracy tended 
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Table 1. Indicators of Political Support: Percentages or Index Values 


Indicators cf Political Support* 
YEAR A B C D E F G HE I J K L M N O P Q 
1946 .30 
1947 22 
1948 22 
1950 21 35 
1951 21 "1 35 25 43 
1952 28 67 59 33 49 24, 
1953 32 46 39 48 26. 
1954 34 49 39 55.31 
1955 36 50 51 4 59 30 
1956 31 63 43 46 38 58 32 
1957 33 42 74 55 36 6 32 
1958 63 29 53 62 4 37 4 60 33 
1959 45 - 56 58 56 60 -34 
1960 40 4l 65 4 52 62 : 36 
1961 46 33 54 55 59 37 
1962 48 46 i 57 37 
1963 35 39 59 50 46 62 36 
1964 65 16 38 66 4 62 51 66 36 
1965 14 “51 59 52 6 37 
1966 59 22 67 65 60 i 39" 35 
1967 23 44 56 50 58 42. 
1968 ^56 29 36 65 d3 51 9 68 42 
1969 59 43 48 
1970 48 60 66 40 53 54 
1971 61 21 
1972 4 1 60 64 47 43 
i973 29 55 36 44 41 60 27 74 .53 37 
1974 37 3 |^ $87 S6 5 78 51 '60 
1975 52 26 51 31 36 64 15 82 49 56 
1976 35 4 43 57 63 48 51 42 379 14 86 50 56 59 
1977 36 42 59 49 78 12 85 51 78 
1978 34 13 43 68 55 S1 49 76 26 87 56 
1979 23 44 53 41 80 21 91 '6 76 
1980 30 7 17 57 51 36 73 21 84 63 #77 
1981 16 40 22 49 53 70 20 81^ 
1982 31 19 37 56 56 6 144 68 21 79 82 
19833 > 52 53 67 64 46 7" 20 82 
1984 44 50 44 64 60 6 4 71 2 & 74 
1985 49 - 41 52 44 73 25 84 B 
1986 , 38 51 49 75 30 86 : 
1987 58 52 75 30 86 





* Indicators of Political Support: 

‘A. U.S.: Political trust index 

B. Italy: Political trust index 

C. Spain: Political trust index 

D. Germany: Political trust index ` 

E. U.S.: Care who wins elections (96 agree) 

` F. Britain: Budget is fair (96 agree) 
G. Britain: Differences between parties (96 agree) 
H: Germany: Approve of Parliament (96 agree) 
I. Germany: MPs represent the people (96 agree) 


to grow or remain relatively stable. The 
German Democracy and Anti-Nazi Indexes 
(N and O) and the Spanish indicators, of 
support for democracy (P and Q) all rose with 
little interruption over the entire period. 
Satisfaction with the way democracy works 
(J-M) generally rose in all countries, but the 


J. Britain: Democracy “works” well (96 agree) 

K. France: Democracy "works" well (9b agree) 

L. Germany: Democracy "works" well (96 agree) 

M. Italy: Democracy "works" well (% agree) 

N. Germany: Support for democracy index 

O. Germany: Opposition to Nazism index 

P. Spain: More than one person should rule (% agree) 
Q. Spain: Permit political parties (95 agree) 


trends are weak. (The mean of regression 
slopes across countries is 0.7 percent per 
year, but the trend is significant at the .05 
level only in Britain.) On the other hand, 
levels of satisfaction vary considerably from 
country to country. 
These findings are consistent with the 
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notion that reservoirs of legitimacy prevent a 
decline of confidence from leading to a 
rejection of democracy (Figure 1, path “y”). 
This hypothesis is tested more directly by 
correlating political confidence indicators 
with support for democracy indicators (Table 
2). The results for the whole period (first 
column) seem at first glance to be ambiguous. 
The British and Spanish results support the 
hypothesis. When falling confidence does not 
lead to a decline of support for democracy, 
the correlations are statistically insignificant. 
And when support for democracy rises 
despite falling confidence, the correlations are 
significantly negative. 

The German and Italian correlations in the 
first column of Table 2 seem at first to 
contradict the hypothesis, but a closer inspec- 
tion reveals that they do not. In both cases, 
positive correlations indicate that political 
confidence and support for democracy were 
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both rising at the same time—in Germany 
during the 1950s and early 1960s, and in Italy 
during the 1980s. The German result is clear 
-when the data are disaggregated by time in 
the last two columns of Table 2 (the Italian 
data are more recent and cannot be disaggre- 
gated). The positive German correlation is 
limited to the earlier part of the time series: 
after the mid-1960s, no significant positive 
correlation appeared between the two dimen- 
sions of political support. One could argue: 
that until the first party rotation in office in 
the mid-1960s, Adenauer’s Christian Demo- 
crats partially succeeded in identifying their 
own popularity with legitimation of democ- 
racy. But by the time the opposition took 
office, this link had been broken, and support 
for democracy became independent of politi- 
cal confidence. Analyses of individual-level 
survey data support this interpretation (Weil 
1981). 


Table 2. Political Support: Relation of Political Confidence to Support for Democracy (Pearson's r) (Path “y” in 


Figure 1) 
All Until After 
Years 1965* 1965* 
Political Trust Inaex l 
Germany: Democracy index Tu 76 1 
N = 27, 16, 11 
Germany: Anti-Nazi index .25 .86** —.65* 
i N = 23, 16,7 
Gesmany: Democracy “works” well .77* — — 
N=8 
Italy: Democracy “works” well .61* — ` ~ 
N=8 
Spain: More than one person should rule —.97** — — 
N=6 ; 
Spain: Permit political parties ~ .28 
= 4 
Approval of Parliament 
Germany: Democracy index .748™ .57* .90* 
N = 19, 15, 4 
Germany: Anti-Nazi index 2" 56* —-.30 
N = 29,15, 5 
MPs Represent the Pzople 
Germany: Democracy index 4*7 84** 90 
N = 12,9,3 
Germany: Anti-Nazi index .61* 80** 34 
N = 13,9,4 
Germany: Democracy “works” well .78 — — 
N23 
Budget is Fair 
Britain: Democracy "works" well -.02 — — 


N= 13 


Differences Between Farties 


Britain: Democracy “works” well 
N28 


** p « 0l * Ol<p< .05 


*05 «p «.10 


* No entry indicates that data were available only after 1965. 


SOURCES AND STRUCTURE OF LEGITIMATION 


Thus, there has been no significant ten- 
dency for a decline of political confidence to 
soften support for democracy. But in Italy in. 
the 1980s and Germany before the mid-1960s, 
support for democracy sometimes rose at the 
same time that confidence was also rising. If 
there was a crisis, it was limited to a decline of 
political confidence: a reservoir of goodwill 
for democracy prevented the "confidence gap" 
from turning against the regime itself. 


The Determinants of Political Support 


If political confidence and support for democ- 
racy rose and fell independently, are they 


Table 3. Relation of Economic and Political Performance to Political Support (Pearson's r) (Paths “a” and “b” in 
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determined by separate factors? the hypothe- . 
ses state that political confidence is affected 
mainly by state performance, and that support 
for democracy is affected mainly by the 
structure of opposition. They further state that 
state performance does not have much effect 
on support for democracy, and that opposition 
structure may or may not affect political 
confidence. I discuss the effects of the two. 
clusters of independent variables separately in, 
the following sections. 

Economic and political performance. The 
effects of the performance indicators (Table 
3), for the most part, strongly confirm the. 


Figure 1) j 
Opinion Ave. Civil 
Misery* on Growth Disorder 
Index Economy of GDP . Index 
. Political Confidence i 
U.S.: Political trust index ` .83*- . .B8** .54* —.49 
N = 14, 13, 14, 8 ; 
Italy: Political trust index .76** .91** .89** 37* 
N = 18, 14, 18, 15 : 
Spain: Political trust index ' .71* .97** .92** — 
N = 7,6,7,2 i 1 
Germany: Political trust index . .40* 24 — .40* ; .52** 
N = 26, 18, 27, 23 3 ' 
U.S.: Care who wins elections .68** 58** 44* —.35 
N = 14, 14, 13, 11 i 
Britain: Budget is fair —.04 .00 .14 —.41* 
N = 29, 26, 31, 23 l 
Britain: Differences between parties .16 .48*. —.03 .33* 
N = 22, 18, 23, 23 
Germany: Approve of Parliament .32+ .62** —.56** .45* 
N = 21, 14, 21, 20 
Germany: MPs represent the people .65** .01 —.56* ^N .62** 
N=13, 7, 13, 12 
Support for Democracy S 
Britain: Democracy “works” well :27. 25 .26 57 
N = 12, 13, 14,5 
France: Democracy "works" well . Al — —.10 .87* 
«e N = 12, —, 13,4 
Germany: Democracy “works” well .30 16 -46* 39 
N = 12,11, 14,5 
Italy: Democracy “works” well 31 —-.01 31 18 
N = 13,9, 14,5 
Germany: Support for democracy index —.05 .36* -.87** Sie 
N = 30, 22, 34, 24 
Germany: Opposition to Naxism index 04 .20 —.80** . 45** 
N = 30,22, 31, 30 
Spain: More than one person should rule -—.96** -.93** 2.929« —.68 
N29,9,9,4*. ' 07 : 
Spain: Permit political parties —.98** —.99** —-.86** —.92* 
N=9,6,9,5 
"pc §* Ol<p< 05 *.05 < p « :10 


* Sign of indicator reversed: positive correlation indicates support for hypotheses. 
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Fig. 3. United States: Economic Performance and 
Political Trust 


hypotheses: performance positively affects 
political confidence but not support for 
democracy. Most of the correlations in the 
upper panel of Table 3 (effects on confidence) 
are strong, significant, and in the predicted 
direction. And most of the correlations in the 
lower panel (effects on support for democ- 
racy) are weak, insignificant, and/or opposite 
the predicted direction. Figures 3-5 dramati- 
cally show the effects of the misery index and 
subjective opinion on the economy on the 
Political Trust Indexes in the U.S., Italy, and 
Spain. Confidence rises and falls with eco- 
nomic conditions, and the tracking is uncom- 
monly close for this sort of data. 

. Although several exceptions to the overall 
pattern appear, most can be incorporated into 
the hypotheses with little adjustment. The 
most important of these are the negative 
correlations between performance and support 
for democracy in Spain. But these results 
actually provide strong support for the 
hypotheses. For they indicate that support for 
democracy grew in Spain despite severe and 
growing problems of performance— rising 
unemployment (over 20 percent), slowing 
growth, and serious levels of terrorism. 
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Fig. 5. Spain: Economic Performance and Political Trust 


Figure 6 displays these trends dramatically. 
The growth of support for democracy in the 
face of poor performance is especially 
impressive because political confidence plum- 
meted at the same time (see Figure 5). Yet it 
makes little sense to speak of a reservoir of 
democratic legitimacy in this case because 
democratic sentiment grew around the time of 
the transition to democracy. It seems more 
appropriate to speak of an extension of credit 
to democracy or of a honeymoon period. 
The negative effects of GDP growth on 
political support in Germany constitute a 
second exception to the main pattern. Disag- 
gregating the time-series shows that the 
negative effects are restricted to the earlier 
period (disaggregation not shown). In the 
later period, with one exception, the correla- 
tions become insignificant. Thus, political 
support appears to rise with falling economic 
growth rates shortly after the transition to 
democracy. One might be tempted to speak 
again here of a honeymoon period. But this 
would not be appropriate because conditions 
in Germany were actually improving rapidly. 
GDP growth fell from an extraordinary 11 
percent in 1950 to 7 percent in 1960— still a 
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very high rate of growth. Thé misery index 
and the civil disorder index; both showed 
improvement in the same period. Thus, 
growing political support in the early decades 
of the Bonn Republic did not come in the face 
of poor performance, but of rapid improve- 
ment, even (if the rate of improvement fell 
somewhat. ‘In this respect, the negative 
correlation of political support with GDP 
growth is something of a statistical artifact. 

A similar‘explanation applies to the nega- 
tive correlation between the .civil disorder 
index and political confidence in America. 
Figure 7 reveals the cause. The civil disorder 
curve in the U.S. resembles a mountain, 
centering on the racial conflicts and opposi- 
tion to the Vietnam War of the 1960s and 
1970s—the ‘same period that confidence 
declined. But the civil disorder curve peaked 
in the mid-to-late 1960s. While it remained 
high by international standards in the 1970s, 
it was now dropping. In a mirror image of the 
previous German example, American condi- 
tions were not good, but they were becoming 
less bad. The improving curve produces the 
negative correlation with confidence, but 
once again, the result is something of a 
statistical artifact.'° Rather, the data suggest 
- that civil disorder was an important catalyst in 
spurring the decline of confidence in govern- 
ment (see also the debate n Miller 1974; 
Citrin 1974). 

Finally, five significantly positive correla- 
tions between state performance and support 
for democracy were not predicted. Two of 
them, involving judgments of the functioning 
of democracy in France and Germany, stand 
as exceptions to the main pattern. But the 
three positive correlations involving the 
Democracy and the Anti-Nazi indexes in 
Germany can be explained by disaggregating 
the time series by period. In the earlier part of 
the Bonn Republic (until 1965), support for 
democracy or opposition to Nazism rose at 
the same time that state performance im- 
' proved. But in the later period, the correla- 
tions became insignificant—or in one case, 
even mildly significantly negative. 





16 In cases where extreme (good or bad), but 
moderating, levels of performance produce an 
unpredicted correlation, causation might be better 
modeled as a cumulative process. This is not done 
here, but the substantive findings underscore the 
importance of examining graphical representations 
of the data. ` 
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Thus, with only a few exceptions that 
cannot be explained, the results in Table 3 
support-the hypotheses i in Figure 1 (paths “a” 
and “b”). The state’s political and economic 
performance affects the level of political 
confidence. But poor performance does not 
weaken support for democracy. On the 
contrary, support for democracy grew in 
Spain despite poor state performance, and 
despite the fact that confidence fell in 
response to poor performance. Thus, we 
again find no evidence that a confidence gap 
caused by poor state performance necessarily , 
leads to a weakening of support for a 
democratic regime. Ordinarily, one would 
argue that this limitation is due to reservoirs 
of legitimacy. But Spain suggests an amend- 
ment to this proposition, for new democracies 
are unlikely to have accumulated such 
reservoirs. In such cases, a honeymoon 
period may serve as a functional equivalent to 
the reservoir of regime support and prevent 
poor performance from leading to a deeper 
legitimation crisis. 

The structure of opposition. The hypothe- 
ses in Figure | state that an unresponsive 
opposition structure will lessen support for 
democracy and may reduce political confi- 
dence (paths “d” and "c", respectively). The 
results in Table 4 broadly support the 
hypotheses, but at first glance, less convinc- 
ingly than was-the case for state performance 
(see Table 3). The structure of opposition 
affects support for democracy as predicted in 
most cases, but there are exceptions. And 
opposition structure often has a positive effect 
on political confidence, although it was 
unclear whether this should be expected. 

The German results account for many of 
the apparent disconfirmations, especially the 
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"Table 4. Relation of Structure of Opposition to Political Support (Pearson's r) (Paths “c” and "d" in Figure 1) 


Polariz.* - Fraction.* Coalit. Cabinet 
Index index Index Stabil. 
Political Confidence va 
U.S.: Political trust index .88** —.01 .66* -.37* 
' N = 8,9, 14, 14 2 . 
Italy: Political trust index .84** —.14 .42* .44* 
N = 18, 18, 18, 18 
Spain: Political trust index 73+ 61 -b = 
|N255 
Gerinany: Political trust index ,52** Sor .05 —.19 
_ N = 27, 27, 27, 27 
U.S.: Care who wins elections .29 44 574 ~ 36. 
N = 11, 12, 14, 14 PON 
Britain: Budget is fair —.08 —.05 — .01 
N = 31, 31, 31, 31 
Britain: Differences between parties -.23 .37* — .03 
^ N = 23, 23, 23, 23 
Germany: Approve of Parliament .38* .63** —.16 —.46* 
N = 21, 21, 21, 21 
Germany: MPs represent the people .68** .67** —.16 — .48* 
N = 13, 13, 13, 13 
Support for Democracy 
Britain: Democracy "works" well 09 —.10 — Al 
N = 15, 15, 15, 15 
France: Democracy “works” well 447 .50* —.48* 12 
"ON = 15, 15, 14, 14 
Germany: Democracy “works” well —.21 — 04 .53* .61** 
N = 15, 15, 15, 15 . ; 
Italy: Democracy “works” well .62** —.53* —-.15 .76** 
, N = 15, 15, 15, 15 
Germany: Support for democracy index 21 .86** ~ .30* — .82* 
N = 34, 34, 34, 34 
Germany: Opposition to Naxism index A79 .87** —.63** — ,86** 
N = 32, 32, 32, 32 
Spain: More than one person should rule .06 .86* — — 
N=5,5 
Spain: Permit political parties .89* .95* — = 
N=4,4 í 
** p < .01 * 01 « p « .05 + 05«p«.10 


* * Sign of indicator reversed: positive correlation indicates support for hypotheses. 


_> Dash indicates Insufficient Variance, 


negative and insignificant effects of the 
coalition index and cabinet stability. Yet as 
before, disaggregating the data helps explain 
the ‘exceptions. The coalition index did not 
vary in Germany before the mid-1960s 
because the only oversized coalition was the 
Grand Coalition government of 1966-1969: 
but after the mid-1960s, almost all correla- 
tions are positive.!7 Likewise, German cabi- 
net stability did not vary much in the earlier 


17 The three insignificant correlations with 
political confidence all become significantly posi- 
tive, and one of the two negative correlations with 
support for democracy becomes (insignificantly 
positive. ; 


period, but all the negative correlations turn 
positive when measured in the later period 
alone (most, significantly). With these factors 
taken into account,’ the structure of opposi- 
tion’s effect on political support is consider- 
ably stronger than it first appeared. 

The Spanish results again highlight these 
findings. While support for democracy rose 
despite poor state performance, it is positively 
correlated with a reduction of polarization and 
fractionalization in the party system. These 
are precisely the features that Juan Linz 
(1979, 1988b) has long stressed in his 
analyses of Spain’s transition to, and its 
consolidation of, democracy. A democratic 
regime may survive poor economic conditions 
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. and social unrest, but if the party system is 
unresponsive, there is a danger that the 
population may look to nondemocratic alter- 
natives. 

Two additional pieces of evidence support 
the proposition that the structure of opposition 
affects support for democracy. First, respon- 
dents in several countries were asked whether 
they saw any danger of Nazism regaining 
strength in West Germany (Figure 8). Since 
1945, increasing numbers have said. there is 
no such danger, but the otherwise rising curve 
dipped during the West German Grand 
Coalition government of 1966—69. (The dip 
appears to come later in Germany, but this 
may be due to the lack of data in key years.) 
This dip corresponds to a resurgence of 
neo-Nazi voting at that time. Although some 
analysts attributed the resurgence to the mild 
economic recession of the mid-1960s (e.g., 
Liepelt 1967), neo-Nazi voting did not rise 
again in the deeper recessions of the mid- 
1970s and early 1980s. Rather, one can argue 
that it was a protest vote whose direction was 
determined by the structure of opposition: 
with both major parties in office, there was no 
responsible opposition to vote for besides the 
Free Democratic Party (which was too small 
to form a government). If the Grand Coalition 
had not ended fairly quickly, it might have set 
in motion a deeper cycle of polarization and 
weakening of support for democracy. 

A series of questions concerning support 
for democracy in France suggest similar 
conclusions (Figure 9). Fourth Republic 
France (1945—58) has long been cited as an 
example of the evils of the polarization and 
fragmentation of the party system and of 
cabinet instability (e.g., Sartori 1966, 1976). 
And DeGaulle's Fifth Republic has been cited 
as an example of.a successful democracy, 
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despite some tendencies toward centralization 
oi power in the Presidency (e.g., Parodi and 
Platone 1985). The survey data are rather 
thin, but most time series suggest that 
evaluation of democracy and support for 
democracy fell during the Fourth Republic 
(before 1958) and rose during the Fifth 
Republic. 

Thus, while state E N has little 
effect on support for democracy (Figure 1, 
path “b”), the structure of opposition does 
(path “d”). Polarization or fractionalization 
of the party system drive down support for 
democracy most.!? But oversized or minority ` 
government coalitions and unstable execu- 
tives also depress support for democracy. 
These findings fit the hypotheses well. If 
democracy is a game among players, the 
players are unlikely to reject the game solely 
because of its outcome, but rather because its 
rules are not working as designed. Although it 
was not necessarily predicted, opposition 
structure affected political confidence (path 
"c").almost as strongly as it did support for 
democracy ("d"). This finding implies that 
one should take the "systemic level" of the 
variables seriously More superficial objective 
conditions like state’ performance only affect 
more superficial evaluations like confidence. 
But more fundamental objective conditions, 
like opposition structure, affect both deeper 
and more superficial subjective evaluations: 
support for democracy and confidence. ', 


15 Sani and Sartori's (1983) argument that - 
polarization is more important than fractionaliza- 
tion is not supported here. However, I analyze time 
series within countries, while they analyze compar- 
ative cross sections. 
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Does State Performance Affect Support for | 
Democracy Indirectly? 


An indirect version of new-left/neo-con- 
servative theories posits that poor state perfor-. 
mance depresses support for democracy by 
disrupting the structure of opposition. It is 
difficult to test such a multivariate hypothesis 
because the available data are so sparse. Still, 
an investigation of the bivariate links should 
give us a hint whether it is true. 

Table 5 shows the correlations between 
state performance and opposition structure. !? 
The pattern is so complex that no simple 
answer emerges. Of 74 correlation coeffi- 
cients, only about half (38) suggest that poor 
state performance contributed to a breakdown 
of à "good" opposition structure. The rest 
show either no significant relationship (20) or 
a disconfirmation of the proposition (16). 
And the only country in which the results are 
especially consistent is Britain, but opposition 
structure did not vary much there. If state 
performance has an indirect effect on demo- 
cratic legitimation, it must be shown: that it 
has a direct effect on opposition structure. 
This cannot be definitely shown here. 

. The correlations in Table 5 are based on 
times series within countries, not cross- 
national comparisons. Cross-national studies 
in the literature do not obtain completely 
consistent results either. Yet most find neither 
indirect effects of state performance on 
democracy nor strong direct effect of state 
performance on opposition structure (e.g., 
Harmel and Robertson 1986; Powell 1982, 
pp. 102-3). European historical evidence is 
somewhat stronger in showing that state 
performance had little direct or indirect effect 
on the survival of democratic regimes. 
Zimmermann and Saalfeld (1988) summarize 
the first author's studies of the collapse of 
democracies in the 1930s. They show that 
economic crisis led to electoral volatility, and 
that polarization and à failure to form a 
democratic national consensus led to the 





19 As noted earlier, the disorder index is 
sometimes used as an indicator of political support 
(a dependent variable). Table 5 shows correlations 
between the disorder index and all other indepen- 
dent variables. Cabinet stability and the misery 
index affect the disorder index as the literature 
predicts in most, but not all, cases. Otlierwise, no 
strong patterns emerge: correlations tend to be 
either insignificant or inconsistent. 
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l collapse of democracy. But there was no link 


between electoral volatility and polarization 
(except in Germany). Therefore, there was no 
link, direct or indirect, between economic 
crisis and the collapse of democracy. Nor was 


there a direct link between economic crisis 


and polarization. It appears that the effects of 
state performance on the survival of democ- 
racy in the 1930s were largely absorbed by 
variation in the structure of opposition. 

Thus, the proposition ‘that state perfor- 
mance affects support for democracy indi- 
rectly through the structure of opposition 
remains plausible but undemonstratéd. The 
weight of the evidence so far suggests that it 
is not true. 


DISCUSSION: RESERVOIRS OF 
LEGITIMATION AND THE SURVIVAL 
OF DEMOCRACY - | 


. This article criticizes legitimation crisis theo- . 
Ties proposed by both new-left and neo- 


conservative writers. They’ argue that the state 
is unable to. satisfy rising demands for more 
services or for more input in decision making. 
As a result, support for the state is eroded: a 
legitimation crisis occurs that threatens democ- 
racy itself. 

I argued that this scenario is logically 


: improbable and historically rare because it 


neglects distinctions that must be made 
among both the sources and types,of political 
support (objective conditions and subjective 


evaluations). When these are made, the likely 


outcomes look much different. In this study, I 
distinguished state performance from the 
functioning of the system as a whole, 
especially the opposition structure in the party 
system and government. coalitions. And I 
distinguished political confidence from legiti- 
mation of democracy. On this basis, I 
prcposed that poor performance can lead to a 
decline of confidence, but not to a softening 
of support for democracy. Yet problems in 
opposition structure can lead to a decline of 
support for dèmocracy. And poor perfor- 
mance does not necessarily lead indirectly to 
rejection of democracy — not because it under- 
mines a responsive opposition structure, nor 
because a, confidence gap escalates to a 
legitimation crisis. 

Broadly speaking, the empirical analyses 
supported the hypotheses. Support for democ- 
racy depends less on good state’ performance 
than on low party-system polarization of 
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fractionalization and (to a` fesser extent) 
minimum-winning coalitions and long-lived 
cabinets. Citizens judge democracy less by 
- what it “gives” them than by whether it 
presents them with real (but.not polarized) 
alternatives and responds to their choices. But 
political confidence depends on both perfor- 
mance and a responsive opposition structure. 
Citizens lose confidence ‘if their representa- 
tives consistently fail to deliver the goods, but 
also if the rules of the game do not: work. 
Indirect links in the model could not be tested 
adequately because the data were too thin to 
use multivariate methods. Still, it seems 
unlikely that state performance has an indirect 
effect on support for democracy. The interme- 
diate links are not strong: lack of confidence 
is mainly unrelated to rejection of democracy, 
and state performance is only ascites 
related to opposition structure. 

The results seem to imply that a ‘“reser- 
voir” of regime legitimation simply depends 
on a long history of responsive opposition 
structure. Much of the evidence points in this 
direction. West German support for democ- 
racy grew over the early decades of the Bonn 
Republic as the party system became less 
polarized and fractionalized. But Spain does 
not follow this pattern. To be sure, as in West 


Germany, support for democracy grew as the ` 
party. system became less polarized and 
fractionalized. But support. for democracy 
was already strong and growing before the 
transition to democracy. Many observers 
argue that the electorate, despite its desire to 
express dissatisfaction with poor government 
performance, avoided voting for extremist 
parties because it feared the polarization that 
had torn apart Spanish democracy in the 
1930s (Maravall and Santamaria 1986; 
Gunther, Sani, and Shabad 1986). Something _ 
like a reservoir of support for democracy 
seems to have existed in Spain even before 
the factors that usually contribute to’ it 
emerged. 

I suggested earlier that a “honeymoon 
period" might sometimes serve as a func- 
tional equivalent to a reservoir of legitimacy. 
But why should citizens extend credit to an 
unproven regime or make an: “advance 
payment" to the reservoir? One possibility 
(not tested here) is that citizens might take 
opposition structures in foreign democracies 
as models, even before they experience them 
in their own country (see Weil 1987, 
unpublished). In such cases, the prestige of 
democracy abroad is already high at a 
country's own point ok transition. Such 


Table 5. Relation of Political and Economic Performance to Structure of Opposition (Person’s r)* (Path “ 


Figure 1) 
U.S Italy 
Misery Polarization 46** 67** 
Index Franctionaliz 29* —.03 
Coalitions 62** 43** 
Govt length -.23* 68** 
Econ Polarization 66** 66** 
Satisf. Fractionaliz 10 17 
Coalitions 41** 14 
Govt length -—.34* 33* 
Growth Polarization 48** 05 
in GDP Fractionaliz .07 .06 
Coalitions .13 —I 
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Disorder Misery index -—.13 .48** 
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“demonstration effects” might be restricted to 
“reference groups” of countries that believe 
their conditions are comparable. Thus, the 
Spanish population might have taken the 
successful construction of democracy in West 
Germany after Nazism as a model, even if 
Anglo-American models were regarded with 
distaste for other reasons. (Demonstration 
effects within reference groups of countries 
may have proceeded in a chainlike manner in 
recent years, as Latin Americans and the 
Filipinos took the Iberian peninsula as a 
model, and South Koreans looked to the 
Philippines.) 

‘Do the present findings mean that state 
performance has absolutely no effect on 
democratic legitimation—or the survival of 
democracy? Do they constitute a disproof of 
legitimation crisis theories? Not necessarily. 
My model does not include every possible 
relationship between performance and legiti- 
mation. Two additional channels of influence 
can be considered by relaxing model restric- 
tions. First, while performance has no direct 
or indirect additive effect on democratic 
legitimation, there may be interaction effects. 
Performance and legitimation might covary 
under an unresponsive opposition structure, 
but be unrelated under a responsive opposi- 
tion structure. For instance, the Great Depres- 
sion might not have “caused” the collapse of 
democracy in all Western countries because a 
responsive opposition structure shielded some 
of chem from legitimation problems. Yet even 
in countries like Germany and Austria, which 
probably had low levels of legitimation?? and 
did have unresponsive opposition structures, 
democracy might have survived and grown if 
not for the jolt of the economic crisis (Bracher 
1970, and Zimmermann and Saafeld 1988 

.each suggest elements of this proposition). 
Analyses of post-1945 survey data might shed 
further light, but my own results are still 
preliminary. The disaggregated time series 
discussed earlier tend to support the hypothe- 
sis, but cross-national time series, like the 
“functioning of democracy," show no clear 
patterns. Interaction effects seem possible, 
but the evidence is not all in. 


20 All propositions about legitimation and confi- 
dence before the advent of opinion surveys must be 
speculative. However, elite statements, the popular 
press, and literature suggest that support for 
democracy in Germany was weak in the 1920s and 
1930s (see, e.g., Baum 1982). f 
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A second factor could be added to 
interaction effects: causal feedback loops. On 
one side, low legitimation and an unrespon- 
sive opposition structure might reinforce each 
other during a performance crisis as citizens 
become increasingly indifferent to democracy 
and cast ever more protest votes for antisys- 
tem parties. The performance crisis might 
aggravate an already vicious circle of delegit- 
imation and unresponsive opposition struc- 
ture. On the other side, high legitimation and 
a responsive opposition structure might rein- 
force each other during a performance crisis 
as citizens pull together to support democ- 
racy, voting only for loyal oppositions to 
“throw the bums out,” not the regime. The 
performance crisis might not be able to 
penetrate a “virtuous circle” of legitimation 
and responsive opposition structure. 

The model with interaction effects and 
feedback loops is considerably more complex 
than the one I tested, though my findings 
seem broadly consistent with it. Historical 
research on the Great Depression again 
supports it more clearly (e.g., Lepsius 1978; 
Zimmermann 1987). The economic crisis of 
the 1930s set in motion two identifiable 
dynamics. In democracies that collapsed (see 
Figure 10), the crisis prompted calls for a 
change in administration, and governing 
parties lost elections. Loyal opposition parties 
were not strong enough to form stable 
alternative coalitions; end low legitimation 
predisposed citizens to vote for antisystem 
parties, who thereby acquired veto power in 
forming new governments. The antisystem 
parties also controlled state performance by 
"controlling disorder." If they were excluded 
from government, they could “deliver disor- 
der" on the streets, and endanger democracy. 
But if they were taken into government (their 
price for "withholding disorder") democracy 
was doomed. Thus, a vicious circle was 
formed. Performance crises led to new 
elections. Because democratic legitimation 
was lcw and prosystem parties were unable to 
form alternative coalitions, citizens voted for 
antisystem parties. The antisystem parties 
further blocked the formation of pro- 
democratic coalitions able to address the 
performance crisis—and a new cycle was 
initiated. Democracy was perceived not to 
have “worked” and was further delegiti- 
mated. In the second dynamic, in democra- 
cies that survived, the crisis also prompted 
calls for a change in administration, and the 
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parties in government also lost elections. But 
with a responsive opposition structure and 
higher levels of democratic legitimation, 
citizens could cast their protest votes for loyal 
oppositions. A rotation in office occurred, an 
alternative prodemocratic coalition addressed 
the problems, and the system was perceived 
to have "worked." Regime legitimation was 
reinforced, and, citizens felt encouraged to 
continue to choose among "system" alterna- 
tives. 

At the current stage of research,, this 
complex model cannot be verified. Research 
on the 1930s cannot measure mass legitima- 
tion directly, and iny analysis is based on a 
simpler model. Indeed, data on postwar 
Western societies —already stretched thin to 
test the simple model—may not be suffi- 
ciently complete, or cover periods of suffi- 
cient strain, to test the more complex model. 
Perhaps data from other countries would help. 
But my findings from the simpler model do 
reduce the scope of the question. They 
indicate that opposition structure affects 
democratic legitimation directly. State perfor- 
mance hardly affects legitimation at all, or if 
it does, only in a bounded and indirect 
fashion. 


APPENDIX A. SOURCES OF SURVEY DATA: 
SURVEY ORGANIZATIONS AND ARCHIVES. 


Britain—(a) Social Surveys Limited (the Gallup 
Poll), London. (b) Market & Opinion Research 
- International (MORD, - London. (c) National 
' Opinion Poll (NOP) Market Research Limited, 
‘London. (d) ESRC Data Archive, University of 
Essex. 
France—(a) Institut Frangais d'Opinion Publique 
(IFOP-ETMAR), Paris. (b) Centre d’etude de la 
` vie politique française (Cevipof) Archive, Paris. 
Italy—(a) DOXA (italian Gallup), Milan. (b) 
Demoskopes, S.R.L, Milan. (c) Eurisko, Milan. 
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(d) GPF & Associati, Milan. (e) Archivio Dati e 
Programmi per le Scienze Sociali, Milan. 

Spain— (a) Centro de Investigaciones Sociologicas 
(CIS), Madrid. (b) Instituto Gallup, S.A, 
Madrid. 


United States—(a) Raper Center Archive, Storrs, 
CT. (b) Inter-University Consortium for Politi- 
cal and Social Research Archive, Ann Arbor. (c) 
Data and Program Library Service, Madison. 

West Germany — (a) Institut für Demoskopie (IfD), 
Allensbach am Bodensee. (b) EMNID-Institut 
GmbH (West German Gallup), Bielefeld. (c) 
INFRATEST Forschung KG, Munich. @ ze 
tralarchiv, Cologne. ~ 


APPENDIX B 
Texts and Components of Political Support Indicators 


I. Political Trust Indexes 


A. United States 
1. How much of the time do you think you can 
, trust the government in Washington to do 
what is right—just about always, most of | 
the time, or only some of the time? [None. 
of the time.] (1958, 64, 66, 68, 70, 72, 74, 
76, 78, 80, 82, 84, 85, 86) >- 

2. Would you say the government is pretty 
much run by a few big interests looking out 
for themselves or that it is run for the 
benefit of all people? (1964, 66, 68, 70, 72, 
74, 716, 78, 82, 84) 

3. Do you think that people in the government 
waste a lot of money we pay in taxes, waste 
some of it, or don't waste very much of it? 
(1958, 64, 68, 70, 72, 74, 76, 78, 82, 84) 

4. Do you feel that almost all of the people 
running the government are smart people 
who usually know what they are doing, or 
do you think that quite a few of them don't 
seem to know what they are doing? (1958, 
64, 68, 70, 72, 74, 76, 78) 

5. Do you think that quite a few of the people 
running the government are a little crooked, 
not very many are, or do you think hardly 
any of them are crooked at all? (1958, 64,:, 
68, 70, 72, 74, 76, 78, 84) 


B. Italy 


1. Do you think that in Italy today things are 
faring well or poorly with regards the 
working of the state (ministries, public 
offices, etc.)? (1963, 67, 68, 74, 76, 80) 

2. Do you think that in Italy today things.are 
faring well or poorly in regard to the 
capability and competence of statesmen? 
Very well, Fairly well, Neither good nor 
-bad, Fairly badly, Very badly. (1967, 74, 
76, 80) 

3. Do you think that in Italy today things are 
faring well or poorly in regard to the 
honesty and seriousness of statesmen? Very _ 
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well, Fairly well, Neither good nor bad, 
Fairly badly, Very badly. (1963, 67, 74, 
76, 80) 

4. Compared to a year ago, do you think the 
situation in Italy today is better, the same, 
or worse? Much better, A bit better, Same, 
A bit worse, Much worse. (1963, 64, 65, 
67, 68, 74) 

5. Personally, are you content or discontent 
with what the government has done this 
year? [More or less content, Neither content 
nor discontent, Somewhat discontent] (1960, 
63, 64, 65, 66, 68, 72, 76, 78) 

C. Spain 

1. Do you think the people that govern Spain 
are capable and know what they are doing, 
or does it seem to you that a lot of them are 
not sufficiently capable? (1975, 80, 81, 82, 
84) 

(variant:) Do you think the people that run 
the government are capable or not capable? 
(1966) 

2. Do you think the government administers 

the majority of the taxes we Spanish pay 
wisely, that it wastes some money, or that it 
throws a large part of it out the window? 
(1966, 80, 81, 82) 
(variant:) Do you think that the government 
spends our tax money well, that it wastes 
some, or that it throws a large part of it 
away? (1975) 

3. Speaking in general terms, would you say 
that the present political situation in Spain is 
very good, good, so-so, bad, or very bad? 
(1975, 79, 80, 81, 81) 

4. Do you think that the political situation [in 
Spain] is better, the same, or worse than a 
year ago? (1975, 79, 80, 81, 82) 

5. And do you think that in one year that this 
political situation [in Spain] will be better, 
the same, or worse than now? (1975, 79, 
80, 81, 82) 

D. Germany 

1. How much of the time do you think one can 
trust the government in Bonn to do what is 
right? Almost always, most of the time, 
only some of the time, almost never. (1974, 
85) 

(variant) One can generaly trust the 
government and be sure that it will do the 
right things. (1969) 

(variant) I am going to read a list of public 
institutions and organizations. Please tell 
me if each one gains your trust or not: the 
Parliament. (1979, 81, 82, 83, 84) 

2. Do you a lot of money is wasted in Bonn, 
or do you think this is greatly exaggerated 
by the newspapers? (1951, 52, 57, 60, 79) 

3. Do you think that one must have great 
ability to become a representative in Bonn? 


f 
1 
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(1951, 52, 53, 54, 55, 56, 57, 58, 59, 60, 
61, 63, 64, 72, 79) p 


. Do you think that officials, in Germany 


generally cannot be influenced and cannot 
be bribed? (1950, 64, 74, 75, 76) 

(variant) Do you think that the officials in 
Germany generally take action uninfluenced 
by relatives, personal relationships, or party 
connections? (1958) 


Il. Additional Political Confidence Items 
A. United States 


1. 


Generally speaking, would you say that you 
personally care/cared a good deal which 
party wins/won the Presidential election 
(elections to the Congress in Washington) 
this Fall, or don’t you care very much 
which party wins/won? (1952, 56, 58, 60, 
64, 68, 70, 72, 74, 76, 78, 82, 84) 


B. Britain 


l. 


2 
“ 


Do you think that there is any really 
important difference between the parties, or 
are they all much of a muchness? (1951, 52, 
57, 58, 59, 61, 62, 63, 64, 65, 67, 68, 69, 
70, 72, 74, 76, 77) 


. Do you think the budget is a fair one or not? 


(1953, 1955~85 annually) 


C. Germany 


l. 


What do you think of the Parliament in 
Bonn as the representative of the people? 
Excellent; Basically good; Fair; Bad. (1951, 
53, 54, 55, 56, 57, 58, 59, 60, 61, 63, 65, 
67, 68, 70, 83) 

Do you believe that the Representatives in 
Bonn first and foremost represent the 
interest of the populace, or do they have 
other interests which are more important to 
them? (1951, 52, 53, 54, 55, 56, 57, 58, 
64, 72, 73, 78) 


Ill. Support for Democracy Indicators 


A. Functioning of Democracy 


l. 


Are you satisfied with the way democracy 
is functioning in [country]? Very satisfied, 
Fairly satisfied, Not very satisfied, Not at 
all satisfied. (1973, 1975-87 annually) 


B. Germany: Democracy Index 


1. 


2. 


Do you think it is better for a country to 
have one party, to obtain the greatest 
possible unity, or several parties so that the 
various opinions can be freely represented? 
Several parties, Not more than two or three 
parties, One party, No parties. (1950, 51, 
52, 53, 54, 55, 56, 57, 58, 59, 60, 61, 67, 
68, 72, 77, 78) 

Two men are discussing how a country 
should be governed. The one says: “I like it 
best when the people place the best 
politician at the top and gives him complete 


governing power. He can then clearly and 
quickly decide with a few chosen experts. 
Not much talking is done and something 
really happens!” The other says: “I prefer 
that a number of people have to determine 
something in the country. They do some- 
times go round and round until something is 
done, but it is not so easy for abuse of 
power to occur.” Which of these two 
opinions is closest to your own view—the 
first or the second? (1956, 57, 58, 60, 61, 
67, 72) 

. If you think about the difficulties which fact 
us: The scarcity of raw materials, food and 
sources of energy, caused by: the rapid 
growth of the population, and the growth of 
the economy.—Do you believe that we can 
control these difficulties with our demo- 
cratic form of state with several parties in 
the Parliament, or do we need a single-party 
system with a strong government at the top 
to take care of these difficulties in the 
future? (1975, 77, 78, 82) 

. Which form of government do you feel is 
best for us as Germans? Democracy; 
Monarchy; Dictatorship/Nazism. [Democ- 
racy] (1953, 54, 55, 56, 57, 58, 60, 62, 64, 
65) 

. Do you believe that the democracy which 
we have in the Federal Republic is the best 
form of state or is there another form of 
state which is better? (1967, 74, 75, 77, 78) 
. Many say: The. democratic relationships 
established in the Basic Law are not 
satisfactory in our state. In order to achieve 
a true democracy, we must change the 
whole system and write a new Basic Law. 
Do you also want this or are you against 
this? (1972, 74, 75, 78) 

. Once again about our Constitution, the 
Basic Law: The Basic Law was in fact 
created almost 30 years ago under the 
supervision and contro] of the Western 
Powers. Therefore, many feel that we 
Germans should write a new Basic Law for 
ourselves and thus make it correspond better 
to our needs and interests. Others say, on 
the other hand, that the current Basic Law 
has lasted this long so well that we do not 
need a new one. Which is your opinion? 
(1968, 74, 78) 

(variant) What do you really have to say 
about our present constitution—I mean our 
Basic Law? Do you find it to be good or not 
good, all in all—or have you not yet 
become so interested in it? [Don't know the 
Constitution] (1955, 56) 


8. What do you think of the form of 


government in the Federal Republic? Is 
there much to object to; is there only a little 
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bit to object to; or is everything in order? 
(1977, 78, 81) 


. Considered completely for a moment from 


the pragmatic view: do we actually need a 
Parliament with all those Representatives in 
Bonn or would it also work without them? 
(1956, 57, 58, 59, 62, 72, 77, 78, 82) 


* C. Germany: Anti-Nazi Index 
1. Do you think National Socialism was a 


good idea that was badly carried out? 
(1945, 46, 47, 48, 68, 70, 77, 80) 


. Everything that was built up between 1933 


and 1939 and much more was destroyed by 
the war. Would you say that Hitler would 
have been one of the greatest statesmen 
without the war? [No] (1955, 56, 59, 60, 
61, 62, 63, 64, 67, 72, 75, 77, 78) 


. Now a question about Hitler and National 


Socialism: Some say that aside from the war 
and the persecution of the Jews, the Third 
Reich was not really so bad. Others say that 
the Third Reich was unconditionally wrong. 
Which of these do you agree with? (1975, 
77, 78, 79) 


. If someone were to say that the National 


Socialist state was an unjust state, a 
criminal regime: According to what you 
know today about or experienced during the 
Hitler era—would you say that this is right, 
no that is wrong, or that not everyone was a 
criminal? (1964, 77, 78, 79) 


. Assuming a new Nazi Party were to try to 


come to power: How would you react? Do 
everything in power to prevent it; Oppose it 
but not do anything in particular; Wouldn't 
care; Support but not do anything in 
particular, Welcome and support it. (1953, 
56, 62, 66, 67, 72) 


. If there were the opportunity now, like in 


1933, to vote again for or against a man like 
Hitler, how would you decide? (1953, 54, 
55, 56, 57, 58, 60, 61, 63, 65, 67, 78) 


. Which form of government do you feel is 


best for us as Germans? Democracy; 
Monarchy; Dictatorship/Nazism. [Against 
Dictatorship/Nazism] (1953, 54, 55, 56, 
57, 58, 60, 62, 64, 65) 


. In your opinion, who was responsible that 


the war broke out in 1939? Germany 
(Hitler, Nazi Party, we); The other states 
(England, France, USA, Poland, Churchill, 
Versailles); Both sides (all statesmen, all 
administrations, Versailles and we); Interna- 
tional capitalism; It was fate that the war 
came. (1951, 55, 56, 59) 

(variant) Who was responsible for the war: 
National Socialist Germany, Both sides, or 
the Other side alone? (1964) f 
(variant) Who do you think was mainly 
responsible for 1939 war? Hitler, Germany, 
Britain, Other. (1969) 


iy 
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D. Spain: Democracy Items 
: 1. Do you think it is better for one person to 

have all the authority and make all the 
decisions for us, or for a group of people 
elected by all citizens to make the political 
decisions? (1966, 74, 76, 77, 79, 80, 82) 

2. Think that the existence of political parties 
would be good, or neither good nor bad. 
(1971) 
(variant) Agree with freedom for political 
parties. (1973) 

: (variant) Believe that political parties should 
be permitted to exist in Spain. (1975) 
(variant) Believe that one of the reforms 
should be the legalization of political 
parties. (1976) 
(variant) Democracy is not just parties, but 
democracy cannot exist without parties 
[agree]. (1984, 85) 


FREDERICK D. WEIL is Associate Profes- 
sor of Sociology at Louisiana State Univer- 
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and political theory. He is completing a 
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STRATEGIES AND STRUCTURAL CONTRADICTIONS: 
` GROWTH COALITION POLITICS IN JAPAN* 


JEFFREY BROADBENT 
University of Minnesota 


The elite coalition that pushes local industrial growth is composed of six actors: 
local state, business, and party and national state, business, and party. The 
question of which dominates the coalition reflects current debates about. the 
structure of power in capitalist society. This paper addresses these debates through 
the detailed study of a growth coalition in a rural industrialization project in 
southern Japan. It traces the strategies and modes of influence of the six actors, 
their structure and process, through the 25-year history of the project. Contrary to 
the “strong state” image. of Japan, national big business dominated the growth 
coalition. The rest of the society, including the state, adjusted around its motion. 
This resulted in policies which favored economic growth over social needs, giving 


some of the other actors legitimation problems. Yet, success at rapid growth . 


prevented these from assuming crisis proportions. The findings run counter to state 
autonomy arguments and suggest closer attention be paid to multiple levels of 


structure and process. 


INTRODUCTION 
Growth Problems and Politics 


A coalition of politically and economically 
active groups form a power bloc that drives 
economic growth. But what groups compose 
and dominate it and what social forces drive it 
remains a matter of debate. This paper 
analyzes the dynamics of a growth coalition 
in industrializing Oita Prefecture on the 
southern island of Kyushu, Japan, from 1958 
to 1980 and uses the data to address 
theoretical issues in the relationship of state 
and business in advanced capitalist societies. 


* The author is indebted to, many whose 
comments improved this paper. They include: 
Charles, Tilly, Jeffrey Paige, Harvey Molotch, 
John Clark, Joji Watanuki, panel members and 
participants in the section on Political Economy of 
Asian Development at the meetings of the 
International Sociological Association in India, 
August 1986, and ASR reviewers Richard Florida, 
Joe Feagin, and Robert Marsh. The remaining 
flaws are my own., The initial data collection 
occurred while on a predoctoral Fulbright fellow- 
ship. I wrote the first drafts of the paper while a 
Fellow of the Michigan Society of Fellows, 
University of Michigan, Ann Arbor. Grants from 
the Graduate School and the Center for Study of 
Conflict and Change, University of Minnesota, 
made further writing and data coding possible. The 
final data collection and writing occurred while on 
a Fulbright fellowship in Japan. 
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Researchers have identified four main 
categories of actors as dominant in growth 
politics: (1) the local business community 
[medium and small capital]; (2) the local - 
government [local state]; (3) the national 
Bovernment [national state]; and (4) the 
national business community [oligopolistic . 
capital]. Typically in the United States, a 
local business-government coalition tries to 
make higher levels of government bear the 
costs of growth, while defining the content 
for themselves (Fleischmann: and  Feagin 
1987, p. 227). In Europe, the state or party 
plays a stronger, more autonomous role 
(Molotch and Vicari 1988). In Oita, national 
and prefectural political parties also played 
important roles (Richardson and Flanagan 
1984, p. 294). 

This paper considers process and structure 
within a growth coalition composed of these 
six groups. A full analysis of power dynamics 
would bring in even more actors, such as 
unions, citizen movements (Broadbent 1986, 
1989a), and mass media (Kabashima and 
Broadbent 1986). But these six compose the 
central “polity,” explication of which suffices 
here. 

Coalition dynamics shed light on theories 
of power in capitalist democracies. All six 
actors have some theoretical cachet. In the 
current debate, the notion of a strong 
capitalist class holds one end of the spectrum 
and that of the strong state holds the other. 


707 


708 


The strong class theory posits an autonomous 
and unified capitalist class that controls the 
state through direct, instrumental pressure 
(Miliband 1969). The strong state theory, on 
the other hand, posits an autonomous state 
which controls classes and the society (Skocpol 
1985). Between these two extremes is the 
notion of a relatively autonomous state which 
services a disorganized capitalist class to keep 
it at maximum productivity despite itself 
(Block 1988). Outside this spectrum, recent 
research argues for the autonomy and influ- 
ence of the political party and local govern- 
ment, especially within a democratic system 
(Przeworski and Sprague 1987; King 1986). 

The six groups use their resources strategi- 
cally for maximum control over each other. 
Their combined pressures structure the auton- 
omy of any single group, providing both 
opportunities and limits. Sometimes their 
pressures contradict each other, as when the 
state or ruling party must respond to both 
public welfare and capital accumulation, 
leading to a loss of legitimacy (Offe 1985). 
The autonomy of an actor, such as state or 
class, depends in part upon its degree of 
integration. Some groups act as a unified, 
integrated entity. Others act through uncoor- 
dinated factions or single organizations, or 
shift between both modes. 

Many Japan scholars agree that a “ruling 
triad” of state, Liberal Democratic Party 
(LDP), and big business dominates Japanese 
politics, but they disagree on which controls 
the triad (Kabashima and Broadbent 1986; 
Fukui 1977). Some argue that the national 
state (the ministries, not the Cabinet and Diet) 
plays a predominant and highly autonomous 
role. It is “plan rational,” they say, giving 
out “fair shares” to all interest groups 
according to its own “higher interests” (see, 
respectively, Johnson 1982; Inoguchi 1983; 
Vogel 1979). Others argue for the autonomy 
of business (Patrick 1977) or the capitalist 
class (Halliday 1975; Miyamoto 1973). The 
three national elites may be so linked by 
ministerial retirees that, breaking down tradi- 
tional Western distinctions, they no longer 
operate as separate and competing actors 
(Taira and Wada 1987). 

The locus of power in capitalist society, if 
it exists in any systematic sense, is subtle, 
complex, and hard to locate. Analytically, 
three main modes of exercising influence 

' have been distinguished: legitimate decisions 
by overt (ie., legislative, etc.) means, 
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behind-the-scenes negotiations between privi- 

leged groups that preempt decisions and set 

limits on what emerges into legitimate public . 
debate (“nondecisions” or agenda-setting), 

and control over public values and opinions 

through image making, mass media, and 

education (“ideological hegemony”) (Lukes 

1976). 

The pressure that an actor can exert is ` 
achieved by using a variety of different modes 
and sanctions in strategic combinations. The 
Six actors exerting pressure on each other 
create a pattern of power. By tracing the flow 
of influence through this pattern, the source 
of bower may be more clearly identified and 
the theoretical questions better addressed. To 
do a6, this paper cuts the political process into 
three-year phases and examines the main 
patterns of power in each phase. 

In Oita, the actors and their pattern shifted 
greatly over time. The shifts occurred mainly 
in response to changing world-market condi- 
tions. In general, big business effectively 
circumscribed and controlled national and 
loca! government growth policy. Other actors 
adjusted to its needs. This supports a version 
of class-structuralist theory, not the autono- 
mous state thesis. But class theory alone 
cannot explain the rapidity and thoroughness 
of that adjustment, compared to other socie- 
ties. 

Several factors prevent simply "reading 
off" the political adjustment process from 
class structure: paternalistic values, corporat- 
istic decision making, relatively low degrees 
of difference in individual wealth, creative 
policy-making by local government, bureau- 
cratic factionalism in the state, and the way 
enacted policies change objective conditions 
and spur the mobilization of new actors. 


Japan and the Oita case 


Japan's economic and political institutions 
superficially resemble those of the Untied 
States and Europe, but because of different 
historical, social, and cultural backgrounds, 
operate quite differently. This makes Japan an 
“outlyer” case useful for testing the general- 
ity of political-economic theories. 

My study of Japanese growth politics in 
Oita Prefecture covers 1958 to 1981, the 
years of most dynamic national growth as 
well as “Oil Shock" recession. The study site, 
Oita Prefecture, fairly represents growth 
conflicts in Japan (Samuels 1983; Miyamoto 
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1973; Kita Nihon Shimbun 1984). Oita 
Prefecture is in Kyushu, the most southerly of 
Japan’s four big islands, at the end of the 
Inland Sea (Setonaikai). It is far from the 
major urban centers of growth that occurred 
in the early post-World War Two years. In 
the 1950s and 1960s, faced with deep 
poverty, the harshness of farming, and 
unemployment, many youth migrated out to 
the large cities in search of work and 
modernity. To influence them to stay in Oita, 
prefectural officials decided to industrialize 
and provide local jobs. Under national 
government guidance, the New Industrial 
City brought both benefits and costs. It (and 
similar projects in other prefectures) increased 
industrial output, but provided only a modest 
number of jobs and, initially, modest taxes. 
Hence, it did not draw population away from 
Tokyo and Osaka (Kita Nihon’ Shimbunsha 
1984, p. 168). And it created considerable 
turmoil over pollution. Such regional growth 
poles failed to halt the overcrowding, poor 
facilities, and subsequent high land prices of 
the central cities and the corresponding 
depopulation and decline of the countryside 
(Koseisho 1988).! 


METHODS 
A case study cannot produce robust generali- 
zations but it can provide insights and new 
hypotheses. To capture the details of process, 
I conducted fieldwork in Japanese in Oita for 
two and one-half years and engaged in over 
500 open-ended interviews, observed elites 
and movements, and perused archival news- 
paper reports, government documents, and 
national census and other statistical data. 
The unit of analysis for this study is the key 
influence relationship (KIR) occurring be- 
tween two politically active organizations. 
Formally defined, an "influence relationship" 
occurs when A furthers its goal by inducing 
or preventing some action by B. The 
relationship is "key" when it affects a growth 
' outcome. Choice of the KIR unit reflects 
exchange theory reasoning that evidence for 
broader structures may be found in the 
microrelations of influence (Homans 1961). 





! The Tokyo metropolitan area currently has a 
total real estate value higher than B alue of all 
United States’ real estate plus all Fortune 500 
companies (Ohmae 1989)! 
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These exchange relations form social network 
patterns which distinguish community power 
structures (Breiger 1979), thus connecting 
micro- and macrolevels. When KIR occur in 
the text, I note them as such (KIR). Some 
important but private forms of influence 
probably escaped my efforts, but the main 
picture emerges. 

After return from the field, I coded my ' 
qualitative data into a format appropriate for 
statistical analysis. The coding categories 
included organizational type (national govern- 
ment, prefectural business, etc.), relational 
and sanction type, and degree of impact on. | 
growth outcomes for each KIR. In order to 
examine central political processes, this paper 
considers only high-impact KIR, 69 out of the 
total 230 KIR recorded. Among these 69, 
progrowth influence relations exerted by. the 
elites accounted for 51. 


DATA ANALYSIS 


Structure and Process in the 
Growth Coalition A 


Sociologists like to characterize society as» 
having "structures," and several of these can 
be constructed from the Oita data. Comparing 
the relative number of KIR exerted by each 
elite produces a static hierarchical structure. 
In it, a "ruling triad" of three actors 
dominates the Oita growth coalition: national ` 
government (21 KIR), national business (16 
KIR), and prefectural government (8 KIR). | 
National LDP (3), prefectural LDP (1), and 
prefectural business (2) had far less KIR. 

An input-output matrix (Table 1).makes the ' ' 
structural image somewhat more dynamic. 
For the same 51 KIR, the senders of influence 
array along the left-hand column, and the 
receivers, along the top. The strongest 
senders and receivers have been moved 
toward the left and top of the matrix, which 
represents the core of the power structure. In 
it, the national state and big business sit at the 
top of the hierarchy, with the local state as a 
junior partner. In addition, the other actors 
sometimes exerted important influence. 

But these structural images are too static 
and monolithic. Without a process analysis, 
one cannot understand how they really 
operate. By compressing 25 years of conflict 
and cooperation, with many modes and 


.sanctions of influence, into a single image, 


they lose sight of their own origins. 


AMERICAN SOCIOLOGICAL REVIEW 


Table 1. Progrowth Influence Matrix (Unit: high-impact progrowth KIR) 

Pref’). Nat'l Nat'l. Nat'l. Pref’). Pref’. i 

Gov't. Bus. Gov't. LDP. LDP. Bus. Other. Total 
Nat'l. 
Gov’t 19 2 0 0 0 0 0 21 
Nat'l. 
Bus. 10 0 2 1 0 1 2 16 
Nat'l. 
LDP 1 0 2 0 0 0 0 3 
Pref’) 
Gov't. 0 0 1 0 0 1 6 8 
Pref'l 
Bus. 2 0 0 0 0 0 0 2 
Pref'l 
LDP 1 0 0 0 0 0 0 1 
Total 33 2 5 1 0 2 8 51 


Source: Data base compiled by and available from author. 


The dominance pattern among the six 
groups in the Oita growth coalition changed 
repeatedly from 1958 to 1980, as shown in 
Figure 1. For each three-year period, the 
percent of key influence relationships (KIR) 
exerted by each elite indicate their relative 
dominance. Five of them exerted dominant 
influence at one phase or another. In the first 
period, the prefectural government took 
initiative with new ideas. In the second, the 
national LDP fashioned the New Industrial 
City (NIC) Law and designated Oita. In the 
next, big business overpowered both national 
and prefectural plans in defining Oita's 
growth. Then, when the prefectural govern- 
ment tried to accommodate the resultant 
objections, the LDP harshly criticized it. 
Finally, when big business lost interest, the 
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Source: Deta base complied by end evelsble from author. 

Fig. 1. Changing Elite Predominance in the Urban 
Growth Coalition (Unit: high impact pro-growth 
KIR) 





NIC project floundered. Nonetheless, the 
prefectural government continued to back it. 
Explaining the dynamic causes of the 
process requires a more detailed look at the 
complex mosaic of phases and modes of 
influence that affect elite mobilization. 


ELITES AND GROWTH POLITICS 
Prefectural Government 


Progrowth policies are usually thought to be 
the product of business-allied governments, 
but in Oita the Japan Socialist Party (JSP) 
took early initiative. In 1957, Oita's JSP 
Governor Kinoshita (served four terms, 1955— 
1971) and earnest “booster” officials like Mr. 
Taiichi Sato announced the ‘Oita Seaside 
Industrial Area Plan.” Governor Kinoshita 
had high hopes that local government could 
create a well-planned model industrial park 
(KIR). The JSP, based in labor unions, was 
the second largest political party in the 
prefecture. Through growth policies, it hoped 


‘to become first by responding to the public 


demand and providing local jobs. As a former 
union organizer, Mr. Sato believed that 
industrialization would expand the size of the 
Oita working class and strengthen the JSP’s 
electoral base. The Liberal Democratic Party 
(LDP), representing local construction compa- 
nies and other business interests, also took a 
strong supporting role. 

Prefectural officials; especially governors, 
enjov a strong leadership role within the 
community (Steiner 1980, pp. 318-19). 
Culturally, given the Confucian and statist 
heritage of Japan, the governor plays the role 
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of political “father figure” (there have never 
been any women governors or mayors in 
Japan). Institutionally, the governor presents 
more than 90 percent of all bills to the 
legislature (Steiner 1980, p. 447). Hence, it is 
the governor’s political party, not that of the 
majority of legislators, which determines the 
political “color” of the administration. 

Japan has a capitalist economy, however, 
and enterprises have considerable freedom to 
build what and locate where they wish. 
Despite his authority, Governor Kinoshita 
could not convince local manufacturers and 
banks to risk building production plants in 
Oita. This forced him to go against the critics 
of “monopoly capital” in his own party and 
seek investment from big business in Tokyo 
and Osaka (KIR). Here too he was initially 
rebuffed. A move to the hinterland would 
require costly shipping to major markets. 

Undaunted, the Governor finally appealed 
to influential members of the Tokyo Ojitans 
Club (Tokyo Oita Kenjinkai) (KIR). These 
included Mr. Hoshino, president of Mitsui 
Trading Co., and other top businessmen, Diet 
politicians, a top MITI bureaucrat and even 
the famous right-wing “fixer” (kuromaku) 
Miura Baien. Mobilized by their common 
Oita roots, they joined forces and induced the 
Fuji Steel Co. to create and finance the 
Kyushu Oil Company (KIR), which built its 
refinery near Oita City in 1961. This acted as 
a magnet to attract other industry. 

Land is expensive in Japan, so new 
industrial sites are often created by landfilling 
shallow coastal waters. At first, Oita officials 
planned a small landfill area, about 100,000 
to 200,000 square meters (51.6 acres). But to 
accommodate these new plants, the project 
quickly ballooned to what is arguably the 
world’s largest landfill project, covering 
2,754 acres. As a public works project with 
related infrastructure (sewers, roads, étc.), 
this required large investment by the prefec- 
ture. 

Oita Prefecture sought national grants to 
cover the landfill and infrastructure construc- 
tion and make up for tax breaks for the new 
industry. Literally translated, the Japanese 
word for local government (chiho jichitai) 
means “local autonomous bodies.” But far 
less financially autonomous than their US 
counterparts, they receive 70 percent of their 
discretionary funds as subsidies, grants- 
in-aid, and tax rebates from the national 
government. This leaves them only “30 
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percent local autonomy” (san wari jichi) 
(Steiner 1980, p. 447). 

Besieged by similar demands for aid from 
many rural prefectures, the national govern- 
ment passed the New Industrial Cities (NIC) 
Law in 1961 (KIR) with aid provisions, and, 
with pressure from the Tokyo Oitans, desig- 
nated Oita as an NIC shortly thereafter. This 
seemed like a blessing at first. But the nature 
of the “aid” largely consisted of permission 
to sell local bonds (chihosai) and some 
repayment for revenues lost through tax 
breaks. 

By the mid-1960’s, Tokyo big business 
took a new interest in Oita, not because of the 
NIC infrastructure, but because Oita’s harbor 
and bay suited the docking needs of large 
refineries which had outgrown big city ports. 
In 1962, the New Japan Steel corporation 
agreed to build a major steel refinery in Oita 
(KIR), and Showa Denko Co. followed suit in 
1964 with a petrochemical refinery (KIR). 

In response, between 1964 and 1974, Oita 
Prefecture invested about $330 million (725 
oku yen) of bond money into the construction 
of landfill, roads, and other industrial infra- 
structure. By 1974, about $266 million of this 
had been paid back: $226 million when 
companies purchased the landfill areas and 
$40 million from the new tax sources. This 
left a debt of $63 million for the prefecture 
and of $21 million for Oita City, mostly for 
roads and other nonlandfill infrastructure 
(Yamada 1979, p. 65). 

As early as 1967, the Economic Planning 
Agency criticized the NICs for pollution, 
strain on local finances, and lack of fit with 
changing economic needs. Yet the prefecture 
continued as a strorig booster. An official of 
Environmental Agency familiar with the Oita 
case said that, by the late 1970s, the main 
impetus driving the prefecture’s NIC policies. 
was the need to recoup its large investment 
debt for landfill and other industrial infrastruc- 
ture, 
Prefectural officials, although disappointed 
at the small amount of employment, take 
great pride that Oita, the “best student” of 
growth, obtained these refineries. They em- 
ployed the latest in automated refinery 
technology. By 1989, with older plants 
shutting down, the steel refinery ran at the 
high rate of 85 per cent of capacity and 
produced 13 percent of the nation’s domestic 
steel. This helped maintain Oita's prosperity 
while other towns fell upon hard times. 
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In 1970, with a booming national econ- 
omy, the prefectural government.still hoped 
to bring in labor-intensive industry. Governor 
Taki (1971-1979), elected by both the JSP 
and the LDP, announced a second phase plan 
- for the New Industrial City extending five 


' kilometers eastward down the coast from the 


first phase of the NIC (KIR). Starting in 
1973, the government built new landfill sites 
nos. 6 and 7 (160 acres), with a harbor site, 
and announced plans for a no. 8. Currently, 
no. 7 and the port landfill are finished, but 
no. 6 has been only half completed and no. 8 
not yet started. 
' The plans for no. 8 raised ` very intense 
- opposition from the villagers living on its 
coast. In a dramatic scenario in 1973, 
'' Governor Taki compromised with their move- 
ment by delaying plans for no. 8 (KIR). In 
response, the prefectural LDP issued harsh 


- criticisms of his “weakness” in the local 


newspaper (KIR). Needing LDP support for 
his 1975 election, Governor Taki quickly 
backed away from his compromise and tried 
to resume landfill plans as quickly as 
possible. - 

.  Mitsui, Mitsubishi, and other big busi- 
. nesses had promised to buy landfills nos. 6 
. and. 7.to set up ship-rebuilding yards and 
-other enterprises (KIR). But the Oil Shock 
and shifts in world market demand changed 
their minds (KIR). By 1980, prefectural 
‘officials doubted that any significant business 
would come there and predicted that Oita 
Prefecture would never recover the full 
amount of its investment. As of 1988, little of 
economic significance had been built on the 
huge sites. 

Despite these dim prospects, in 1981 the 
prefectural government formally proclaimed 
that citizen objections had been resolved and 
that it intended to continue building the 
second phase of the NIC (KTR), including no. 
8 if necessary. It tried to prop up the flagging 
spirits of the local business community by 
urging it to work harder (to adjust to the needs 
of the big businesses) and contribute to 
national growth needs (OGN Dec. 5, 1967). 
By that time, however, the public held great 
: distrust of “technocrats” and their growth 
plans (according to a survey by the local 
NHK [Japan Broadcasting Corporation]) 
(Broadbent 1986). 

Since then, the new Governor Hiramatsu 
'(1979—present) has recognized that heavy 
industrial-style growth is no longer feasible 
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and has vigorously sought new styles of 
growth. His programs, the *one village-one 
product" (isson ippin) movement and a 
silicon-valley style high-tech “Technopolis” 
have made him famous throughout Japan 
(Hiramatsu 1983; Broadbent 1989b) and, 
although having minimal effect on the 
prefectural economy as of yet, may indicate 
the direction of growth in the future. 


Local Business . 

Local business elites, unlike their central 
influence for growth in, the United States 
(Molotch 1976), exercised key influence at 
ony two points in Oita's growth process: in 
the early 1960s when they finally did become 
boosters, and in 1967 when disillusionment 
set in and some factions demanded a more 
locally beneficial type of growth. 

With Oita Prefecture's designation as a 
New Industrial City, Governor Kinoshita 
predicted a "rosy future." By 1970, he said, 
the area would become a great city of 


.700,000 (from a 1961 population of 461,732— 


a 51 percent increase), industrial output 
would increase by a factor of 10, and the 
average income would triple (Mainichi News- 


paper October 22, 1961). 


Hearing this, local business associations in 
manufacturing, commercial, real estate, and 
construction sectors became enthusiastic boost- 
ers. Small manufacturers hoped for subcon- 
tracts from incoming big business. Merchants 
engaged in a feverish building boom to 
prepare for the prospective new customers 
(Nishi Nihon Newspaper November 3, 1961). 
Investor speculation drove up land prices. 

At first, local construction. companies 
entered as subcontractcrs to Mitsui Construc- 
tion Co. and other Tokyo firms. As the 
project proceeded, they demanded continued 
growth (KIR) and more direct contracts from 
the prefectural government. Typically, accord- 
ing to one local construction firm, they held 
secret consultations (dango) to decide the 
amount of their bids and- which firm would 
get the government contract. But because of 
sharper oversight, they could not use dango 
methods wben subcontracting for big Tokyo 
construction firms, and had to cut their 
margins much closer. 

Speculators impeded the smooth progress 
of growth-related land use plans. First, real 
estate companies bought land in the path of 


 Browth or on its edges, such as a planned 
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coastal highway area. Then, local legislators 
pressured the government to buy the land at 
inflated prices. The same politicians de- 
manded that neighborhoods on the edge of the 
plan got the same new roads, sewers, and 
other facilities too. For these services, 
politicians get repaid in votes and campaign 
contributions. Ultimately, the coastal bigh- 
way was abandoned as too expensive. By 
1970, such maneuverings had led Mr. Taiichi 
Sato to reflect bitterly: “In no time at all, my 
hopes have been crushed . . . the secret 
maneuvers (anyaku) of [real estate] brokers 
have caused disorderly growth" (OGN 1970). 

The social and economic effects of the NIC 
were quite mixed. Certainly, bustling Oita 
City with its new city hall, library, and 
expanding network of straight paved streets 
and sewers (kukaku seiri) represents a 
considerable material improvement over the 
early postwar poverty. Given tax breaks, 
significant tax benefits from the project 
started to come in only years after it started. By 
the 1980s, however, these revenues made 
Oita self-financing and independent of govern- 
ment largess in some ways. 

Between 1961 and 1970, the prefectural 
industrial product increased, not the predicted 
tenfold, but still a hefty threefold (from about 
182 million dollars to about 530 million 
dollars). This increase came mostly from the 
vast steel and oil output of the new refineries. 
however, and not a spread of local manufac- 
turing. 

Given the rapid pace of technology, the 
automated NIC factories required only about 
7,000 workers, not the expected 30,000. 
Most of these were brought in from older 
plants, not hired locally. As a result, even 
though the prefectural work force structure 
changed, losing farmers rapidly and gaining 
urban service workers, it only increased by 
12,000 (1960 = 568,622; 1980 = 580,588). 
The service jobs, feeding, clothing, and 
entertaining the NIC workers, provided a 
welcome alternative to marginal farming, but 
not the hoped-for skilled training. 

Unlike auto or shipbuilding plants, the 
refineries did not require many local subcon- 
tractors to manufacture parts, nor local 
distributors of their product. Hence, the NIC 
produced little stimulating "ripple effect" 
(hakyu koka) on the prefectural economy. 

By 1970, the population of the Oita NIC 
area (centering on Oita City) had increased 
only 10 percent, not the predicted 51 percent 
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(to 509,009) (Nomura 1979, p. 43). Among 
the 47 prefectures, Oita's average income 
continued to rank about 41, as poor as ever. 

This relatively poor economic performance 
split the local business community into 
factions which profited and those which did 
not. In 1978, the Oita Chamber of Commerce 
held a public meeting to show support for the 
second phase of the NIC project (KIR). The 
meeting roster lists 72 support groups, mostly 
Oita City merchant associations in retail and 
wholesale trade, as well as construction and 
food manufacture that had prospered with the 
swelling of Oita City’s population. 

But for small manufacturers, the gover- 
nor’s rosy vision of growth turned out to be a 
mirage. Summing up their disappointment, 
one official of the Oita Chamber of Small and 
medium Size Firms confessed in 1978, “for 
small and medium businesses, the [industrial- 
ization project] has not been a big plus.” As 
early as 1967, some business people de- 
manded that the prefecture restrict second- 
phase industry to that which would help the 
local economy, for example, factories or 
shipyards (KIR). They opposed Mitsubishi’s 
plans. for an iron ore storage area, or the 
expansion of the Showa Denko petrochemical 
plant. These problems caused Mr. Sato to 
completely lose his earlier idealistic faith in 
growth—rather than heavy industrialization. 
He urged the revitalization of agriculture as a 
way of making local jobs (OGN Sept. 20, 
1967). 

Their objections had little effect, however. 
Their political ineffectiveness’ reflects the 
dependence of small and medium business 
upon larger business organizations. As in any 
capitalist economy, the percentage of compa- 
nies and jobs controlled by Tokyo firms 
increased over time. A local “ward heeler” 
and small businessman told me that since the 
advent of the NIC, the Chamber of Com- 
merce had come under the control of the 
branches of big Tokyo business and small 
business had to meekly follow their lead. 


National Government 


The strong state thesis implies that Japanese 
national government would exercise a domi- 
nant influence over local growth (Reed 1986). 
Certainly, MITI held the legal power to do 
so, especially in the earlier stages of growth, 
when foreign exchange was scarce (Holler- 
man 1979, p. 719). Government national 
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economic plans helped recovery and growth 
(Matsubara 1977). Despite this; when op- 
posed by the unified will of big business, the 
economic ministries exhibited little autonomy. 

The 1962 New Industrial Cities Law came 
about in response to a vociferous clamor for 
growth from rural, underdeveloped prefec- 
tures, including Oita. The surge of new, 
post-World War II growth had concentrated 
along the “Pacific Coast Belt,” stretching 
from Tokyo along the Inland Sea to northern 
Kyushu, all near the big urban markets. 
* Growth-hungry rural prefectures pressured 
their Diet representatives for government aid 
for decentralized industrialization. To keep 
the loyalty of the crucial rural vote, the LDP 

had to respond. 
` This rural demand for growth, however, 
contradicted big business pressure for central- 
ized industrialization (Uchino 1983, p. 65). 
Business pressure weighs heavily in the LDP 
because it largely funds the immense cam- 
paign expenses (averaging $2 million per 
LDP Diet candidate) (Asahi Newspaper, 
survey of Diet Representative finance sources, 
April 10, 1989. See also Richardson and 
Flanagan 1984, p. 186). In 1960 just before 
the New Industrial Cities law, the LDP’s 
Ikeda Cabinet had already issued its famous 
“Income Doubling Plan” (Baizoron). This 
responded to business pressure and supported 
. population flow into and job creation within 
the big cities like Tokyo. 

These contradictory demands forced the 
LDP into a classic "double-bind" —the need 
to support contradictory ends simultaneously — 
that is, both decentralized and centralized 
industrialization. The scenario unfolded in a 
" number of phases. First, the LDP requested 

ministries to submit plans for the decentrali- 
- zation of industry and population (KIR). In 
. 1961, the National Land Agency (NLA) 
“presented its first National Comprehensive 
Development Plan (Zenkoku Sogo Kaihatsu 
Keikaku), which provided a vision of the 
ideal distribution of industry for best social 
effect. Then various ministries presented 
competing drafts of concrete legislative bills 
designed to implement this plan, each reflect- 
ing their own bias. Illustrating the lack of 
monolithic unity in the Japanese state, the 
ministries split into opposing “subgovern- 
ments” (Campbell 1985). 
Ultimately, dissatisfied with all of these 


` drafts, the LDP designed and passed its own: 


bill, announced in 1962 as the New Industrial 
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Cities Law (KIR). This law left the amounts 
cf aid and number of sites undetermined. . 

The ministries then competed to define 
amounts of aid and numbers of sites. The 
Ministry of Local Autonomy pushed for 
medium-size subsides to many sites, MITI 
wanted large and effective grants to a few 
sites, and the Ministry of Finance wanted to 
give out very little to anyone (OGN, Decem- 
ber 6, 1963). Since the real purpose of the bill 
was to appease the demands of the rural 
areas, the LDP allowed the designated sites to 
mushroom to 15 (KIR) The NIC Law 
predicted great results, so this defused 
demands and preserved LDP legitimacy. But 
as a result, the amount of grant per site 
dwindled and the amount expected from 
prefectural bonds grew, diluting the Law's 
petential for economic stimulus. The New 
Industrial Cities Law gave symbolic appease- 
ment to the rural electorate but had little real 
efrect on industrial location. 

Despite the weakened state of the NIC law, 
big business adamantly opposed it, resisting 
any support at all for the decentralization of 
industry. To negate what they saw as a 
misuse of aid, big business demanded the 
same kind of grants for areas nearer the big 
cities. In September of 1962, business 
representatives on the Regional Industrial 
Development Advisory Council, chosen by 
the Federation of Economic Organizations 
(Keidanren), objected to the hinterland loca- 
tion of the NIC sites and proposed alterna- 
tives. By March of 1963, the LDP Diet had 
passed a rider to the NIC law setting up 6 
Special Industrial Consolidation Areas to 
receive the same benefits. These 6, unlike the 
original 15, all lay within the Pacific Coast 
Belt, close to the markets. This directly 
countered the original purpose of the NIC 
Law. A senior official from the National Land 
Agency commented ruefully, 


The big business leaders wanted to put all the 
real investment into the Pacific Coast Belt area. 
. . . In actuality, most of the growth did take 
place in that area. . . . MITI had guidance 
power, but did not use it. 


In November of 1963, the Advisory Council 
on Regional Industrial Development, attached 


to the Prime Minister's Cabinet and headed ` 


by Mr. Kojima, a prominent businessman and 
leader of the Federation of Business Organi- 
zations, approved all the NIC and Special 
Areas. In this case, the state ministries, MITI 
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included, exhibited little effective: autonomy 
in pursuing their preferred policies. Instead, 
big business instrumentally redefined the 
implementation of policy, and in so doing 
effectively foreclosed public choice— creating 
a kind of "nondecision." One government 
official expressed the tenor of this state- 
business relationship succinctly: 


Industry makes the broad framework of the 
economy, but the bureaucrats adjust industry’s 
desires to that of the whole populace in order to 
avoid severe internal struggle between labor and 
capital, and to keep the country peaceful and 
Beveloping:: 


In contrast to its weakness regarding big 
business and the LDP in top policy matters, 
the national government has considerable 
formal and informal regulatory power over 
prefectural government. It regulates such 
issues as, harbor and landfill construction, 
certain types of industrial location, and the 
issuance of construction bonds. It requires 
prefectural government to set up local advi- 
sory councils, consult constantly with offi- 


. cials from the appropriate national ministries, 


and submit growth plans repeatedly. Central 
ministries regularly station young officials in 
analogous prefectural -departments, where 
they play a liaison. role. Governors them- 
selves, such as Governor Hiramatsu, are often 
retired central ministry officials. In addition, 
a wide range of informal and semiformal 
meetings and consultations occurs constantly 
between all levels of officials, politicians, and 
business people. 

After NIC designation, the prefectural 


government was swept into the orbit of 


national ministries and lost the autonomy to 
define its own growth policy. This occurred, 
not because of the formal provisions of the 
NIC law, but through informal "guidance" 
from the Ministry of International Trade and 
Industry (MITT). MITI's representation of big 
business took precedence in industrial loca- 
tion policy over more decentralizing propos- 
als by the National Land Agency or the 
Ministry of Local Autonomy. The ministry 
that "rationalizes" the long-range interests of 
business achieves intrastate dominance (Fried- 


. land, Piven, and Alford 1977, p. 458; Jessop 


1982, p. 248). 

Joint discussions in the Industrial Location 
Advisory Council and other related. councils 
affiliated with MITI, between representatives 
of big business, government ministries and 


academic experts, led to the realization that, 
given the crowding of the central cities and 
ports, hinterland location for basic refinery 
industries was now a logistic necessity. In - 
1965, MITI told Oita Prefecture to give up its 
plans for a machinery factory and settle for 
more oil production (KIR) (OGN March 24). 
Again, in 1967, MITI brought.a group of 
industrialists to survey the proposed expan- 
sion of the NIC landfill area (KIR). This: 
group included a vice president of Mitsubishi 
Heavy Industries, the vice-chair of the Japan 
Machine Industries Association; and the 
Director of Matsushita Electric's administra- 
tive department (kanzaibu). They also discour- 
aged hopes for manufacturing industry and 
pushed for oil refineries and storage facilities 
(bichiku kichi) (OGN, Jan. 18. 1967). . 

- This guidance was beneficial for rapid 
capital accumulation at the national level, 
which the MITI big business complex defined 
as the quickest path to national’ welfare. But 
on the prefectural level, by replacing a , 
labor-intensive manufacturing industry with a 
highly automated one, it eliminated a strong’ . 
ripple effect.. At that time, manufacturing . 
industry was expanding elsewhere in Japan, 
so with stronger "guidance," MITI might ' 
have been able to send it to NIC sites. 
Eliminating it from the Oita plans (as well as 
the other NIC sites) hence reflected not only a 
rational accommodation to market and loca- 
tional factors, but also big business predomi- 
nance in national growth policy decision 
making. 

The prefectural government did not meekly . 
acquiesce to this redefinition of its hopes. 


Several times it tried to avoid or subvert ' 


national “guidance.” One NLA official com- . 
plained to me about prefectural compliance: 


They only give us what they want us to know, so ` 
the NLA is frustrated. . . . There are a lot of 
contradictions in the process. that they don't 
want to reveal to me. 


Yet, in the end, at the cost of many its 
original hopes, Oita had little choice but to 
adjust to the vision of MITI and big business 
as a convenient port for the expansion of 
basic materials refineries. 


Big Business 


Class theory predicts strong big business. 
influence over state policy, but this can occur 
via different modes, such as instrumental and 
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structural. The preceding section showed 
business’ direct, instrumental influence over 
the creation of the NIC law. If consistent at 
the local level, Oita Prefecture’s growth 
policies should be equally shaped by big 
business interests. 

Instrumental influence is more effective 
when a class is highly concentrated and 
organized. Japan’s oligopolistic sector is 
much more so than that of the United States. 
Large oligopolies (keiretsu) control more than 
90 percent of Japanese mining and manufac- 
turing (Halliday 1975, p. 275; Caves and 
Uekusa 1976, pp. 499—500). Rather than the 
individual-owner centered notion of class, 
Japanese business more closely approximates 
an interlocking “corporate capitalism" con- 
trolled by managers. Banks and other compa- 
nies in the same conglomerate (keiretsu) own 
about 70 percent of stocks, while the rest is 
finely divided among many tiny individual 
owners. 

Businesses are closely organized in indus- 
trial groups (gyokai) that in turn cluster under 
class representative organizations such as the 
Federation of Economic Organizations (Kei- 
danren), the "Foreign Ministry of Japanese 
Business" (Vogel 1979, p. 113). Keidanren 
coordinates opinions, collects political contri- 
butions, and appoints business representatives 
to government advisory councils. Big busi- 
nesses also accept many retired ministerial 
officials into their top positions. This concen- 
tration, sharp organizational focus and inter- 
lock with government helps big business as a 
whole exert effective political influence. 

Following their own logic of capital 
accumulation on a national firm scale, big 
Tokyo-based businesses urged the prefecture 
to support NIC industrialization. Local growth 
provided profitable contracts for Tokyo-based 
companies like the Kashima Construction Co. 
and Mitsui Real Estate Construction Co. 
(Mitsui Fudosan Kensetsu Kaisha). These 
exceed the largest of local construction 
companies by 40 to 1 in product value 
(Teikoku 1987). Their vast size and superior 
technical skill, as well as intense political 
pressure, brought them the general contractor 
role. They built most of the landfill and other 
industrial infrastructure, giving some subcon- 
tracts for roads and other smaller projects to 
Oita construction companies. 

After the construction of the refineries on 
the first phase, the second phase was 
supposed to receive shipyards or other 
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labor-intensive industry. However, the col- 
lapse of the world currency system, the Oil 
Shock of 1973, and shipyard competition 
from Korea and Taiwan made big business 
lose interest in Oita. Both the Mitsui Corp. 
and Mitsubishi Shipbuilding Company tried 
to withdraw from their agreements to pay for 
and build on landfill areas no. 6 and 7. The 
Mitsubishi group companies, using an escape 
clause written into their contract, cancelled 
buying the eastern half of no. 6 landfill and it 
was never completed (KIR). The Prefecture 
sold the completed western half to Kyushu 
Oil Co., Kyushu Electrical Power Co., and 
Snowa Denko petrochemical complex, for 
expansion of their facilities (OGN Jan. 27, 
1981). | 

Sometimes, a paternalistic sense of respon- 
sibility committed big business to an unprof- 
itable investment. The home-prefectural ties 
between Mr. Hoshino of Mitsui Bussan and 
Gov. Kinoshita, having given birth to Kyushu 
Oil Co., cemented a long-term relationship 
between the Mitsui conglomerate and Oita 
Prefecture. Mitsui companies did most of the 
later construction of its New Industrial City. 
According to a Mitsui official, one Oita vice- 
governor in his eagerness to further Oita’s 
growth, appealing to their paternalism, regu- 
larly urged Mitsui leaders to make Oita City a 
“Mitsui town" (kigye joka machi). 

Consequently, the Mitsui companies proved 
more willing to commit themselves to a tight 
contract for No. 6 landfill with no escape 
clause. After the recession of 1973, Mitsui 
Shipbuilding Co. could not renege like 
Mitsubishi, although it wanted to (KIR). 
Now, regretting its earlier largess, Mitsui 
Zosen is searching for a use for the land. One 
plan is to build a golf course. But this is 
hardly a profitable use for land which in 1980 
cost $480,000 per acre. (In Japan, that was a 
bargain, however. Land at the Mitsui Ship- 
yards in Chiba near Tokyo costs six to eight 
times that.) Given the recent revival of orders 
for Japanese ships, a new plan is to move the 
Chiba Shipyards there. This would be exactly 
the kind of labor-intensive industry Oita had 
hoped for in the beginning. 

Despite their success at manipulating the 
prefectural government, big businesses man- 
agers complained about its uncooperative- 
ness. They felt reluctant to locate in Oita, 
they said, because it compromised with 
opposition citizen movements. One top man- 
ager complained, “The prefectural govern- 


ment's basic interest is fesident’s welfare. 


They . must listen to the residents. If [their 
objections] . 


depended heavily upon prefectural govern- 
ment support for ‘projecting an acceptable 
public image and quieting disabling opposi- 
tion. In one case, the prefectural government 
rejected a Showa Denko Co. proposal to build 
an aluminum refinery on no. 8 landfill (KIR). 
An official of that company commented, 


' The prefecture refused the company because of 
the citizens’ movement (objections). 
Governor Kinoshita néeded full agreement from 
the residents. 


They may be overstating prefectural reluc- 
tance, but these rueful comments are hardly 
what one would expect from a self-confident 
"ruling class.” , 


DISCUSSION 
Modes of Influence 


Behind the formal decision processes of the 
state are layers of negotiations and maneuver- 
ing among actors, giving full room for the use 
of ideological, nondecisional, and legitimate 
influence. Are these mixed in a chaotic 
jumble, or is there a certain order and strategy 
to their use? ` 

Ideological manipulation had little effect on 
elites, who are by profession highly sensitive 
to the implications of policies. It did occur 
between the national LDP' and the rural 
governments over the NIC law, at a time 
when considerable ignorance about such 
matters prevailed. But if nonelite groups had 
been considered, this would have played a 
stronger role. 

Nondecisions, which set the limits of 
public debate, occurred between major elites 
in corporatistic advisory councils, LDP re- 


_ search committees; and informal negotiations 


(nemawashi). Contrary to the notion that 


advisory councils simply “rubber stamp". 


bureaucratic draft bills Johnson 1982, p. 47), 
those with big business representatives on it 
prove a convenient forum for collaborative 
national planning. MITI's original advocacy 
of a few intensive NIC sites, for instance, was 
changed. through this process. This set the 


.framework for the negotiations between 


individual firms and local governments. 


~ get strong, the legislature will, 
Say ‘no’ to the company.” Such refusal meant 
.more than legal sanction. Big business 


n 
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That framework, once set, helps define 
what is “legitimate” influence. The power of 
big business to invest or not in an area, its 
“refusal power,” takes shape against that 
framework. Given that framework, in order to 
achieve any growth at all, the prefectural 
government had to provide favors (yugusochi) 
such as tax breaks; industrial parks, and other 
infrastructure. This made the prefecture 
subservient to big business demands and also 
put it into heavy debt. Landfill purchase by 
companies and tax repayments by the national’ 
government lightened the debt, the cost of 
roads, port, and other infrastructure remained 
a prefectural burden. This pushed the prefec- 
ture to keep trying to get industry to locate on 
the second-phase landfill, even after little " 
realistic possibility remained, in hopes that 
further growth would cover accumulated 
losses. 
` ' But big business did not hold all the reins 
of power. The social ethic of interpersonal, . 
obligations led the-Mitsui Shipbuilding Co. to 
sign a rigid contract for no. 7 landfill with no 
escape clause. Also, prefectural government 
Sometimes gave priority to citizen welfare. 
over business interests. Businesses depends. 
heavily upon prefectural government support: 
in projecting a good public image and. 
quieting opposition, so this effectively ` pre- 
vented entry. In addition, interministerial 
fights over policy jurisdiction at times in- 
jected new possibilities into preter! growth 
prospects. ; 


Patterns of Power 


As these events show, public policy has 
difficulty controlling the centralizing tenden- 


' cies, of capitalism. Yet this principle works 


out in different ways through the political- 
economic institutions of a society, with 
important implications for policy outcomes. 
The Japanese growth coalition differs sharply 
from the U.S. pattern of local business 
leadership and local government representa-. 
tion of it. In Japan, this business-centered ' 
relationship is transferred to the national 
level. 2 
Over the long run, within the six elite 
power structures, big business had the 
greatest autonomy in pursuing ' its - growth 
interests. It overrode other interests and . 
policies, shaping national growth policy so 
that capital accumulation occurred in a highly 


, centralized fashion. The case of the NIC . 
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Special Areas illustrates how big businesses 
acted in concerted fashion to ensure a political 
framework favorable to that end. Individual 
businesses freely made use of the benefits that 
framework provided. 

This political structure produced highly 
efficient capital accumulation at the national 
level, but not for local-level firms. Fortu- 
nately for the national elite, however, the 
resultant rapid rate of economic growth and 
diffusion of its benefits generalized public 
approval of big business and LDP growth 
policies. This softened the local delegitimiz- 


ing effect of less than expected NIC economic ° 


performance. 

Surprisingly, the national state, while 
relentless in its bureaucratic procedures, had 
little effect on the content of growth policy. 
Even MITI, with formidable legal powers and 
its own policy preferences for a small number 
of NIC sites, gave in and represented LDP 
and big business interests rather than control- 
ling them. Other ministries with more welfare- 
oriented ideas had even less control. 

The national LDP defended its dominance 
by seeking both business contributions and 
voter allegiance. At times, this put it into 
acute contradictions, as around the NIC law. 
To resolve these, the national LDP give 
rhetorical concessions (the NIC program) to 
the rural. constituencies and actual conces- 
` sions (the Special Areas) to big business. A 

structural contradiction between two opposing 
constituencies pushes elected actors like 
- parties and local governments into “symbolic 
politics" (Edelman 1977), or what the 
Japanese often note as a gap between stated 
(tatemae) and real intentions (honne) (Lebra 
1976, p. 136). 
In sum, within this ruling triad, the national 
state, even MITI, one of the "strongest" state 
ministries, achieves even a relative autonomy 
only when it flows with the predominant 
business opinion. When with that flow, it can 
guide a few recalcitrant businesses back into 
the main current. Together, big business and 
the LDP hem the state in tightly. Accord- 
ingly, the idea of a state-led "plan rational" 
image of the Japanese political economy 
seriously underestimates the self-organizing 
capacity and political potency of big business. 
While good for national capital accumula- 
tion and growth, this structure discouraged 
social infrastructure investment and weakened 
local political institutions. The resultant 
growth policies imposed severe regional 


themselves. The 
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disparities in population concentration and 
social services (kamitsu kaso mondai), abet- 
ted local government inability to control land 
acquisition and reduce real estate speculation 
through zoning and other regulations, and 
weakened local entrepreneurial business un- 
der the oligopolistic dominance cf big Tokyo 
businesses. 

As a “buffer,” elected elite, prefectural 
government tries paternalistically to define 
growth according to local needs. But other 
elites often stymie its attempts. This generates 
a contradiction between its administrative 
(keiei) and control (shihai) functions. On the 
ore hand, it wants to build and administer 
public works (like industrial parks) for local 
public benefit, on a fully consultative and 
representative basis. On the other, outside 
pressure bends growth policy to other ends 
(Funahashi, Hasegawa, Hatanaka, and Kajita 
1988, p. 83). This damages the legitimacy of 
prefectural government (Offe 1985). 

Local businesses played an opportunistic 


: role, highly dependent upon public works 


contracts defined elsewhere, but often making 
their local implementation more difficult 
through speculation. But even there, high 
degrees of class organization gave it surpris- 
ing power in relation to prefectural govern- 
ment. 


Dynamics of Change 


Why did the pattern of the growth coalition 
change so much over the 25-year span? Why 
did actors mobilize and disperse, enter and 
exit the stage of active politics? They did so 
in response to shifting contexts of opportunity 
and threat (Tilly 1978). These include deep 
institutional transformations, changes in broad 
political and economic conditions, and new 
corditions imposed by the growth policy 
outputs, 

Among the six actors, the entry of big 
business had the strongest effect. It moved 
in response to changes in the world economy. 
The other actors on stage adjusted accord- 
ingly, reconstructing the hierarchy among 
increasing success of 
Japanese goods in Western markets, the 
Nixon “dollar shock” of 1972, the Oil 
Sheck of 1973, and economic competition 
from the East Asian NIC’s (New Industrial 
Countries) all affected business investment 
dec:sions. 

Big business responses affected other 
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actors through their investment decisions and 
supporting government policy. Prefectural 
business also responded to economic opportu- 
nities, but these were more defined by 
government contracts and threats, such as the 
lack of NIC ripple effect. Likewise, the LDP 
responded strongly and effectively when 
government policies threatened the interests 
of its political constituency —such as rural 
growth policies or the governor's compromise 
with residents’ movements. The prefectural 
government continued to advocate growth 
whatever the objective possibilities. To pre- 
serve legitimacy, it had to project a dedication 
to public welfare, even while internalizing 
market discipline. 

The democratizing reforms of the Occupa- 
tion strongly affected the first steps of Oita’s 
growth. Governor Kinoshita was the first 
elected governor, his predecessor having 
originally been a wartime appointee. The 
political ambitions and social conscience of a 
newborn Socialist Party, struggling to realize 
the potential of the new democratic system 
(which allowed for unions and opposition 
parties), shaped the. initial growth policies. 
The shifts in national institutions after that 
time, such as growth of the middle class, 
decline of union strength, loss of state power, 
expansion of business power, all resulted 
directly from the success and rapid pace of 
national growth. This growth diffused wealth 
throughout the country, albeit unevenly, but 
enough to moderate public support for 
opposition parties. In Oita, this moved the 
prefectural government increasingly into the 
LDP orbit and moderated public dissatisfac- 
tion with the specific faults of the NIC 
project. 

The feedback from prefectural growth 
policy also changed the influence pattern. Its 
outcomes changed material conditions, both 
economic and environmental, which activated 
new groups, changing the party and social 
class constituency of prefectural government. 
Flows of issues through the changing 
public provided opportunities for different 
parties to win office and impress their 
agendas on policy (Broadbent 1988). 
Ironically, JSP government growth policies 
spurred local business into boosterism. Then, 
its infrastructure debts pushed it to keep 
chasing the mirage of heavy industrial growth 
even as it receded. Governor Taki’s compro- 
mise with protesters raised LDP protest, 


leading the governor into. another contradic- 
tion. se 


CONCLUSION 


In conclusion, an analysis of Oita's re- 
gional growth politics reveals a multileveled 
and contradictory process of adjustment 
around big business priorities. Both instrumen- 
tal and structural strategies and modes of 
influence supported this. The logic of capital . 


.accumulation as understood by big business 


and conveyed by individuals with experience 
in both worlds, persuaded the government 
and shaped its growth policies. This set the 
legitimate parameters of business investment 
freedom very broadly, but gave the prefecture 
little more than supplicatory influence. Its 
ramifications caused recurrent legitimation 
crises for both LDP. and local government, 
only resolved by a generalized public satisfac- : 
tion with growth and a lack of viable 
alternative political possibilities. In broad 
outline, the data support a view of Japanese 
society as more sensitively rationalized around 
the priorities of oligopolistic capital accumu- 
lation than is the case for other industrialized, 
capitalist democracies. 

The theoretical implication of the finding is 
to weaken the case for the full or relative 
autonomy of the state. Several examples of 
seeming national and local state autonomy in 
growth policy were observed in this study. 
But when considered in context, as part of a 
longer process and within a multileveled 
structure, they shrink in significance com- 
pared to the autonomy and influence of big 
business. This would warrant greater attention 
to contexts of process and structure in future 
studies in this field. 
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MURDER AND CAPITAL PUNISHMENT: 
A MONTHLY TIME-SERIES ANALYSIS OF 
EXECUTION PUBLICITY* l 
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In a recent analysis of the effect of execution publicity on homicides, Stack (1987) 
challenged the consensus of most social scientists that capital punishment does not 
effectively deter murder. He found thar publicized executions have a very 
significant deterrent effect. Stack reports that 16 highly publicized executions may 
have saved as many as 480 innocent lives during 1950-1980. The present 
investigation attempts to shed additional light on the execution publicity/deterrence 
question. Our review of Stack’s investigation shows that it suffers from a number of 
conceptual and methodological limitations. Correcting for these difficulties, we . 
find no evidence that execution publicity influenced the rate of homicide during the 
1950-1980 or 1940-1986 period. Some evidence suggests that higher levels of 
execution are associated with lower murder rates. However, the apparent deterrent 
effect is very slight and short term. Indeed, the cumulative effect of capital 
punishment on homicides during the execution and subsequent months is 


essentially zero. 


INTRODUCTION 


` The United States is the only Western nation 
retaining capital punishment for civil homi- 
cide (Zimring and Hawkins 1986). The death 
penalty is authorized in 37 of the 50 states. 
Moreover, this country's commitment to 
capital punishment is not nominal; the federal 
(and most state) appellate courts have ruled 
that capital punishment is constitutionally 
acceptable as long as it is practiced properly 
(Gregg v. Georgia 1976). And practiced it is. 
Since the mid-1970s, persons have been 
sentenced to death in all but three retentionist 
jurisdictions resulting in over two thousand 
persons on death row (NAACP 1988). 
Further, the annual number of executions has 
‘increased steadily throughout the 1980s. 
Since the Gregg (1976) decision, 106 persons 
have been executed. 
A variety of arguments have been posed for 
this country’s commitment to capital punish- 
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ment, including: general deterrence, revenge, - 
retribution, normative/moral validation, per- 
manent incapacitation, scapegoating, racism 
and classism, and various other "demand for 
punishment" arguments. Most of these argu- 
ments fall outside the realm of empirical 
inquiry. For example, the basis of the 
retribution claim is that convicted murderers 
deserve to die because of their antisocial 
behavior. This argument rests upon a matter 
of belief and cannot be demonstrated or 
refuted empirically. In contrast, some death 
penalty issues are amenable to empirical 
inquiry. Deterrence arguments are testable, 
and much evidence has been assembled 
pertaining to the claim that capital punish- 
ment discourages would-be-killers. The fol- 
lowing analysis examines an important deter- 
rence and death penalty issue: the impact of | 
execution publicity on homicides in the 
United States, 1940—1986. 


The Status of the Deterrence/Death 
Penalty Question 


Until the mid-1970s most social scientists 
held the view that capital punishment does not 
provide an effective deterrent to murder. This 
consensus was based upon analyses dating 
back to the early writings of such figures as 
Beccaria (1764), Bentham (1830), and Tarde 
(1912), and more recently, research findings 
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from comparative dialys ‘of homicides. in 
retentionist and abolitionist jurisdictions (Sellin 
1955, 1961, 1967; Schuessler 1952; Baldus 
and Cole 1975). In 1975, the publication of 
an important paper by Isaac Ehrlich shattered 
this consensus. He applied econometric mod- 
eling in examining national homicide and 
execution rates from the mid-1930s through 
the late 1960s. Controlling for a variety of 
sociodemographic and law enforcement fac- 
tors, he found a highly significant inverse 
relationship between the certainty of execu- 
tion and annual homicide rate. Over the 
30-year period, Ehrlich reported that. each 
'execution was associated on average with 
seven to eight fewer killings. 

Ehrlich's novel findings were greeted with 
considerable skepticism. Scholars who at- 
tempted to replicate his results usually did not 
support Ehrlich's findings (Bowers and Pierce 
:1975; Passell and Taylor 1975; cf. Yunker 
1976; Layson 1985). Rather, they showed 
that his work suffered from a variety of 
theoretical and methodological limitations, 
which, when corrected, yielded the more 
familiar pattern of a chance association 
between homicide rates and the provision for, 
and level of use of, capital punishment (see 
Bowers and Pierce 1975; Passell and Taylor 
1975; Nagin 1978; Friedman 1979; Brier.and 
Feinberg 1980 for discussions of Ehrlich's 
methodology and findings). 

A decade of post-Ehrlich time-series and 
cross-sectional research again brought a 
general consensus that capital punishment 
does not effectively deter murder. Based on a 
recent survey of the issue, Zimring and 
Hawkins (1986) conclude that the evidence is 
clear and abundant that deterrence justifica- 
tions for capital punishment are groundless.: 

The views of Zimring and Hawkins and 
most social scientists notwithstanding, once 
again, new evidence purports to: demonstrate 

„a substantial’. deterrence trade-off between 
capital punishment and homicides. This most 
recent challenge is provided by Stack’s 
(1987) examination of the relationship be- 
tween execution publicity and homicide. 


Deterrence Analyses of 

Execution Publicity 

Research on the effect of execution publicity 
on homicide dates back to the 1930s. It is 


based upon the argument -that criminal 
sanctions, regardless of their certainty, sever- 


a 23 
ity, and celerity, cannot influence. the percep- 


tions and behavior of potential offenders 


unless they are publicly-communicated (Gibbs 


1975, 1986). Recognizing this fundamental - 


point, Dann (1935) examined the number of 


. apparent capital homicides for the 60-day 


period before and after five highly publicized 
Philadelphia executions in 1927, 1929, 1930, 
1931, and 1932. He found an increase, ‘not a 
decline, in killings following each execution. 
Also contrary’ to deterrence predictions, - 
examining four highly. publicized death sen- 


‘tences in Philadelphia in 1944, 1946 (two), 


and 1947, Savitz (1958) found no change in 
homicides during the eight-week periods 
before and after eüch event. These two 
investigations, and similar contrary results 


‘reported by Graves (1956) for three California 


counties for 1946-1955, and by King (1978) 
for the state of South Carolina for 1950-1963, 
led most criminologists to conclude that even 
highly publicized executions do not influence 


.homicides. ' 


Phillips questioned .this ETS in a’ 
1980, investigation of weekly homicides in 
London, England, before and after 22 highly 
publicized executions between 1858 and 
1921. Phillips found that the number of 
homicides declined” by approximately 35 


“percent , during the two weeks immediately 


following each execution. He accounted for 
this pattern in terms of deterrence, but 
Phillips also reports a “rebound effect” 
whereby homicides returned to their baseline 


"levels during the third through the fifth weeks 


after executions. Essentially this rebound in, 
homicides cancelled the immediate decline in 
killings. Phillips cautioned that his findings 
may not be generalizable to other eras and 


locations. 


Indeed, a partial replication of the Phillips 
study-in this country by’ McFarland :(1983) 
produced no evidence of deterrence. Follow- 
ing a 10-year moratorium on executions, 
Gary Gilmore was put to death in Utah on 
January 17, 1977, followed by John Spinkelink 
(May 25, 1979 in Florida), Jesse Bishop i 
(October 27, 1979 in Nevada), and Steven 
Judy (March 9, 1981 in Indiana). Each 
execution received extensive media attention, . 
but McFarland’s weekly, national homicide 
analysis showed that none were followed by a 
significant decline-in killings that could be 
attributed to deterrence. In brief then, several 


‘investigations of the effect of execution ' 


publicity on hómicides consistently demon- 
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strated a lack of support for the deterrence 
proposition. 

Stack’s (1987) recent investigation chal- 
lenges this conclusion. Controlling for the 
level of unemployment and population aged 
16-34, Stack examined the relationship be- 
tween execution publicity and monthly homi- 
cide rates for the United States for 1950— 
1980. In contrast to previous investigations, 
he treated execution publicity as a variable. 
Executions recorded in Facts on File (a 
comprehensive national index of major news 
Stories) and the New York Times were 
classified as receiving high levels of public- 
ity. Those appearing in the New York Times, 
but not in Facts, were classified as receiving 
moderate media attention. Executions not 
receiving coverage in either source were 
considered as low-publicity cases. Stack 
. reported a significant decline in homicide 
rates for months with highly publicized 
executions, but only a chance association for 
cases receiving moderate or little publicity. 
Over the three decades, he estimates that “16 
[highly] publicized executions may have 
saved as many as 480 lives," for an average 
of 30 persons per execution (Stack 1987, p. 
538). 


Adequacy of the Stack Investigation 


Like McFarland (1983), Stack examines the 
effect of execution publicity on homicides on 
a national scale, but for a much more 
extended period, 1950-1980. Importantly, 
Stack measures variation in execution public- 
ity by differentiating among high-, moderate-, 
and low-publicity executions. To isolate the 
effect of execution publicity on homicides, 
Stack introduced percent unemployment and 
percent population 16—34 years of age (along 
with yearly and monthly dummy variables) as 
control factors. Including these factors is 
important since unemployment (3 to 8 
percent) and the population age 16 to 34 (26 
to 37 percent) varied considerably during the 
1950-1980 period. These innovations make 
Stack's analysis the most comprehensive 
study of its kind to date. Yet, the analysis 
suffers from a number of important problems 
that render its conclusions suspect. 

First, the dependent variable in Stack's 
investigation is the number of monthly 
homicides per 100,000 persons 16 years of 
age and older. Since the numerator of the 
homicide index includes all homicides (regard- 
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less of the age of victims), it would have been 
preferable to compute homicide rates per 
100,000 total population. Using this more 
conventional measure avoids the problem of 
now changes in the age structure of a 
population may artificially influence the 
number of potential homicide victims and 
offenders. If age (16 plus years) is an 
important contributor to homicide, then it can 
be included as a control variable in a model of 
homicides and executions. 

Second, the numerator for Stack’s depen- 
dent variable is the number of monthly 
bomicides as reported by the U.S. Depart- 
ment of Health, Bureau of Vital Statistics. In 
accord with the International Classification of 
Diseases, the Bureau (1967, p. 9) defines 
homicide as "a death resulting from an injury 
purposely inflicted by another person." The 
Bureau considers legal executions as purpose- 
ful deaths, and therefore, includes them in 
their homicide figures. To avoid bias, it 
would have been preferable for Stack to 
subtract the number of executions from the 
vital statistics in computing monthly homicide 
rates. 

Third, although Stack's inclusion of the 
unemployment rate and age as control vari- 
ables in the analysis is important, they do not 
exhaust the structural variables associated 
with homicide rates. Previous investigations 
have found that homicide rates are associated 
with such factors as poverty, income inequal- 
ity, race, the divorce rate, urban population, 
and region. Although significant changes 
occurred in some of these sociodemographic 
factors during 1950-1980, they are uncon- 
trolled variables in the Stack analysis. 

Stack's model also excluded two additional 
factors found to be important in previous 
bomicide investigations: (1) the availability of 
firearms, and (2) the homicide arrest clear- 
ance rate. Kleck (1979, 1984) found a very 
strong link between the level of gun owner- 
ship and homicide rates over the last few 
decades, and some studies have reported a 
strong inverse relationship between the cer- 
tainty of arrest and homicide rates (Ehrlich 
1975; Bailey 1976; Logan 1982). Of note, 
FBI reported arrest clearance rates for homi- 
cide ranged from 72 to 94 percent over the 
1950-1980 period. These theoretically impor- 
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tant variables are uncontrolled and possibly 
confounded in Stack’s analysis.! 

Fifth, Stack failed to control for changes in 
the proportion of the U.S. population legally 
subject to capital punishment for murder, that 
is, the proportion of the U.S. population 
residing in retentionist versus abolitionist 
jurisdictions. Although we would not expect 
execution publicity to significantly deter 
killings in abolitionist states, Stack’s model 
makes this assumption. This would be of only 
minor consequence had the proportion of the 
two types of population been roughly constant 
over 1950-1980. However, measured at each 
year’s end, the proportion of the U.S. 
population residing in jurisdictions without 
capital punishment for murder varied widely 
from approximately 10 to 45 percent. 

Sixth, deterrence theory predicts an inverse 
relationship between homicide rates and the 
levels of execution, but the frequency of 
monthly executions for murder (which ranged - 
from zero to 16 during the period) is an 
uncontrolled variable in Stack’s investigation. 
As a result, his study provides no insight 
about the relative deterrent effect of (1) the 
sheer volume of executions versus (2) those 
executions receiving major media attention. 

Seventh, Stack examined only the relation- 
ship between execution publicity (month /) 
and homicide rates (month f) within the same 
month. He did not consider the possibility of 
(1) a continued deterrent effect over future 
months, or (2) a “rebound effect” for 
homicides during proceeding months (t+ 1, 
t+2, .). Recall that Phillips (1980) 
reported the latter pattern for highly publi- 
cized executions in London. Unfortunately, 
lagged values for the execution publicity 
variables were not included in Stack’s analy- 
sis. 

Finally, over 1950-1980, Stack (1987, 
Appendix A, p. 539) classified 23 executions 
(occurring during 16 different months) as 
receiving high levels of publicity. However, 
using his classification scheme, our examina- 
tion of Facts and the Times reveals 26 


! By including a lagged (t— 1) monthly homi- 
cide rate in the regression model (to control for 
problems of serial correlation), Stack provides a 
type of control for the variables excluded from his 
model. However, as we shall see from the 
multivariate analysis to follow, a lagged homicide 
rate variable does not control properly for a variety 
of factors that influence homicide rates. 
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high-publicity executions over 19 months. 
These include one execution in June 1954, 
March 1957, and July 1958.2 We will show 
later that simply correcting for this oversight 
alters Stack’s findings dramatically. 

In sum, Stack provided an additional step 
towards a better understanding of the effect of 
execution publicity on homicides. However, 
the noted shortcomings may well account for 
his findings being at odds with the bulk of 
previous research. 


METHODS AND PROCEDURES 


In this investigation we extend Stack’s 
analysis in a number of important ways. Stack 
examined the years 1950-1980, a period 
when executions were generally declining. 
We extend the time series to include the 
1940s and the very recent period of 1981- 
1986.3 This more extended period provides 
much greater variation in levels of execution 
and homicide. For example, executions were 
more common for murder during the 1940s, 
leveled off during the 1950s, and declined 
throughout the 1960s, culminating in a 
10-year de facto moratorium between 1967 
and 1977. The number of executions has been 
rising throughout the 1980s. Homicides 
generally fluctuated in the opposite direction. 
During the earlier years homicide rates were 
comparatively low, rising in the 1960s and 
1970s, and falling throughout the' 1980s. Our 
investigation of the 1940 to 1986 period will 
permit examination of whether the shifts in 
homicide are related systematically to the 


Stack overlooked the executions of (1) Edward 
Grammer, hanged in Maryland on June 11, 1954 
for killing his wife, (2) Thomas Abraham, put to 
death in Kansas on July 24, 1958 for the murder of 
four persons, and (3) Burton Abbott, executed in 
California on March 15, 1957 for the kidnap- 
murder of a 14-year-old girl. 

? Our preference would have been to extend the 
time-series to include 1987 and 1988, as well as 
the pre-1940 period. Unfortunately, required me- 
dia (Facts on File) data are available from October 
1940 only. In addition, vital statistics have not yet 
been issued by the U.S. Department of Health for 
1987 and 1988 on the proportion of suicides and 
homicides that are gun-related (two key control 
variables in the multivariate analysis). Accord- 
ingly, the time-series is restricted to October 1940 
through July 1986, the latest mid-year estimate 
available on the proportion of gun-related suicides 
and homicides. 
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' corresponding variation' in executions and 
their associated publicity as deterrence theory 
predicts. (See Appendix A for the distribution 
of homicides and executions over the 1940— 
1986 period.) 

To examine the relationship between exe- 
cution publicity and homicides, we employ 
'two types of analysis. First, in the compara- 
tive analysis tradition (Dann 1935; Savitz 
1958; Phillips 1980), we examine rates of 
monthly homicides for periods leading up to 
and following highly publicized executions 
(n — 61) during the 1940-1986 period. (Exam- 
ining homicide rates rather than the frequency 
of killings controls for changes in the size of 
the population between the comparison 
months). A series of sign and paired t-tests 
are used to test for.a significant downturn in 
homicides during execution months, and the 
months that follow. Second, like Stack, we 
employ a monthly regression analysis of the 
correspondence between executions, execu- 
tion publicity, and homicide rates. Here, we 


consider execution publicity and the number - 


of monthly executions for lag periods ranging 
through +12 months. Together, the two 
methodologies provide a more comprehensive 
picture of the relationship between execution 
publicity and homicide than either alone 
permits. 

In the first stage. of the regression, we 


replicate the Stack analysis using his variables, 


and operationalizations for 1950-1980, but 
correcting for the three “high”-publicity 


executions that he overlooked. We also repeat ` 


this analysis using the conventional homicide 
` rate (the number of total homicides per 
100,000 population) in place of Stack’s 
measure (the number of homicides per 
100,000 population age 16 years and over). 
These analyses allow us'to determine whether 
Stack's supportive findings for the high 
execution publicity variable result from tech- 
. nical coding errors, and/or his selection of an 
unconventional dependent measure. 

Next we conduct parallel analyses for the 
1950-1980 and 1940-1986 periods, using 
models that include’ the unemployment and 
age control variables, along with the follow- 
ing additional structural factors: (1) percent of 
the U.S. population residing in the south, (2) 
the divorce rate per, 1000 population, (3) 
percent black population; (4) percent urban 
population, and (5) percent of families 
receiving Aid to Families with Dependent 
Children (AFDC) Beoeum, Unfortunately, we 


2 
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are upable to ‘eins as control variables 
three. socioeconomic factors found to be 
important in the general homicide literature 
(level of poverty, general income inequality, 
and racial income inequality). Until the 
1960s, income figures required to construct 
the inequality measures are available only for 
the decennial census years, and poverty data 
are available only for the post-1960 period. 

The regression model also includes two 
indicators of the availability of firearms— 
percent of homicides attributable to firearms, 
and percent of suicides resulting from fire- 
arms. There are no reliable nationwide figures 
for the 1940-1986 period on the number of 
firearms in circulation, but.our firearm proxy 
variables have been shown to be associated 
positively and significantly with homicide 
rates (Kleck 1979, 1984). Further, we 
examine two sanction variables omitted from 
tbe Stack investigation but considered in other 
deterrence analyses: the homicide arrest 
clearance'rate, and the number of monthly > 
executions for homicide. Deterrence theory 
predicts a significant inverse relationship 
between homicide rates and the certainty of 
arrest, and levels of execution.4 


Dcta Sources and Codes 


Hemicides. Like Stack, we operationalize the 
dependent variable as the rate of total monthly 
homicides. Unfortunately, police, prosecu- 
tion, judicial, and correctional data do’ not 
permit differentiating capital from noncapital 
homicides over 1940—1986. The inability to 


differentiate capital and noncapital homicides 


* We wished to include a homicide conviction 
variable in the analysis, but this was not possible. 
Anrual figures for the percent of apprehended 
homicide offenders that are convicted of murder 
(anc lesser crimes) are available from.the FBI for 


. 1940-1971, but not for the last 15 years of the 


time-series. According to an FBI spokesperson, the 


. Bureau discontinued issuing conviction data in 


1971:due to the small number of homicide cases 
reaching a judicial outcame during the reporting 
year. This is unfortunate in view of the importance 
of the certainty of conviction for deterrence theory. 
Over the period where annual conviction data are 
available (1940—1970), there is a: substantial 
inverse relationship (r= —.683) between annual 
murder conviction rates and monthly homicide 
rates However, there is also a substantial positive . 
relationship (r — .466) between homicide arrest and 
conviction rates during the period. 
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: has plagued deterrence and: death penalty 
investigations for two centuries. However, 


students have argued that capital punishment . 


might reduce killings in general (not just 
capital killings) through deterrence as well as 
through ‘more subtle processes of education, 


moralization, and normative validation (Ande- . 


naes 1974; Gibbs 1978; van den Haag 1975; 
van den Haag and Conrad 1983). If this 
argument has merit, it should become evident 
by examining gross monthly homicide rates. 
(See Peterson; and Bailey 1988 for a more 
extended discussion of the issue of capital 
punishment and general homicides). 

Monthly homicide figures are drawn from 
annual mortality statistics issued by the 
United, States Department of Health, Bureau 
of Vital Statistics. The homicide figures, with 
executions subtracted, are used to compute. 
monthly rates per 100,000 population, and 
per 100,000 population age 16 and over. Vital 
statistics are also used to compute the percent 
of firearm-related homicides and suicides. 

Execution and punishment measures: We 
use three punishment variables in the analysis: 
number of executions, the capital punishment 
status of the country over the 1940-1986 
period; and the. arrest clearance rate for 
murder. Monthly execution figures for homi- 


cide are taken from Bowers (1984) and Espy . 


(1986). The U.S. Department of Justice's 
annual Capital Punishment series furnishes 
the abolitionist/retentionist status of jurisdic- 
tions over the period. States that provided for 
. capital punishment for.murder as of the last 
. day of each year are regarded as retentionist 
. jurisdictions. Data from the Capital Punish- 
ment series, along with available population 
figures, were used to compute the percent of 
the U.S. population residing in jurisdictions 
` without capital punishment for murder. Arrest 
clearance rate data were derived from Layson 
(1985) for 1940-1977, and from the FBI's 
.. Uniform Crime Reports for 1978-1986. 
Execution publicity. As in -the Stack 
investigation, execution publicity is differen- 
tiated as high, moderate, and low. Highly 
, publicized executions are operationalized as 
* those that appear in Facts on File and in:the 


«New York Times. Executions that are not . 


_ listed in Facts but appear in the Times ‘are 
regarded as receiving moderate publicity; 
those not listed in either Facts or the Times 
are considered as low-publicity executions. 
‘We concur with Stack (1987, p. 533) that in 
combination, Facts and the Times provide the 
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best available. indicator of stories likely. to 
have received national media dissemination. 
Facts is well recognized as the most.compre- 
hensive index available of the daily press over 


i 


the period of this study. The Facts staff - 
regularly surveys dozens of major daily- 


newspapers in all regions of the’ country 
(Orlofsky 1988). The New York Times is 
recognized as the “paper of record” in this 
country: Over the 1940-1986 period, 61 
executions qualify by our definition., as 
receiving high levels of publicity. : 
The execütion publicity. factors are treated 
as (0/1) dummy variables in the regression 


analysis. Following Phillips (1974) and Stack 


(1937), execution stories appearing between 
the first and the 23rd of the month are coded 
as appearing within that month. Eighteen 
execution stories that appeared after the 23rd 
of the.month are coded as.affecting homicides 
the next nionth.5 The assumption is that 
execution stories appearing at the end of a 


month will have their greatest deterrent effect. 


on homicides the next month (t+ 1). 

The analysis of execution publicity is 
limited to the printed press because alterna- 
tive media data are not available for the 
1940-1967 period; the Vanderbilt Television 
News Index and Abstracts of ABC, CBS, and 


NBC evening news broadcasts are available 7 
since 1968 only. In addition, over 1940- ' 
1986, no major network indices or abstracts ` 


are available to gauge radio news attention to 
executions. 

. To assess how changes over 1940-1986 in 
the role of print versus the electronic media 


may have biased: our findings (and Stack's) 


for the print media variables, two additional 


media control factors were considered for the 
multivariate analysis: per capita circulation of - 


daily newspapers which ranged from approx- 


imately .31 in 1940 to .26 in 1986; and the . . 


percent of households with television sets, 
which ranged from less than one percent 
(from.1940—1948) to over 98 percent through- 


out the 1980s (figures were compiled from thé. ' 


annual Statistical Abstracts of the United 
States and Historical Statistics of the United 
States, Colonial times to 1970 (1975). To the 


5 These executions occurred on: 4-29-1943 
(two), 1-26-1956, 7—24—1958, 6-25-1959, 5- 
25-1979, 11-30-1983, 1-26-1984, 2-29-1984, 
3-31-1984, 10-30- 1984, 11-26-1984, 12- 
28-1984, 1-30-1985, 5-29-1985, LU 
6-25-1986, 7-31- 1986. ] 
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extent that newspaper readership declined, 
and television displaced newspapers as a 
major source of news over the period, we 
might predict that newspaper coverage of 
executions would provide a less effective 
deterrent to murder. 

Including these factors in the regression 
models did not alter the pattern of findings for 
any of the death penalty variables, including 
the number of monthly executions, and those 
receiving varying levels of print media 
attention. These findings suggest that the 
results for the capital punishment variables 
are not biased seriously due to the shift in the 
prominence of the print versus electronic 
news media from 1940 to 1986. 

Control variables. Monthly population, 
age,’ and unemployment figures,? and annual 


$ Unfortunately, there are no reliable data on the ' 


relative importance of various sources of news 
over the 1940—1986 period. Roper polls show the 
growing significance of television news, and a 
decline in the relative importance of newspapers 
over the past two and one-half decades. Periodi- 
cally, between 1959 and 1982, respondents - were 
asked: "Where [do] you usually get most of your 
news about what's going on in the world 
today—from the newspapers or television or 
magazines or talking to people or where?" The 
most frequent news sources mentioned (multiple 
Sources could be cited) were newspapers and 
television. The respective percentages of respon- 
dents designating each of these as their major news 
source are: 1959 (57% vs. 51%), 1961 (57% vs. 
52%), 1963 (53% vs. 55%), 1964 (56% vs. 58%), 
1967 (55% vs. 64%), 1968 (49% vs. 59%), 1971 
(48% vs. 60%), 1974 (50% vs. 64%) 1975 (47% 
vs. 65%), 1976 (49% vs. 64%), 1978 (49% vs. 
67%), 1980 (44% vs. 64%), 1982 (44% vs. 65%). 
In general, over the period, newspapers lost 13 
percentage points and television gained 14 points. 
Radio and magazines declined by 16 (34% to 18%) 
and two (8% to 6%) percentage points, respec- 
tively, over this same period. Because comparable 
data are not available for the 1940-1958 and 
post-1982 periods, it is not possible to introduce 
the relative importance of newspapers versus other 
sources of news as a control variable in the 
time-series. 

7 Stack operationalized the age variable as the 
percent of persons age 16-34 years. We use the 
same operationalization in our analysis of the 
1950-1980 period. However, for most of the 
1940s estimates are not available from Bureau of 
Census publications for the 16-34 year old 
category. Therefore, for the 1940-1986 analysis, 
we consider the age category 15-34 years. The 
slight variation in the two age variables is of little 
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figures for the sociodemographic control 
variables were taken from various U.S. 
Department of Commerce sources, including 
the yearly Statistical Abstracts of the United 
States, Current Population Reports, Histori- 
cal Statistics of the United States, Colonial 
Times to 1970 (1975), and The Population of 
the United States (Bogue 1985). For 1940- 
1986, monthly figures are not available for 
arrests and the other sociodemographic con- 
trol factors. In instances where only annual 
figures are available, linear interpolation was 
employed to estimate monthly figures for the 
control variables.? 

Appendix B provides descriptive statistics 
and zero-order correlations for homicide, 
levels of execution, execution publicity, and 
the control variables for the 1940-1986 
period. In the multivariate analysis, we 
consider models that include the number of 
executions and the level of execution public- 
ity for months f through t-12. The correlation 
matrix in Appendix B excludes the lagged 
execution and execution publicity variables 
(data available upon request). 


FINDINGS 


Comparative Results 


If execution publicity deters killings, signifi- 


consequence, since the two are correlated very 
highly (r—.993) for the period where figures are 
available for both age-groups. 

* Monthly figures (as used in the Stack investi- 
gation) are available for percent unemployment for 
the 1950—1986 period, but only annual figures are 
available for most of the 1940s. Accordingly, for 
the more extended time series (Tables 5 and 6 to 
follow), we estimate monthly unemployment rates 
through linear interpolation. We do not view this 
as a serious limitation given the high correlation 
(r=.915) between the interpolated and actual 
monthly unemployment figures for the period 
wheze data are available for the latter. 

? In our view, using interpolated rather than 
actual monthly figures for the control variables 
does not pose a major limitation. We are not 
concerned directly with estimating the effects of 
the arrest and sociodemographic factors on monthly 
homicide rates. Rather, they are included in the 
analysis to contro] for changes in theoretically 
important conditions over the 1940-1986 period, 
and to avoid possible spurious results for factors of 
primary interest—the effect of executions and 
execution publicity on homicides. 
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cantly lower rates of homicide should be 

expected for months with (month 4), and 
immediately following (e.g., month t+1, 

t+2) highly publicized executions (month 1) 

than for pre-execution months. Table 1 

reports monthly homicide rates for the 

six-month period before, the six-month period 

after, and the month of the highly publicized 

executions. Comparison of rates for highly 

publicized execution months (month f) with 

the preceding months (t— 1) shows a decline 

in killings in 28 cases, and an increase in 33.. 
Similarly, a comparison of rates for the. 
months preceding (t — 1) and following (t+ 1) 

highly publicized executions shows’ an in- 

crease in 31 cases and a decline in 30. 

When more distant “before and after” 
periods are compared, a similar picture 
emerges. For the lag and lead months of t—2 
and t--2, respectively, homicide rates in- 
creased in 32 cases, and declined in 29. For 
months t—3 and r+3, rates declined in 29 
cases, remained at the same level in one case, 
and increased in 31. The same type of pattern 
holds when comparisons are extended to 
months 7*4, t+5, and t+6. Indeed, applying 
a series of sign tests (Siegel 1956, pp. 68-71) 
in comparing monthly homicide rates for 
various pre- and post-periods over the 13 
months, we are not able to reject the null 
hypothesis of no change in killings Poprad 
with highly publicized executions. 

While the sign-test assesses the direction of 
change (increases or decreases) in the rates of 
killing, it is not sensitive to the magnitude of 
such changes. Rather, it treats large and small 
changes as equally important. To address the 
magnitude of changes in homicides, Table 2 
reports the results of a series of paired t-tests 
(Blalock 1979) examining the rates of killing 
for high execution publicity months and 
various pre- and post-periods ranging through 
six months. The results are consistent with the 
earlier analyses. The average rate of homicide 
does not decline significantly during high 
execution publicity months (month r) or the 
months that follow. In short, the comparative 
analysis provides no reason to question the 
conclusion that capital punishment and homi- 
cide are unrelated. 


Regression Analysis: The 1950—1980 Period 


Above we noted a number of shortcomings of 
the Stack investigation, including errors in 
coding the execution publicity variables, an 
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unusual operationalization of the dependent 


‘variable, omission of other aspects of capital 


punishment and levels of arrest from the 
model, and the omission of a number of 
structural and other control variables in the 
analysis. To assess how Stack's findings for 
execution publicity may have been affected 
by these problems, Table 3 presents a 
replication and extension of his analysis for 


“1950-1980. We first correct his coding errors 


for highly publicized executions, and regress 
his homicide rate measure (the number of 
total homicides per 100,000 population age 
16 years and over) against the execution 
publicity and five control variables (the 
unemployment rate, the percent population 
16-34 years of age, the one-month lagged 
homicide rate, yearly and monthly dummy 
variables) included in his model.!? For 
purposes of comparison, we repeat the 
analysis using the conventional homicide rate 
measure—the number of total homicides per 
100,000 total population. Our objective is to 
determine how Stack's findings for the high 
execution publicity variable were affected by 
the technical coding errors noted and/or his 
selection of an unusual dependent variable. 

To gain insight into how omitting possibly 
significant sanction and structural factors may 
have affected (biased) Stack's findings, Table 
3 also presents analyses where Stack's 
original model is extended to include first the 
number of monthly executions, and then the 
additional sociodemographic and other con- 
trol variables. The analyses are ordinary least 
squares (OLS) regressions. The left-hand 
panel of the table reports results for Stack's 
homicide measure; the right panel reports the 
analysis for homicide rates per 100,000 
general population. 

Following the procedures Stack reports and 
correcting for the three coding errors, we do 
not observe a significant inverse relationship 
between the levels of execution publicity and 
homicide rates. Over the' 1950-1980 period 
there is a tendency for executions receiving 
high or moderate levels of publicity to be 
associated .with' lower, monthly homicide 
rates, but the relationships are very slight, and 
are only at a chance level. Also, our 
replication of Stack's analysis shows signifi- 


10 | ike Stack, for reasons of brevity and clarity, 
we do not report in the tables (3—6) to follow the 
regression results for the monthly and annual 
dummy variables. 


s 
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"Table 1. Homicide Rates for High Execution Publicity Months, and Pre- and. Post- Six-Month Periods 
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Table 2. Summary of Difference of Means Analyses For Execution Publicity and Homicides Rates . - 





Comparison of Months* 


month t—1 vs. month t 

month t— Í vs. month t+1 
month t—1 vs. month t+2 
month t— 1 vs. month t+3 
month t- 1 vs. month t+4 
month t—1 vs. month t-- 5 
month t— 1 vs. month t+6 


. month t vs. month t+1 
month f vs. month t+2 


month f vs. month t+3 
month t vs. month (4-4 
month t vs. month 1+5 


‘month f vs. month f-- 6 


month t—- 1 & 2 vs. month t-1 &2 . 


: month r~1, 2 & 3 vs. month t-- 1, 2 & 3 


month t—1, 2, 3 & 4 vs. month t+1, 2, 3 & 4 
month t—1,2, 3, 4 & 5 vs. month t-1,2, 3, 4 & 5 


month t— 1, 2, 3, 4, 5 & 6 vs. month +1, 2, 3, 4, 5 & 6 


month f vs. $ month t—1 & —2 

month t vs. X month t—1, —2 & —3 

month t vs. X month t-1, —2, —3 & —4 
month t vs, & month t—1, —2, —3, -4 & —5 : 
month ; vs. X month t—1, —2, —3, —4, -5 & —-6 - 


month t-- 1] vs. X month :t—- 1 & —2 

month t+ ] vs. X month r- 1, -2 & —3 

1, -2, -3& -4 

month t+1 vs. X month t-1, —2, 7-3, -4& —5 
month /+1 vs. & month t—-1, —2, —3, -4, -5 & —6 


mean 1 _ inean 2: t value 
5.95 6.02 1.12 
5.95 6.04 1.13 
5.95 6.00 59 
5.95 6.09 1.66 
5.95 6.05 1.23 
5.95 . 6.06 1.11 
5.95 6.01 63 
6.03 6.04 19 
6.03 6.00 —.39 
6.03 6.09 82^ 
6.03 . 6.05 29 
6.03 6.06. 33 
6.03 . 6.01 416 
11.88 12.04 1.17 
17.89 18.13 1.21 
23.86 24.18 1.28 
29.83 30.24 1.38 
35.83 3625 . 1.29 
6.03 5.94 -1.52 
6.03 5.96 — 1.10 
6.03 6.03 — 1.03 
6.03 5.97 —1.01 
6.03 -6.03 —,89 
6.04 5.94 — 1.34 
6.04 5.96 t= 1,04 
6.04 5.97 -— .99 
6.04 5.97 — .98 
6.04 5.97 —.90 





, * Month : refers to the execution publicity month. 


cant (p<.05) first-ordet autocorrelation. Anh 
statistic of 1.96 or larger calls-for rejecting. 
the null hypothesis (p<.05) of no first-order 
serial correlation (Durbin 1970).9, With 


‘significant autocorrelation, the least squares 


estimators are unbiased, but they: are not 
efficient, and t statistics tend to be exagger- 
ated (Maddala 1988). Accordingly, the statis- 
tically significant and positive associations 


. between homicide rates and the age, unem- 


ployment, and lagged (month 1— 1) homicide 
rate variables should be viewed with caution. 

The contrary results, for the high execution 
publicity variable suggests that Stack's find- 


ings are a consequence of technical errors in 


. P The Durbin’ Watson d statistic commonly is 
used to test for first-order autocorrelation in a 
time-series.: However, when a model includes a 
lagged value for the dependent variable as a 
predictor, Durbin's A (rather than d) is an 

iate measure of first-order autocorrelation. 
A value of 2.0 for the Durbin Watson measure, 
and 0.0 for Durbin's h, -reveal no first-order 
autocorrelation. 


his analysis. Indeed, when we overlook the 
three high execution publicity cases in- 
question and reestimate the equations, we are 
able to replicate very closely Stack's reported 
findings.!? Extending Stack’s analysis to: 
include the frequency of monthly executions 
and the sociodemographic and other control ` : 
variables does not change the pattern ‘of 


- findings for the execution publicity variables., 


In each case, there remains only a slight, and . ` 
chance, association between homicide rates 
and execution publicity. . 

For the additional capital punishment 
factors, the figures in Table 3 suggest that 
homicide rates (Stack’s measure) do not 


> 


7 Ror example, when we reestimate Stack's 
model treating the three highly publicized execu- . 
tions in question as receiving only a low level of , 
publicity, we observe a near-significant (p<.051) 
and more substantial (b= — .22) trade-off between 
his homicide rate measure and the high execution 
publicity variable. Stack reports an OLS estimate 
of b= — .27 (p<.05) for this factor (See his Table 
1). 
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respond to either the number of monthly 
executions or the percent of the country 
residing in abolitionist jurisdictions. Coeffi- 
cients for the former variable are in the 
predicted negative direction but do not reach 
statistical significance. For the latter factor, 
the coefficient is nonsignificant and in the 
opposite direction of what deterrence theory 
would predict. 

When the conventional homicide rate 
measure is considered the results for the 
sanction variables are similar. Again, homi- 
cide rates do not respond significantly to 
percent abolitionist population, the number of 
monthly executions, or the amount of public- 
ity that they received during the 1950-1980 
period. The trade-off between offense rates 
and executions receiving high or moderate 
levels of publicity is in the predicted 
direction, but the coefficients do not reach 
statistical significance regardless of whether 
additional predictor/control variables are added 
to the equation. The Durbin h values for the 
right hand panel of Table 3 indicate that 
first-order serial correlation is also a problem 
(all A values exceed 1.96) for this analysis. 

If autocorrelation problems extend beyond 
the first order, the analysis in Table 3 is even 
more suspect since higher-order autoregres- 
sive processes in a time-series are no less 
troublesome than first-order serial correlation 
in deriving efficient parameter estimates. 
Higher-order autoregressive processes are 
possible and have been observed in at least 
one previous analysis of monthly homicide 
and execution patterns (Bailey 1984). We 
have explored the autoregressive process for 
the 1950-1980 period from t through :— 12 
months for each model, excluding the lagged 
homicide rate variable. For both dependent 
variables, and for Stack's original model and 
the model that adds only the execution 
frequency variable, there are significant 
fourth-, tenth-, and eleventh-order autocorre- 
lations. For the models that include the 
sociodemographic and other control vari- 
ables, the fifth-order autocorrelation is also 
statistically significant. 

Table 4 reports a replication of the earlier 
multivariate analysis where the significant 
(p<.05) autoregressive parameters are fit, 
and Yule-Walker parameter estimates are 
derived (via the SAS AUTOREG procedure). 
For the sanction variables, the pattern in Table 
4 is very similar to the earlier OLS analysis. 
Regardless of which dependent variable is 
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considered, there is a slight negative associa- 
tion between homicide rates and both the 
number of monthly executions and the two 
execution publicity variables. Contrary to the 
deterrence argument, however, in most cases 
the trade-off is not statistically significant. An 
exception occurs when Stack's original model 
is considered using the conventional homicide 
rate measure. Here, highly publicized execu- 
tions are associated significantly (b= — .143, 
s.e.=.070, p<.05) with homicide rates. 
However, this pattern does not hold when the 
models are extended to include monthly 
executions and the control factors. 

Like the OLS analysis, the autoregressive 
analysis also indicates that the larger the 
percent of the U.S. population residing in 
abolitionist jurisdictions from 1950-1980, the 
lower the homicide rate, and significantly so 
(b= —.024, s.e.=.012, p<.05) when the 
conventional homicide rate measure is consid- 
ered. This finding is consistent with the 
results of a number of comparative analyses 
that show that average homicide rates are 
higher for death penalty than abolitionist 
states (Sellin 1955, 1961, 1967, 1980). 

Importantly, the contrary findings reported 
in Tables 3 and 4 for the execution variables 
are not due to problems of multicollinearity. 
For example, over the 1950-1980 period the 
relationship between the high and moderate 
execution publicity variables is very slight 
(r= — .030), as are the associations between 
the execution frequency variable and the high 
(r= .222) and moderate (r= .064) execution 
publicity measures. Further, when each exe- 
cution variable is regressed against the other 
predictors in the full model, the resulting R? 
values (.128 for the high execution publicity 
variable, .216 for the moderate execution 
publicity factor, and .705 for the execution 
frequency variable) provide no indication of 
serious collinearity problems for the death 
penalty measures. 


Regression Results for the 
1940-1986 Period 


Extending the time-series to include the 1940s 
and 1981-1986 permits us to consider a 
period. with much greater variation in both 
the levels of execution and homicide. (See 
Appendix A and the discussion on page 9.) 
Table 5 reports OLS (left panel) and 


xYule-Walker (right panel) estimates for the 


1940-1986 period when homicide rates per 
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100,000 population are regressed against the 
execution and other predictor variables. 
(Because of a lack of clear theoretical and 
practical justification, we see no further 
utility in examining Stack’s homicide rate 
measure.) The Durbin-Watson d statistic falls 
in the uncertainty range in the OLS analysis, 
and the fourth-, fifth-, and tenth-order 
autocorrelations are statistically significant 
for the 1940-1986 time-series. Accordingly, 
the standard errors (and significance levels) 
reported in Table 5 for the OLS analysis 
should be viewed with caution. However, for 
the purpose of comparison with the earlier 
analysis, we continue to examine the OLS 
estimates. 

No evidence for 1940-1986 supports the 
conclusion that highly (or moderately) publi- 
cized executions significantly deterred homi- 
cides. The coefficients for the high execution 
publicity variable are negative in the OLS and 
autoregressive analyses, but in each case, the 
value is very slight (b< — .03) and falls short 
of statistical significance.!? In contrast, in 
each analysis, a significant inverse relation- 
ship exists between the number of monthly 
executions and the monthly homicide rate. 
Over the period, each execution was associ- 
ated, on average, with a 1.4 to 1.7 one- 
hundredth of a person decrease in the 
homicide rate. Although slight, this trade-off 
is in the predicted direction. 

Contrary to deterrence theory, however, we 
again observe a significant negative relation- 
ship between the percent of the U.S. 
population residing in abolitionist jurisdic- 
tions and the homicide rate. Here too the 
relationship is slight (b— « —.03), but it 
challenges the argument that persons living in 
death penalty states are afforded an added 
measure of protection against murder. To the 
contrary, it appears that residents of jurisdic- 
tions with capital punishment ran a slightly, 
but significantly, greater risk of being killed 
during the period. li 

The analysis reported in Table 5 does not 
address the possible longer term impact of 


13 The contrary results for the high execution 
publicity factor are not due to collinearity prob- 
lems. When this variable is regressed against the 
full list of predictors considered in Table 5, the 
resulting R? value is only .441. Similarly, for the 
moderate execution publicity variable, the result- 
ing R* value (.169) also provides no indication of 
collinearity problems. 
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executions on homicides, for example, during 
month t+1, month t+2, and the like. As 
noted, the deterrent effect of executions may 
be confined primarily (or exclusively) to the 
execution month, or it may carry over to 
future months. Conversely, executions may 
simply displace homicides to the months that 
follow. Recall that Phillips (1980) observed 
such a pattern for highly publicized execu- 
tions in London. This type of displacement 
may account for the slight, but statistically 
significant, negative relationship between 
monthly executions and monthly homicide 
rates noted in Table 5. 

The results reported in Table 6 explore the 
possible delayed effect of executions on 
homicides for the period from month t 
through month t+ 12. Here we regress month 
t homicide rates against the number of 
executions for month t, month z— 1, month 
t—2, . . ., month £—12, and the control 
variables. Because highly publicized execu- 
tions also may have a delayed deterrent 
effect, we consider &n additional model that 
includes the high execution publicity variable 
for month ż through t— 12 months.!^ Because 
of significant fourth-. fifth-, and tenth-order 
autocorrelations in the time series, only 
Yule-Walker estimates are reported in Table 
6.15 

For both analyses, the patterns reported in 
Table 6 are familiar. First, the percent 


14 An anonymous reviewer of an earlier version 
of this paper expressed concern that the lagged 
execution frequency and high publicity variables 
possibly were measuring the same thing. This is 
not the case. For 1940-1986, over the period from 
month f thrcugh ż— 12, the correlations (r values) 
between the execution frequency and high public- 
ity variables are: monta r=.057, t—1=.061, 
t—2—.061, t—3=.063, t—4= .064, t— 5 = .066, 
t—6= .070, t—7= .070, 1—8 = .073, t—9 = .078, 
t— 107.082, ż— 11 =.086, t—12=.085. 

3 When the model reported in Table 6 is 
estimated via OLS regression, the patterns parallel 
closely the results of the autoregressive analysis. 
There is (1) a significant (p<.05) inverse relation- 
ship between homicide rates and the percent of the 
population residing in abolitionist jurisdictions, (2) 
8 chance-only association between rates and each 
execution publicity variable lagged through 1— 12 
months, and (3) a significant negative relationship 
between the execution frequency variable for 
month 7 (p<.01) and month t—1 (p<.05), but a 
chance-only association for the other execution lag 
variables. 
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Table 6. Lagged Yule-Walker Regression Analysis, 1940-1986 


Predictor Variable b 
Percent south — .009 
Percent urban — .006 
Percent black — 2.659 
Divorce rate —.067 
Percent AFDC .400 
Percent firearm homicides .198 
Percent firearm suicides .218 
Arrest rate —.126 
Unemployment rate —.057 
Percent 15-34 yrs. of age 71 
Percent abolition population ~ 027 
Number of executions 
month f — 016 
month t—1 .005 
month t—2 -—.001 : 
month t—3 .003 
month ż— 4 .002 
month t- 5 .001 
month t — 6 — .004 
month t—7 — 001 
month t—8 —.013 
month t—9 .001 
month t— 10 O11 
month t—11 .001 
month t— 12 .001 
High publicity executions 
month t 
month t— 1 
month t—2: 
month t—3 
month t—4 
month 1—5 
month t—6 
month t—7 
month t—8 
month 1—9 
month :— 10 
month :— 11 
month t— 12 
Intercept 17.766 
,979*** 
* p< .05. 
** p < 0l. 
*** p < 001. 


abolitionist population variable continues to 
be negatively and significantly associated 
with homicide rates. Clearly, residents of 
death penalty jurisdictions are not afforded an 
*added" measure of protection by the provi- 
sion for capital punishment. Second, only a 
chance association is observed between levels 
of killing and the high execution publicity 
variable for month t through the 12 lag 
periods. Third, a slight but significant deter- 
rent effect (b=.—.016 and —.018) appears 
for the number of month t executions. In 
addition, both analyses show a significant 
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$.€ b Se. 
357 .166 .367 
.086 — .032 .086 
1.564 —3.419 1.623* 
.243 — 089 .249 
.262 .467 .263 
0484+ .200 .050*** 
.070** .205 .072** 
.054* —.148 .055* 
.040 — 074 045 
.138 .079 .145 
.O11* — 027 .O11* 
.006** — .018 .006** 
.006 .004 .007 
.006 .001 .006 
.006 .004 1006 
.006 ~ .001 .006 
006 .001 .006 
.006* —.012 .006* 
.006 .002 .006 
.006 .010 .006 
.006 .003 .006 
—.037 .056 
.089 .059 
— .050 .060 
.017 .061 
~ 048 .058 
—.091 .056 
—.106 056 
— .096 .055 
— .095 .056 
— .019 .058 
.009 .057 
—.065 .058 
—.093 .056 
19.586 26.236 19.994 
98]*** 


negative relationship between monthly homi 
cide rates and the frequency of execution: 
lagged by :—8 months (b= —.013 anc 
— .012). 

The significant result for the month r—& 
lagged execution variable is puzzling. Foi 
example, if executions have a lingering 
deterrent effect on homicides, why would the 
delayed effect be felt only eight months afte: 
executions? During months t+1 througl 
t+7 the execution frequency coefficients are 
very slight, and mixed in sign. For the 
periods t+9 through ¢+12 months, the 
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execution coefficients are all positive, and the 


positive coefficient for month t+ 10 comes - 


very close to achieving statistical significance 
(b=.011, s.e.=.006, p<.052) when the 
execution publicity vatiables are not included 
in the analysis. 

Examination of the raw data and analysis of 
ithe residuals do not clarify why the signs and 
significance levels of the execution frequency 

coefficients differ over the 13-month period. 


Nor is there any evidence that the unexpected - 


pattern for month t+8 (and ż+ 10) is a result 
of collinearity problems. When the number of 
executions for month 7, and the lag periods, 
are regressed against the other. execution 
frequency variables, the execution publicity 
variables, and the other predictors in the 
model, the resulting R? values do not exceed 
.77. When the same procedure is followed 
with each of the monthly execution publicity 
variables, the highest R? value is .56. 
‘Although we cannot provide an adequate 
explanation for the mixed results for the 
lagged execution frequency variables, it is 
still appropriate to ask: what was the overall 


effect of executions on homicides over the: 


period? Was the net effect of executions to 
decrease homicides, as .proponents of deter- 
rence claim; to increase killings, as suggested 
_by the brutalization argument; or was there 
essentially a zero net effect over the period? 
"The most straightforward and defensible 
manner to assess the net effect of executions 
on homicides ‘is to sum the execution 
coefficients for month : through month 
"t+ 12.16 For example, we might ask, what is 
the summary/cumulative ` effect of ‘capital 
punishment on homicides for the periods of 
say 3, 6, 9, and 12 months after executions. 

‘For the analysis (Table 6) where the 
execution publicity variables are excluded, 
the execution frequency coefficients sum as 
follows: month t through month t—-3- — 009, 
month f through month t—6-— — .010, month 
t through month ;-9— —.023, and month t 


through month 112 — .010. For the analysis’ 


that includes the execution publicity vari- 


‘16 Because deterrence arguments do not address 
systematically the form of immediate and possible 
lingering effects of executions on homicides, we 
have not attempted to model the possible summary/ 


: . cumulative effect of executions via a geometric — 


Koyck—lag, or 8 polynomial distributed lag 
structure. And it makes no sense at this “after the 
fact" point to conduct such analyses. 
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ables, the respective cumulative figures are: 
—.009, —.004, —.013, and .004. Note that 
for the cumulative period ranging through 12 
months, the model with the execution public- 
iy variables excluded suggests that a one- 
person increase in the number of executions is 
associated on. average with a .01 person 
decrease in the homicide rate per 100,000 
population. And when the full model ‘is 
considered, a one-person increase in the 
number of executions is associated with a 
.004-person increase in monthly homicide 
rates. 

In short, both analyses lead to the same 
conclusion. Clearly, over the 1940-1986 
period, the net effect of executions neither 
significantly: decreased nor increased homi- 
cide rates.. Whatever deterrent effect that 
might have been realized during month t (and 
during month t+ 8) wash out over the course 
of the year. 


CONCLUSIONS 


This investigation has attempted to extend our 
understanding of a theoretically important 
dimension of capital punishment—the deter- 
rent effect of execution publicity on homi- 
cides. As a communication theory, deterrence 
suggests that more highly publicized execu- 
tions should have.a greater deterrent effect on 
murder than executions receiving less public- 
ity. Despite its plausibility, the execution 
publicity hypothesis has not received much 
empirical support. In contrast, a recent 
analysis by Stack suggests a very significant 
deterrent effect for highly publicized execu- 
tions in the United States over the iE ie 
period. 

"We have reviewed the Stack investigation 
and found that it has a number of serious 
limitations, including technical coding errors, 
an unusual dependent measure, and omission 
of important deterrence, sociodemographic, 
and other factors associated with homicide 
rates. In addition, Stack ignores the possibil- 
ity that the effect of executions may extend 
beyond execution months. To'better assess 
whether highly publicized executions have an 
immediate and lingering influence on homi- 
cides, we replicated and extended the Stack 
investigation, correcting for shortcomings in 
his analysis. ‘The time seties was also 
extended to include a period of much greater 
variation in the levels of both execution and 
homicide. Our results diverge substantially: 
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from Stack's. First, once the technical coding 


errors are -corrected,- we are not able to 
replicate his finding. of a significant inverse 
relationship -between the levels of execution 
publicity and homicide rates. For both the 
1950-1980 and the: more extended (1940— 


: 1986) period, the null hypothesis prevails for 


the high execution publicity variable regard- 
less of how homicide rates are operational- 


' ized, and whether the regression model is 


expanded to include the frequency of execu- 
tions, sociodemographic and other control 
variables. This pattern holds when lag periods 
through 12 months are considered. 

Although the publicity variables consis- 
tently were unrelated to homicides, this was 
not the case for.the remaining capital 
punishment measures. Homicide rates tend to 
be associated positively with the provision for 
capital punishment, suggesting that residents 
of retentionist jurisdictions run a slightly, but 
significantly, greater risk of falling victim to 
homicide than their counterparts in apalan: 
ist states. 

When the analysis is extended to include 
the 1940s and 1981-1986, findings for: the 
execution frequency measure provide some 
support for deterrence theory, A slight, but 
statistically significant inverse relationship 


appears between monthly executions and the 


monthly homicide rate. The deterrence trade- 
off should not be overstated, however. 
Throughout the analysis, a one-person in- 
crease in monthly executions was associated, 
on average, with about a one and one-half 
one-hundredth of a person reduction in the 
homicide rate. Put differently, a 10-person 


"increase in monthly executions would be 
associated with approximately a 
reduction in the homicide rate. The correspond- 
‘ing figure for a 20-person increase in monthly 


.15-person 


executions would be.a .30-person drop in the 
homicide rate. During the 1940—1986 period, 


+ the number of monthly executions ranged 


from zero to a high of 21. 
Regarding the possible lingering/delayed ef- 
fect (beyond the execution month) of execu- 


` tions, the analysis shows that there was only a 


chance association between the levels of exe- 
cution and monthly homicides for most. lag 
periods. However, consistent with deterrence, 
there was a significant negative relationship 
between homicide rates and the number of ex- 
ecutions lagged by eight months. By contrast, 
there was a near-significant (p = .052) positive 


trade-off between rates and the frequency of 
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executions lagged by 10 months. The: reasons 
for these contrasting exceptions are not clear. 
What is clear, however, is that when followed 
over a one-year period, the impact of execu- 
tions on homicides was negligible. 

' In sum, this analysis provides no indication 
that national media attention to executions had 
the hypothesized deterrent effect on homicides 


during 1940-1986. However, it should be noted: 


that the publicity variable included in this (and 
Stack's) analysis is limited. We do not con- 
sider the actual volume of monthly execution 
publicity in,the printed press. Nor is the elec- 
tronic media systematically examined. Fur- 
ther, no attention is given to the nature of 
media attention devoted to executions. Since 
the deterrent or other impact of execution prob- 
ably depends upon the type of message'com- 


municated, as well as the volume and source ` 


of media coverage, future research should ex- 
amine these aspects of execution publicity. 


Available print and electronic media data per- ` 


mit such analyses for the 1970s and 1980s. 
In contrast to execution publicity, the 
frequency of executions' appears to have had a 
slight, but significant, immediate (month t) 
and delayed (month t-- 8) deterrent effect on 
homicides. However, whatever deterrent ef- 


fect that might have been realized is of minor . 


significance by comparison with the cumula- 


tive effect of executions. For periods ranging . 


through one year after executions, the overall 
effect of executions on homicide rates was 
essentially zero regardless of whether the 
execution/homicide relationship was positive 


or negative. Therefore, this analysis provides . 


little reason to question the consensus reached 


over the last two centuries by most criminol- ' 


ogists that capital : punishment does not 
provide an effective deterrent to murder. 
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THE BIOGRAPHICAL CONSEQUENCES OF ACTIVISM 
Douc McApAM 


University of Arizona 


Using survey data collected in 1983-84 on 212 participants in the 1964 
Mississippi Freedom Summer project and 118 individuals who applied, were 
accepted, but did not take part in the project, the author seeks to assess the short- 
and long-term political and personal consequences of high-risk activism. Using 
both descriptive and inferential statistics, the author demonstrates a strong effect 
of participation on the subsequent lives of the volunteers and “no-shows.” The 
volunteers were more politically active throughout the sixties than the no-shows 
and remain so today. In addition, the volunteers are much less likely to be married 
and to have significantly lower incomes at present than are the no-shows. Besides’ 
reporting these basic findings, the author seeks through path analysis to explore 
the specific factors and processes that mediate the impact of participation in 


Freedom Summer on the later lives of the volunteers. 


INTRODUCTION 


Historically, two principal research questions 
have dominated the sociological study of 
social movements. The first concerns the 
origins of collective action and constitutes the 
longest and most coherent research tradition 
in the field (see, e.g., Gurr 1970; McAdam 
1982; Skocpol 1979; Smelser 1962; Tilly 
1978). Recently, however, the “micro ques- 
tion" of recruitment to activism has begun to 
command as much attention as the issue of 
movement emergence. The former question 
focuses on psychological, attitudinal, or 
social-structural factors that account for the 
entrance of the individual into activism (see, 
e.g., Klandermans 1984; McAdam 1986; 
Oliver 1984; Snow, Zurcher, and Ekland- 
Olson 1980). 

Although different in emphasis, these two 
questions share a common focus on the 
earliest stages of collective action. Regretta- 
bly, scholars have paid much less attention to 
the later stages of a social movement at both 





* I would like to thank Ron Aminzade, Gerhard 
Arminger, James Fendrich, Roberto Fernandez, 
Neil Fligstein, John McCarthy, Susan Olzak, 
David Snow, and Verta Taylor for their extremely 
helpful comments and suggestions on various 
drafts of this paper. I am very grateful to Kelly 
Moore and Ronnelle Paulsen for their help with 
various analyses reported in the paper. The 
research was supported in part by a Guggenheim 
Fellowship, a grant from the National Science 
Foundation (SES-8309988), and various forms of 
support through tbe University of Arizona's Social 
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the macro- and microlevels of analysis. At the 
macrolevel we are only beginning to explore 
such topics as movement-countermovement 
interaction (see, e.g., McAdam 1983; Zald 
and Useem 1987), movement outcomes (see, 
e.g., Gamson 1975; Snyder and Kelly 1979), 
and state-movement relations (see, e.g., 
Burstein 1985; Gale 1986). At the microlevel 
we know a great deal about the factors that 
make for individual activism, but much less 
about how movement participation ebbs and 
flows over time or the political and personal 
consequences of movement participation. The 
latter is the subject of this paper. Using 
survey data collected in 1983-84 on 212 
participants in the 1964 Mississippi Freedom 
Summer project and 118 individuals who 
applied, were accepted, but did not take part 
in the project (“no-shows”), I assess the 
short- and long-term political and personal 
consequences of high-risk activism (see 
McAdam 1986). Specifically, I examine the 
activist, occupational, and marital histories of 
both groups and seek to determine what 
effect, if any, participation in the summer 
project had on the subsequent lives of the 
volunteers and “no-shows.” Before turning to 
thes: analyses, however, I address the 
theoretical issue of the biographical conse- 
quences of activism. 


THE CONSEQUENCES OF ACTIVISM: A 
CONCEPTUAL OVERVIEW 


Although only a few scholars’ studies have 
focused on the biographical impact of activ- 
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ism, the media have paid considerable 
attention to the contemporary ‘lives of 60s 
activists. Based on countless newspaper, 
newsmagazine, and television news stories, 
many in the general public feel certain they 
“know” what happened to the 60s activists. 
And in knowing what happened to the 60s 
activists, they presume to know something 
more general about the consequences of 
movement participation. What they "know" 
can be gleaned from popular media portraits 
of former activists. i , 

What emerges from these stories is the 
image of the former activist as opportunistic 
Yuppie. The contemporary lives of former 
activist “stars” such as Jerry, Rubin and 
Eldridge Cleaver are routinely offered as 
“evidence” to support this view. Rubin, now 
a stockbroker, and Cleaver, the born-again 
clothier, represent reassuring symbols of a 
kind of moral and political maturation claimed 
to be typical of many 60s “radicals.” So often 
have stories on these two appeared in the 
popular press that their lives now serve as a 
general account of the contemporary biogra- 
phies of yesterday’s activists. Thus the 
collapse of the Movement in the early 70s 
allegedly triggered a period of wholesale 
generational sellout that found the lion’s share 
of former radicals embracing the politics and 
lifestyles of the “Me Decade.” 

Given that Rubin and Cleaver are virtually 
the only former activists that the popular press 
publicize, why do these images of genera- 
tional sellout persist? The answer may lie in 
the larger depoliticizing function of the 
account. If most of the 60s radicals grew up 
to become Yuppies, then their earlier radical- 
ism can be attributed to immaturity or 
faddishness. By growing up to espouse 
mainstream values and hold conventional 
jobs, figures like Rubin and Cleaver reassure 
the public that it need not take their earlier 
radical politics seriously. The “kids” were 
just sowing a few wild oats before they settled 
down. Properly chastened, the 60s radicals 
are now finding fulfillment in commodity 
futures and gentrified urban housing. From 
this perspective, the long-term biographical 
consequences of 60s activism appear to be 
modest. 

Despite the popular appeal of the contem- 
porary media account, there are several 
reasons for doubting its generalizability. 
First, after Rubin and Cleaver, it is hard to 
identify any other prominent 60s activists 


who fit the account. Second, the account rests 
on a dubious interpretation of the shifting 
patterns of cultural and political allegiance 
within the baby boom generation. Probably 
no more than 2 to 4 percent of the generation 
took an active part in any of the movements 
of the mid-to-late 60s. It, therefore, seems 
likely that today’s Yuppies are drawn, not 
from the activist subculture, but from the 
other 96-98 percent of the generation. Third, 
the popular account is inconsistent with the 
growing number of studies of former activists 
and the theoretical literature dealing with the 
processes of conversion and alteration. 


Conversion, Alternation, and 
High-Risk Activism 


Conversion is defined as a radical transforma- 
tion of a person's life, including their 
self-conception, network of associations and 
larger worldview. The most common exam- 
ples of conversion tend to focus on entrance 
into cults or insular religious groups (cf. 
Lofland and Stark 1965; Richardson and 
Stewart 1978; Seggar and Kunz 1972; Snow 
and Phillips 1980). These studies clearly 
highlight the potential for long-term biograph- 
ical effects as a result of the conversion 
process. While some "converts" "lapse" or 
"stray from the flock," many others are 
permanently transformed. 

Distinct from conversion are forms of 
personal change that Travisaro (1981) calls 
alternation. These are identity changes that 
are not as drastic as conversions, but 
"relatively easily accomplished changes of 
life . . . which are a part of or grow out of 
existing programs of behavior" (p. 243). 
Travisaro cites such examples as a high 
school student becoming a college student 
and a husband becoming a father. The crucial 
difference between conversion and alternation 
centers on the degree to which the change is 
continuous with the individual's previous life 
and conception of self. Unlike conversion, 
alternation does not entail a radical break with 
the past or the construction of an entirely new 
self. This does not suggest that alternation is 
an insignificant social process. On the 
contrary, it is associated with most of life’s 
key turning points. While the transition from 
high school. to college student may entail no 
radical break with the past, it has profound 
and enduring implications for the individual’s 
future. All aspects of a person’s life are 
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subject to significant change as a ‘result of 
becoming a college student. Moreover, the 
effect of these changes is expected to persist 
throughout life. Though less dramatic thàn a 
true conversion experience, this example 


highlights the powerful and enduring changes 


that accompany alternation. 

"The relevance of this literature for the study 
of individual activism comes from recogniz- 
. ing certain similarities between conversion 

: and alternation and the personal changes that 
often accompany activism. The degree to 
which activism is experienced either as 
conversion or alternation depends on the level 
.and forms of activism in which the individual 
engages. For most movement participants, 
nothing in their experiences even: hints of 
conversion or ‘alternation. Quite simply, the 
fleeting low-cost, low-risk forms of activism 
(e.g., giving money, writing letters, signing 
petitions) in which most individuals engage 
do not require ongoing interaction with other 
activists. This requirement and the gradual 
immersion into a new subculture set the stage 
for either conversion or alternation. What 
distinguishes conversion from alternation is 
the degree to which the group or subculture in 
question is exclusive and organized in oppo- 
sition to the rest of society. Conversion tends 
to occur in groups that demand the exclusive 
loyalties of its members and maintain a 
hostile stance toward mainstream society. 
Alteration generally takes place in the 
context of a group that is relatively more 
inclusive and tolerant of the other techies 
of its members. 

Virtually all forms of high-risk/cost activ- 
ism are organized through and therefore 
involve the individual activist in one of these 
two types of groups. By necessity, revolution- 
ary movements tend to create groups that are 
exclusive, highly insular, and hostile to the 
society they seek to change. Therefore, 
something akin to conversion would seem to 
be a likely outcome of entrance into and 
absorption into such groups. Reform move- 
ments, on the other hand, tend to spawn 
groups that are more inclusive and tolerant 
of mainstream society than revolutionary 
movements. Nonetheless, they can be very 
demanding of a person's time, energy, and 
loyalties.! Such groups, then, constitute an 


! In her autobiographical account of participa- 
tion in a local chapter of the Congress of Racial 
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ideal setting. within which alternation | can 
occur. 

Thus, theoretically, we should speci that 
those involved in high-risk/cost activism will 
be very susceptible to either conversion or 
alternation and to the long-term behavioral 
and attitudinal effects that -accompany these 
processes. Empirically, there is even an 
emerging literature that suggests as much. 


Follow-Up Studies of Activists 


Though far less developed than the literature 
on individual recruitment to- activism, there 
does exist a small body of studies on the 
personal consequences of movement partici- 
pation (see Table 1 for references). These 
studies are hampered by a host of methodo- 
logical problems,- which I examine later. 
Nonetheless, they have consistently yielded 


results strongly at odds with the popular 


account reviewed above. Taken together, 
these studies suggest a powerful and enduring 
eifect of participation on the later lives of the- 
activists. Unlike Rubin and Cleaver, the 
subjects in these studies display marked . 
continuity in their values and politics over the 
past 10—20 years. Specifically, the former 
activists were found to have: 


— continued to espouse leftists political attitudes 
: (Demerath, Marwell, and Aiken 1971, p. 184; 
Fendrich and Tarleau 1973, p. 250; Marwell, 
Aiken, and Demerath 1987; Whalen and 
Flacks 1980, p. 222); 

— remained active in contemporary movements . 
or other forms of political activity (Fendrich 

» and Krauss 1978; Fendrich and Lovoy 1988; 
Jennings and Niemi 1981); 

— concentrated in the teaching or other "helping 
professions" (Fendrich 1974, p. 116; Maiden- 
burg.and Meyer 1970); and 

-—continued to'define themselves as "liberal" or 

' "radical" in political orientation (Fendrich 
and Tarleau 1973, p. 250). 

Taken together, it is hard to reconcile these 
findings with the image of the activists as 
depicted in the contemporary media account. 
Hcwever, before we embrace these studies as 
the final word on the subject, it is necessary 
to amplify the note of caution touched on 
earlier. The works cited earlier are beset by a 
number of methodological shortcomings (see 
Table 1). 


Equality (CORE), Inge Powell Bell (1968) pro- 
.vides a striking example of the potential for 


absorption within just such a group. 


1 
bg 


' Investigator(s) 


‘Jennings and Niemi 





Yearof ` Year of 

Participation 

Demerath, Marwell, ` 1965 1969 
and Aiken =~ NE 


Fendrich 1960-1963 4970 


Fendrich and Lovoy ' 1960-1963 - 1986. 


1964-1972 1973 .. 


Maidenberg and Meyer 1967 ` 1969 


. Marwell, Aiken, and .” 1965 1984 


Demerath ' f 
Nassi and Abramowitz 1967 . 1977 


Whalen and Flacks 1970 - 1980 


Activists 
Follow-up ,in Sample (N) Group? 
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"Table 1. Follow-Up Studies of Movement Participants" aa 


Before and 
After Data? 
40° No ., Yes - 


' Control Resulting 
Publications 

Demerath 
et al. 1971. 
Fendrich 
1974, 1977; ` 
Fendrich and 

, Krauss 1978; 
Fendrich and 
Tarleau 1973. 
Fendrich and 
Lovoy 1988. 

a, Jennings and 

! Niemi 1981;. 

i Jennings 1987. 
Maidenberg and 
Meyer 1970. 
Marwell et. 
al; 1987. 

. . Abramowitz and 
` Nassi 1981; 
Nassi and 
Abramowitz 

1979. 
Whalen and 
Flacks 1980. 


28  . — Yes No 


H^. No No 


* Table was adapted from Table 1 in DeMartini 1983, p. 198. 
> Fendrich's:1977 article ‘is based on comparative data on 28 white and 72 black activists., 
¢ Nassi and Abramowitz relied on 15 subjects; Abramowtiz and Nassi, on 30. ` 


The first problem concerns the timing of 
the studies. Only four of the studies (Fendrich 
and Lovoy 1988; Marwell et al. 1987; Nassi 
and Abramowitz .1979; .and Whalen and 
Flacks 1980) were conducted after the close 
of what is popularly conceived of as the 
recent activists era (see Table 1). For the 
other studies, it is hard to know whether the 
reported continuities in political thought and 
‘action were an effect of an era in which the 
follow-up studies were .conducted or an 
enduring consequence of the subjects’ earlier 


‘activism. For instance, Maidenberg and 


Meyer: (1970) reported that a sample of 
former Free Speech Movement demonstrators 


: living in the Bay Area in 1969 remained 


active in leftist politics. Given the high level 
‘of activism in the Bay Area in that era, the 
finding may be a simple function of time and 
place, rather than the impact of the Free 
Speech Movement. 

A second timing issue centers on the basic 
question of whether sufficient time..had 
.elapsed to allow for an adequate assessment 
of. the impact of movement participation. In 
half of the studies, no more than five years 
had passed. between activism and follow-up 
investigation. Moreover, only a small number 


] 


of jec are involved in these Studies. Only 
the Jennings and Niemi (1981), Maidenberg 
and Meyer (1970), and Marwell et al. (1987), 
studies involved more than 40 subjects. ? And 
the first two of these studies share the timing 
deficiencies touched on earlier. Both were 
conducted while popular protest was still 
widespread in the United States, no more than 
five years after the subjects’ initial activism. 

Third, most of the studies draw subjects 
from only a narrow geographic area, some- 
times a single city (cf. Whalen and Flacks 
1980). This makes it difficult to generalize 
the results especially when the geographic 
areas are as atypical as the Bay Area (cf. 
Maidenberg and Mayer 1970; -Nassi' and 
Abramowitz 1979) or Santa Barbara (cf. 
Whalen and Flacks 1980). : ` 

Another weakness of these studies is their . 
failure to make use of nonactivist control 
groups. Without such groups, one cannot 
M Pie S ba . 

? Most of James Fendrich’s research has been 
based on data. gathered on 28 white activists 
involved in civil rights activity while students at 
Florida State University. In his 1977 article, 
Fendrich.employs comparative data on 100 activ- 
ists, 72 of them black. 
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establish a behavioral or attitudinal baseline 
to judge the effects of activism. Four of the 
seven studies in Table 1 failed to employ a 
contro] group. Even when control groups 
were used, the characteristics on which 
activists and nonactivists were matched often 
left the issue of consequences unresolved. For 
example, Fendrich matched his subjects on 
the basis of college attendance at Florida State 
between 1960 and 1963. While clearly an 
advance over the other studies, even this 
research design fails to allow for a clear 
explication of the impact of activism. What is 
at issue is the salience of the characteristic on 
which the subjects were matched. Matching is 
done to hold constant variables that might 
otherwise confound the interpretation. of 
causal effects. Fendrich held constant the 
effects of time and place but left unexamined 
the influence of prior attitudinal differences. 
It may be that these differences account for 
both the activists' involvement in the move- 
ment as well as the later differences between 
activists and nonactivists turned up by these 
studies. 

Finally, the earlier studies also lack “be- 
fore" and "after" data on the activists. The 
usual procedure has been to gather contempo- 
rary information on the former activists and to 
then infer the effects of participation from the 
data collected. But without prior information 
on the subject, it is hard to determine the 
extent and significance that changes in 
participation may have bought about. 

Correcting all of the deficiencies noted 

' above yields a prescription for a very different 
type of study than has been completed to 
date. It would be national in scope, involve a 
much larger sample of activists, employ 
“before” and "after" on activists and nonac- 
tivists alike, and be conducted well after the 
activists episode in question. The research 
reported here generally approximates this 
ideal. 


THE STUDY 


This paper reports the results of a follow-up 
study of 330 applicants to the 1964 Missis- 
sippi Freedom Summer project. That project 
brought hundreds of primarily white, northern 
college students to Mississippi for all, or part 
of, the summer of 1964 to help staff freedom 
schools, register black voters and dramatize 
the continued denial of civil rights throughout 
the South. As instances of activism go, the 
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summer project was time-consuming, physi- 
cally demanding, and highly newsworthy. 

The project began in early June with the 
first contingent of volunteers arriving in 
Mississippi fresh from a week of training at 
Oxford, Ohio. Within ten'days, three project 
members, Mickey Schwerner, James Chaney, 
and Andrew Goodman, had been kidnapped 
and beaten to death by a group of segregation- 
ists led by Mississippi law enforcement 
officers. That event set the tone for the 
summer as the remaining volunteers endured 
beatings, bombings, and arrests. Moreover, 
most did so while sharing the grinding 
poverty and unrelieved tension that was the 
daily lot of the black families that housed 
them. 

‘ Prior to their participation in the campaign, 
all prospective volunteers were asked to fill 
cut detailed applications providing informa- 
tion on, among other topics, their organiza- 
tional affiliations, college activities, reasons 
for volunteering, and record of previous 
arrests. On the basis of these applications 
(and, on occasion, subsequent interviews), 
the- prospective volunteer was accepted or 
rejected. Acceptance did not necessarily mean 
participation in the campaign, however. In 
advance of the summer, many of the accepted 
applicants informed campaign staffers that 
they would not be taking part in the summer 
effort after all. Completed applications for all 
three groups—rejects, participants, and with- 
drawals—were copied from the originals, 
which are now housed in the archives of the 
Martin Luther King, Jr. Center for the Study 
of Nonviolence in Atlanta and the New 
Mississippi Foundation in Jackson, Missis- 
sippi.? In all, 1068 applications were copied, 
with the breakdown as follows: 720 partici- 
pants, 239 withdrawals, 55 rejections, and 54 
applicants whose status in the summer project 
was unclear. The applications were then 
coded and the data used as the basis for an 
earlier study of recruitment to high-risk 
activism (see McAdam 1986). 

The applications were also used as the 





? My deep appreciation goes to Louise Cook, 
the former head librarian and archivist at the King 
Center, and to Jan Hillegas—herself a Freedom 
Summer volunteer—of the New Mississippi Foun- 
dation for all their help in locating and copying the 
application materials used in this project. Without 
their help, this research would have been impossi- 
ble. 
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methodological starting point for the present 
study. Specifically, several application items 
(alma mater, parents’ address, major in 
school) functioned as crucial leads in my 
efforts to obtain current addresses for as many 
of the applicants as possible. The information 
on alma mater allowed me to contact alumni 
associations for help in tracking the appli- 
cants. Failing that, phone directories were 
searched to ascertain whether their parents 
were still living at the addresses listed on the 
applications. Their help was then enlisted in 
contacting the subject. Using the information 
on college major as a guide, academic 
directories were scanned for possible matches. 
Finally, once contacted, the applicants were 
often helpful in locating other subjects. 

The result of these efforts yielded current 
addresses for 556 of the 959 participants and 
withdrawals for whom I had applications. Of 
these, 382 (of a total of 720) had been 
participants in the project, while another 174 
(of 239) had withdrawn in advance of the 
summer. Separate questionnaires were then 
prepared and sent to the participants and to 
the “no-shows.” Participants were questioned 
about their experiences during Freedom 
Summer, their activist histories, and the 
broad contours of their lives, personal as well 
as political, post-Freedom Summer. The 
questionnaire sent to the no-shows dealt with 
these latter two topics as well as the reasons 
why they withdrew from the project. In all, 
212 (or 56 percent) of the participants and 
118 (or 68 percent) of the no-shows returned 
completed questionnaires. In turn, the com- 
pleted questionnaires yielded the data on 
which the findings reported here are based. 

This study closely approximates the ideal 
research design described earlier. It involves 
a large number of subjects (330) drawn from 
all over the country. The research itself was 
conducted nearly 20 years after the instance 
of activism in question and employs data 
gathered prior to the summer project as a 
baseline against which to judge any subse- 
quent changes in the subjects’ lives. Perhaps, 
most importantly, subjects were drawn not 
only from a group of former activists, but 
from the perfect comparison group: other 
applicants to the same project. This feature of 
the study allows for an unambiguous resolu- 
tion of the troublesome issue of prior values. 
The earlier study of recruitment showed 
clearly that the no-shows did not differ 
significantly from the participants in the 
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values they brought to the project (McAdam 
1986, pp. 72-76). That analysis also indi- 
cated that "the volunteers and no-shows 
appear as essentiall alike on a list of 
variables: race, social class, type of neighbor- 
hood (urban, suburban, rural), home region, 
type .of school, and major in school" 
(McAdam 1988). Holding so many important 
variables constant increases confidence in 
attributing subsequent differences between 
the no-shows and participants to the effects of 
participation in the summer project. The 
subjects in both groups looked very similar 
going into the project. The key question is, to 
what extent do their biographies diverge after 
the summer? To what extent can we speak of 
Freedom Summer as having constituted an 
instance of alternation in the lives of the 
volunteers? 


RESULTS 


One of the most distinctive aspects of the 60s 
was the twin emphases on personal liberation 
and social action. While it remained for 
radical feminists to give explicit voice to the 
notion that "the personal is political," the 
idea had, in fact, informed New Left politics 
almost from the outset. Accordingly, any 
assessment of the consequences of participa- 
tion in the summer project must focus not 
only on the applicants' later political activities 
but on their personal lives as well. 


Political Consequences 


The original project applications included 
several questions that provided information 
about prior political activities. Not surpris- 
ingly, both participants and no-shows appear 
to have been reasonably active politically 
before the summer. This was especially true 
in regard to the civil rights movement. Fifty 
percent of the participants and 40 percent of 
the no-shows were already members of civil 
rights organizations. Seventy-nine percent of 
the participants and 65 percent of the 
no-shows reported some form of prior civil 
rights involvement on their applications. 
Similar differences were also found in regard 
to other forms of political activity. Thus, 
while both groups were politically active 
before the summer, participants were slightly 
more so. 

Did this pattem hold following Freedom 
Summer or did the summer mark a significant 
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divergence in the political lives of the two- 
'groups? In answering this question, we will 


` want to distinguish the short- and long-term 


summer project. 


political consequences of participants in the 
Short-term refers to the 
subject's political involvements between the 
fall-of 1964 arid the National Guard shootings 


at Kent State University in May of 1970. 


These dates were selected because they define 
the peak years of New Left activism in the 
United States. The long-term consequences of 
involvement in Freedom Summer refers to the 
contemporary political attitudes and activities 
of the participants and no-shows. 

` Short-term political’ consequences. The 
evidence from the follow-up questionnaires 
would appear to support two conclusions. 
First, the Freedom Summer volunteers were 


far more politically active than: the no-shows. 
‘between 1964 and 1970. More importantly, 


` the differences in the activity levels between 


the two groups appear to be directly related to 


' the participants' involvement in the summer 


project. Evidence for the first conclusion is 
summarized in Table 2. 

By a margin of 90 to 74 percent, the 
volunteers were significantly more likely to 
have remained active in the civil rights 
movement following the summer. Just as 
important, the forms of civil rights activism 
the participants engaged in tended to be much 
more intensive than those of the no-shows. 
Thus the ratio of participants to no-shows 


Table 2. Percent Differences in Political Activism 1964— 
1970, by Status on the Summer Project 
(Percents) 


Volunteers No-Shows 





Type of Activism (n = 212) (n 118) 

Active in the, civil rights 90** 74 
movement following ' 
Freedom Summer 

“Very” active in the 46** 28 
antiwar movement 

Returning students who report 40* 28 
being "very" active in the 
student movement 

Females who report 66e* 45 

being, "very" active 

in the, women's movement i 

Full-time paid activist 36* 24 
employment 

' Mean number of member- 3.1** . L5 


ships in political organi- 
zations, 1964—1970* 

* Mean rather than percent. 
*= p< 05. : 
wees p< 01. 
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mvolved in southern voter registration activi- 
ties. was eight to one. Three times as many 
participants as nonparticipants helped orga- 
nize civil right-related boycotts of northern 
schools. Twice as many returned to the South 


in connection with later civil rights cam- 


paigns. In contrast, the proportion of no- 
shows listing “on campus civil rights activi- 
ties” as their principal form of activism was 
triple the comparable figure for participants. 
With the rise of “black power,” the role of 
whites in the movement grew ever more 
problematic. Consequently, the focus of 
activism for the white New Left. shifted to 
cther targets. The three movements to benefit 
most from this displacement of activist 
energies were the student, antiwar, and 
women’s liberation movements. Not surpris- 
ingly, both groups of applicants report high 
levels of involvement in all three movements. 
As shown in Table 2, however, the volunteers 
were more active in each than were the 
no-shows. 
» This ‘characterization holds for one final 
comparison between the two groups. The 
follow-up questionnaire asked all subjects to 
p-ovide a detailed employment history sine 
Freedom Summer. What was surprising was 
tke number of subjects who had worked as 
full-time paid activists at some point during 
the 60s. Thirty-six percent of the volunteers 
included such a job on their questionnaires as 
compared to 24 percent of the no-shows. 
Altogether, these findings support two 
conclusions. First, the volunteers were signif- 
icantly more active throughout the 60s than 
were the no-shows. Second, the gap in the 
activity levels of the two groups was more 
pronounced than before the summer. This 


- suggests that the participants’ higher post- 
summer activity levels were a function of 


their experiences in Mississippi. Fortunately, 
more systematic data are available to test this 
implicit proposition. 

` Using information from the follow-up , 


“questionnaire, a measure of the applicant’s 


level of movement activism between 1964 
and 1970 was constructed.4 The continuous 


. * The activism scale was constructed using three 
questionnaire items. The first asked respondents to 
report the forms of civil rights activism, if any, 
they were involved in following’ the summer. A 
numeric value was then assigned to each of these 
forms of activity based on its intensity relative to 
all others.’ For example, joining the staff of one of 


p 
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variable ` ranges from zero to 59. ' Not . Table 3. Results of Regression ‘Analysis Assessing the 


surprisingly, .the volunteers’ mean score 


(21.3) was significantly higher than that of ' 


the no-shows (13.6).5 The real interest here, 
however, is in identifying those independent 
variables that best predict variation in 60s 
activism. A simple ordinary least squares 
regression was preformed regressing [15 


` independent variables on the dependent vari- 


able of 60s activism (see Table 3). The - 


resulting equation produced a multiple r of 
.64 and an r? of .35. 


As shown in Table 3, only five of the 


variables in the model were significantly 
related to the dependent variable. Three of 


these support the causal importance implied : 


by the simple bivariate relationships reviewed 
earlier. However, the single strongest predic- 
tor of high levels of subsequent activism was 
participation in the summer project. What 
makes this finding all the more important 


theoretically is the presence in the model of: 


four variables (age, gender, level of activism 
prior to the summer, and number of organiza- 
tion affiliations prior to the summer) which 


were. earlier shown to bear a strong relation- 


ship to participation in the summer project 
(McAdam 1986, p. 88). However, with one 
exception, the significance. of these variables 
disappeared..when entered into the present 
equation. Clearly, the subjects’ later activism 


tion in the summer project. The suggestion is 
that the Summer served as an instance of 
alternation in the. lives of the volunteers and 
was largely responsible for the shape of their 
subsequent activist histories. 

What was it about Freedom Summer that 
encouraged the subjects' later movement 





the major civil rights organizations was assigned a 
numeric value of 5. Participating in civil rights 
fund-raising was accorded a value of 1. Second, all 
subjects were asked to indicate their level of 
involvement in the major movements of the period: 
the antiwar, student, and women's liberation 


movements, among others. Defining oneself as, 
“very involved". in any of these movements was ' 


assigned a point value of 3; "moderately" or 
“somewhat involved," 2 and 1 point, respectively. 


Finally, all instances of full-time paid activist 


employment were assigned a value of 5. The 
individual's activism score was the:sum of the 
points on these three items. ] 

5 Using a standard F-test, the difference of 
means is significant at the .01 level. 


. Independent variable ` b SE(b) 
Ties to other volunteers : : soc 

in 1966 S 201 212: 

in 1970 . 5 |^ c1476** — .632^.— 
Political orientation prior to ‘ 

, Freedom Summer , i —.935 .683 
Change in orientation pre- and' ^ — 

post-Freedom Summer 534* 223 

Participation in Freedom 
Summer? , 

(yes/no) 5.008** 1,848. 
Gender [1.787 71.885 
Age | .139 38 . 
Family income prior to Freedom i 

Summer .0002  ..0003 
- Level of activism prior to . 

Freedom Summer .213 .136 
Organizational affiliations prior " 

to Freedom Summer (N) 1.306* 4.575 

: Years married during the 60s —.106 .403 
* Years as parent during the,60s —.326** ` .606 
. Years employed full-time , 

during the 60s —1.136**  ..440 
Years in college during the 60s 4.708 448 
Attend college post-Freedom m Ti 

Summer? . . Rr - 
(yes/no) ` 2.452 1.965 
Constant 10.263 ` 8.431; 
N = 189. 
*p< .05. 
** p< .01. 


', cannot be attributed to the same mix of ` 
background factors that led to their participa- . 


' Effect of Various Independent Variables on 
-" Level of Activism 1964-1970 
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involvements? Two variables in Table 3- 


suggest some answers to this question. One is 
- the subject’s own estimate of the change in 


z 


his or her “political stance" pre- and 
post-Freedom Summer. ,Political stance . was 
measured by means of a ten-point. scale 
ranging from “1” for radical left to “10” for 
radical right. Table 3 indicates that a sharp 


- leftward shift in the subject’s political orien- 


tation following Freedom Sunimer is signifi- 
cantly related to levels of 60s activism. This : 
suggests that participation in the summer 


' project radicalized the volunteers and encour- 


aged higher levels of activism. 
' A second independent variable that bears a 

strong relationship to the’ activism measure is 
the subject’s estimate of the number of ' 
Freedom Summer volunteers he or, she 


- remained in contact with in 1970. The greater 


the number of ties, the higher the level of 
activism in the: postproject period. This 
finding adds a structural coinponent to the 
attitudinal interpretation. advanced above. 


z Freedom Summer did more than radicalize 
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the volunteers. It also put them in contact 
. with like-minded people. Thus the volunteers 
left Mississippi not only more attitudinally 
disposed toward activism, but embedded in a 
set of relationships and an emerging activist 
subculture ideally suited to reinforce the 
process of personal change begun in Missis- 
sippi. 

Long-term political consequences: the vol- 
unteers today. Have the volunteers remained 
politically active? How do the no-shows and 
volunteers compare on various measures of 
contemporary activism? Have the volunteers 
remained significantly more active or has the 
gap between the two groups narrowed? Let 
me take up the first of these questions. 

One item on the follow-up questionnaire 
asked the volunteers whether they were 
“currently active in any social movements.” 
Nearly half of the volunteers reported they 
were, while a quarter said they remained 
“very active” in at least one current social 
movement. 

How do the volunteers’ current rates of 
activism and political attitudes compare to 
those of the no-shows? The data in Table 4 
show that the volunteers remain significantly 


more active and more leftist in political. 


orientation than the no-shows. 


Table 4. Percent Differences in Rates of Contemporary 
Activism and Current Political Attitudes by 
Status on the Summer Project 


Volunteers No-Shows 


(n = 212) (n = 118) 

Currently active in any so- 4g** 33 
cial movement 

“Very” involved in antinu- ll 6 
clear movement 

“Very” involved in the nu- 10 5 
clear freeze movement 

“Very” involved in the envi- 11* 3 
ronmental movement 

Mean number of member- 2.1** 1.4 
ships in political organiza- 
tions 

“Leftist” in current political 60** 41 
stance? 

Agreeing that “tax structure 56** 40 


should be modified to re- 

duce the income disparity 

between the rich and the 

poor" 

* The designation “leftist” was reserved for those 
subjects who used the numbers 1, 2, or 3 to designate 


their current “political stance” on a 10-point scale 
ranging from 1 for “radical left" to 10 for “radical 
right.” 

*p< 05 
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What are the causal dynamics that may 
account for the differences between activists 
and no-shows. To get at these dynamics, a 
summary measure of contemporary activism 
was constructed combining three items from 
the follow-up questionnaire. Comparable to 
the summary measure of 60s activism, the 
contemporary activism variable is continuous 
and ranges from zero to 37. The volunteers 
also tend to score higher on the variable just 
as they did on the measure of 60s activism.” 
The ultimate value of the measure, however, 
stems from its role as the dependent variable 
in a path analysis (see Figure 1) designed to 
shed light on the causal dynamics that shape 
contemporary activism. The model consists of 
variables drawn from four time periods. Time 
I represents the period immediately preceding 
Freedom Summer; Time II, the summer itself; 
Time III, 1964-1970; and Time IV, 1983- 
1984. 

The goal of the analysis is to assess the 
strength of the relationships between the 
variables at these four points in time. À 
detailed analysis of the relationship of the 
variables at Time I and participation in 
Freedom Summer has been reported else- 
where (McAdam 1986, p. 88). Three of the 
four Time I variables (gender, # of orgs. 
presummmer, and level of activism presum- 
mer) were among the strongest predictors of 
participation in that earlier analysis. 

The four presummer variables were in- 
cluded in the model to assess the strength of 
association between them and level of activ- 
ism, the principal dependent variable at Time 
III. If participation in Freedom Summer was 
as personally transforming, then the subjects’ 
level of 60s activism should be more a 


$ The applicants’ current level of political 
activism was measured by a scale constructed from 
their responses to three items on the follow-up 
questionnaire. First, they received one point for 
answering "yes" to a question asking them 
whether they were currently involved in any social 
movement. For all those movements they reported 
being "actively involved in," in response to a 
second question, thev received three additional 
points. Finally, if they were currently employed 
full-time in an activist capacity, they received five 
more points. The person's score on the contempo- 
rary activism scale was the sum of these point 
totals. 

7 The mean scores on the contemporary activism 
measure are 7.4 for the no-shows and 9.5 for the 
volunteers. 


function of their summer status than of their 
characteristics prior to the summer. Specifi- 
cally, I anticipate a significant direct effect of 
participation on the subject’s level of 60s 
activism, as well as indirect effects mediated 
through the other Time MI variables: political 
stance postsummer, # of orgs. 1964—1970, 
and ties in 1966. I hypothesize that participa- 
tion in Freedom Summer encouraged later 60s 
activism both by radicalizing the volunteers 
(political stance postsummer) and by increas- 
ing their integration into political organiza- 
tions (# of orgs. 1964—1970) and activist 
networks (ties in 1966) that supported their 
latter involvements. ; 

A similar set of dyusmics is expected to 
link variables at. Times III and IV. In 
particular, the level of the subjects' activism 
during the 60s is expected to bear a strong 
direct relationship to their level of contempo- 
rary activism. We should also see indirect 
effects of level of 60s activism on current 
activism as mediated through the other Time 
IV variables. That is, a high level of activism 
during the 60s is expected to have laid a 
strong foundation for activism today, both by 
cementing the subject's links to other activists 
(current ties) and movement organizations (# 
of current orgs.) and by reinforcing a set of 
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leftist political values that have persisted into | 
the present (current political stance). 

For the most part, the findings reported in 
Figure 1 are consistent with these hypotheses. 
Taken together, they provide strong evidence 
of an enduring political impact of participa- 
tion in the Freedom Summer project. For 
many volunteers, the project seems to have 
initiated an important process of personal 
change and political resocialization and the 
beginnings of a kind of activist career. The 
importance of Freedom Summer to this alter 
career is reflected in the strength of the paths 
leading to level of 60s activism. While 
participation in Freedom Summer does not 
bear a significant direct relationship to level 
of 60s activism, it is linked indirectly to the 
latter variable by several intervening vari- 
ables. Participation in the summer project is 
positively related to both the number of 
political organizations and ties to other 
volunteers the subject was involved in during 
the period 1964-1970. In turn, these variables 
were significant predictors of the subject's 
level of activism during these same years. In 
addition, a strong association exists between 
the subject's participation in Freedom Sum- 
mer and his or her political orientation at the 
close of the project. That orientation, in turn, 





political 443*» political 428** current 
stance political 
pre-summer post-summer f stance 
i .166* 
er 
gender -.457 *«p«.05 


** ep«.01 


Fig. 1. Results. of Path Analysis of Long-Term Political Consequences of Participation in Freedom Summer 
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is a strong predictor of the sùbject’s activism 
between 1964 and 1970. 

The level of a subject’s activisrn during the 
60s stands in a similar relationship to current 
activism as participation in the summer 
project did to it. The one difference is that the 
level of a subject’s activism during. the 60s 
seems to have exerted direct as well as several 
indirect effects on level of current activism. 
The direct path between 60s and current 
activism constitutes the single best predictor 

. of variation in the dependent variable. But in 

> addition, the two ‘intervening variables, num- 
ber. of current organizations and current 
political stance, provide strong indirect links 
between the two variables. This suggests that 
activism during the 60s is related to present 
organizational affiliations and current politi- 
cal orientation and that, in tum, these 

'. variables bear a significant relationship to 
level of current activism.® : 

The picture that emerges is one of 
persistent differences in level of activism 

. between the volunteers and no-shows. These 
differences are evident prior to Freedom 
Summer (McAdam 1986) and continue to 
exercise some independent influence over the 
volunteers' later activist careers. At the same 
time, it is clear that participation in Freedom 
` Summer plays a particularly decisive role in 

, shaping the applicants’ later political values 
and involvements. In addition, these data 
confirm a certain self-perpetuating quality to 
individual activism. Activism, by ‘its very 
nature, broadens the base of the activists’ 
links to movement organizations. and other 

' activists. In turn, these links make it more 
likely that the activist will be drawn into 
subsequent activist episodes, thereby deepen- 
ing his or her commitment to activist values 
and perpetuating the process of personal 
change that initial forays into activism have 
set in motion. 


* The only remaining variable that is signifi- 

: cantly related to level of current activism is gender, 

` with female applicants more likely to be active. I 

haye refrained from discussing this rélationship or 

the general issue of gender because the topic merits 

systematic attention in its own right. I am currently 

. at work on an empirical paper focusing exclusively 

"Gender Differences, in the, Causes and 

' Consequences of Activism" using the data set 
employed in this paper. 
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Table 5. Short-Term Differences in Occupational His- 
tory by Staus on the.Summer Project 








i "Volunteers No-Shows, 
39w. 59% 





Working full-time before age 
25 (96) 

Mean number of full-time 
jobs held 1964-1970 
(means) : 

Mean years of full- time em- `` 
ployment 1964-1970 e, 
(means) ; 

Employed as full-time paid 
activists sometime be- 
tween 1964—1970 (96) 

"Strong agreement" with 
statement: "My participa- 

, tion in social movements , 

` strongly affected'my 
choices about work" (96)' 


* p< .05. 
** p « 0l. 


3.9 2.0 
2.2* 3.0 
33%** 


4697*. 3496 





Personal Consequences 


The ideological imperative of the 60s called 
for activists to recognize the political signifi- 
cance of their personal lives and to make. 
choices about work, family, and relationships 
that reflected their politics. Any complete 
accounting of the impact of participation in 
Freedom Summer, then, must examine these 
ostensibly nonpolitical aspects of a person; s 
life. 

Short-term personal consequences. In as-, 
sessing the effects of activist participation, I 
will again distinguish short- from long-term 
consequences. The period 1964-1970 seems 
especially .appropriate for examining short- 
term differences in work and marital histories 
because it demarcates the span of years within 
which most applicants would have been - 
expected to, begin careers and/or marriages.’ 
Did they fulfill these traditional expectations 
and, if not, can we see the imprint of 


Freedom Summer on their personal and `` 


professional lives? 

Work histories. Based on the data pre- ` 
sented in Table 5, it seems clear that the 
volunteers and no-shows were as different in 
their work as their activist histories during the 
late 60s. The volunteers entered full-time 
employment. later, changed jobs more fre- 


? The mean age of the two groups on the eve of 
the summer project was 23.6 years for the 
valunteers and 21.8 foz the no-shows. By 1970 the - 
average volunteer was in His late 20s or early 30s 
with the no-shows only slightly younger. 


1 
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quently, and worked fewer ‘years during the 
later 60s than the no-shows. 
work-related measure on which the volunteers 
‘scored higher than the no-shows is percent 
engaged in full-time activist employment. 
This finding suggests that the roots of the 
occupational differences between volunteers 
and no-shows lie in the different political 
values and commitments motivating each 
group. On the questionnaires, all subjects 
were asked to respond to the following 
statement: “My participation in social move- 
ments affected my choices about work.” 
Nearly half of the volunteers (46 percent) 
‘expressed “strong” agreement with the state-- 
ment as compared to 34 percent of those who 
withdrew from the project. A chi-square test 
shows this percentage difference to be 
significant at the .05 level. 

Marital histories. Between 1964 and 1970 
the volunteers’ were just as likely to wed, 
were married just as long, and got married at 
almost the same.age as the no-shows.!° But it 
is. possible that the volunteers’ -criteria for. 
selecting a mate were more often influenced 
by their politics than wás true for the 
no-shows? On the questionnaires, the appli- 
cants were asked to express their level of 
agreement with the following statement: *My 
participation in social movements affected my 


‘choice of mate(s).” Nearly two-thirds of the 


volunteers but not quite half of the no-shows’ 
agreed with the Statement. This percentage 
difference is statistically significant at the .05 
level. It must be remembered -that the 
volunteers averaged nearly 24 years of age at: 
the beginning of the summer. That meant that 


‘the majority of them had grown up in the 


1950s and early 1960s, during one of the 
more romanticized and conservative eras of 
domestic life in this country’s history. Even 


.as they were challenging much of. this 


socialization, the volunteers couldn't help but 
be affected. by it as well. They were no less 
interested in getting married than their peers, 
but their vision of the ideal mate appears to 
have been politicized by their experiences in 





' P The proportion of each group to marry 
between 1964 and 1970 was 59 percent for the 
volunteers and 56 percent for the no-shows. 
During thése years, the volunteers averaged 2.3. 
years of marriage, and the no-shows, 2.1. Mean 
age at first marriage was 26.9 for:the volunteers 
and 25.9 for the no-shows. None of these 
differences was statistically significant. i 


The only 


Mississippi: They were now looking for.a 
partner who shared their commitment to the 


‘struggle. : 


Current occupation/income. "Ote of -the. 
most consistent findings from the previous 
follow-üp studies is the concentration of 


. former activisís in the teaching or other 


“helping professions" (cf. Fendrich 1974, p. 
116; Maidenburg and Meyer 1970). Data in 
Table 6 confirm ‘the applicability of this 


'occupational profile to. the volunteers. How- 


ever, the table also shows that, relative to the 


comparison group, there are few significant 
“differences between the volunteers and no- 


shows in terms of their distribution into broad 
occupational categories, suggesting that par- 
ticipation in the summer project had little. 
impact on the fields in which the volunteers 
have chosen to work. 

However, one work-related difference be- 
tween the volunteers and no-shows appears to * 
bear the imprint of Freedom .Summer. The 
no-shows have significantly higher incomes 
than the volunteers, The. modal income ' 
category for the no-shows i is $40,000, while it 
is $20-29,999 for the volunteers. At the other 


‘end of the income scale, nearly. 50 percent 


more volunteers than no-shows. earn under 
$10,000. This.disparity would appear to 
derive from the different political histories of. 
the two groups. Perhaps the income gap has 
its roots in the volunteer's later entrance into. 
full-time employment, a difference that previ- 
ously -has been linked to their greater 
willingness to subordinate work to politics. 

Excluding those who were employed prior to 
Freedom Summer, the modal years of en- 
trance. into full-time employment was “1969. 

for the no-shows and 1972 for the volunteers. ` 
In effect, the volunteers postponed the start of , 
their careers, thereby sacrificing the market. 
advantage they could have enjoyed as babies 
born during World War II or the early 
postwar years. Had the volunteers entered the 
work force on schedule in the mid-to-late 60s, 
they would have benefited not only from a. 


‘boom economy, but from the relative paucity 


of competitors for an expanding number of 
jobs. Instead, by waiting until the early 70s, ’ 


; they confronted the same stagnant, competi- 


tive job market as their younger brothers and 
sisters. 

But this remains mere speculation. While 
the different work histories of the volunteers 
and no-shows may explain the disparity in 


their incomes, it remains for me to link those 
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Table 6. Volunteers, No-Shows, and Comparison Group” in Selected Occupational Groups (percents) 


Comparison 
Volunteers No-Shows Group 
% N % N % N 

Selected professional occs. 
College professors 17 (34) 18 (18) (108) 
Lawyers or judges 8 (16) 12 (12) 2 (83) 
Teachers 7 (15) 5 (5) 11 (465) 

(except college) 
Health practitioners 3 (6) 0 (0) 0 (4) 
Social workers 2 (5) 4 (4) 1 (40) 
Clergy 2 (4) 2 (2) 1 (29) 
Physicians and dentists 2 (4) 1 (1) 2 (91) 
Nurses 1 (3) 2 (2) 2 (64) 
Psychologists 1 (2) 4 (4) 0 (13) 
Engineers 0 (0) 0 (0) 3 (115) 
Accountants 0 (0) 0 (0) 2 (78) 
Writers, artists, 11 (22) 7 (7) 2 (106) 
entertainers, athletes 

Other professionals 10 (21) 5 (5) T (316) 
(including technicians) 

Executive, managerial 15 (30) 21 (21) 17 (723) 
and administrative occs. 

Sales occupations 2 (4) 3 (3) 9 (395) 

Adminis. support occs., 1 (2) 4 (4) 7 (283) 
including clerical 

Service occupations 1 (3) 0 (0) 5 (229) 

Farming, forestry and 0 (0) 1 (1) 2 (90) 
fishing occupations 

Precision production, 1 (3) 3 (3) 4 (178) 
craft and repair occs. 

Operators, applicators, 1 (3) 1 (1) 8 (320) 
and laborers 

Unemployed or 12 (25) 8 (8) 13 (534) 
not employed : 

97 Q02) 101 (101) 101 (4264) 


* The comparison group is composed of all subjects from the Census Bureau's 1984 Current Population Survey 


Annual Demographic File who share the same age and general educational level as the volunteers. 


different histories to their earlier patterns of 
activism. I attempt to do this in a path 
analysis which features current income as the 
principal dependent variable (see Figure 2). 
Of particular interest is the series of paths 
linking participation in Freedom Summer to 
current income. I expect that participation 
will bear a strong indirect relationship to 
income as mediated by the subjects' later 
activism and work history. 

The results provide strong support for a 
link between participation in Freedom Sum- 
mer and current income. Specifically, the 
analysis. documents two dynamics linking 
these variables. The first centers on the 
familiar positive relationships between the 
activism variables and the significant negative 
relationship between current activism and 
income. Participation in the summer project 
clearly encouraged many of the volunteers to 


pursue "activist careers." In turn, the nega- 
tive relationship between income and current 
activism suggests that the pursuit of such a 
^career" encouraged the subordination of 
work to politics, resulting in significantly 
lower incomes for the volunteers. 
Participation is also linked to lower in- 
comes through the mediating effect of the 
various work variables. Data in Table 5 show 
that the project veterans started to work at a 
larger age, worked less during the 60s, and 
were enrolled in college fewer years after the 
project than were the no-shows. In turn, all of 
these variables were significantly related to 
the total] number of years the applicants have 
been employed. Finally, the more years the 
subject has worked, the higher his or her 
income. These results, then, clearly confirm 
the political or activist roots of the disparity in 
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Project (y/n) 
-.045** 
current 
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165** 
years current 
employed income 
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pre-summer 
years of 
college E Ei : P 
post-summer LET: 
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level of 


343** 
60s activism 













Fig. 2. Results of Path Analysis of Effect of Participation in Freedom Summer on Current Income 


current income between the volunteers and 
no-shows. 

Current marital status. Earlier I reported 
that the marital histories of the volunteers 
were not appreciably different during the late 
60s from, those of the no-shows. Yet, only 
half of the project veterans were married as of 
1984. This compared to 72 percent of the 
no-shows, and 79 percent of the matched 
comparison group. What accounts for this 
striking difference? Table 7 reports the results 
of a logistic regression analysis that bears on 
this question. The analysis reported in Table 7 
was designed to measure the degree of 
association between 15 independent variables 
and the applicants' current martial status (0 — 
not married; 1 = married). Significantly, 
participation in Freedom Summer is clearly 
the best predictor of the subject’s current 
martial status. Indeed, it overwhelms all the 
other variables in the model. Without being 
able to assess the strength of the various 
causal paths linking participation to current 
martial status, one cannot know what it was 
about Freedom Summer or the biographical 
path it set the volunteers on that has made 
them so much less likely to be married. One 
can speculate, however. Marriages fail for a 


variety of reasons. One of the chief culprits is 
change (cf. Houseknecht, Vaughan, and 
Macke 1974; Scanzoni 1978). Marriages are 
partnerships founded on certain assumptions 
about the world and the partners themselves. 
Should these assumptions be rendered obso- 
lete, the likely result is a marital crisis and 
quite often divorce. Given the importance 
accorded politics by the volunteers and the 
rapid pace of change within the New Left in 
the late 60s and early 70s, it is likely that'their 
marriages foundered on the political instabil- 
ity characteristic of the era. 


CONCLUSIONS 


‘It would be hard to imagine a set of findings 


that would contradict the popular image of the 
60s activists more than the one presented 
here. Unlike the personalities profiled in the 
popular press, the summer volunteers have 
evidenced a remarkable continuity in their 
lives over the past 25 years. They have 
continued not only to voice the political 
values they espoused during the 60s, but to 
act on those values as well. They have 
remained active in movement politics. More- 
over, in a variety of ways they appear to have 
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Table 7. Logit Regression Predicting “Current Marital 


Status (Married/Unmarried) of the Freedom 
Summer Applicants 





_ Independent variable b SE(b) 
Gender 241 .340 
Age ` 2 3 —.031 .038 
Political stance prior to Free- — 477 .132 
` dom Summer. 

Choice of mate(s) affected —.114 .097 
by movement involve- z 
ment? (yes/no) 

Currently employed? (yes/ .033 , .581 
no) P 

Income .130 .127 

Participate in Freedom Sum- —1.254** .328 
mer? (yes/no) 

Years of full-time employ-, —.032 .038 
ment ] 

. Current political stance —.071 207 
Level of current activism —.015 .027 
# of current political affilia- .143 .096 

tions í 
# of years college atten- 022 ' .061 
dance post-Freedom Sum- : 
mer 
Level of activism, 1964-70 —.016 .017 
Family income prior to Free-  —.00001 .00005 
dom Summer 
Scale of current political , .042 "097 
attitudes 
Constant 2.055 1.410 
N= 217. , ; 
*p<.05. — f 
** p< 0l. 


remained faithful to that New Left imperative 
to treat the personal as political. Indeed, both 
„their work and marital histories appear to 
have been shaped, to a remarkable degree, by 
their politics. 

The findings reported here confirm- the 
results of the earlier follow-up studies of the 
60s activists. What makes the consistency of 
these findings all the more significant is the 
size.of the sample involved and the span of 
years that have elapsed since the subject's 

initial activism. National in scope and con- 
ducted 20 years after the instance of activism 
in question, this study provides strong and 
consistent evidence of the enduring impact of 
participation in activism. 

Theoretically, the results reported here 
provide a`firm basis for two principal 
conclusions. Activism, at least of the duration 
and intensity of Freedom. Summer, does 
indeed have the potential to trigger a process 
of alternation that can affect many aspects of 
the participants' lives. Secondly, the conse- 


quences of this process may be lifelong or at . 
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least long-term. The results also shed light on 
the specific social processes that appear to 
account for the transformative ‘potential of 
high-risk activism. One effect is-attitudinal, 
the other structural. Attitudinally, high-risk 
activism is likely, as a consequence of the 
events participants are exposed to and their 


‘immersion into an activist subculture, to 


result in a radical resocialization of those 
involved. Participants‘ are likely to emerge 
from: the experience more committed to 
activism than ever before, thus laying the 
attitudinal foundation for ongoing involve- 


. ment. The significant positive effect of the 


"political stance" variable on the various. 
measures of subsequent activism attests to the 
credibility of this interpretation. 

But the effects of high-risk activism are not 
merely attitudinal. The activist “careers” of 
the Freedom Summer volunteers also attest to 
the mediating effects‘ of certain structural 
consequences of the Summer Project. That 
project left many of the volunteers tied to 
networks of organizational and personal 
relationships that helped sustain their activ- 


. ism. The series of positive relationships 


linking organizational or personal ties to 
subsequent activism suggests the critical role 
of structural embeddedness in sustaining 
activist careers. 
Future research might explore in more 
detail the mediating effects of attitudinal 
change and subcultural integration on sus- 
tained activism. For now it is enough to alert 
researchers to these processes and to docu- 
ment the role they appear to have played in 


: accounting for the activist “careers” of many 


of the Freedom Summer. veterans. For these 
volunteers, the summer marked a watershed 
in their lives, a point in time around which 
their biographies can be seen in “before” and 
“after” terms. The summer left them attitudi- ' 
nelly more disposed and structurally more 
available for subsequent activism. For many, 
New Left politics became the organizing 
principle of their lives, personal as well as 
pelitical. In effect, the summer set them on 
course for a kind of activist career that has 
Continued to shape all aspects of their lives 


.dawn to the present. Far from being the 


fleeting, faddish activity often depicted in thé 
popular: press, activism—at least of the 
high-risk variety— would indeed seem to 
highlight the potential for personal transfor- 
mation embodied in intense and sustained 
social action mediated through integration 


- DeMartini, Joseph R. 1983. 
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into organizational and personal networks of. 
individuals. 
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SOCIAL MOVEMENT CONTINUITY: 
THE WOMEN’S MOVEMENT IN ABEYANCE* 


VERTA TAYLOR 
The Ohio State University 


This article uses social movement and organization theory to develop a set of 
concepts that help explain social movement continuity. The theory is grounded in 
new data on women’s rights activism from 1945 to the 1960s that challenge the 
traditional view that the American women’s movement died after the suffrage 
victory in 1920 and was reborn in the 1960s. This case delineates a process in 
social movements that allows challenging groups to continue in nonreceptive 
political climates through social movement abeyance structures. Five characteris- 
tics of movement abeyance structures are identified and elaborated: temporality, 
purposive commitment,. exclusiveness, centralization, and culture. Thus, social 
movement abeyance structures provide organizational and ideological bridges 
between different upsurges of activism by the same challenging group. 


INTRODUCTION 


Scholars of the social movements of the 
1960s have by and large held an “immaculate 
conception” view of their origins. These 
“new social movements” (Klandermans 1986) 
seemingly emerged out of nowhere and 
represented a sudden shift from the quiescent 
1940s and 1950s (Flacks 1971; Touraine 
1971; McCarthy and Zald 1973; Jenkins 
1987). Recent empirical work, however, 
challenges this view, suggesting that the 
break between the sixties movements and 
earlier waves of political activism was not as 
sharp as previously assumed (e.g., Isserman 
1987; McAdam 1988). The overemphasis on 
movement origins and on new elements in the 
sixties movements has blinded students of 
social movements to the “carry-overs and 
carry-ons” between movements (Gusfield 
1981, p. 324). What scholars have taken for 
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“births” were in fact breakthroughs or turning 
points in movement mobilization. 

This paper develops a framework that 
specifies the processes of movement continu- 
ity. The framework is grounded in research 
on the American women’s rights movement 
from 1945 to the mid-1960s. Most accounts 
trace its origins to the civil rights movement 
(Freeman 1975; Evans 1979). Yet the wom- 
en’s movement, like the other movements 
that blossomed in the 1960s, can also be 
viewed as a resurgent challenge with roots in 
an earlier cycle of feminist activism that 
presumably ended when suffrage was won. 
My approach relies heavily on the central 
premises of resource mobilization theory: 
political opportunities and an indigenous 
organizational base are major factors in the 
rise and decline of movements (e.g., Ober- 
schall 1973; McCarthy and Zald 1977; 
McAdam 1982; Jenkins 1983). The paper 
makes a new contribution by elaborating 
certain abeyance processes in social move- 
ments and by specifying features of social 
movement abeyance organizations. The term 
“abeyance” depicts a holding process. by 
which movements sustain themselves in 
nonreceptive political.environments and .pro- 
vide continuity from one stage of mobiliza- 
tion to another.  . ' 

After discussing data sources, the analysis 
briefly describes the history of the, American 
women’s movement and the persistence of a 
small band of feminists who, in the 1940s and 
1950s, continued to remain faithful to the 
political vision that had originally drawn them 
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` into the suffrage movement nearly a half 
century earlier. Because the cultural and 
political climate had changed, these women 


found that their ideals and commitment to, 


feminism marginalized and isolated them 


‘from the mainstream of American women. I’ 


.argue that their form of activism is best 
understood as a social movement abeyance 
structure. Finally, I delineate the features of 
abeyance structures that were a source of 
. movement continuity by tracing the: conse- 
. quences of postwar activism for the contem- 
porary women's movement. I conclude by 
-exploring the implications of the abeyance 
hypothesis for understanding the organiza- 
tional and ideological bridges between earlier 
_activism and the development of other 
movements of the 1960s. 


. ABEYANCE PROCESSES IN 
SOCIAL MOVEMENTS 


The term “abeyance” is borrowed from 
Mizruchi (1983) and is central to a theory of 
social control. Abeyance structures emerge 
when society lacks sufficient status vacancies 
to integrate surpluses of marginal and dissi- 
dent people. The structures that absorb 
marginal groups are abeyance organizations. 
They temporarily retain potential challengers 
' -to-the status quo, thereby reducing threats to 
the larger social systems. Abeyance organiza- 


tions have certain properties that allow them. 


to absorb, control, and expel personnel 
according to the number of status positions 
available in the larger society (Mizruchi 
1983, p. 17). 

~ Although Mizruchi recognizes the social 
change potential of abeyance organizations, 
he does not address this aspect systematically 
(Kimmel 1984). I both challenge and extend 
Mizruchi's thesis to hypothesize that social 
movement abeyance organizations, by provid- 
ing a measure of continuity for challenging 
groups, also contribute to social change. I 
hold that the abeyance process characterizes 
` mass movements that succeed in building a 
support base and achieving a measure of 
influence, but are confronted with a nonrecep- 
tive political and social environment. A 
central tenet of resource mobilization theory 
‘concerns the role that changing opportunity 
structures play in: the emergence and. the 
attenuation of collective action (McCarthy 
and Zald 1973; Barkan 1984; Jenkins 1983). 
As a movement loses support, activists who 
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had been most intensely committed to its aims 
become increasingly marginal and socially 
isolated. If insufficient opportunities exist to | 
channel their commitment into routine sta- 
tuses, ‘then alternative structures emerge to 
absorb the surplus of people. These. structures 
both restrain them from potentially more 
disruptive activities and channel them into 
certain forms of activism. In short, a 
movement in abeyance becomes a cadre of 
activists who create or find a niche for 
themselves. Such groups may have little 
impact in their own time and may contribute, 
however unwillingly, to maintenance of the 
status quo. But, by providing a legitimating 
base to challenge the status quo, such groups 
can be sources of protest and change. . 

The following factors are relevant to the 


` abeyance process. First, certain factors exter- 


ral to a movement create a pool of marginal f 
potential activists. These include changes in 
Gpportunity structures that. support and con- 
strain the movement and an absence of status 
vacancies to absorb dissident and excluded 
groups. Second, there are internal factors or 
organizational dimensions of social movement 
abeyance structures: temporality, commit- : 
ment, exclusiveness, centralization, and cul: 
ture. Since these dimensions were inductively 
derived, I elaborate them with the case at 
hand. The significance of abeyance lies in its '. 


‘linkages between one upsurge in activism and 


another. I delineate three ways that social 
movement abeyance structures perform this 
linkage function: through promoting the 
survival of activist networks, sustaining a 


. repertoire of goals and tactics, and promoting 
. collective identity that offers participants .a 


sense of mission and moral purpose. 


DATA 


Most accounts describe the American wom- 
en's movement as peaking in two periods 
(Chafe 1972; Freeman 1975; Klein 1984). 
The first wave, generally referred to as the 
suffrage movement, grew out of the abolition- 
ist struggle of the 1830s, reached a stage of 
mass mobilization between 1900 and 1920, 
and declined after the passage of the suffrage 
amendment. The second wave emerged in the 
mid-1960s, reached a stage of mass mobiliza- 
tion around 1970, and continued into the 
1980s (Carden 1974; Evans 1979. Ferree and 
Hess 1985). 

: Curiosity about what happened to the 


wo 


"r 
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: organizations and “networks of:women who | 


participated in the suffrage campaign led to 
the research described here. There are two 


' . reasons for.focusing on the period from 1945 


a 


to the mid-1960s. First; other researchers 


have explored the period from 1920 to 1940. 


(Lemons 1973; Becker 1981; Cott 1987). 
Second, most researchers see the civil rights 
movement .as the major predecessor to the 
contemporary women’s movement (e.g., Free- 
man 1975; Evans 1979; McAdam 1988). By 
examining feminist activity in the decades 
just prior to the resurgence of feminism as a 
mass movement, I hoped to shed light on the 


: accuracy. of this view. 
The data for this-.study .come from-: 


documentary material in public and private 
archival. collections and .interviews with 
women who, were activists from 1945 to the 
1960s. Fuller description of the movement in 
this period and complete documentation are 
available in Rupp and Taylor (1987): 

(1) Archival data included the papers of the 


` National Woman's Party and the League of 


Women Voters, the two major factions of the 
suffrage. movement, and the papers of the 
President's Commission on the Status of 


' Women: (1961-63), whose activities facili- 
:fated the resurgence of the contemporary 
women's'movement. Other material exam- ` 


ined were unofficial and official: organiza- 
tional documents, publications, personal let- 
ters, and memos in public ‘and private 
collections, most of which are housed at the 
Schlesinger Library at Radcliff College or the 
Library of Congress. The papers of individual 
women provided an important source of 


information, not only about their organiza- 


tional careers, but also about the activities of 
diverse women's organizations." 

(2) The second source of data was 57 
open-ended, semistructured, tape-recorded in- 


terviews, conducted between 1979 and 1983," 


with leaders and core members of the most 
central groups involved in women's rights 
activities. Twelve of the women were 'active 
at the national level and thirty-three at the 


' local level. Twelve other transcribed inter- . 


views conducted by other researchers and 
available i in archival cóllections were. used. 


THE WOMEN'S MOVEMENT IN THE 
POSTSUFFRAGE DECADES: THE 
TRANSFORMATION OF FEMINIST 
ACTIVISM . 


Feminism activism continued in the yea. 


D 


763 


_after the suffrage victory but was transformed 


as a result of organizational success, internal 
confiict, and social changes -that altered 


' women’s common interests (Lemons 1973; 


Becker 1981; Buechler 1986; Cott 1987). 
Deradicalization and decline of the women's , 
movement left militant feminists limited 
avenues through which to pine their 
political philosophy. , 

: In 1920, with the vote won, the women’s 
movement was left with no unifying goal. 
Moreover, tactical and ideological differences . 
divided militant from moderate suffragists 
and those who saw winning the vote as a 
means from those who viewed it as.an end. ` 
As-a result, the major social movement 
organizations of the suffrage. movement , 
evolved in two opposing directions. . 

The militant branch of the movement, the . 
National Woman's Party (NWP), launched a 
relentless campaign to pass-an Equal Rights 
Amendment (ERA) to the constitution. The . 
NWP was never a mass organization but saw 
itself as a feminist vanguard or elite (Lunar- 
dini 1986). Hoping to enlist the support of, 
former. suffragists, NWP leader Alice Paul 
instead alienated both socialists and moderate 
feminists by her dictatorial style and the 
decision to focus on the ERA. The vast 
majority of 'suffragists feared that the ERA. 
would eliminate the protective labor, legisla- 
tion that women reformers had earlier strug- 
gled to achieve (Balser 1987)... ' 

- The mainstream branch of the. movement,. 
tke National American, Woman Suffrage. 
Association, formed the nonpartisan League 
of Women Voters. It spearheaded the opposi- 
‘tion to the ERA, educated women for their 
new citizenship. responsibilities, ‘and advo- ` 
cated a broad range of reforms. ‘Other 
activists in the suffrage campaign channeled 
tneir efforts into new or growing organiza- , 
tions that did not bave an explicitly feminist 
agenda but promoted a vast range of ‘specific 
causes that, in part, grew out of the expanded 
‘role options available to women (Cott 1987). 
Thus, even though the women’s movement | 
was rapidly fragmenting, feminist activism 
continued throughout the 1920s ‘and 1930s. 
‘But in the face of increasing hostility between ' 


` the two camps of the suffrage movement, 


cooperation developed on only a few issues. - 

In addition to goal attainment and internal 
conflict, a third factor contributed to the 
dissipation of the mass base of the women's 
movement. Ironically, the role expansion for 
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which the movement had fought fractured the 
bonds on which the solidarity of the move- 
ment had been built. As women’s lives grew 
increasingly diverse, the definition of what 
would benefit women grew less clear. 

As a result, the NWP—which alone 
continued to claim the title “feminist” —had 
become increasingly isolated from the main- 
stream of American women and even from 
women's organizations. With the demise of 
. the large mass-based organizations that pro- 
pelled the suffrage movement, the more 
radical feminists sought out the NWP. When 
the NWP captured the feminist agenda, 
however, the broad program of emancipation 
narrowed to limited goals and tactics pursued 
by an elite group of women (Cott 1987). This 
spelled the final demise of feminism as a 
mass movement. 


FEMINIST ACTIVISM FROM 1945 TO 
THE 1960’S: THE WOMEN’S 
MOVEMENT IN ABEYANCE 


From 1945 to the 1960s, women's rights 
activists confronted an inhospitable political 
and social environment. Women who advo- 
cated equality .found few outlets for their 
activism and became increasingly marginal 
and isolated from the mainstream of Ameri- 
can women. Two social processes had this 
effect: first, advocates of women's rights 
lacked access to and support from the 
established political system; and, second, the 
cultural ideal of "the feminine mystique" that 
emerged after World War II affirmed the 
restoration of "normal family life" and 
discredited women who protested gender 
inequality. 


Changing Opportunity Structure: 
Nonreceptive Political Elites 


Despite an increase in the female labor force 
and the female student body in institutions of 
higher education, support for women’s rights 
and opportunities declined sharply following 
the Second World War. By 1945, the 
women’s movement had further fragmented 
into three overlapping interest groups, each 
with a different political agenda (Harrison 
1988, p. 7). Because women’s issues were 
not generally salient, the three groups lacked 
political access and influence. Just as impor- 
tant, when they did gain access to political 
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elites, they often canceled out each other’s 
influence. 

One interest group consisted of a coalition 
of women’s organizations associated with the 
Women’s Bureau of the Department of 
Labor. Throughout the 1940s and 1950s, this 
coalition sought to improve women’s working 
conditions and defeat the ERA. Despite its 
governmental status, the Women’s Bureau 
bad little political clout, and the coalition 
used much of its influence to fight supporters 
of the ERA. 

A second group consisted of a network of 
women in politics, including women active in 
the women's divisions of both the Democratic 
and Republican parties. They advocated the 
election and appointment of women to 
policy-making positions despite a dramatic 
decline in women's political opportunities 
after the Second World War. For the most 
pert, the selection of women for policy- 
making positions was done by party women 
without regard for their position on women’s 
issues (Harrison 1988, p. 64). Since women 
ofricials generally had no policy role and little 
influence on women's issues, advocating 
token appointments of women diverted atten- 
tion from major policy questions such as the 
ERA 


A third group, the National Woman's 
Party, remained furthest outside the estab- 
lished political order. By 1944, the NWP had 
begun a major campaign to get Congress to 
pass the ERA and had managed to garner 
support from a few women's organizations. 
Confronted with the establishment of the 
National Committee to Defeat the Unequal 
Rights Amendment, spearheaded by the 
Women's Bureau, the NWP sought the 
support of both political parties. Presidents 
Truman and Eisenhower endorsed the ERA, 
both party platforms advocated it, and 
Congress considered it in 1946, 1950, and 
1953. Yet such support was more a nod to 
women than a serious political consideration 
(Freeman 1987). 

None of these three groups made much 
progress in attaining their goals in the 1940s 
and 1950s. Although women's organizations 
succeeded in having 236 bills pertaining to 
women's rights introduced into Congress in 
the 1950s, only 14 passed (Klein 1984), p. 
18). This reflected not only organized wom- 
en's lack of political access and their 
conflicts, but also the exaggerated emphasis 


SOCIAL MOVEMENT CONTINUITY 


on sex roles that emerged on the heels of the 
Second World War. 


Status Vacancy and Marginality 


Following the war, a variety of social forces 
helped to reinstitutionalize traditional family 
life supported by rigid sex role distinctions 
(Friedan 1963; Breines 1985; May 1988). 
Women whose roles did not center on the 
home and family were considered deviant. In 
1957, 80 percent of the respondents to a 
national poll believed that people who chose 
not to marry were sick, immoral, and neurotic 
(Klein 1984, p. 71). As a result of the 
pressure, fewer married women remained 
childless in the 1950s than in the 1900s—only 
6.8 percent compared to 14 percent (Rupp 
and Taylor 1987, p. 15). Indicative of the 
tide, in 1945 only 18 percent of a Gallup Poll 
sample approved of a married woman’s 
working if she had a husband capable of 
supporting her (Erskine 1971). 


In addition to criticizing women who did: 


not conform to the cultural ideal, the media 
denounced feminism, discredited women who 
continued to advocate equality, and thus 
thwarted the mobilization of discontented 
women (Rupp 1982). The most influential 
attack came from Ferdinand Lundberg and 
Marynia Farnham’s popular and widely quoted 
book, Modern Woman: The Lost Sex (1947), 
which denounced feminists as severe neurot- 
ics responsible for the problems of American 
society. In the face of such criticism, only the 
NWP continued to claim the term "feminist." 
In fact, the core group of women in the NWP 
differed in major respects from the cultural 
ideal. Àn analysis of the 55 leaders and most 
active members of the NWP indicates that, by 
1950, the majority were white, middle- or 
upper-class, well educated, employed in 
professional or semiprofessional occupations 
(especially law, government, and higher 
education), unmarried or widowed, and older 
(in their fifties, sixties, or seventies).! Specif- 





-| This analysis of the leadership and core 
membership is based on a careful reading of 
archival material, particularly correspondence, as 
well as research in biographical sources. For 55 
women identified as leaders and core members, 
information was recorded about race, class, 
education, occupation, age, place of residence, 
political affiliation, political views, marital status, 
presence and number of children, living situation, 
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ically, 71 percent of the women were 
employed; 97 percent were over the age of 
50; and 60 percent were unmarried or 
widowed. In short, the lifestyles of the 
women, while relatively advantaged, were 
not normative. Feminists were largely unat- 
tached women with time, money, and other 
resources that facilitated their activism. Yet 
the retreat of a broad-based women’s move- 
ment left few outlets to express their views 
either inside or outside the established 
political arena. 

In summary, as the political and cultural 
wave that had once carried feminism forward 
receded, members of the NWP paid for their 
lifelong commitment with a degree of alien- 
ation, marginality, and isolation. Neverthe- 
less, the NWP provided a structure and status 
capable of absorbing these, intensely commit- 
ted feminists and thus functioned as an 
abeyance organization. 


Dimensions of Social Movement 
Abeyance Structures 


The abeyance process functions through 
organizations capable of sustaining collective 
challenges under circumstances unfavorable 
to mass mobilization. Properties of abeyance 
organizations help an organizational pattern to 
retain potentially dissident populations. My 
analysis of the women’s rights movement in 
the postwar period suggests that the most 
relevant variables with respect to the abey- 
ance process are: temporality, purposive 
commitment, exclusiveness, centralization, 
and culture. Since these variables are derived 
from a single case, each dimension is treated 
as a hypothetical continuum with respect to 
other cases. 

Temporality. By definition, of course, an 
abeyance structure persists throughout time, 
but temporality refers to the length of time 
that a movement organization is able to hold 
personnel. Activism provides a community 
that is an alternative source of integration 
and, thus, can have an enduring effect beyond 
a particular period in an individual’s live 
(Coser 1974; White 1988). 

During the 70-odd years of the first wave 


and time of first involvement in the women's 
movement. In addition, any comments made by 
participants about the social characteristics of the 
membership were noted. 
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of the women’s movement; a number of 
' women’s rights groups emerged and provided 
~ alternative status vacancies for large numbers 
. of mainly white and middle-class women 
-Flexner 1959; Buechler 1986; Chafetz and 
Dworkin 1986). Among the 55 leaders and 
core activists of the NWP, 53 percent had 
been recruited at least four decades earlier 
during the suffrage campaign. . 

For.NWP members, early participation in 
high-risk activism (McAdam 1986), including 
picketing the White House, engaging in 
, hunger strikes while imprisoned, and burning 


‘President Wilson's speeches, kept them ` 


involved long after the suffrage victory. 
Lamenting the passage of that period, Flo- 
rence Kitchelt asked a fellow suffragist in 
1959 whether she ever felt "as I do that the 
modern' woman is missing something very 
thrilling, uplifting as well as unifying in not 
being able to take part in a suffrage 
campaign? Those were the days!"? Katharine 
Hepburn, mother of the actress, in a speech to 
"women's rights activists, described her experi- 
ences in the suffrage struggle. "That whole 
period in my life I remember with the greatest 
delight,” she said. "We had no:doubts. Life 
was a great thrill from morning until night. ”3 
Involvement in the suffrage movement had a 
: powerful and enduring effect on participants, 
,So much so that they continued even into the 
^, 1950s to promote women’s rights in a society 

antagonistic to the idea. The strong and 
. lasting effects of participation in high-risk 
activism is supported by McAdam's (1988) 
study of participants in the 1964 Freedom 
Summer project. 

By the 1940s and 1950s, a core of women 
in the NWP had devoted a major portion of 
their lives to feminist activity. Typical 
participation patterns are reflected ‘in the 
' comments of two members. In 1952, one 
woman wrote, “Since 1917 I have devoted all 
‘my spare time to feminism. "^ Another 
woman asked in 1950 for a "cure for giving 


? Florence Kitchelt to Katharine Ludington, 
August 14, 1950, Florence Kitchelt papers, box 6 
(175), Schlesinger Library (SL), Radcliffe Col- 
` lege, Cambridge, Massachusetts. 

Katharine Hepburn, speech to the Connecticut 
Committee, n.d.[1946], Kitchelt papers, box 6 
(153), SL. 

4 Betty Gram Swing to Ethel Ernest Murrell, 
October 3,- 1952, National Woman’ s Party (NWP) 
papers, reel 99. 
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too much of one's time to one thing,” 
although she still continued to devote herself 
to passage of the ERA.5 Not surprisingly, the 
most striking characteristic. of the NWP 
membership was advanced age. Isserman 
(1987, p. 24) found a'similar age structure in 
another organization that provided continuity 
between two stages of mass mobilization, the 
American Communist Party in the 1940s and 
1950s. Constant numbers—even if small— 
are better for morzle than steady turnover, so 
temporality enhances the likelihood that an 
organization will continue to endure. ' 
Purposive commitment. Commitment refers 
to the willingness of people to do what must 
be done, regardless of personal rewards and 
sacrifices, to maintain a collective challenge 
and is essential for holding an organizational 
pattern alive between stages of mass mobili- 
zation. Research on social movement involve- 
ment has focused primarily on the types of 
incentives that induce activists to make an 
initial commitment to a movement. (e.g., 
Pinard 1971; Fireman and Gamson 1979; 
Oliver 1983; McAdam 1986). In exploring 
movement continuity, we: must pose a differ- 
ent question: why do individuals maintain 
radical or unpopular convictions over time? 
The few studies that have explored this 
question suggest that the nature of and 
incentives for commitment depend on a 
movement’s stage in the mobilization pro- 
cess. Kanter’s (1972) research on American 
communes concludes that groups character- 
ized by high commitment are more likely to 
retain participants and to endure. Other 
research suggests that, although individuals 
may become activists through solidary or 
material incentives, continued participation 
depends upon high levels of commitment to 


. group beliefs, goals, and tactics (Hirsch 1986; 


White 1988). 
From its inception, the NWP appealed to 


"women with strong feminist sympathies. By 


the 1950s, continued participation depended 
largely on the singleness of members’ devo- 
ton to the principle of sexual equality 
embodied in the Equal Rights Amendment. 
Rejecting all other proposals for a feminist 
program, NWP leaders insisted that ideologi- 
cal integrity and the dogged pursuit of legal 


_ equality, not membership gain, would guaran- 


tee triumph. 


5 Mary Kennedy to Agnes Wells, July 12, 1950, 


. NWP papers, reel 97. 
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A dedicated core of NWP members worked 
for the ERA’ by lobbying Congress and the 
President, seeking endorsements from candi- 
dates for political office and from other 
organizations, establishing coalitions to. sup- 
port the amendment, ánd educating the public 
through newspaper and magazine articles, 
letters to the editor, and radio and television 
appearances. Commenting on the persistence 
- of feminists’ lobbying efforts, one Represen- 
tative from Connecticut wondered in 1945 
“whether or not the Congress meets for any: 
‘other purpose but to consider the Equal Rights 
Amendment:”® Since the NWP depended 
entirely on. the volunteer work of members, 
commitment: was built on sacrifices of time, 
energy, and financial resources. Recognizing 
the impact of such high levels of commit- 
ment, one new recruit commented that “the 
secret of the ability of the group to do so 
much in the face of such odds is that it can 
attract just such devotion and loyalty."? 
Commitment, then, contributes to the 
abeyance process by ensuring that individuals 
continue to do what is necessary to maintain: 
the group and its purpose evén when the odds 
‘are against’ immediate success. Moreover, 
' such intense commitment functions as an 
“obstacle to participation in alternative roles. 
and organizations. 
Exclusiveness. Organizations vàry uoi- 
ing ‘to their openness to members, some 


having more stringent criteria than others. - 


Mizruchi (1983, p. 44) hypothesizes that the 
expansion-contraction of.an abeyance organi- 
zation’s personnel occurs in response to 
changes in the larger, social system's require- 
ments for absorption, mobility, or expulsion 
of marginal populations. To absorb large 
. numbers of people who are unattached to 
other structures requires organizations to be 
inclusive, as happens during the peak mobili- ` 
zation of social movement organizations. In 
cycles of decline, however, when challenging 
groups lack widespread attitudinal support, 
organizations become exclusive and attempt 
to-expel or hold constant their membership. 
Zald and Ash (1966) contend that exclusive 
movément organizations are more € to: 
endure than inclusive ones. 

At the peak of me suffrage struggle, the 





$ Joseph E. "Talbot to Florence Kitchelt, Febru- l 


ary 12, 1945, Kitchelt papers, box 8 (234), SL. 
'" "Mamie Sydney Mizen to Florence Armstrong, 
October 25, 1948, NWP papers, reel 94. 
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NWP was inclusive across the class and 
political. spectrum (Cott -1987, pp. 53-55). It 
attracted: wage-earning women from a ‘variety 
of occupations as well as elite women social 
activists. Its members had ties to political 
parties, government, and industry, as well as 


` to the socialist, peace, labor, and antilynching : 


movements. But when the NWP launched its. 
ERA campaign, many bodies organized on 
occupational, religious, .and racial grounds . 
and devoted to other policy issues- began to 
absorb, women from mainstream suffrage 
groups and siphon off NWP members. This | 
left the NWP with a small and relatively 
homogeneous permanent core of feminists. 

By the end of World War II, the NWP had 
lost-most of its members and was not 
attracting new ones. Compared to its 60,000 


. members in the last years of the suffrage ' 


campaign, the NWP had about 4000 “gen- 
eral" members by 1945 and only 1400 by 
1952. More revealing, it listed 627 "active". 
miembets in 1947 and 200 by 1952. Although 
the NWP. also lost members as. a result of an 
internal conflict over whether to expand 
membership in 1947 and again,in 1953, the 
leadership preferred to keep the organization . 
a small elite vanguard. AS one member put it, . 
“no mass appeal will ever bring into the Party 
that type of woman who can best carry. 
forward our particular aims. We are an ‘elect 
body’ with a single point of àgreement. "5 
Just as important, the membership of the. 
NWP 'also grew increasingly homogeneous 
and socially advantaged over the decades. Of 
55 core activists, 90 percent of the employed 
held professional, managerial, or technical ` 
positions. Several researchers have noted that 
intellectuals and other privileged groups are 
likely to be overrepresented among the 
leadership and supporters .of neo-liberal and 
commünal movements., Some have attributed 


' this to the risks and resources tliat participa- 


tion entails (Lenin 1970; McCarthy and’ Zald 
1973; Oberschall 1973, p. 161), while others 
look to the unique political culture of 


intellectuals and professionals (Pinard and 


Hamilton 1988). 

Despite the. fact that the NWP lala 
preferred a small homogeneous membership, 
they recognized the significance-of size and 
REM for public impact. In ad to RS : 





j Open letter” from: Ernestine Bellamy: to Ethel 
Ernest Murrell, | May 24, 1953, NWP papers, reel 
99, 5 
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the appearance of a mass constituency, the 
NWP devised certain strategies. Members 
maintained multiple memberships in women’s 
organizations in order to win endorsements 
for the ERA; they established coalitions to 


give the impression of a large membership; 


they financed a “front” organization to give 
the appearance of cooperation between femi- 
nists and labor women; and they recruited 


leaders of the National Association of Col-: 


ored Women in order to obtain its endorse- 
ment of the ERA. Yet, despite attempts to 
appear inclusive, the NWP did not seriously 
try to build an indigenous base of support. 

Organizational exclusivity is closely related 
to the commitment variable. 
that insist upon high levels of purposive 
commitment and make stringent demands of 
time and financial resources cannot absorb 
large numbers of people. They are, however, 
good at holding constant those members that 
they have. Thus, exclusiveness is an impor- 
tant characteristic of abeyance organizations 
because it ensures a relatively homogeneous 
cadre of activists suited to the limited 
activism undertaken. 

Centralization. Organizations vary in their 
centralization of power. Some operate through 
a“single center of power,” whereas decentral- 
ized groups distribute power among subunits 
(Gamson 1975, p. 93). Although centraliza- 


tion contributes to a decline in direct-action' 


tactics (Piven and Cloward 1977; Jenkins and 
Eckert 1986), it has the advantage of 
producing the organizational stability, coordi- 
nation, and technical expertise necessary for 
movement survival (Gamson 1975; Wehr 
1986; Staggenborg 1989). 

By the end of World War II, the NWP 
functioned almost entirely on the national 
level with a federated structure in which local 
and state chapters had little control. State 
branches, which had been active in the 1920s, 
consisted in most cases of little more than a 
chairman and served the organization primar- 
ily as letterheads to use in lobbying senators 
and representatives.’ 

A national chairman headed the NWP. The 


Party's founder and leading light, Alice Paul, 


however, directed and kept a tight reign on its 
activities, even though she formally occupied 
the chair for only a brief period from 1942 to 


°T use the term “chairman” because that was 
the term used at the time. It seems mforically 
inaccurate to change this usage. 


Organizations 
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1945. As one member described it, Paul 
“gave the chairman all deference. But if you 
were a wise chairman, you did what Alice 
Paul wanted, because she knew what was 
needed.” !° The chairman headed a national 
council that met peziodically at the Washing- 
ton headquarters. There was also a national 
&dvisory council composed of prominent 
women who lent their names to the group’s 
work. 

Paul, reputedly & charismatic leader com- 
mitted to the point of fanaticism, maintained 
tight control over the ERA campaign. She 
cecided when it wes time to press Congress 
and when to maintain a low profile and, 
according to members’ reports, worked from ' 
six in the morning until midnight. On at least 
two occasions serious conflict erupted over 
the lack of democratic procedures in the 
Party. It focused specifically on Alice Paul's 
autocratic leadership style and on the refusal 
af the national leadership to allow state 
branches to expand membership. A letter, 
circulated in 1947, contained charges typical 
of those directed against Paul: 
made it clear that you consider yourself and 
the small group around you an elite with 
superabundant intellect and talents, and con- 
sider us, in contrast, the commonfolk.”!! 
Thus centralization of leadership, like exclu- 
siveness, had the potential to provoke conflict 
among members. But it also had advantages . 
in a nonreceptive political environment. 

Paul used her influence to direct a small 
g-oup of activists with highly specialized 
skills—lobbying and researching, testifying, 
and writing about policy issues— who viewed 
themselves as an embattled feminist minority. 
The NWP was able to finance its activities 
with some invested funds, dues, contributions 
from members, and revenue from the rental of 
rcoms in. its Washington property. As a 
result, activists did not have to expend energy 
generating resources to maintain the organiza- 
tion. 

This kind of central direction allowed the 
NWP to sustain the feminist challenge 
through the years by concentrating on a single 
strategy that could be carried out by a 
dedicated band of activists with highly 





1° Interview no: 2. 

! Laura Berrien and Doris Stevens, “An Open 
Letter to Miss Alice Paul,” Committee on 
Information, Bulletin No. 4, July 30, 1947, 
Katharine A. Norris papers, box 2 (7), SL. 
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specialized skills. Thus, centralization contrib- 
utes to the abeyance process by ensuring the 
maintenance of organization and at least 
minimal activity during periods when condi- 
tions do not favor mass mobilization. 

Culture. The culture of a social movement 
organization is embodied in its collective 
emotions, beliefs, and actions. Although all 
social movements create and bear culture, 
movement organizations vary in the character 
and complexity of their cultures (Lofland 
1985). 

The effectiveness of an organization with 
respect to its abeyance function depends, in 
part, on its capacity to motivate persons to 
assume certain positions. As the larger 
political and cultural atmosphere becomes 
less hospitable to the social movement, 
recruitment of personnel becomes difficult. In 
order to make participation more attractive, 
organizations must elaborate alternative cul- 
tural frameworks to provide security and 
meaning for those who reject the established 
order and remain in the group. Previous 
research suggests that the more highly 
developed an organization’s culture, the more 
it offers members the ‘satisfaction and other 
resources necessary for its survival (Kanter 
1972; Lofland 1985). 

The NWP developed an elaborate and 
expressive culture through activities at the 
Alva Belmont House, its national headquar- 
ters in Washington, D.C. Belmont House 
served not only as an office for national 
council meetings, but also as a center where 
lobbying efforts were coordinated and where 
the monthly newsletter was published. It also 
created the kind of female world essential to 
the maintenance of feminism (Freedman 
1979; Rupp 1985). A few women lived at 
Belmont House and in two other Party-owned 
houses, while lobbyists stayed there from a 
few days to several months. In addition, 
Belmont House was the site of feminist events 
and celebrations: teas to honor women 
politicians or sponsors of the ERA, victory 
celebrations, and parties on Susan B. An- 
thony’s birthday or on the anniversary of the 
suffrage victory. The activities and relation- 
ships women formed at Belmont House 
provided both ideological and affective sup- 
port for participation in women’s rights work. 

Although NWP members believed in the 
pervasiveness of discrimination against 
women, the Party did not develop and 
advance a well-articulated ideological and 
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theoretical position. Rather, feminism was 
defined principally through a culture that 
promoted a feminist worldview. One mem- 
ber expressed her world view, complaining of 
“Masculinity running rampant all over the 
earth!” and rebelling at the “utter man- 
mindedness” she saw all around her.!? Alice 
Paul characterized women’s rights advocates 
as sharing a “feeling of loyalty to our own sex 
and an enthusiasm to have every degradation 
that was put upon our sex removed."!? 
Despite occasional conflict over whether men 
should be brought into the movement, the 
NWP retained a separatist strategy. To ensure 
that the Party remain committed to its original 
vision—collective action by women for 
women— wealthy benefactor Alva Belmont 
included a clause in her bequest revoking her 
legecy if men ever joined or participated in 
the organization. . 

In addition to reinforcing feminist beliefs, 
the culture harbored at Belmont House 
fulfilled expressive and symbolic functions 
that contributed to the survival of feminism. 
Women who lived and worked at the house 
became, for some, the “Woman’s Party 
family.” Many who could not live at the 
house, because of family, work, and financial 
constraints, made regular pilgrimages in order 
to remain a part of the women's community. 
One member wrote that she was "looking 
forward with joy to my return home, and 
Home to me now, means the dear Alva 
Belmont House. "!^ In fact, bringing friends to 
Belmont House was the primary way that 
women recruited new members. 

Personal ties of love and friendship among 
members were an important cultural ideal. A 
willingness to shape personal relationships 
around the cause was, in large measure, what 
made possible the intense commitment of 
members. NWP members described their ties 
as mother-daughter or sororal relationships, 
and members' letters to one another were 


2 Rose Arnold Powell, diary entry, Nov. 2, 
1960, Powell papers, box 1, v. 8, SL; Rose Arnold 
Powell to Mary Beard, June 23, 1948, Powell 
papers, box 2 (27), SL. 

3 Alice Paul, “Conversations With Alice Paul: 
Woman Suffrage and the Equal Rights Amend- 
ment,” an oral history conducted in 1972 and 1973 
by Amelia R. Fry, Regional Oral History Office, 
University of California, 1976, p. 197. 

14 Mary Alice Matthews to Alice Paul, March 
24, 1945, NWP papers, reel 85. 
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filled with expressions of intimacy and 
friendship. Ties among members took the 


form of close friendships, intense devotion to 
Alice Paul, and couple relationships. Having 


,another woman as life partner seemed to 


facilitate feminist work because these wom- 


''en's personal lives meshed well with their 


political commitments. 


Movement organizations that cultivate and’ 
„Sustain rich symbolic lives, then, enhance the 


` abeyance function by helping to hold mem- 
bers. This finding is consistent with other 
research that demonstrates that commitment 
` to peers and tò a shared political community 
- promotes sustained involvement in social 
movements (Rosenthal and Schwartz forthcom- 


ing; McAdam 1988; White 1988). 


In summary, I have described the NWP in 


„the post-1945 period as an organizational. 


pattern characterized by high longevity of 
attachment; intense levels of individual com- 
mitment to movement goals and tactics; high 
exclusiveness in terms of membership; high 
centralization that ensures a relatively ad- 
vanced level of specialized skills among core 


. activists; and a rich political culture that 


promotes continued involvement in the move- 
ment. This appears to be the ideal combina- 
tion of factors necessary to hold a movement 
in abeyance until the external forces make it 
possible to resume a more mass-based 
challenge. 


CONSEQUENCES FOR THÉ 
RESURGENT WOMEN'S MOVEMENT 


. However movement success is measured, the 
: women's rights movement from 1945 to the 


mid-1960s was not successful in its own time. 
what 
if any, 
feminists in this period have for the revital- 


: ized movement for gender equality in the late 
‘1960s? The founding of the National Organi- 
' zation for Women (NOW) in 1966 serves as a 
‘useful signpost marking the rise of the 
‘contemporary women's movement. 


NOW. 
brought together labor union activists, govern- 
; ment employees, and longtime feminist activ- 
ists and took leadership of the liberal branch 
. of the movement (Freeman 1975). At about 
the same time, younger women involved in 
the civil rights and New Left movement 


-formed the more locally organized radical 


branch. - 
Studies have not generally .recognized 


did the actions of, 
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- connections between the existing women's 


rights movement and the resurgent ' one. My ` 
analysis suggests three ways in which the 
activism of the NWP shaped the feminist 
challenge that followed. It provided preexist- 
ing activist networks, an existing repertoire of 
goals.and tactics, and a collective identity that 
justified feminist opposition. These elements ' 
constitute the most important consequences of. 
abeyance structures for future mobilization 
around persistent discontents. 


Activist Networks 


A substantial body of research documents the 
significance of preexisting links and organiza- 
tional ties among individuals for the rise of 
collective action (e.g., Snow, Zurcher, and 
Ekland-Olson 1980; Freeman 1979, 1983; 
Kosenthal et al. 1985). The feininist network 
of the 1940s and 1950s affected the resurgent 
movement of the 1960s in two ways. First, 
activism by NWP members played a crucial 
role in two key events: the establishment of 
the President's Commission on the Status of 
Women, convened by President Kennedy in 
1961, and the inclusion of sex in Title VII of 
the Civil Rights Act of 1964, which forbade 
discrimination in employment. Second, many 
women who participated in the struggle for 
women’s rights in the 1940s and 1950s 
became active in the resurgent women’s 
movement, especially the liberal branch. 
NWP members were among the founders and 
charter members of NOW. Of the 10 
individuals who signed NOW’s original 
Statement of Purpose, 4 were members of the 
NWP (Friedan 1976). In her account of the 
early years of NOW, founder Betty Friedan 
(1976, pp. 110-17) describes an “under- 
‘ground network” of longtime committed 
feminists who provided crucial resources 
necessary for the formation of NOW. Even 
Alice Paul joined NOW, although she criti- 
cized NOW members for acting “as if they’ve 
discovered the whole idea. "5 

Although less common, a few NWP 
members established ties to the radical 
branch. One member met with the women's 
caucus of the National Conference on New 
Politics in Chicago in 1967, a conference that 
helped spark the formation of the radical 
branch. Another member attended a speech 


13 Interview no. 12. 
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‘ “iy Kate Millett at Purdue University in the 


early 1970s and handed out ERA literature to 
the crowd. Contrasting vividly the feminists 
of her generation with those of the 1970s, she 
noted that she was the only one thére in a hat 
and that everyone else, including Millett, had 


long hair. ^ Thus a committed core of 


activists helped to provide resources for a 
resurgent more mass-based movement. _ 


Goals and Tactical Choices 


Tilly’s (1979) concept of repertoires of 


collective action provides the greatest insight 


: into the ways that actions of a challenging 
‘group at.a given point in time can affect 


actions of a subsequent group. Thus, the 
forms of action available to.a group are not 
unlimited but are restricted by time, place, 
and group characteristics. Movement goals 
and strategies are learned, and they change 
slowly, Extending Tilly’s hypothesis, the 
array of collective actions that a movement 
develops ‘to sustain itself should influence the 
goals and tactics adopted: by the same 
movement in subsequent mass mobilizations. 

This is indeed the case with respect to the 
American women's rights movement. - Al- 


though-the NWP abandoned disruptive and 
. Militant strategies after the suffrage victory, it 


retained the same goal—a constitutional 
amendment. Largely as a result of NWP 
pressure, NOW voted at its second confer- 
ence in 1967 to endorse the ERA, which 
became the most unifying’ goal of the 
movement by the 1970s (Ferree and Hess 
1985; Taylor 1989). Further,, NOW adopted 
many of'the NWP's institutionalized tactics, 


" such as lobbying, letter writing, and pressur- 


ing the pelitical parties. NOW even made use 


‘of the NWP's political connections $ and its list 


of ERA sponsors. 
The ERA example illustrates the ways ‘that 
existing repertoires of action can both facili- 


‘tate and _constrain a movement. The ‘final 
.campaign for the ERA in the late 1970s and 


early 1980s mobilized massive numbers of 


participants, swelling the ranks of ‘(NOW to, 


almost 200,000 and its budget.to nearly 3 
million dollars (Gelb and Palley 1982; 


-Mueller unpublished). During its early years, 
. with its equal rights emphasis which appealed 


‘© Mary: Kennedy to Alice Paul, February ll, 
1971, NWP papers, reel rhe 


‘771 


mainly’ to white and. middle-class ‘women, 
NOW alienated. black and union + women 


(Giddings 1984). Thus, the liberal branch of . 


the contemporary women’s movement, by 
adopting the goals and strategies of earlier 
feminists, found it difficult to shake the class 
and race limitations of its predecessors. 

For a.movement.to,survive periods of 
relative hiatus, it must develop a battery of 
specialized tactics that can be carried out by 
an activist cadre without the support of a mass 
base (Oliver and Marwell 1988) These 


become a part of a group's repertoire of ` 


collective action and influence the subsequent 
range of actions available to future chal- 
lenges. : cd sis 


Collective Identity "s * 


Collective identity is the shared definition ofa 
group that derives from its..members com- 
mon interests and solidarity. Although re- 
source mobilization theorists minimize the 
importance of group identity and conscious- 
hess in the rise of social movements (McCarthy 
and Zald 1973, 1977; Jenkins and. Perrow 


1977), these factors are central to theorists of ` 
the “new social: movements” (e.g., Pizzorno 


1978; Cohen 1985; Melucci 1985; Klander- 
mans 1986). They suggest that, by definition, 
social movements create a collective opposi- 
tional consciousness. Mueller’ s: (1987, p. 90) 
research on the women’s movement suggests 
that: changes in consciousness can have 


long-term significance because they can serve ` 


as a resourcé for. future mobilization. « 

The creation of a shared collective identity 
requires the group to revise its history and 
develop symbols to reinforce movement goals 
and strategies (Gusfield 1970, p. 309-13). 
For the 1960s women’s movement, the NWP, 
because ofits. ties to suffrage, became an 
important ` symbol of the long history of 
women’s oppression and resistance. As a 
result of its historical significance and prime 


location, Belmont House was used throughout : 
the 1970s for celebrations of women's > 
movement history, as ‘a temporary residence - 


for scholars and ‘students engaged in feminist 
research, and'as a place for ERA lobbyists to 


meet. Moreover, Alice Paul, who earlier had - 
sparked so' many conflicts, became the . 


quintessential symbol of feminist. commit- li 
ment. In 1977, NOW ‘sponsored a birthday- 


benefit for her ‘at Belmont. House that was 
attended by members of'a wide range of 
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feminist organizations. Even after Paul’s 
death in 1977, the NWP continued to list 
her as founder on its letterhead and to 
advertise “Alice Paul Jail Jewelry,” a replica 
of the famous jailhouse door pin proudly 
worn by imprisoned suffragists. 

The significance of the NWP grew as 
younger and more radical women discovered 
the legacy of militant feminism. Even in 
1981, the NWP’s symbolic importance re- 
mained great enough to inspire an attempted 
takeover by a group of younger feminists, led 
by Sonia Johnson, who claimed the militants 
who first formed the NWP as their foremoth- 
ers and even adopted the original name of the 
Party. Ironically, as the contemporary wom- 
en’s movement grew stronger and more 
militant, the actual heirs of the early militants 
grew increasingly isolated and less central in 
the struggle for women’s rights. 

In an abeyance phase, a social movement 
organization uses internally oriented activities 
to build a structure through which it can 
maintain its identity, ideals, and political 
vision. The collective identity that it con- 
structs and maintains within a shared political 
community can become an important sym- 
bolic resource for subsequent mobilizations. 


CONCLUSION 


This paper presents new data that challenge 
the traditional view that no organized feminist 
challenge survived in the 1940s and 1950s. I 
have used the NWP case to highlight the 
processes by which social movements main- 
tain continuity between different cycles of 
peak activity. I analyze the factors associated 
with adaptations of Mizruchi’s (1983) abey- 
ance process. Abeyance is essentially a 
holding pattern of a group which continues to 
mount some type of challenge even in a 
nonreceptive political environment. Factors 
that contribute to abeyance are both external 
and internal to the movement. Externally, a 
discrepancy between a surplus of activists and 
a lack of status opportunities for integrating 
them into the mainstream creates conditions 
for abeyance. Internally, structures arise that 
permit organizations to absorb and hold a 
committed cadre of activists. These abeyance 
structures, in tum, promote movement conti- 
nuity and are employed in later rounds of 
mass mobilization. 

Although any theory based on a single case 
is open to challenge, recent research points to 
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the utility of the abeyance model for under- 
standing other movements of the 1960s, 
particularly the civil rights (McAdam 1988), 
New Left (Gittlin 1987; Isserman 1987; 
Hayden 1988), and gay rights (D’Emilio 
1983) movements. But this work has not yet 
had major impact on revising theory about the 
sixties movements or on social movement 
theory in general. 

Why have scholars of social movements 
neglected sources of continuity between 
cycles of movement activity and, instead, 
preferred an “immaculate conception” inter- 
pretation of social movements? First, scholars 
generally are more interested in movements 
undergoing cycles of mass mobilization and 
have done little research on movements in 
decline and equilibrium. Second, the limited 
conceptualization of movement organization 
in the literature has perpetuated classical 
conceptions of social movements as numeri- 
cally large and mass-based. Research on a 
variety of organizational forms, including 
becalmed movements (Zald and Ash 1966), 
professional movements (McCarthy and Zald 
1973), movement halfway houses (Morris 
1984), elite-sustained movements (Rupp and 
Taylor 1987), and consensus movements 
(McCarthy and Wolfson unpublished), is now 
challenging the classical view by suggesting 
that these types of movements are capable of 
sustained activism in nonreceptive political 
climates (Staggenborg 1988). Third, existing 
approaches overlook social movement conti- 
nuity by neglecting to think about outcomes 
(Gamson 1975). Focusing on sort-term gains 
ignores the possibility that social reform 
proceeds in a ratchetlike fashion, where the 
gains of one struggle become the resources 
for later challenges by the same aggrieved 
group (Tarrow 1983). 

The research presented above specifies the 
ways that organizational and ideological 
bridges span different stages of mobilization. 
Most movements have thresholds or turning 
poirts in mobilization which scholars have 
taken for “births” and "deaths." This research 
suggests that movements do not die, but scale 
down and retrench to adapt to changes in the 
political climate. Perhaps movements are 
never really born anew. Rather, they contract 
and nibernate, sustaining the totally dedicated 
and devising strategies appropriate to the 
external environment. If this is the case, our 
task as sociologists shifts from refining 
theories of movement emergence to account- 
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ing for fluctuations in the nature and scope of 


omnipresent challenges. 
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This paper examines the process by whick married men and women form and 
balance work and family identities. Hypotheses derived from alternative 
conceptualizations of the commitment process are tested with data from the 1977 
Quality of Employment Survey. We find that for both men and women, engagement 
in work and family roles leads to identification with those roles. However, the 
process of identity formation differs for men and women in ways that correspond to 
gender-based differentiation in household and workplace activities. Married 
women employed outside the home give precedence to family in balancing work 
and family identities, while married men may have the discretion to build 
identification with work and family roles without trading one off against the other. 
Despite differences in the process of commitment formation, our results suggest 
that when men and women engaged in similar work and family roles mey are 
almost equally committed to those roles. 


INTRODUCTION 


Over 30 years ago, Myrdal and Klein (1956) 
observed that for most women, paid labor was 
a role to be fulfilled over and above their 
primary responsibility for household labor. 
Since then, the proportion of wives working 
outside the home has nearly doubled, from 
just over 30 percent in 1960 to nearly 55 
percent in 1985. The traditional family of a 
married couple with children and a wife not in 
the paid labor force, which represented 38 
percent of all families in 1960, accounted for 
just. 15 percent of all families 25 years later 
(Merrick and Tordella 1988). Yet women 
continue to be primarily responsible for most 
domestic labor and child care, despite slight 
increases in men’s contributions to those 
activities (Walker and Woods 1976; Hart- 
mann 1981; Fox and Nichols 1983; Coverman 
1985). As women increasingly play dual roles 
in the “work-family role system” (Pleck 
1977), issues of commitment that may once 
have been taken for granted are now very 
much on the minds of both husbands and 
wives. 

The descriptive studies cited above docu- 
ment how couples spend time on household 
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and labor force activities, while work in the 
human capital tradition attempts to understand . 
the decision-making process that leads to an 
allocation of time and effort to work and 
family roles (e.g., Mincer and Polachek 
1974; Becker 1981, 1985; England and 
Farkas 1986). Sociologists have examined the 
consequences of women's dual roles in terms 
of role strain or “overload” (e.g., Rapoport 
and Rapoport 1969; Scanzoni 1978; Geerken 
and Gove 1983). However, the personal 
bases ‘upon which individuals choose to 
allocate time and other personal resources 
between work and family spheres has re- 
ceived far less attention (Aldous 1982; 
Johnson and Firebaugh 1985; Kanter 1976; 
Plezk 1983). As individuals allocate time and 
energy to work and family roles, they come to 
identify with those roles. Labor force and 
family behaviors build commitments to work 
and family identities. Those commitments in 
turn provide the personal bases for attributing 
meaning to dual roles, identifying conflict 
between them, and forming intentions regard- 
ing future role behaviors. dcdit. in this 
paper, we examine the process by which 
married men and women form and balance 
work and family identities. 


CONCEPTUALIZING COMMITMENT 


Commitment has been defined as the binding 
of an individual to behavioral acts (Kiesler 
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1971; Salancik 1977), and is often analyzed 
as a process of retrospection about a cuimula- 
tive line of activity (Becker 1960).! Individu- 
als adjust their preferences and subjective 
investments to conform to past behaviors and 
become bound to those behaviors to the extent 
that they are explicit, irrevocable, public, and 
volitional (Salancik 1977). Commitment makes 
subject behavior less changeable, thereby 
accounting for stability in subsequent behav- 
ior. 

Applied mostly to the study of organiza- 
tional commitment (e.g., Pfeffer and Lawler 
1980; O'Reilly and Caldwell 1981), this view 
of commitment is equally applicable to paid 
work and family roles. Ít implies that as 
individuals find themselves engaged in a 
particular pattern of employment and family 
responsibilities, they change their subjective 
attachments to be consistent with those 
engagements. Thus, according to this perspec- 
tive, commitments to paid work and family 
roles are functions of one's past and current 


- experiences, responsibilities, and statuses at 


work and in the family, respectively. 
However, one is "committed" to the extent 
that role behaviors become a source of 
meaning or identity (Rosenfeld and Spenner 
1988). Thus, one establishes "work identity" 
or "family identity" as behavior in these 
Spheres becomes a source of meaning and 
contributes to a sense of self, which in turn 
predisposes one to persist in a line of activity. 
Implicit in this definition of commitment is a 
distributional dimension whereby individuals 
distribute or trade off commitments among 


. alternative activities. Thus, activities differ in 


their relative importance as sources of iden- 
tity, and intentions regarding behavior are 
formed with respect to an allocation of time 
and effort across activities  (Safilios- 
Rothschild 1971). Understanding commit- 
ment to work and family requires attention to 
both identity and distribution. 

Research on work and family commitment 
often begins from the premise that an 
individual can only build a strong commit- 
ment to a work identity by relinquishing a 
commitment to family, and vice versa. 
Assuming scarcity in the personal resources 


! See Bielby and Bielby (1988) for a discussion 
of alternative theoretical perspectives on the 
commitment process and a longitudinal research 
design that allows perspectives to be tested against 
one another. 


that sustain commitments, conceptualization 
and measurement schemes typically attempt 
to locate individuals along a dimension of 
commitment to work versus family (Bailyn 
1978; Lopata and Norr 1980). Whether 
individuals in fact trade off commitments 
among alternative activities is not treated as 
an empirical issue from this perspective. 

In contrast, Marks (1977) offers an alterna- 
tive to the "scarcity" view of commitment, 
time, and energy. He argues that individuals 
are able to form strong commitments to 
multiple roles and are almost infinitely 
capable of sustaining numerous, diverse 
involvements. In short, according to this 
"multiplicity" view of commitment, individ- 
uals make time and generate energy to engage 
in role behaviors to which they are commit- 
ted. From this perspective, whether men and 
women trade off commitments is an empirical 
issue, since, under some circumstances, 
individuals might form strong commitments 
to both work and family. Recent empirical 
research on women's allocation of effort to 
work and family roles provides modest 
support for this speculation (Bielby and 
Bielby 1988). 

In our view, neither the "scarcity" nor the 
"multiplicity" views are capable of ade- 
quately explaining how men and women 
distribute commitments to work and family 
roles. Behaviors in these realms are shaped by 
a sex-based division of labor, and the values 
placed on those behaviors are prescribed by sex 
role norms. Because of the demands of 
household responsibilities and the expecta- 
tions surrounding the roles of "wife" and 
"mother" (Johnson 1988), we expect that 
wives employed outside the home balance 
dual role identities by trading one off against 
the other. In contrast, for men, contemporary 
normative expectations for the husband" and 
"father" roles still do not include fully shared 
responsibility and involvement in household 
and child-care activities. Furthermore, a 
husband's role in the workplace is consistent 
with his family obligations as "provider." 
Accordingly, married men may not trade one 
identity off against the other. In short, we 
expect that the differential structural and 
normative constraints on husbands and wives 
allow men to sustain dual work and family 
identities but constrain women to forgo one to 
sustain the other. 

The balance of commitments across work 
and family has implications for occupational 
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and earnings disparities between men and 
women. If an individual's commitment to a 
‘family role precludes a strong identification 
-with a career, then the traditional household 
division of labor could be largely responsible 
for gender inequities in the workplace 
(Polachek 1976). On the other hand, many 

' women may sustain high levels of commit- 
ment to work ‘and family roles, yet find 
themselves disadvantaged in the workplace 
because of assumptions employers make 
about women's commitment to work roles. In 
analyses below, we allow for a reciprocal 
relationship between work and family. iden- 
tity, and test empirically whether women and 
men differ in their propensities to "trade off" 
one form of identity against the other. 


MODELS AND HYPOTHESES 


Table 1 summarizes a model of work and 
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family identity, based on the conceptual 
review described above. The model has the:’ 
following structure. First, work. and family 


identity are influenced by characteristics of an 


individual's work and family roles, réspec- 
tively. Second, a set of worker traits, 
including human capital, labor supply, and 
life cycle characteristics, affect both work and 
family identity: Finally, work and family 
identity are reciprocally related to one an- 
other. Hypotheses about specific relationships 
are indicted in Table 1. ; 

Family traits. Time devoted to household 
responsibilities and length of time in a marital 
relationship are hypothesized to contribute to 
a strong family identity. Since women are 
primarily responsible for child-rearing, the' 
effects of children are hypothesized to be, 


stronger for women. 


Job traits. Work that is’ interesting, utilizes 


a worker’s skills, and allows for individual 


Table 1. Determinants of Work and Family Identity: Hypothesized Relationships* 


WORK IDENTITY 


- Males 
Family traits 
Children under 6 
Children six and older 
Responsible for child 
Child-care hours 
House chores hours 
Years married | 
Job traits 
Job autonomy 
Interesting work 
Skill utilization . 
Self-employed 
Stake in job 
Job security 
7. Afraid to quit 
Worker traits 
' Age 
Spouse NILF 
' Part-time 
Part-year 
. Work continuity 
Education 
Spouse's income 
Occ. % female 
- ' Identity 
. Work identity PIS 
Family identity 0 
*+ positive relationship 
— negative relationship 
0’ no relationship 
? no hypothesis . : 
++ positive relationship stronger among women . 
: -- negative relationship stronger among women 
no applicable UC 


oooooo 


URERA 


P+t#E lit 


inverted U 


Females 


oOoooooo 


I++++++ 
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initiative should facilitate strong work iden- 
tity, as should running one’s own business. A 
feeling that one has a “stake” in one’s job is 
likely to reflect past investments in. the work 
role and also contribute to. work identity. 
Workers secure in their jobs are likely to feel 
stronger attachments to their employers and 
therefore identify more with the work role. 
The “external justification hypothesis" ‘sug- 


gests that individuals are less likely to commit ` 


to behaviors that are motivated by external 
constraints (Aronson 1980). This hypothesis 
has received partial support in studies of 
' organizational commitment (Pfeffer and Lawler 
: 1980; O'Reilly and Caldwell 1981). Applied 
to work identity, we should find that a worker- 
who persists on a job because he-or she is: 
afraid of the consequences of quitting has 
lower commitment than one who is motivated 
by intrinsic features of the job. 

Worker traits. We expect work identity to 
‘increase early in the life cycle and to begin to 
decline shortly after age 40, paralleling life - 
cycle patterns of human capital investment: 
(Mincer 1974). However, we have no specific 
hypothesis regarding life cycle variation in 
family identity, net of age-related changes in 
family characteristics. The consequences , of 
having a spouse who is not in the paid labor. 
force should be different for men and women. 
For a man, the spouse is likely to ‘be. 
specializing in household activities in ways 
that facilitates his identification with a “job. 
For a woman, the spouse is more likely to be 
unemployed than to be specializing in house- 
hold activities. Accordingly, we hypothesize 
that a spouse not in the paid labor force 
facilitates a man’s career identity (Gould and 
Werbel: 1983) but has no impact on a 
women's. For traditional males, identification 
: with family roles may be greater when the 
spouse specializes in household activities. In 
contrast, an unemployed husband is likely to 
have a negative impact on family.dynamics. 
Thus, we hypothesize that having a spouse 
who is, not in the paid labor force. increases 
family identity for men and decreases it for 
women. 

Part-time.and part-year allow for more time. 
in family roles relative to work roles and 
should. therefore decrease work identity: and 
increase family identity. For women, the 
choice of part-time versus full-time work is 
likely to be based on a concern for balancing 
work and family responsibilities. For men, 
the choice is more likely to be shaped by 


‘flexible scheduling, or, 


Tm 
external constraints sich as seasonal and 
business cycle variation in employment oppor- 
tunities. Accordingly, we expect the impact. 
of part-time and part-year work to be greater 
among women. Following the same logic, we 
expect that among women, the effects of 


'continuous versus disrupted work histories 


will be opposite in sign to those for part-time 
and part-year work. However, for men, a’. 
disrupted work history is-unlikely to reflect a 
strong family orientation. Indeed, it is more 
likely to reflect instability in a man's social 
situation and erode commitments to family. ' 
Accordingly, we hypothesize a positive effect 
of work continuity on family identity among 
men. 

Educational credentials can be a resource 
for improving career prospects, and thus we 
expect individuals with more schooling to 
have stronger work identification. We find no 
compelling rationale for a hypothesis regard- 
ing the effect of schooling on family identity. 

. For both.men and women, having a 

well-paid spouse reduces the extent to which 

financial constraints motivate work behav- 

iors. Accordingly, in another variation on the 

"external justification hypothesis," we argue 

that an individual who pursues a 'career 

despite a spouse's high earning power is. 
likely to develop greater work identification ` 
than someone who is constrained to work 
because of the spouse’s lack of earning 
potential. Since the “breadwinner” role is. 
culturally prescribed for husbands, work 
identity for males should be less sensitive to. 
spouse’s income than it is for women. We 
also hypothesize that having a well-paid 
spouse enhances the financial resources avail- : 
able to sustain a family and should therefore 
facilitate family identification. Again, be- 
cause of culturally prescribed role expecta- 
tions, this effect should be greater among "s 
women than among men.’ : 

Drawing on. human capital theory, we 
hypothesize that employment in a female- 
dominated occupation should reduce work . 
identity and increase family identity. Human 
capital economists like Polachek (1976) argue 
that certain jobs are female-domiriated be- 
cause they facilitate fulfillment of family 
obligations. Such jobs may allow for more ` 
as suggested by 
Becker (1985), may require less energy or 
effort (but see Bielby and Bielby 1988). 
Thus, if the human capital approach is 
correct, individuals who pursue work in 
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female-dominated occupations have orga- 
nized their work lives in a way that does not 
require a strong commitment to a career.? 

Work identity/family identity trade-offs. 
Finally, our model allows for a reciprocal 
relationship between work and family iden- 
tity. That is, we allow for the possibility that 
net of particular work and family situations, 
men and women reconcile their commitments 
to work and family by trading one off against 
the other. 

The hypotheses posed in Table 1 reflect 
traditional sex-role arrangements. That is, we 
hypothesize that working wives are embedded 
in a set of role structures that require them to 
sacrifice a strong work identity if they are to 
identify with a traditional family role (and 
vice versa). In part, this is because a women’s 
identification with the family role is closely 
linked to a set of responsibilities in the 
household. ‘However, for traditional males, 
the extent to which they identify with the 
family role is only loosely linked to a 
constellation of family responsibilities. As a 
result, we hypothesize that men have greater 
freedom to develop strong levels of identifi- 
cation with both work and family roles. 
Accordingly, Table 1 shows negative relation- 
ships between work and family identity for 
women, reflecting a trade-off of identities, 
and no direct causal line between the two 
dimensions of identity for men. 

There are two plausible alternatives to the 
hypotheses regarding reciprocal effects posed 
in Table 1. A strict “zero-sum” or “scarcity” 
view of identity suggests negative relation- 
ships between the two dimensions for both 
men and women, while Marks’s (1977) 


? Our theoretical perspective conceptualizes 
commitment as a consequence of behavioral acts, 
and thus we model identity as a function of labor 
supply variables. It is, of course, possible that 
labor supply is influenced by work and family 
identity: individuals who have formed strong 
family identities and weak work identities may 
subsequently choose part-time work in female- 
dominated occupations. However, as measured 
here, identity is assessed by perceptions at the time 
of the survey and labor supply characteristics 
pertain to either prior work experience or “usual” 
work situation. Therefore, the temporal ordering of 
measurements of identity and labor supply corre- 
sponds to the causal direction in our models. 
Nevertheless, estimates of the effects of labor 
supply are likely to be somewhat overstated due to 
simultaneity bias. 
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“multiplicity” view suggests no relationship 
or possibly even positive reciprocal relation- 
ships between the two dimensions of identity. 
Our models allow us to test which of these 
alternative explanstions is most consistent 
with the data. 

Who is more committed, men or women? 
So far, we have posed no hypotheses about 
overall or net differences between men and 
women in commitment to work and family. 
Other research has shown small zero-order 
differences, with men somewhat more com- 
mitted to work roles and women somewhat 
more committed to family roles (Sekaran 
1983; Pleck 1985). Our model of the process 
cf identity formation, which draws upon the 
retrospective approach to commitment, sug- 
gests that any overall differences are likely to 
be attributable to differences between men 
and women in prior work and family 
experiences and behaviors. That is, if men are 
more committed to work roles, it is because 
they have engaged in more public, volitional, 
explicit, irrevocable work behaviors that lead 
to commitment. Thus, we would expect 
comparable levels of work identity among 
women and men with comparable histories of 
work behaviors. Similarly, comparing men 
and women with similar family roles and 
experiences, we should find no sex difference 
in family identity. 


DATA AND MEASUREMENT 
Data 


Data are from the 1977 Quality of Employ- 
ment Survey (QES), a representative sample 
of adults living in households, 16 years of age 
and older, and working at least 20 hours per 
week (Quinn and Staines 1979). Results 
below pertain to samples of 270 married 
women and 761 married men. Detailed 
descriptions of independent variables appear 
in Appendix Table 1, and descriptive statis- 
tics for all measures are reported in Table 2. 
These data are publicly available from the 
Inter-University Consortium for Political and 
Sozial Research. 


Work and Family Identity 


Our dependent measures, work and family 
identity, are modeled as unobserved con- 
structs, imperfectly measured by two survey 
items for each construct. The two indicators 


v 


té.. 


BALANCING COMMITMENTS TO WORK AND FAMILY 781 
Table 2. Descriptive Statistics, 1977 Quality of Employment Sutvey, by Sex (V = 761 Males, 274 Females)* 


MALES 
Variable (range) Mean 
Family traits 
Children under 6 (0-1) 28 
Children six and older (0-1) .49 
Responsible for child (0—1) .04 
Child-care hours (0—8) 1.01 
House chores hours (0—8) 1.14 
Years married (1—55) 15.78 
Job traits 
Job autonomy (1-4) 3.02 
Intrinsic rewards (1—4) 2.95 
Interesting work (1—4) 3.33 
' Skill utilization (1~4) 2.98 
Self-employed (0-1) 16 
Stake in job (1-4) 2.70 
Job security (1-4) 3.16 
Afraid to quit (1-4) 2.68 
Worker traits 
Age (18-78) 41.07 
Spouse NILF (0-1) i .50 
Part-time (0-1) 04 
Part-year (0-1) .09 
Work continuity (0-1) 91 
Education (6-18) 12.65 
Ln other family 
income (0-11.0) 5.55 
Occ. % female (0-100) 14.84 
Ln earnings (6.2-11.5) 9.62 
Identity and role conflict* 
Main sat. from work (1-4) 2.37 
Job most important (1-5) 3.25 
Family most important (1—5) 4.38 
Think of family (1-4) 2.72 
Work/family interfere (1-4) 2.18 
Won't relocate (0-1) WS 


FEMALES 

$.p.> Mean S.D. % NA 
- 24 pn 0% 
= .48 = 0% 
= 44 eH 1% 
1.39 2.03 2.61 2% 
1.10 3.38 2.06 3% 
11.31 15.69 12.99 2% 
64 2.80 78 4% 
60 2.99 76 496 
86 3.26 1.06 1% 
75 2.80 91 3% 
= 13 = 0% 
74 2.40 1.05 2% 
74 3.27 n 5% 
8S 2.31 1.09 3% 
11.67 38.30 13.55 0% 
E 09 = 0% 
— 18 = 1% 
= .19 = 5% 
14 .68 39 096 
2.76 12.58 2.83 1% 
3.98 8.88 2.56 8% 
19.72 66.52 35.65 0% 
55 8.81 81 6% 
79 2.06%** 88 3% 
1.17 2.72*** 1.41 1% 
7 4.47** 81 2% 
67 2.87*** 75 3% 
1.09 2.26* 1.30 2% 
= 59*** ~ 0% 


* Correlation matrices are available from the authors upon request. 


> Standard deviations not reported for binary variables. 


° Significance levels for sex differences on identity and role conflict measures (one-tailed tests): 


* p «X .10. : 
** p < 05. 
wt p « Ol 


of work identity are: (1) “My main satisfac- 
tion in life comes from my work”; and (2) 
“The most important things that happen to 
you involve your job (1 = “strongly disagree 
to 5 = "strongly agree").? The two indicators 
of family identity are: (1) “The most important 
things that happen to you involve your hus- 
band or wife [and your children] (1 = “strong- 
ly disagree" to 5 — "strongly agree); and (2) 
“How often do you think about your husband 
or wife [and your children] when you’re busy 


3 The former item, scaled 1 to 4, has no neutral 
category; the latter includes a middle category, 3 
= “neither agree nor disagree.” 


2 


doing other things” (1 = “rarely” to 4 = 
“always”). On each dimension of identity, the 
items described above emerged as strong first 
factors in exploratory factor analyses includ- 
ing other potential measures. Modest, statisti- 
cally significant sex differences exist on each 
of the four measures of identity (see Table 2). 
Men are somewhat more likely to identity with 
their work roles and women with their family 
roles.4 Statistical models presented below show 
how these differences can be attributed to dif- 


* Pleck (1985) provides a more detailed discus- 
sion of sex differences on these and related 
measures. 
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ferences between husbands and wives in their 
activities, responsibilities, and statuses within 


families and workplaces. 
Analyses below are based on models that 


.allow measurement parameters to vary: by 
' sex. That is, we allow for the possibility that 


the link between underlying identity and the 
survey items is stronger for working husbands 
than for working wives, or vice versa. 
Metrics are established for the unobserved 


„constructs by fixing to unity the first indicator 


` *of each construct. - 


Independent Variables 


` Family traits include whether the respondent 


.has preschool-age or school-age children, 
whether the respondent has primary responsi- . 


bility for child care, hours devoted to child 


'care and household chores, and years mar-. 
ried. Job traits include job autonomy, intrin- ` 


sic work rewards, skill utilization, self- 
employment status, and respondent's percep- 


. tions of having a stake in her or his job, job 


security, and fear of quitting. The intrinsic 


-rewards scale is included as a predictor of the 


role conflict measures. However, the scale is 
not included in the model of work and family 
identity, since "importance" and "meaning" 
of work are perhaps. more correctly viewed as 
reflections of work identity than as determi- 
nants of it. For the model of work and family 
identity only one component of the intrinsic 
rewards scale, interesting work, is included 


among the predictors. 


Worker traits include the following: age, 


'spouse's labor force status, whether the 


respondent works part-time or part-year, work 


.continuity, ‘schooling, family income, sex 


composition of the respondent's occupation, 
and respondent's earnings. All analyses 
control for occupational categories correspond- 
ing to major census groups: professional, 
managerial, sales, clerical, craft, operatives, 
transportation workers, laborers, farmers (in- 
cluding farm laborers and farm managers), 


` and service workers. Nine binary variables: 


control for the ten occupational categories. 
Since we pose no hypotheses about the effects 
of occupational category, coefficients for the 
nine binary variables are not reported in the 
tables below. 

Maximum likelihood LISREL analyses and 
OLS regressions are computed from pairwise- 


. present correlations weighted to adjust for the 


number of eligible respondents per house- 
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kold. Comparisons of results computed pair- 
wise, listwise, weighted, and unweighted 
revealed no appreciable differences. 


RESULTS 


Table 3 reports LISREL maximum likelihood 
estimates for our model of work and family 
identity. Coefficients are constrained to be 
equal across sexes except where we hypothe- 
size sex differences in the strength of the 
relationships (see Table 1).5 Measurement 
parameters are not constrained to be equal 
across sex.$: 
Family traits. As hypothesized, activities, 
responsibilities, and statuses within families. 
and workplaces shape the work and family 
identities of husbands and wives. Additional 
hours devoted to child care and household 
chores are associated with higher levels of 
family identity for both men and women.’ 
Presence of a school-aged child has a small: 
effect on work identity for males and a large 
effect for females. However, holding constant 
hours spent in child care, the presence of a 
younger child has little effect on family 
identity for either sex. Finally, contrary to our 


5 Nested chi-square tests among alternative 
models support our judgments about coefficients 
constrained to'be equal across sex. The improve- 
ment in fit between the model estimated in Table 3 
and one that makes no constraints on coefficients 
across sex is not statistically significant (x^ = 
24 25, 22 df, p > .25). Moreover, the deteriora- 
tion in fit between the model estimated in Table 3 
and one that-constrains all coefficients to be equal 
across sex is statistically significant (x? — 32.68, 
16 df, p « .01). Thus, these tests suggest that in 
the population, coefficients vary across sex much 
in the ways we have hypothesized. 

© The hypothesis that measurement parameters - 
are invariant across sex is firmly rejected in tests of 
nested models (x? = 23.22, 6 df, p < .005). 

7 Of course, the amount of time spent on 
hovsehold tasks by men and women may not 
reflect their respective efforts and responsibilities. 
To test whether a given hour of household work is 
more "committing" for a woman than for a man, 
we allowed the effects of child care and house 
chores hours to differ by sex. The effect of an 


additional hour of house chores was greater for 


wives and the effect of an additional hour of child 
care greater for husbands. However, these differ- 
ences were not statistically significant (x? = 1.27, > 
2 df, p > .50). 


y 


F 
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Table; 3. Determinants of Work and Family Identity among Working Spouses, LISREL Maximum’ Likelihood : 


, Estimates (V = 761 Males, 274 Females) 








' l NEUE WORK IDENTITY . | FAMILY IDENTITY 
Predetermined Variable Males ` Females ^ ` Males Females 
Family traits ' t gi 

Children under 6 0 0 055* . 029 
Children six and older 0 0 .068** 0024999 
Respónsible for child* 0 0 056. . . .056 
Child-care hours" “0 0 ' 022** .022** 
: House chores hours* 0' 0 .030*** .030*** 
. Years married x 10% 0 0 — 51*«b — .051**° 
Job traits ] y > ] 
Job autonomy* ` 034 .034 0 | 0 
Interesting work? .205*** 205*«* 0 0 
Skills utilization* | 166908 .166**» 0 0 
Self-employed* .129** .129** 0 0 
Stake in job* s gore .0g0*** 0 0 
Job secure" — 042 r — 042 0 0 
Afraid to quit* — .. .035 í .035 0.. 0 
Worker traits `, b 
Age x 10 | 047 |] — .094**. — .009 — 062* 
(Age—40)? x 100° ` 016 .033 .020* .034 
Spouse NILF 0 0 .26** -.112 — .032 -.337** 
Part-time .— 049 — 257% .086 ~ 108 
Part-year : —.116 d 110 * .090** Ag4** 
Work continuity 440 . 262 .330*** — .079 
Education" ` m 021% = 021" — .023*** — .023"** 
` La other income l .008 — 021 : —.003 ~ 015 
- Occ. % female x 10° ` 011 —.014 0 0 
Identity C 
Work identity R Ea 048 — .002 
Family identity 2 304 Sat a sacs 
RO. 392 ` ' 642 |. 484 .363 


* Effects constrained to be equal across sex. 


> Significant effect, not in nynenis tee direction (two-tailed test). 


Significance levels: 
*p«.10. 
** p < .05. 
CO*** p « OL. 


hypothesis, length of time in a marriage is. 
associated a lower levels of family iden- 


tity. 


Job traits. Individuals who ‘find their work | 


interesting, utilizing their skills, and giving 
them a stake in their jobs are much more 
likely to. see the work role as a source of 
identity than those whose jobs lack these 
features. Also, as hypothesized, individuals 
who run their own businésses view their work 
role as a source of personal identity. How- 
ever; job autonomy, which is a one of the 
strongest predictors of both job satisfaction 
(Mortimer, Finch, and Maruyama 1985) and 
the amount of effort devoted to the job 
(Bielby and Bielby 1988), has no net impact 


‘on the degree to which the work role is a. 
source of identity. Neither a sense of per 


security nor a fear of quitting are associated 


. with work identity. 


Worker traits. Net of other traits that vary | 


_over the life cycle, the effects of age are quite 


small. The negative effect of age on work 
identity among women'is more likely due to. 
cohort differences than changes over the life 
cycle, and there is no evidence of the © 
hypothesized inverted-U shaped. relationship 
for work identity. In contrast, the pattern of 
effects for the presence of a spouse not 
employed outside the home corresponds 
closely.to our hypotheses. À spouse not in the 
paid labor force contributes to stronger work 
identity for men but not for women. Con- 


'versely, wives with unemployed husbands 


have significantly lower family identity, 
while a wife’ s labor force status has no effect 
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on a man’s family identity. In other words, as 
recently as 1977, a traditional household 
division of labor enhanced men’s identifica- 
tion with their work, while a husband's 
unemployment reduced substantially the de- 
gree to which working wives identified their 
families as central sources of identity. 

: The effects of labor supply are partially 
consistent with our hypotheses. Among 
women, part-time work is strongly related to 
lower levels of work identity. Individuals 
working part-year (mostly teachers) have 
higher levels of family identity (especially 
among women). However, only one of our 
hypotheses about the continuity of labor force 
experience are supported by the data. The 
effects on work identity are in the predicted 
direction, but not statistically significant. 
Although the effect of work continuity on 
family identity among women is not signifi- 
cant, among men, the hypothesized negative 
effect of a disrupted career is surprisingly 
strong. 

The effects of schooling, spouse’s income, 
and occupational sex composition fail to 
support our hypotheses. Net of other variables 
in the model, men and women with more 
schooling have lower levels of family and 
work identity. However, schooling indirectly 
enhances work identity, since the job traits 
that are strong determinants of work identity — 
interesting work, skill utilization, stake in 
job—are positively associated with schooling 
and might be viewed in part as consequences 
of schooling. Contrary to or hypotheses, the 
amount of money earned by a spouse has no 
effect on work and family identity. Thus, 
spouse’s labor force status (see above), not 
his or her earnings potential, directly shapes 
one’s degree of identification with work and 
family roles. Finally, contrary to the hypoth- 
esis derived from human capital theory, 
individuals working in female-dominated oc- 
cupations appear no different from those in 
male-dominated lines of work in terms of 
work identity. 

Balancing work and family identity. Accord- 
ing to results in Table 3, married women 


* Although men and women in clerical occupa- 
tions have lower levels of work identity, it is 
actually slightly lower among craft workers. Net of 
other variables in the model, work identity is 
highest among transportation workers, followed by 
professionals, and lowest among craft and clerical 
workers (coefficients not reported in Table 3). 
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balance work and family identities in a way 
that gives causal priority to identification with 
the family role. Estimates from the nonrecur- 
sive model suggest that balance is not 
achieved by a simultaneous accommodation 
or adjustment between the two sources of 
identity. Instead, a women’s level of work 
identity varies largely as a function of her 
family identity. Working wives who identify 
strongly with the family role avoid identifying 
strongly with their careers, and, equivalently, 
women who form weak commitments to the 
family role in turn tend to identify strongly 
with their work outside the home. In the 
metrics of the model in Table 3, a one point 
increase in family identity leads to a decrease 
in work identity of over one-half point.. But 
the corresponding impact of work identity on 
family identity is virtually zero. In short, 
women adjust their work identities to accom- 
modate their family identities, but not vice 
versa. 

Estimates in Table 3 suggest that men make 
no such trade-offs in establishing identities 
toward work and family. Men appear able to 
form strong (or weak) work identities irrespec- 
tive of commitments to their families, and 
vice versa. Thus, it appears that for men, 
commitment to dual roles is not a zero-sum 
process. This finding is consistent with a 
traditional household division of labor. For 
men, identification with family as a source of 
meaning and identity is not closely linked to 
responsibilities and time commitments within 
the household. Men in traditional families 
have the freedom to commit or not to commit 
to family and work roles without confronting 
the issue of balancing the behavioral and 
psychic demands of activities in those two 
spheres. For women, however, balancing 
identities is not insulated from competing 
responsibilities in the two realms. Our results 
indicate that commitment to family received 
primacy in for typical working wives in 1977. 

We note one caveat regarding this conclu- 
sion. The results for men are not quite as 
definitive as the point estimates in Table 3 
suggest. Although we cannot reject the 
hypothesis that work and family identity 
among men are independent of one another 
(apart from spurious sources of association), 
neither can we reject the hypothesis that the 
reciprocal relationships are the same for men 
and women. That is, constraining the trade- 
offs between work and family identity to be 
equal across sex does not lead to a serious 


LE 
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deterioration in the fit of the model (x? = 
1.27, 2 df, p > .10)? The 95 percent 
confidence interval for the effect of family 
identity on work identity among men ranges 
from —.528 to +.423, whereas the corre- 
sponding interval for women is —.951 to 
— .105. In sum, we can safely conclude that 
family receives priority in the balancing of 
identities among women, but given the 
available data we cannot reach strong conclu- 
sions about the process among men. 

Who is more committed and why? Descrip- 
tive statistics in Table 2 show modest sex dif- 
ferences, on average, on the four measures of 
work and family identity. Women tend to iden- 
tify more strongly with family, and men with 
their careers. Our model of the identity for- 
mation process suggests that part of the gender 
disparity in commitment can be attributed to 
differences between men and women in the 
actual roles they fill within families and work- 
places. Table 4 reports net differences be- 
tween men and women on two of our mea- 
sures evaluated at four points. The first line 
shows the mean difference between married 
working men and women on the items “my 
main satisfaction in life comes from my work" 
and “the most important things that happen to 
you involve your husband or wife [and your 
children]."!9 The next three lines show net 
differences predicted from the reduced form of 
the model in Table 3, evaluated at the grand 
mean of the exogenous variables, at the male 
mean, and at the female mean. 

Of particular interest are net differences in 
work identity evaluated at the male mean and 
net differences in family identity evaluated at 
the female mean. The former captures pre- 
dicted differences in work identity between 
men and women who have job traits, labor 
supply, and human capital endowments of the 
average male in the sample. The result in Ta- 
ble 4 shows that the sex difference in work 
identity disappears when we compare men and 
women with traits of the typical male (line 4 of 
the first column). In fact, compared to men 


? In the constrained model, the estimate for the 
effect of family identity on work identity is 
-— 324, p< .05. 

? These two items are the reference indicators 
of the LISREL model. By setting their respective 
measurement model loadings to unity, we assume 
that the metrics of these indicators are comparable 
across sex. For a more detailed treatment of this 
issue see Bielby (1986). 


Table 4. Decomposition of Mean Sex Differences on 
Reference Indicators of Work and Family 











Identity 
Main Family 
Satisfaction Most 
from Work Important 
Zero-order difference — .306 + .087 
Net effect," at grand mean —.010 —.100 
Net effect, at male mean + .057 —.145 
Net zffect, at female mean —.198 + .030 





* Computations are based on reduced form of model 
reported in Table 3. 


with the same traits, the net difference of .057 
(on a five-point scale) shows women slightly 
higher in work identity. 

Results for family identity evaluated at the 
female mean capture predicted differences 
between men and women who have family 
responsibilities, labor supply, and human 
capital endowments of the average female in 
the sample. Predicted family identity remains 
slightly higher for women (.030 on a 
four-point scale; see last line of second 
column in Table 4) but is two-thirds lower 
than the zero-order difference (4.47 for 
females versus 4.38 for males; see Table 2). 
In short, these results suggest that: (1) when 
women have work statuses and experiences 
similar to men’s, they identify as strongly 
with the work role as do men; and (2) when 
men have household responsibilities similar to 
women’s, they are almost as strongly commit- 
ted to the family role as are women.!! 


SUMMARY AND CONCLUSIONS 


Our research examined commitments to work 
and family in a way that incorporated the 
trade-offs dual-earner couples manage in 
everyday life. Our conceptualization of com- 
mitment suggested that as individuals become 
engaged in role behaviors, they develop 
identities linked to those roles. Accordingly, 
we expected men and women in dual-earner 
marriages to form work and family role 
identities that were consistent with the 
gender-based roles they engage in over the 


!! Note, however, that these results pertain only 
tc comparisons of married men and women in the 
paid labor force, working at least 20 hours per 
week. Estimates are therefore subject to, sample 
selection bias (Berk 1983). Unfortunately, the QES 
sampling frame precludes ae for the 
eifects of selection bias. 
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life course. We also expected sex differences . 


in the process of identity formation to reflect 
traditional sex role differentiation. . 

The results were largely consistent with our 
hypotheses. For both men and women, a 
strong engagement in work arid family roles 
in terms- of .time demands, responsibilities, 


and the like leads to identification with those’ 


roles. However, the process. of identity 
formation differed for men and women in 
ways corresponding to gender-based differen- 
tiation in the roles husbands and wives play in 
. the family and in the paid labor force. For 
. example, time out of the labor:force has 
different consequences for the identities of 
husbands and wives. Labor force interrup- 
tions eroded family identity for males but not 
for females, while having an employed 
spoüse increased family identity for females 
- and decreased work identity. for males. Thus, 
how time out of the paid labor force is used to 
serve family demands differs by sex in ‘ways 
consistent with traditional sex roles. 

Given our-conceptualization of commit- 
ment, an especially important difference in 
the process of identity formation is in how 
: men and women balance identification with 
' work and family roles. Married working 
women give precedence to family in balanc- 
ing work and family identities. In contrast, 
married men may have the discretion to build 
identification with work and.family roles 
. without trading one off against the other. 


-. These results fail to fully support either 


"scarcity" or "multiplicity" notions of com- 

mitment and.show instead how the gender- 

based structural and cultural context shapes 

. „the identity formation process. 

. In families with a traditional division of 
labor, wives take responsibility for household 


' roles that obligate them to engage in specific , 
behaviors. Thus, it is not surprising that the ` 


family is given priority in the distribution of 
role identities among working wives. In 
contrast, for husbands in traditional families, 
a strong family identity obligates them to very 
limited responsibilities outside of the "provid- 

er" role. This suggests that husbands and 
wives form different kinds of attachments to 
the. family role. Future research should 
disaggregate the conceptualization and mea- 
surement of family identity in order to capture 
identification with various aspects of roles in 


dual-earner families that might vary by. sex;. 


work context, and family situation. Future 
, research might also differentiate among single- 
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earner families, more and less traditional 
dual-earner families, single-parent families, 


and single adults to better understand varia- . 
tion and emerging trends in the link between ' 


work and family identity. 
Differences between men and women in 
the roles they play at home and in the 


workplace largely explain sex differences in- 


family and work identities. Sex differences in 
work identity are negligible between working 
Lusbands and, wives with typically "male" 
work and family situations. Further, married 
men who take on the family responsibilities of 
the typical working wife form family identi- 
ties not much different than their female 
counterparts. These results support the con- 
ceptualization of commitment posed above: as 
individuals become engaged in role behav- 
iors, they develop identities linked to those 


roles. The findings also suggest that job 


segregation in the workplace and inequality in 
tbe household division: of labor generate sex 
differences in commitment. Thus, increased 
parity between men and women in their 
workplace and household roles should contrib- 


ute to stronger work identity among women 


and family identity among men. 

: Whether the dynamics of identity formation 
in dual earner couples has changed over the 
pest decade is, for now, a matter of 
speculation. On the one hand, the increase in 
female labor force participation and in the 
number of female-headed households may 
make the dual roles of "mother" and 
“provider” culturally acceptable and the dual 
identity more easy to sustain for working 
women. On the other hand, there has been 
only modest reduction in sex segregation in 
the workplace and perhaps even less change 


in-the household division of labor. We leave -' 


it xo future research to discover whether the 


structural and cultural context of work and ^ — 
family roles have changed enough to alter the .. 


identity formation process. 

Another. important area for future research 
is exploration of the consequences of work 
and family identity. In our own research, we 
are using the QES data to examine how role 


identities shape the resolution of conflicts over 


work and family demands. However, cross- 
sectional research designs have significant lim- 
itations that can be only partially overcome 
through statistical modeling. A definitive study 


of the link between identity formation, role 
conflict, and role integration would follow in-,- 
dividuals over the life course and examine the ` 


LS 
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ideals they develop about how role behaviors 
should evolve sequentially. It would then ex- 
amine the overall pattern of choices individu- 


als make regarding work and family activities - 


across the life course and also disentangle the 
over-time relationships between work and fam- 
ily identity formation, role conflict, and role 
integration. 
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Appendix Table 1. Descriptions of Independent Variables 


Variable Metric 
Family traits : 
Any children under six binary 
Any children in six or older binary 
Responsible for a child binary 
Child-care hours hrs 
Household chores hours brs 
Years married yrs 
Job traits 
Job autonomy 1-4 
Intrinsic rewards 1-4 
Skill utilization 1-4 
Self-employed binary 
Stake in job 1-4 
Job security 1-4 
Afraid to quit 1-4 
Worker traits 
Age yrs 
Spouse not in labor force binary 
Part-time work binary 
Part-year work binary 
Work continuity proportion 
Education yrs 
Ln other family income Ln $ 
Occupation sex composition percent 
Earnings Ln $ 





Description 


Whether at least one child under six lives in household. 

Whether at least one child six or older lives in household. 

“If someone has to be home with your child(ren) or do 
something for (him/her/them) when you are both supposed 

, to be working, which of you is more likely to stay home?” 
(1 if respondent; 0 if spouse, “it depends,” spouse not in 
labor force, or childless.) 

"On average, on days when you're working, about how much 
time do you spend taking care of or doing things with your * 
children?" (0 for childless respondents.) 

*On average, on days when you're working, about how much 
time do you spend on home chores—things like cooking, 
cleaning, repairs, shopping, yard work, and keeping track of 
money and bills?" 

Years married to current spouse. 


Average of four items: (1) “I have the freedom to decide what 
I do on my job;" and (2) “The work is interesting to me;"' 
(3) "It is basically my responsibility to see how my job gets 
done;" and (4) “I am given a lot of freedom to do my own 
work.” 

Average of three items: (1) “What I do at work is more 
important to me than the money I eam;” (2) "The work I do 
is meaningful to me;" and (3) “The work is interesting.” 

“My job lets me use my skills and abilities." ' 

Primary employment from own business. 

“I have to much stake in my job to change jobs now.” 

Average of two items: (1) “The job security is good; and (2) 
“How likely is it that during the next couple of years you 
will lose your present job and have to look for a job with 
another employer?” 

“I am afraid of what might happen if I quit my job without 
having another one lined up.” 


Age at time of survey. 

Spouse unemployed or not looking for work. 

1 if respondent works 30 hours or less per week. 

1 if respondent works 44 or fewer weeks. 

Proportion of years worked since age 16. 

Years assigned to survey category midpoints. 

Natural log of total family income minus respondent's 
income. (Coded 0 if no other income.) 

Percent female in respondent's three-digit occupation, based 
on April, 1971 CPS. 

Natural log of total compensation from job, before taxes, other 
deductions. Includes overtime pay, bonuses, commissions, 
etc. 
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study of industrial change and the employ- 
ment relationship in television production. 
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RACIAL IDENTITY AMONG CARIBBEAN HISPANICS: 
THE EFFECT OF DOUBLE MINORITY STATUS 
ON RESIDENTIAL SEGREGATION* 


DouGcLas S. MASSEY 
University of Chicago 


Nancy A. DENTON 
University of Chicago 


Within the Caribbean region, racial identity forms a multicategory continuum from 
white to black, whereas in the United States it is a dichotomy of black versus white. 
Many Caribbean Hispanics, therefore, reject a strict racial dichotomy and select 
some category intermediate between black and white when asked to identify 
themselves racially on the U.S. Census. Using 1970 and 1980 census tract data, 
we show that these people display a lew degree of segregation from white 
Hispanics and a high degree of segregation from both black Hispanics and 
non-Hispanic blacks. They are highly segregated from Anglos, however, 
suggesting that people of mixed racial ancestry are accepted by white Hispanics on 
the basis of shared ethnicity but are rejected by Anglos on the basis of race. Black 
Hispanics are highly segregated from all groups. These findings have changed 
little over time and persist despite socioeconomic controls. Although both race and 
ethnicity remain potent factors in American life, results underscore the special 


salience of race in U.S. society. 


The relative impact of race versus ethnicity on 
a group’s ability to assimilate spatially has 
long captured the attention of sociologists. 
Studies typically have compared levels of 
racial and ethnic segregation and tried to 
explain why blacks fare so poorly (e.g., 
Taeuber and Taeuber 1964; Lieberson 1980). 
The findings generally show that blacks 
experience a high degree of segregation that is 
not explained by racial social and economic 
differences (Farley 1977; Massey 1979). 
Ethnic segregation, in contrast, is more 
moderate and is largely explained by social 
and economic factors (Massey 1985). Asians 
closely follow the patterns of European ethnic 
groups, suggesting that it is black race, not 
nonwhite race per se, that matters (Langberg 
and Farley 1985). 

Caribbean-origin Hispanics provide a unique 
opportunity to study the effects of race and 
ethnicity on segregation. because this group is 
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both ethnically distinct from the Anglo- 
American majority of the United States and 
racially diverse. Caribbean Hispanics are 
clearly distinguished from the majority popu- 
lation by their use of Spanish, their Spanish 
colonial heritage, and their Latin Catholicism. 
At the same time, they may be black, white, 
Asian, American Indian, or some mixture 
tnereof. This racial diversity permits a natural 
experiment wherein the effects of race on 
segregation are examined holding ethnicity 
constant. A crucial contrast is between black 
and white Hispanics, who equally differ from 
the majority in their Spanish culture, but who 
in turn differ in race. 

By focusing on "Hispanics," we do not 
underestimate the social, economic, and 
cultural differences between Mexicans, Cu- 
bans, Puerto Ricans, and other Spanish origin 
groups (Bean and Tienda 1987). Rather, our 
analysis is limited by the availability of data. 
For small-area tabulations, which are required 
to measure segregation, the U.S. Census 
Eureau does not cross-tabulate specific Span- 
ish-origin groups by race; racial breakdowns 
are presented only for Hispanics as-a whole. 
Although we examine the racial composition 
of origin groups at the national level using 
special tabulations of microdata, we cannot 
study patterns of racial segregation separately 
for Mexicans, Cubans, and Puerto Ricans. 
We confine our attention to the total “His- 
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panic" population and study urban areas that 
contain large numbers of Caribbean Hispan- 
ics, a group which displays a unique racial 
diversity that stems from the colonial heritage 
of the Spanish Caribbean, which includes the 
Spanish West Indies plus the coastal regions 
of Mexico, Guatemala, Honduras, Nicaragua, 
Costa Rica, Panama, Columbia, and Venezu- 
ela. This region contains relatively large 
numbers of black inhabitants tracing their 
origins to African slaves imported in the 18th 
century to work in the Caribbean plantation 
economy. 

We begin our analysis with a brief history 
of race relations in the Caribbean, focusing in 
particular on Puerto Rico and Cuba. Next we 
describe the racial composition of the U.S. 
Hispanic population using special tabulations 
prepared from the public use microdata 
sample of the 1980 Census. This exercise 
shows how current U.S. racial categories are 


employed by Hispanics, particularly those: 


from the Caribbean, to identify themselves 
racially. Next we consider census classifica- 
tion procedures and the technical problems 
involved in identifying black Hispanics using 
small area data for specific metropolitan 
areas. Finally, we examine how racial 
identity affects the process of spatial assimi- 
lation by computing segregation indices by 
race among Hispanics in selected metropoli- 
tan areas. 

This analysis provides a critical test of the 
relative importance of race versus ethnicity in 
determining the level of spatial integration in 
U.S. society. We address the issue of whether 
black race is simply an end point on an 
ethnicity continuum, or a qualitatively differ- 
ent dimension. If the former, black Hispanics 
should not be spatially distributed differently 
from white Hispanics. If the latter, black race 
will add an additional degree of segregation to 
that stemming from Hispanic ethnicity alone. 
If the latter case, Hispanics who do not 
identify themselves as either black or white, 
but as something in-between, are hypothe- 
sized to provide a transition between the 
moderate segregation of white Hispanics and 
the high segregation of black Hispanics. 


RACE RELATIONS IN THE CARIBBEAN 


The race relations literature reveals a funda- 
mental cleavage between the Caribbean no- 
tion of race and that prevalent in the United 
States. Race in the United States is a binary 


concept: one is either black or white. This 
fact is reflected historically -in the legal 
system that governed slavery and in later Jim 
Crow legislation, which explicitly defined 
who was black and who was white (Motley 
1966; Higginbotham 1978). Persons with 
even a small fraction of black ancestry have 
traditionally, and until recently legally, been 
regarded as black (Gordon 1949). In sharp 
contrast, race in the Caribbean is perceived as 
a spectrum running from white to black, with 
many people falling in between. This funda- 
mental difference follows from a complex 
history that differs importantly from that of 
British North America. 

As in the United States, after colonization 
the Caribbean was populated by three groups: 
American Indians, white Spanish colonizers, 
and black slaves imported from Africa to 
work on plantations (Curtin 1969; Babín 
1971; Fitzpatrick 1981; Morales 1987). Inter- 
marriage was extensive and gradually led to a 
significant mixture of the races, especially in 
Puerto Rico where there was a scarcity of 
Spanish women and a need to increase the 
number of plantation workers (Gordon 1950). 
It was less prevalent in Cuba, where laws 
against intermarriage were stricter (Knight 
1974). 

This blending of the races was encouraged 
by three distinctive features of the Spanish 
colonial experience. First, the Spanish view 
of race sprang from an Iberian vision of it that 
differed substantially from that in northwest- 
ern Europe (Hoetink 1967). Lying between 
Europe and Africa, Spain had a longer history 
of contact with members of other racial 
groups and for several centuries was under 
Airican occupation. The Moslem religion of 
the occupiers also helped to impart toleration 
of racial differences under the aegis of a 
universal religion. As a result, being Spanish 
came to be defined as being non-Moor, 
non-Jewish, or simply Christian, rather than 
being of a certain color. Moreover, the 
perception of "whiteness" was considerably 
darker than the ideal in northwestern Europe 
(Hoetink 1967; Rogler 1946). 

A second factor is the nature of Spanish 
slavery. Under Spanish law, slaves were not 
viewed simply as chattel, but as persons with 
souls and certain rights, including the possi- 
bility of manumission (Hoetink 1967, p. 9; 
Masferrer and Mesa-Lago 1974). Although 
this position may have positively influenced 
black-white relationships culturally, in prac- 
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tice laws protecting slaves’ rights were 
seldom enforced and their lives differed little 
from slaves in the British colonies (Betances 
1972; Gordon 1950). Slaves were particularly 
harshly treated in places where the plantation 
economy was highly developed, as in Cuba. 
Sugar plantations came later to Puerto Rico, 
which was more of a white colony than a 
plantation economy (Betances im wiles 
1945; Knight 1974). ' 

A third factor was the Spanish Catholic 
church: On the one hand, the Church insisted 
' on the humanity of slaves and took steps to 

convert and baptize them; slaves were encour- 

aged to attend mass with their masters. On the 
other hand, the church itself owned slaves, 
-and the: clergy largely ignored them and 
ministered primarily to whites. A frequent 
duty of priests was to attest to the racial purity 

of white children at baptism (Betances 1972); 

Hoetink 1967). Thus the impact of the 

: Colonial: church was analogous to that of the 

. Spanish legal code— promoting a positive 

cultural attitude toward persons of color in 

theory but failing to implement the ideal in 

practice. l 

Growing out of these foundations, the 

Caribbean ` view of race evolved into a 
`: variegated concept that included a diversity of 

categories rather than just two. Numerous 
` words in Caribbean Spanish describe different 

combinations of racial characteristics and 
` reveal the complexity of the racial taxonomy. 
'In addition to the common words for white 
and black, a variety of terms indicate different 
combinations of Negro and Caucasian charac- 
‘teristics. Often these terms are ‘overlaid with 
social class connotations. According to Gor- 
don (1949), raja (the Spanish word for stripe) 
.is-used to indicate a tinge of Negro blood, as 

is pinta (derived from the word for paint). A 

grifo is someone who is light complexioned, 

sometimes with blue or gray eyes, but with 
kinky hair (pelo malo); and an indio generally 
has copper skin with high cheekbones (Mills, 

Senior, and Goldsen 1950). A jibaro is 

someone who is light skinned, but of low 
. socioeconomic background, and typically 

from an inland or rural area. Trigtefio refers 
to darker-skinned persons and is frequently 
substituted for mulato as socioeconomic 
status rises (Rogler 1944). Moreno is often 
used to describe darker-skinned people and in 
many contexts has a pejorative connotation. 
Given this nomenclature and the. ideology 
' that underlies it, racial prejudice is expressed 
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in ways very different than, in the United ` 
States. Overtly, many. deny that racial prob- 


‘lems even exist on the islands, particularly in 


Puerto Rico (Betances .1972; Gordon 1949, 
1950; Babín .1971), an attitude Betances 
(1973) calls “the prejudice of having no 


prejudice." Although racial prejudice may be 


denied, parents frequently urge their children 
zo adelantar la raza (literally “to advance the 
race" or marry whiter), make negative 
comments about Negroid hair ("bad hair") 
and lips (Jorge 1979), and discourage their 
sons and daughters from marrying blacks, 
either because of fear of mistreatment or to 


. avoid “spoiling the race" (Betances 1973). 


. In addition, a long-standing association 
exists between race and social class in Puerto 
Rico and Cuba (Mathews 1974; Betances 
1972). Race is a criterion for employment 
Ciscrimination (Masferrer and Mesa-Lago 
1974) and a barrier to the achievement of high 


‘social status (Bonilla 1961). The upper 


classes have traditionally emphasized white- 
ness and pay careful attention to ancestry. 
They can identify people who have colored 
ancestors, even when’ those people are 
apparently white (Mills et al. 1950). Those of 
darker color also have greater difficulty - 
achieving political office (Betances 1973). As 
a result of ‘these subtle prejudices, the 


. proportion of blacks has declined in succes- 


1 


sive Puerto Rican and Cuban censuses, 
reflecting the social pressure to identify as 
white whenever possible (Masferrer and 
Mesa-Lago 1974; Mathews 1974; Bonilla 
1961; Rogler 1946). f 
Denials of prejudice are often expressed in 
péternalistic terms. They refer to lower 
classes who "know their place" (Megenney 
1974) and argue that "white". refers not to 
skin color but to class (Williams 1945, 1955). 
The denials serve important social ends and 
they distinguish Puerto Rican racial prejudice 
from the more blatant racism of the United 
States (Betances 1973; Jorge 1979; Bonilla ' 
1961). They prevent those who would make 
an issue of racial discrimination from having 
to acknowledge their own mixed racial 
ancestry (Betances 1973). Black Americans 
who have visited the island to promote racial 
equality and-explain the struggle on the 
mainland generally have not received wárm 
receptions (Bonilla 1961; Betances 1972, 
1973; Mathews 1974). . i ' 
‘Most Caribbeans know about racial atti- 
tudes in the United States and have been 
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affected by them (Mills et al. 1950). Both 
Puerto .Rico and Cuba were under U.S. 
influence in 1898 when they were seized 
during the Spanish-American war. Military 
officials stationed on the islands were usually 
southern and commonly espoused the view 
that blacks were unable to govern themselves, 
a sentiment that was shared by U.S. Congress 
at the time (Kennedy 1971; Christopulos 
1974). Although Cuba was eventually granted 
independence, Puerto Rico became a United 
States commonwealth. As soldiers during 
World War II, many Puerto Ricans experi- 
enced discrimination (Betances 1973). Exten- 
sive migration back and forth between the 
island and the mainland after the war also 
spread knowledge about U.S. racial prejudice 
(Mills et al. 1950; Morales 1987). 

In spite of this contact with the mainland, 
however, Caribbean racial attitudes remain 
distinctly different from those prevalent in the 
United States. On the surface, Caribbean race 
relations are tolerant and relatively open, 
consistent with a racial ideology that distin- 
guishes categories of whiteness or blackness 
and attributes manifestations of prejudice, 
especially in Puerto Rico, to social class 
differences. To the extent that it exists, racial 
prejudice remains buried in “damp and 
unhygienic obscurity” (Betances 1973). In 
coming to the United States, therefore, 
Caribbean migrants face a set of race relations 
vastly different from their own, and the 
definition of race has profound implications 
for personal identity, group cohesion, and 
collective behavior. 


HISPANIC RACIAL IDENTITY AND THE 
U.S. CENSUS 


We first investigate the racial self-definition 
of Hispanics by examining their reaction to 
the racial taxonomy of the U.S. Census. Race 
and Spanish origin are identified separately 
on the census questionnaire (U.S. Bureau of 
the Census 1982a). The concepts are nowhere 
defined for respondents, however, and recent 
censuses have been conducted almost entirely 
by mail. Thus race and ethnicity necessarily 
represent self-definitions, and they provide a 
good measure of how people view them- 
selves. ‘ 

In 1980, Hispanics were identified by a 
question that asked of all household mem- 
bers, “Is this person of Spanish/Hispanic 
origin or descent?” and then provided five 


possible answers: No (not Spanish/Hispanic); 
Yes, Mexican, Mexican-Amer., Chicano; 
Yes, Puerto Rican; Yes, Cuban; Yes, Other 
Spanish/Hispanic. The word "race," in con- 
trast, was not mentioned anywhere on the 
census questionnaire. Question four simply 
asked, “Is this person— " and then followed 
the query with a list of 15 possible categories 
that boiled down to five basic groups: white, 
black, American Indian, Asian, and Other 
Race. Ignoring the latter three categories for a 
moment, the wording of the question and the 
procedures used to code it closely adhere to 
the black-white dichotomy typical of the 
North American racial taxonomy. Categories 
such as mulatto, Creole, African, or Afro- 
American do not appear on the questionnaire, 
and if they are written in, they are recoded as 
black (U.S. Bureau of the Census 1986). 

Since race and Spanish origin are deter- 
mined by separate questions, Hispanics are 
free to place themselves in one of several 
racial categories. The cross-classification of 
race by Spanish origin provides the basic data 
for a study of racial self-definition. Such a. 
study is aided by a recent change in census 
coding procedures. In 1970, Hispanics who 
wrote entries such as Cuban, Puerto Rican, 
Chicano, or some other Spanish identification 
in response to the race question were recoded 
as whites. In 1980, however, these people 
were coded as “Spanish Race" and tabulated 
separately under the “Other Races" category. 
Only if Hispanics specifically stated that they 
were white or Caucasian were they coded into 
the white racial group; and likewise only a 
specific mentioning of black, Negro, African, 
or mulatto was coded as black (U.S? Bureau 
of the Census 1986). Although we cannot 
know the reasons why respondents select one 
category over another, the coding change 
allows Hispanics, for the first time, to place 
themselves in.a racial classification outside 
the strict black-white dichotomy. We assume 
that Hispanics of racially mixed origins are 
most likely to describe themselves with a 
write-in term and are recoded as "Spanish 
race." à 

The cross-tabulation -of race and the 
Hispanic groups in Table 1 was prepared 
using data from the 5 percent Public Use 
Microdata Sample (PUMS) of the 1980 
Census. Overall, a clear majority of Hispan- 
ics (about 58 percent) regard themselves as 
white, an only a tiny minority (about 3 
percent) see themselves as black. Roughly a 
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Table 1. Race by National Origin among Those Who Responded to 100-Percent Spanish-Origin Question 


Puerto 
Race: Mexican Rican 
United States v 
White 55.4 48.3 
Spanish 37.5 43.1 
Black 1.9 3.5 
Amerindian 0.7 0.2 
Asian 0.3 0.6 
Other 4.2 4.3 
Number 438,593 101,751 
60 SMSAs in data set 
White 54.4 47.8 
Spanish 38.8 44.1 
Black 1.3 3.6 
Amerindian 0.7 0.1 
Asian 0.3 0.2 
Other 4.6 4.2 
Number 80,424 


283,423 
SOURCE: 1980 59» PUMS file. 


third did not find a census category that 
corresponded to their racial self-definition, 
and wrote in a response that was recoded 
"Spanish race." Very few Hispanics identi- 
fied themselves as Asians or American 
Indians. 

‘Substantial differences appear in the racial 
self-identification of the three principal Span- 
ish-origin groups. Cubans are most likely to 
see themselves as white and least likely as 
outside of the black-white dichotomy. Only 
10.5 percent reported themselves as of 
"Spanish race," whereas 84 percent identified 
themselves as white, and 3 percent as blacks. 
This relatively close adherence to the bipolar 
classification of the United States reflects the 
lower degree of intermarriage on the island of 
Cuba (Knight 1974), and the highly selective 
racial immigration: of Cubans to the United 
States. Cubans who arrived as refugees from 
Castro's Cuba were drawn principally from 
the middle and upper classes, where blacks 
were underrepresented. According to Portes 
and Bach (1985, p. 88), blacks constituted 
less than 5 percent of Cuban immigrants prior 
to the Mariel boatlift of May 1980. During 
the boatlift, the percentage jumped to about 
30 percent, which is closer to the percentage 
of blacks on the island, but this migration 
occurred after the 1980 Census was taken. 

Puerto Ricans appear to reflect more 
closely the multicategory Caribbean racial 
identity. Since Puerto Ricans are U.S. 
citizens, movement between the island and 
the mainland is unrestricted and migrants are 


Cuban Spanish Total 
83.8 63.4 57.7 
10.5 22.8 33.7 
2.9 4.5 2.7 
0.1 1.1 0.2 
0.2 4.7 12 
2.6 3.5 4.0 

41,106 157,309 738,759 
83.8 6.6 . 512 
10.7 24.2 34.5 
2.7 4.8 2.5 
0.1 ' 0.9 0.6 

0.2 3.8 1.0 
2.5 3.6 ` 4.2 

35,572 110,203 “509,622 


mcre representative of the island as a whole. 
If anything, the flow is skewed toward the 
lower classes where black racial traits are 
overrepresented. Although the. percentage of 
blacks among Puerto Ricans is small (3.5 
percent) and does not differ much from that of . 
Cubans, the fraction who state they are of ` 
“Spanish race” is four times greater. Some 43 
percent of Puerto Ricans see themselves as 
neither black nor white, but- something 
in-between, the highest percentage for any 
Hispanic group. 

Mexicans likewise specify a “Spanish 
race" quite often, but for them "Spanish: 
race” is likely to mean something quite 
different. The black racial presence in Mexico 
is negligible, and to the extent that it appears 
it is isolated geographically away from the 
main migrant-sending regions (Parkes 1950). 
In Mexico, “Spanish race” probably means 
mestizo, a mixture of European and Indian 
ancestries. There the racial continuum runs 
from white to Indian, not from white to black. 
Mexico’s official ideology is that it is 
a mestizo country (see Russell 1977), and 
some observers believe that 90 percent of 
Mexicans fall into this category (Riding 
1985). Nevertheless, only about 38 percent of 
Mexizans classified themselves as “Spanish 
race,” whereas 55 percent called themselves 
white. MAN 

It is difficult to interpret the figures for the 
Other Spanish category, since it is a compos- 
ite of many origin groups. Nonetheless, 
several of the largest origin groups are from 
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Caribbean nations that contribute black mi-' 


grants, including the Dominican Republic, 
Colombia, Guatemala, Nicaragua, and Pan- 


ama. Accordingly, the percentage of blacks is ` 


relatively high (4.5 percent) and the percent 
identifying themselves:as "Spanish race”: is 
about 23 percent. Given the heterogeneous 
composition’of this origin category, however, 

“Spanish race” probably includes both mesti- 
zos and mulattos. . 


-In sum, the relative number’ of blacks is 


suite small among all Hispanic origin groups, 
never exceeding 5 pércent, but a large 
proportion of Hispanics seem to have a racial 
identity that does not correspond to either of 
tlie two U.S. polar categories. The case is 
clearest. among Puerto Ricans where the 
Indian’ racial heritage is minimal and racial 
identity focuses on the continuum between 
. White and black.-Among Puerto Ricans, no 
racial category represents a majority and the 
sizes of the white and “Spanish race" 
. categories are similar. Clearly, Hispanics in 


.. general, and Puerto Ricans in particular, do 


not understand race as do most people in the 
United States. 


BLACK HISPANICS AND 
BLACK MEXICANS 


We have reasons to believe that even the 
small percentages of blacks in Table 1 are 
overestimates, especially amiong Mexicans. 
According to a special Census Bureau report 
(U.S. Bureau of the Census 1982b), in some 
regions of the country relatively large num- 


bers of blacks mistakenly identified them- 


selves as Hispanics. This error produced a 

substantial overcount of black Hispanics and 

apparently stems from the intention of many 

native blacks to identify themselves as 
“American.” 

, The desire to identify oneself as "Ameri- 
can" is quite strong and this creates special 
problems for the enumeration of Hispanics. 
The problem was first recognized in 1970, 
when the Spanish-origin question included a 
category, "Central or South American." This 
category induced large numbers of non- 
Hispanics from the midwest and south to 


check the Spanish-origin box in order to be. 


counted as “Americans” (U.S. Bureau of the 
"Census 1973; Siegel arid Passel 1979). This 
category was eliminated in 1980, and the 
Mexican-American category was abbreviated 
to read "Mexican-Amer.," 


so-that the word 
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“American” would not actually appear on the 
questionnaire. These actions were still not 
enough to prevent many non-Hispanics from 
seeking out the Mexican-American category 
to identify themselves as “Americans.” The ` 
problem was especially severe among blacks 
in the south; where levels of education are 
relatively low and .there are very few 
Mexicans (U.S. Bureau of the Census 1982b). 

The gravity of the problem is indicated by 
Table 2. Prior work we have done on 
residential segregation focused on a data set 
of 60 metropolitan areas that includes the 50 
largest SMSAs plus 10 others with large 
Hispanic populations. For each SMSA in this 
data set, Table 2 presents the percentage of 
black Hispanics within each Hispanic group 
(the four columns. on the left) and the 
percentage of all black Hispanics who are in 
each of the national origin groups (the four 
columns.on the right). The SMSAs are ranked 
in descending order of the percentage of black 
Mexicans. 

The most accurate indication of the true per- 
centage of black Mexicans comes from places 
with large Mexican populations and relatively 
small black.populations. SMSAs fitting this 
description cluster at the bottom of the table 
and generally display black percentages rang- 
ing from 0.1 to 0.5 percent. Los Angeles has 
the largest Mexican origin population in the 
United States, and even though it has a sizable 
black population, blacks there apparently re- `. 
alize they are not Mexican-Americans. Thus 
the percentage of black Mexicans is only 0.4. 
These figures suggest that the true percentage 
of blacks among Mexicans is less than one- 
half of one percent. 

The enormity of the over enumeration 
problem is indicated in the SMSAs clustered 
toward the top of the table. In 10 SMSAs, the `. 
percentage of blacks among Mexicans is 
above 30 percent; and in 20 it is above 15 
percent. These places tend to be SMSAs with 
large black populations and few Mexicans, 
which are concentrated heavily in the south. 
Even allowing for random error because of 
small sample sizes, it is obvious that the 
overenumeration of black Hispanics. was 
substantial in most of the SMSAs in our data 
set. It is highly unlikely that more than one 
percent of Mexicans could be black given the 
fact that only a tiny percentage of Mexican 
migrants comes from the Caribbean region 


(Dagodag 1975; North and Houston 1976; 
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Table 2. Percentage of Black Hispanics among Different Sparish Origin Groups and the National Origin Composition 
of Black Hispanics: 60 SMSAs Ranked by the Percentage of Black Hispanics among Mexicans 








% of Black Hispanics among: % of All Black Hispanics: 
Puerto Other Puerto 
SMSA Mexicans Ricans Cubans Spanish Mexican Rican Cuban Other 
Birmingham 58.1 10.5 13.3 18.3 81.6 1.8 1.8 14.9 
Memphis 51.6 19.2 0.0 25.0 79.4 3.5 0.0 17.0 
Baltimore 47.1 5.2 8.9 15.3 61.9 4.2 2.1 31.8 
Atlanta 41.2 10.8 2.9 6.0 75.9 8.9 32 12.3 
Cleveland 40.9 2.0 3.1 9.4 68.1 12.1 1.4 18.4 
Greensboro 40.7 17.4 0.0 17.9 75.1 6.1 0.0 18.2 
Ft. Lauderdale 35.5 0.9 L5 5.2 52.6 4.4 7.9 35.1 
Norfolk 32.3 15.3 3.7 7.5 58.1 17.1 0.9 23.8 
Philadelphia 31.5 2.8 11.5 5.6 41.7 30.9 7. 20.2 
Newark 30.9 3.3 1.2 3.5 18.5 34.6 4.7 42.3 
Rochester 29.7 2.3 20.4 23.0 25.0 17.1 13.2 44.7 
Cincinnati 29.5 13.3 0.0 18.2 60.0 4.8 0.0 35.3 
New Orleans 25.9 7.8 2.8 6.7 44.9 3.9 3.9 41.3 
New York 24.8 3.9 10.8 9.8 . 6.8 34.7 8.3 50.2 
Nashville 20.9 6.7 0.0 12.5 68.1 2.1 0.0 29.8 
Columbus 19.0 7.8 4.2 11.2 51.7 8.6 1.7 37.9 
Tampa 17.9 1.3 3.6 L6 47.6 4.8 25.9 21.8 
Washington 17.0 6.7 5.1 6.2 33.1 9.9 4.3 52.8 
Louisville 15.2 150 . 6.7 6.6 62.5 12.5 4.2 20.8 
Miami 14.8 2.1 0.8 3.7 14.1 9.5 33.8 42.7 
Buffalo 13.9 3.4 10.5 90 26.9 32.7 3.9 36.5 
St. Louis 13.8 11.9 11.1 5.0 73.2 4.6 2.8 19.4 
Pittsburgh 12.8 8.7 10.0 8.2 44.1 6.8 3.4 45.8 
Jersey City 9.8 2.1 1.8 27 3.1 35.4 26.7 34.8 
Nassau-Suffolk 8.4 2.9 4.5 6.8 5.6 29.9 5.6 59.0 
Dayton 8.1 29.6 7.1 7.2 32.0 32.0 4.0 32.0 
Detroit 7.9 5.4 11.6 9.4 59.2 6.7 3.6 30.5 
Boston 74 5.9 6.1 12.3 4.7 31.5 5.4 58.3 
Paterson 6.6 2.8 4.8 3.7 4.7 47.2 9,4 38.7 
Albany -5.7 4.4 11.8 2.2 23.1 38.5 15.4 23.0 
Indianapolis 4.1 13.0 28.6 6.6 28.1 18.8 25.0 28.1 
Seattle 3.0 3.2 7.3 0.8 67.7 5.9 8.8 17.7 
Oklahoma City 2.9 0.0 0.0 IC 60.0 0.0 0.0 40.0 
Chicago 2.2 1.6 3.9 56 56.0 14.9 4.9 242 
Portland 1.7 0.0 0.0 2C 55.0 0.0 0.0 45.0 
Milwaukee 1.5 0.7 15.2 9.2 32.0 6.0 14.0 48.0 
San Francisco 12 3.4 4.1 1.6 42.2 11.9 3.7 42.2 
Gary 1.2 3.1 0.0 3.8 41.5 37.5 0.0 15.0 
Kansas City 1.1 12.8 18.2 12.3 28.3 9.4 15.1 47.2 
Minneapolis 1.0 0.0 2.9 5.7 28.0 0.0 4.0 68.0 
Dallas 0.9 2.5 1.7 2.2 75.9 2.2 2.2 19.7 
Houston 0.8 8.5 0.0 2.3 72.0 1.6 0.0 20.4 
Salt Lake 0.8 4.6 0.0 1.4 42.3 11.5 0.0 46.2 
Sacramento 0.7 4.6 5.6 1.7 52.9 11.8 2.0 33.3 
Denver 0.6 7.6 4.6 0.7 46.9 1¢,1 4.1 34.4 
Austin 0.5 9.8 33.3 1.2 61.1 11.1 16.7 11.1 
Riverside 0.5 4.5 7.9 0.8 63.6 13.6 9.1 13.6 
Fresno 05 , 4.5 0.0 0.4 91.7 2.8 0.0 5.6 
San Diego 0.4 4.4 1.5 2.4 45.1 11.5 0.9 42.5 
Bakersfield 0.4 0.0 0.0 1.2 81.0 0.0 0.0 19.1 
Los Angeles 0.4 5.2 1.8 24 39.0 11.6 5.4 44.0 
Albuquerque 0.4 0.0 5.3 0.3 44.8 0.0 3.4 51.7 
Phoenix 0.3 2.6 13.8 L1 65.2 4.4 8.7 21.7 
Corpus Christi 0.3 0.0 0.0 0.4 95.5 0.0 0.0 4.6 
San Jose 0.3 1.0 0.0 0.6 59.1 6.8 0.0 34.1 
El Paso 0.2 0.0 0.0 3.3 59.2 0.0 0.0 40.8 
San Antonio 0.2 4.0 2.7 0.6 68.6 11.8 2.0 17.7 
Anaheim 0.1 2.4 0.7 0.8 36.6 14.5 4.9 43.9 
Tucson 0.1 4.1 0.0 0.4 50.0 25.9 0.0 25.0 
Providence 0.0 7.0 21.7 3.3 0.0 33.3 15.2 51.5 








Source: 1980 5% PUMS file. 


Jones 1982). Yet in 39 of the 60 SMSAs the 
percentage of black Mexicans is over 1.0. . 
" The degree of bias introduced into the 


* census tract data by this overenumeration is 


likely to be substantial. Since black Hispanics 
are a small population, the addition of large 
numbers of incorrectly identified black Mexi- 


. cans. substantially inflates the count in most 


SMSAs. In 31 of the 60 metropolitan areas, 


' black Mexicans constitute a majority of all 


black Hispanics; and in 51 cases they make 
up at least a quarter of this population. Since - 


` U.S. blacks are highly segregated from 


virtually all other racial and ethnic groups, 
their erroneous inclusion in the Hispanic- 


_ population grossly inflates any segregation - 
. measure computed for black Hispanics. 


Obviously the data set we have used in 


' prior work cannot simply be employed, to ` 


study the segregation of black.and white 
Hispanics without modification. Since the 
error is confined to black Mexicans, the 
simplest solution would be to eliminate them 
from the counts of black Hispanics: Unfortu- 
nately, aggregate census tract data do not 
permit this adjustment, since they only give 
racial data for the entire Hispanic population, 
not for individual origin groups. We therefore 
chose the next simplest modification — 
eliminating SMSAs where the true number of 
black Hispanics is likely to be quite small and 
where black Mexicans represent an inordi- 
nately large share of the DIE Hispanic 

population. 

‘In order to make this cut, we applied three 
criteria. First, we compiled a list of SMSA 
where Puerto Ricans or Cubans constituted at 
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least 30, percent of all black Hispanics, 
reflecting . the fact that black Hispanics. are 
drawn largely from these two Caribbean 
islands. Second, from this list we eliminated 
any SMSA that was not in a state on the 
eastern seaboard, since this is where most 
Caribbean „immigrants have settled. Finally, 
we added to this list two SMSAs that did not ` 
satisfy these criteria but nonetheless had 
relatively large numbers of Puerto Ricans and 
were therefore likely to contain black Hispan- 
ics (Los. Angeles and Chicago). At the same 
time we eliminated two SMSAs where the 
number of black Hispanics was too small to 
sustain’ meaningful analysis (Albany and 
Buffalo). This three-phase selection yielded'a 
set of 10 SMSAs ‘(see Table 3). 

Although this procedure yielded SMSAs. 
where the overenumeration problem is mini- 
mized, it is not eliminated. Therefore, we 
made additional adjustments at the census 
tract level to improve the count of black 
Hispanics. Again, since Hispanics are not 
cross-tabulated by race and national origin 
within census tracts, black Mexicans cannot 
simply be eliminated from tract-level counts 
of black Hispanics. Our adjustment procedure 
goes through the data tract by tract and 
identifies four situations suggesting a system- 
atic misstatement of Hispanic ethnicity by 
non-Hispanic blacks. The adjustments and 
their effect on the counts of black Hispanics 
are summarized in Table 3. 

The first dubious situation occurs when the 
total number of Hispanics equals both the 
number of black Hispanics and the number. of 
Mexicans (i.e., R In.this case 


Table 3. Summary of Tract- Level Adjustments to the Black Hispanic ime 10 SMSAs Containing’ Significant 


Concentrations of Cubans and Puerto Ricans 


Total Black . Total 


` SMSA Hispanics Deleted 
;. Boston ` 5,886 106 
Chicago fs 12,486 ‘4,579 
Jersey City 3,518 51 
Los Angeles ; 17,225 881 
Miami 10,57 ' 193 
< Nassau-Suffolk 4,529 (46 - 
New York 97,028 769 
Newark “> 6,320 362 
Paterson * 1,770 - 81 
‘Philadelphia 7,868 °° 1,260 : 
Total’ , - 166,787 8,328 


‘Deletions ^, Deletions‘ T Black 
as % of ` as % of capi dm 
Black ^. | Black MS zi trs NE pM 

Mexicans Hispanics 1970 ' 1980 
34.6 > 1.8. 14.2 ` 8.7 
58:5" 36.7 ' $8 > 1d 
41.8: L4 c 2.6 2.4 
13.9 ` 5.15: 2.1 0.8 
14.7 “19, 16: 1.7 
16.7 1.0 . 0761 - 44 
13 7 0.7 10.9 ` 65 
32.8 v 5.6 5.8 4.5 
39.8 4.6 2.9- 2.7 
480 : 16.0 12.6 . 5.8 
45.3. ` 5.0 8.5 3.0 


Source: U.S. Census, 1980. STF4-A, 1970 Fourth Count Summary Tapes. 
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it is likely that ali Hispanics are incorrectly 
enumerated black Mexicans and the Hispanic 
population for such tracts is set to zero 
(TH=0). The second suspicious circum- 
Stance occurs when the total number of 
Hispanics equals the total number of black 
Hispanics (TH= BH), and it is corrected by 
deleting all Mexicans from the tract’s black 
Hispanic population (BH*=BH-MX). The 
third questionable condition occurs when the 
number of black Hispanics is greater than the 
number of non-Mexican Hispanics (BH>TH- 
MX). Since most black Hispanics are non- 
Mexican this should rarely be the case, and 
the figures are corrected by subtracting from 
the black Hispanic population the difference 
between the number of black Hispanics and 
non-Mexican Hispanics [BH* — BH-(TH- 
MX)]. The last condition indicating a need for 
adjustment occurs when the number of black 
Hispanics equals the number of Mexicans 
(BH = MX), in which case all black Hispanics 
in the tract are eliminated (BH — 0). 

This algorithm eliminated some 8,328 
black Hispanics from a total population of 
166,787 in the 10 SMSAs under study, 
representing 5 percent of the original count of 
black Hispanics and about 45 percent of the 
count of black Mexicans. The percentage of 
black Mexicans eliminated varied from 11 
percent in New York to 80 percent in 
Paterson, N.J. The final data set contains 
158,459 black Hispanics, which represents 
about 40 percent of the total number enumer- 
ated in the 1980 Census (which itself was a 
substantial overestimate). These SMSAs con- 
tain the largest Puerto Rican and Cuban 
populations in the United States as well as 80 
percent of Dominicans, the third major group 
containing black Hispanics (see Bean and 
Tienda 1987, p. 100). 

After correction, the 1980 black Hispanic 
percentages range from 0.8 percent in Los An- 
geles to 8.7 percent in Boston, and averages 
about 3 percent (see Table 3). These figures 
represent substantial] declines from the black 
Hispanic proportions reported in 1970. It is 
unlikely these differences reflect an actual de- 
crease in blacks among Caribbean Hispanics. 
Rather, they probably indicate a growing re- 
luctance on the part of Hispanics with African 
roots to identify themselves as black. Con- 
fronted with the reality of a rigid color line, on 
one side of which lies the prospect of discrim- 
ination and prejudice, Puerto Ricans, Cubans, 
and other Hispanics of mixed racial ancestry 
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seem to have increasingly opted to call them- 
selves “Spanish” cr white. 


TRENDS IN THE SEGREGATION OF 
BLACK- AND WHITE HISPANICS 


In this section, we measure the residential 
segregation of black and white Hispanics in 
order to examine the impact of racial 
self-definition on behavior. Since discrimina- 
tion is probably more severe in housing than 
in other areas of U.S. life (see Wienk, Reid, 
Simonson, and Eggers 1979; Yinger 1986), 
segregation represents a key indicator of how 
g-oups make the transition from a society 
where race is a multicategory continuum to 
one where it is a bipolar dichotomy. 

To study the spatial situation of black and 
white Hispanics, we computed indices of 
residential segregation using adjusted census 
tract data from the 1980 STF-4A data files 
(U.S. Bureau of the Census 1980) and 
unadjusted data from the 1970 Fourth Count 
Summary Tapes (U.S. Bureau of the Census 
19708). Hispanics are defined in 1980 using 
the 100-percent Spanish-origin question de- 
scribed above, and in 1970 by the "Spanish 
American" definition, which is based on a 15 
percent sample item. The latter definition 
includes people who speak Spanish or who 
grew up in a home where Spanish was 
spcken, and in the southwest, all people of 
Spanish surname (U.S. Bureau of the Census 
1970b). Earlier work we have done indicates 
this definition is roughly comparable to the 
Spanish-origin definition used in 1980 (Mas- 
sey and Denton 1987). 

Race was defined in 1970 and 1980 using 
similar census questions, but as described 
above, a major change occured in how 
persons of "Spanish race" were coded. In 
1970 they were coded as white, but in 1980 
they were coded into their own racial 
category. In order to study trends between 
1970 and 1980, we recoded persons of 
"Spanish race" to be whites in 1980!; and in 





! This coding change also has important implica- 
tions for analyses studying trends in the white pop- 
ulation between the two censuses. In 1970 the white 
population included nearly all Hispanics, whereas in 
1980 :t included only about 50 percent of them. Thus 
in SVSAs where Hispanics are numerous, like San 
Antonio, the *white" population was 50 percent His- 
panic in 1970 but only 42 percent Hispanic in 1980, 
even though the proportion of the overall population 
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Table 4. Trends in the Residential Segregation of White and Black Hispanics from Anglos in 10 Metropolitan Areas, 











1970-1980 
White Hispanics Black Hispanics 

SMSA and Index 1970 1980 Change 1970 1980 Change 
Dissimilarity from Anglos 

Boston 462 559 097 .687 851 164 
Chicago .582 .636 054 .799 852 053 
Jersey City '.550 .489 ~ 061 .658 .681 023 
Los Angeles .466 571 105 .788 .T70 -.018 
Miami .505 522 017 169 715 — 054 
Nassau-Suffolk 3 .282 347 065 .644 794 150 
New York .636 .649 013 .801 811 010 
Newark .599 653 054 -760 842 082 
Paterson 606 720 114 .660 838 178 
Philadelphia .533 623 090 .728 844 116 
Mean 522 577 055 .729 800 071 
Interaction with Anglos 

Boston .831 .679 —-.152 .554 .439 -.116 
Chicago .664 .505 —.159 .331 .304 — ,026 
Jersey City .604 457 — 147 596 .373 -.223 
Los Angeles .529 .348 —.181 265 .245 —.019 
Miami .491 .343 —.148 297 .256 —.041 
Nassau-Suffolk .889 794 —.095 632 519 —.113 
New York .408 .344 ~ 064 258 .193 — .065 
Newark .599 .496 —.103 392 262 —.130 
Paterson .585 .372 —.213 557 .288 —.269 
Philadelphia .672 .520 —.132 454 .338 —.116 
Mean 627 489 —.141 434 322 —.112 


Note: Entire table employs 1970 definition of white Hispanics. 
Source: U.S. Census, 1980 STF4-A, 1970 Fourth Count Summary Tapes. 


order to define mutually exclusive popula- 
tions for study, white and black Hispanics 
were subtracted from the respective white and 
black populations. Non-Hispanic whites are 
called “Anglos,” and non-Hispanic blacks are 
simply called "blacks." 

Two indices were computed to measure 
segregation at the census tract level. The first 
measure is the well-known index of dissimi- 
larity, which varies between 0 and 1.0 and 
gives the proportion of minority members that 
would have to change their tract of residence 
to achieve an even residential distribution. 
The second measure: is the ,Py' interaction 
index, which also varies between 0 and 1.0 
and gives the probability of residential contact 
between two groups, X and Y. It states the 


that was Hispanic remained about constant. Thus an 
understanding of racial concepts and definitions is 


important not only for analyses that include Hispan- . 


ics but for those that exclude them as well. Research- 


_ ers should begin the practice of defining non- 


Hispanic white populations for analysis rather than 
simply relying on what the Census Bureau calls 
*white" in any given census year. 


likelihood that a member of group X shares a 
tract with a member of group Y. 

The dissimilarity index is insensitive to the 
relative number of minority members in an 
urban area, but the interaction index is 
affected by population composition. Other 
things being equal, as a minority's percentage 
of the population falls, its probability of 
contact with the majority rises. Because of 
this property, interaction indices are asymmet- 
rical and the probability of interaction be- 
tween X and Y generally does not equal that 
between Y and X. The dissimilarity index is, 
however, symmetrical so that its value is the 
same whether one considers X relative to Y or 
Y relative to X (see Lieberson 1981; James 
and Taeuber 1985; White 1986; Massey and 
Denton 1988). 

In Table 4, the dissimilarity indices dis- 
played in the upper half of the table reveal 
that black Hispanics are considerably more 
segregated than white Hispanics. A good rule 
of thumb is that dissimilarity values below 
.300 are low, values between .300 and .600 
are moderate, and values above .600 are high 
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(Kantrowitz 1973). By this criterion, white 

Hispanics display moderate levels of segrega- 

‘tion, whereas black Hispanics are highly 
segregated at both dates. The average 1970 
dissimilarity score was .522 for white Hispan- 
ics but .729 for black Hispanics. Over the 
ensuing decade, segregation levels for both 

. groups increased somewhat, but the differen- 
tial remained constant, with black Hispanics 
being about 38 percent more segregated than 
white Hispanics at both dates. By 1980, black 
Hispanic dissimilarity had reached a very 
high average of .800, which is comparable to 
the level experienced by non-Hispanic blacks 
in large metropolitan areas. 

The lowest. levels of black Hispanic 
dissimilarity from Anglos are found in cities 
were Puerto Ricans do not predominate. In 
Miami and Jersey City, Cubans constitute a 
large share of black Hispanics (see Table 2), 
and in Los Angeles most black Hispanics are 
Mexican. Outside ,of these SMSAs, Puerto 
Ricans generally predominate among black 
Hispanics, and the level of dissimilarity from 
Anglos is very high, averaging .833. Miami 
and Los Angeles were also the only two 
SMSAs that experienced a decline in the level 
of segregation from Anglos; in SMSAs where 
Puerto Ricans. concentrated, the segregation 
of: black Hispanics generally rose. These 
findings suggest that black Puerto Ricans are 
particularly segregated relative to blacks in 
other Hispanic groups. 

Findings based on the interaction indices 
parallel those using dissimilarity index. Be- 
cause both residential dissimilarity and rela- 
tive numbers of black and white Hispanics 
increased over the decade, the probabilities of 
Anglo contact fell sharply for both groups. 
But black Hispanics displayed particularly 
high levels of spatial isolation at both census 
dates. In 1980, for example, the average 
probability that a white Hispanic shared a 
neighborhood with an Anglo was .489 and 
was .500 or over in four cases. In.contrast, the 
likelihood of Anglo contact for black Hispan- 
ics averaged only .322 and was under .500 in 
all but one case. Black Hispanics were 
particularly segregated in New York, where 
the probability of Anglo contact was only 
.193. 

Thus, in terms of their residential dissimi- 
larity from the majority and their spatial 
isolation from Anglo society, black Hispanics 
are very segregated and are much more like 
U.S. blacks than white Hispanics. Table 5 
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extends the contrast between the two racial 
groups by measuring segregation from non- 
Hispanic blacks. Again patterns for black and 
white Hispanics differ markedly. As with 
whites generally, white Hispanics display 
high levels of segregation ‘from blacks. 
Although there was a slight decline over the 
1970s, the average dissimilarity between 
white Hispanics and blacks remained high at 
.686 in 1980, and levels fell into the moderate 
range in only two cases, Boston and Paterson- 
Clifton-Passaic. In contrast, the level of 


residential dissimilarity between black Hispan- ` 


ics and U.S. blacks averaged only .483 in 
1980, and all indices save one were, in the 
moderate range. New York represents a 
particularly interesting contrast since it con- 
tains the largest number of black Hispanics. 
Here the degree of dissimilarity between 
white Hispanics and blacks increased by.030, 
while that between black Hispanics and 


' blacks decreased by .079. Thus, black and 


white Hispanics were moving in different 


. directions —the former toward U.S. blacks 


and the latter away from them. 


Interaction indices remained remarkably . 


stable over the decade as declines in dissimi- 
larities were offset by rising minority propor- 
tions. But the figures for 1980 show the 
contrast between black and white Hispanics 
even more clearly than the dissimilarities. 
The probability of residential contact with 
blacks is always at least two times higher for 
blacx Hispanics than for white Hispanics, and 
the average is nearly three times greater. In 
New York, the probability of residential 
contact between white Hispanics and blacks 
was only .213, whereas the probability of 
contact between black Hispanics and blacks 
was .470; in Newark the respective figures 
were .220 and .541. 

Table 6 considers the degree of segregation 
between white and black Hispanics. Dissimi- 


. larity indices ranged from moderate to high in 


both 1970 and 1980, and averaged .577 in 
1980. The lowest scores were achieved in 
Paterson-Clifton-Passaic (.394), Jersey ‘City 
(.426), and New York (.449); in the remain- 
ing SMSAs values were in excess of .580. 
Thus, although black end white Hispanics 


. Share a common ethnicity, they are fairly | 


segregated from one another in most metro- 
politan areas. 

The interaction indices display a basic 
asymmetry in the residential experience of the 
two .Hispanic racial groups. Since black 
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Table 5. Trends in the Residential Segregation of White and Black Hispanics from Blacks in 10 Metropolitan Areas, 


1970-1980 
White Hispanics ' Black Hispanics 

SMSA and Index 1970 1980 Change 1970 1980 Change 
Dissimilarity from blacks 

Boston 687 579 —.108 491 423 — .069 
Chicago 899 859 — .040 509 574 064 
Jersey City ,816 .805 ~ Oli 764 709 —.055 
Los Angeles .860 -734 —.126 364 435 071 
Miami .882 .801 —.081 466 523 057 
Nassau-Suffolk 715 629 —~ 086 466 376 —.091 
New York 574 .604 .030 422 343 —.079 
Newark 666 .673. .007 521 492 — .030 
Paterson 494 .440 —.054 569 468 —.101 
Philadelphia 759 .733 —.026 497 488 — .009 
Mean 735 686 — .049 507 483 — .024 
Interaction with blacks 

Boston .103 .160 .057 .376 412 .036 
Chicago 064 .079 015 587 525 — .062 
Jersey City 046 .046 000 108 152 044 
Los Angeles 051 .080 .029 548 441 —.107 
Miami .038 .059 .021 .467 .346 —.121 
Nassau-Suffolk .049 .099 .050 .302 .374 í .072 
New York .217 213 —.004 .384 .470 .086 
Newark .224 .220 — .004 .498 .541 .043 
Paterson .206 .239 .033 .255 .308 .053 
Philadelphia .210 .239 .029 .489 .544 .056 
Mean .121 .143 .022 .401 All .010 


Note: Entire table employs 1970 definition of white Hispanics. 
Source: U.S. Census, 1980 STF4-A, 1970 Fourth Count Summary Tapes. 
| 


Hispanics are such a small share of all They are more likely to interact with 
Hispanics, their likelihood of contact with non-Hispanic whites, and even with U.S. 
white Hispanics is much greater than white blacks, than with black Hispanics. In con- 
Hispanics’ likelihood of contact with them. In trast, black Hispanics display a significant 
all cases, white Hispanics have a minuscule probability of sharing a tract with white 
probability of living near black Hispanics. Hispanics, averaging about .213. 


Table 6. Trends in the Residential Segregation of White and Black Hispanics from Each Other in 10 Metropolitan 





Areas 
Dissimilarity . Interaction Index Interaction Index 
between White and between White and between Black and 
Black Hispanics Black Hispanics White Hispanics 


SMSA and Index 1970 1980 Change 1970 1980 Change 1970 1980 Change 
Boston .626 .587 — 039 005 .009 004 .033 .083 .050 


Chicago 775 442 —.033 .002 .002 000 .058 125 .067 
Jersey City 498 426 — 072 .007 .011 .004 264 417 .153 
Los Angeles 749 .693 — .056 .003 .002 —.001 .120 .230 110 
Miami 677 -611 — .066 .003 .006 .003 215 345 .130 
Nassau-Suffolk .642 .622 — 020 .003 .004 .001 .045 .077 .032 
New York .391 449 .058 .035 .019 —.016 281 .263 — .018 
Newark - .633 .597 — .036 .005 .008 .003 .084 .157 .073 
Paterson .467 .394 — .073 .005 .010 .005 .169 .348 .179 
Philadelphia .674 .653 — 021 .006 .005 — .001 .038 .083 .045 


Mean .613 577 — .036 .007 .008 .001 AZ] 213 082 


Note: Entire table employs 1970 definition of white Hispanics. 
Source: U.S. Census, 1980 STF4-A, 1970 Fourth Count Summary Tapes. 
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If spatial distance indicates social distance, 
the data here show very different affinities for 
white and black Hispanics. White Hispanics 
- appear to be closest to Anglos, with whom 
they have a high probability of interaction 
(mean of .489) and a moderate level of spatial 
dissimilarity (mean of .577). In fact, they are 
more likely to share a tract with Anglos than 
with themselves (the average interaction 
index for white Hispanics with themselves, 
which we have not shown, is only .199). 
Black Hispanics, in contrast, are closest to 
U.S. blacks, with whom dissimilarity is 
lowest (mean of .483) and the probability of 
interaction is highest (mean of .411). They 
' are more likely to share a neighborhood with 
* U.S. blacks than with themselves and they are 
only moderately integrated with white Hispan- 
ics and Anglos. 
; In most SMSAs, these patterns held for 

both 1970 and 1980, suggesting the relative 
stability of Hispanic segregation patterns by 
race. There is little sign, in other words, that 
black and white Hispanics are coming to- 
gether as a common ethnic group; rather, they 
continue to be separated by the barrier of 
race. In New York, especially, the Hispanic 
population appears to be bifurcating, with 
black Hispanics moving closer to American 
blacks and away from both Anglos and white 
: Hispanics. Thus, not only does race matter in 
Hispanic segregation, but in the largest 
"mainland settlement of Caribbean Hispanics, 
.it'appears to matter increasingly. 


BEYOND THE BLACK-WHITE 
DICHOTOMY 


In this section we remove persons of 
"Spanish race" from the white racial group 
and consider them separately, viewing race as 
a tripartite continuum encompassing whites, 
blacks, and those of "Spanish race," who see 
themselves as racially in between black and 
white. If race matters, then residential 
segregation from Anglos should increase as 
one moves from white to Spanish to black 
race. Moreover, if the U.S. racial taxonomy 
predominates, then Anglos will view people 
of mixed race as black no matter how they see 
themselves, and the degree of segregation 
between persons 'of "Spanish race" and 
Anglos will be closer to that of black 
Hispanics than to white Hispanics. Finally, if 
race matters more than ethnicity among 
Hispanics themselves, then the level of 
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segregation between white Hispanics and 
black Hispanics will not differ substantially 
from the level of segregation between white 
Hispanics and those of “Spanish race.” 

In Table 7 the three leftmost columns show 
the level of segregation from Anglos, and the 
three rightmost show the extent of segregation 
from blacks. In general, the addition of the 
third racial category underscores the salience 
cf race and suggests that, no matter how 
Hispanics view themselves, Anglos perceive 
anyone of mixed race to be black and treat 
them accordingly. While white Hispanics 
have Anglo dissimilarities that lie primarily in 
the moderate range (in 8 of 10 cases, yielding 
an average of .519), the Anglo dissimilarities 
for black Hispanics and persons of “Spanish 
rece” lie almost entirely in the high range. 
The average dissimilarity for “Spanish race” 
is .719, 39 percent higher than the dissimilar- 
ity for white Hispanics, but only 10 percent 
lower than the black Hispanic average of 
.800. Similarly, the probability of Anglo 
cantact declines monotonically in every 
SMSA as one moves from white, to Spanish, 
to black race among Hispanics; the average 
interaction indices for the three racial groups 
are .551, .383, and .322, respectively. 

The data on segregation from U.S. blacks 
sbow a very different pattern, with those of 
"Spanish race" being very similar to white 
Hispanics and very distinct from black 
Hispanics. Both white Hispanics and those of 
"Spanish race" have a relatively high degree 
of segregation from blacks, with average 
dissimilarity indices of .712 and .674, 
respectively, whereas the degree of segrega- 
tion between black Hispanics and blacks is 
quite moderate, having an average dissimilar- 
ity score of only .483. Likewise the probabil- 
ity of contact with blacks rises monotonically 
as one moves from white Hispanics to 
"Spanish race" to black Hispanics, with 
average interaction indices of .111, .201, and 
.411, respectively. 

In general, these results suggest that 
persons of "Spanish race" are closer to white 
Hispanics than to black Hispanics, since both 
white Hispanics and those of "Spanish race" 
exhibit a high degree of segregation from 
U.S. blacks, but black Hispanics do not. 
Table 8 confirms this line of reasoning by 
showing the degree of spatial segregation 
between the three Hispanic racial groups 
themselves. Residential dissimilarities be- 
tween Hispanics of white and "Spanish race" 
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Table 7. Hispanic Residential Segregation from Blacks and Anglos by Race in 10 SMSAs, 1980 


Hispanic Segregation 
from Anglos 
. . White Spanish 
SMSA and Index ` Hispanics Race 
Dissimilarity index : 
Boston . 448 . 790 
' Chicago n .570 . «740 
Jersey City - .490 (543° 
Los Angeles .532 .640 
Miami.  . «4519 662 
Nassau-Suffolk 317 632 
New York .567 .768 
Newark .618 - 781 
Paterson i . 674 .800 
Philadelphia 457 834 
Mean 519 © 719 
Interaction index 
Boston |."  ,808 .495 
Chicago 572 42] 
Jersey City .468 425 
Los Angeles .383 296 
Miami ' 352 254 
Nassau-Suffolk 817 675 
New York 432 242 
Newark 559 . 04944 
Paterson - "419 | .307 
Philadelphia .698 .369 
Mean . 551 383 : 


; ` Hispanic Segregation 
: from Blacks 
Black . White Spanish Black 
Hispanics Hispanics ' Race Hispanics 
.851 .668 ` .573 ,423 
..852 .863 . "^  .865 c c 574 
681 830 "758 709 
770 760 712 435 
715 812 753 523 
794 653 582 376 
811 635 601 343 
842 724 .636, 492 
838 438 .467 468 
844 739 ' .789 488 
800 712 .674 483 
439 .083 269 412 
304 069 .092 525 
373 035 .080 152 
245 .065 102 441 
256 054 Em 346 
519 083 .178 374 
193 174 .259 470 
262 168 .346 541 
288 226 .259 308 
338 155 .310 544 
322 111 .201 411 





Source: 1980 STF4-A. 


lie enie in the moderate range and average 
only .400, suggesting closeness. On the other 
hand, both groups maintain a relatively high 
degree of spatial dissimilarity from black 
Hispanics, with averages of .567 (for those 
“Spanish race”) and .609 (for-white Hispan- 
ics). 

White Hispanics likewise ' display ex- 
tremely low probabilities of sharing tracts 
with . black Hispanics (with an average 
interaction index of .006), as do those of 
“Spanish race”. (average interaction index of 


010). At the same time, they maintain a. 


significant degree of interaction between each: 
other. The white-Spanish interaction index is 
.095, while the Spanish-white index is .192. 
Black Hispanics, however, maintain a nontriv-. 
ial interaction potential "with both groups 
owing to their small relative numbers, with 
average values of .127 relative to white 
Hispanics and .083 relative . to Hose of 
“Spanish race.’ 

A persisting theme throughout the literature 
on Caribbean race relations is that race is a 
marker for social class. If true in whole or in 
part, then intergroup differences in segrega- 


E 


tion may reflect socioeconomic status effects 
rather than prejudice, or discrimination. In 
order to account for this possibility, Table 9 
presents regression models that control di- 
rectly for levels of education, income, and 
nativity in the various racial groups. This 


analysis pools segregation indices for the 


three Hispanic races and regresses them, 
across SMSAs, on each group's median 
education, median household income, and the. 
proportion mainland born. The top panel uses : 
dissimilarity scores to measure segregation, 
and the bottom pànel employs interaction 
indices. Segregation scores involving black 
Hispanics are indicated by a dummy variable 
that equals one when Hispanics are black and 
zero otherwise. Segregation indices involving 
persons of "Spanish race" are indicated by a 
vatiable that equals one for "Spanish race” 
and zero otherwise. White Hispanics are the 
reference category. The regressions also 


: control for the proportion of each racial group 


in the metropolitan area. Since persons of, 
"Spanish race" who reside in Los Angeles 
and Chicago are likely to. be mestizo 
Mexicans rather than mulattos, these two 
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Table 8. Interracial Segregation among Hispanics in 10 SMSAs, 1980 








Segregation between 

White Hispanics and: 

Black Spanish 
SMSA and Index Hispanics Race 
Dissimilarity index 
Boston .660 598 
Chicago -750 306 
Jersey City 432 362 
Los Angeles .714 247 
Miami .627 464 
Nassau-Suffolk 650 442 
New York 509 338 
Newark 656 455 
Paterson 411 240 
Philadelphia .676 550 
Mean .609 400 
Interaction index 
Boston .006 032 
Chicago 002 149 
Jersey City 011 094 
Los Angeles .002 179 
Miami .006 047 
Nassau-Suffolk .003 017 
New York .016 153 
Newark: .006 064 
Paterson .009 140 
Philadelphia .003 076 
Mean .006 095 


Source: 1980 STF4-A. 


Segregation between Segregation between 
Black Hispanics and: Spanish Race and: 
White Spanish White Black 
Hispanics Race Hispanics Hispanics 
* .571 * »* 
* .744 * + 
* .498 * * 
* 677 » * 
* 550 * * 
* .570 * * 
+ 436 * + 
* 541 * x* 
* .394 *. * 
* .678 * * 
* 567 * * 
036 .048 045 .012 
062 .063 189 .002 
322 .095 275 .009 
121 .109 .267 .002 
284 .061 .460 .012 
057 .021 .088 .006 
115 .148 176 .023 
092 .065 155 O11 
187 .162 195 011 
023 .060 065 .007 
.130 .083 .192 .010 


* Symmetrical indices not repeated in table. Dissimilarity between white Hispanics and black Hispanics is the same 


as that between black Hispanics and white Hispanics, etc. 


SMSAs were excluded from the analysis, 
leaving 24 data points. Finally, in order to 
permit OLS estimation, the segregation mea- 
sures were transformed into logits, changing 
them from limited-range to infinite-range 
dependent variables. 

If interracial differences in segregation lev- 
els among the Hispanic racial groups truly re- 
flect socioeconomic differences, then the 
dummy variables for race should be statisti- 
cally insignificant and socioeconomic controls 
should explain. most of the variance in segre- 
gation. However, the analysis yields precisely 
the opposite results. With two exceptions, the 
socioeconomic indicators are not significant in 
explaining the level of Hispanic segregation, 
whereas black race is highly significant in ev- 
ery case. “Spanish race” is significant in ac- 
counting for the level of segregation from An- 
glos, but not blacks. Although we do not 
present the regression results here, socioeco- 
nomic differences between the various His- 
panic groups were also totally ineffective in 
accounting for levels of intergroup segrega- 


tion. Thus the addition of socioeconomic con- 
trols leaves intact the pattern of intergroup dif- 
ferences described above. 

Considering everything, the patterns of seg- 
regation uncovered using the tripartite racial 
categorization suggests that those of “Spanish 
race” are closer to white Hispanics than to 
black Hispanics. Like white Hispanics, they 
maintain high degrees of segregation from 
American blacks and from black Hispanics; 
and the relatively low degree of segregation 
between the white and “Spanish” racial groups 
themselves suggests that persons of “Spanish 
race” are accepted on the basis of their com- 
mon ethnicity by white Hispanics. Although 
people of “Spanish race” may be accepted by 
white Hispanics, however, they appear not to 
be accepted by Anglos, who maintain a high 
degree of segregation from them. Black His- 
panics, in contrast, are not accepted as neigh- 
bors by any group except American blacks, 
and in spatial terms, they appear to be on the 
way to becoming part of the general U.S. black 
population. Very clearly, it is race and not 
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Table 9. Regression of Hispanic Segregation Scores on Selected Social and Economic Variables 











»* p «x .05. 


class that is crucial to understanding patterns 
of residential segregation among Caribbean His- 
panics in U.S. cities. 


CONCLUSION AND DISCUSSION 


Caribbean Hispanics living in the United 
States face a serious conflict between two 
opposing and inconsistent definitions of a 
central self-concept, race. Their Caribbean 
heritage .defines race as a multicategory 


continuum, but after migration to the United : 


States they face strong pressures to reshape 


heir identity to conform to a bipolar racial 


categorization. This conflict is problematic 
because race is not socially neutral. Although 


' white attitudes towards blacks have moder- 


ated in recent years, significant antipathies 
still remain (Schuman et al. 1985), especially 


' with respect to integration (Farley, Schuman, | 
Bianchi, Colasanto, and Hatchett 1978; Sears ` 


f Segregation from Anglos Segregation from Blacks 
Index and Regression Standard " Regression» Standard 
"Variable * Coefficient Error ' Coefficient Emor' > 
INDEX OF DISSIMILARITY à "s 
Socioeconomic status (e 
Median education — . —0.128 0.105 0.147 0.121 
Median household income —0.027 0.029 — 0.043 . . 0.033 
Proportion mainland born ~ 1.182 0.878 -0.727 <- 1.016 
Race of Hispanics f i EC 
Black ' 1.149** 0.290 —0.759** . 0.335 
Spanish 2o 0.729** ' 0.299 —0.043 . 0.346 
Population composition dux 8s l ] 
Group's % of population 1.042 1.784 1861 2.064. 
Intercept . 1.562 . 1166 ^ -0.312 0 [2349 
Adjusted R? ` 0.629** : 0.363** 
N et 24 24 
INTERACTION INDEX ' 
Socioeconomic status ' i 
Median education ; a ' 0.158 0.105 — 0.109 . 0.139 
Median household income '  0.060* 0.029 70.027 0.038 
Proportion mainland bom 0.115 0.883 2.524** 1.166 
Race of Hispanics ' ; 
Black —1.190** 0.291 1.043** 0.385 
Spanish —0.556* 0.300 0.452 0.397 
Population composition . : 07 
Group's 9 of population —5.224** 1.794 -2.325 2.368 
Intercept -2207* 1.172 —1.103 1.548 
Adjusted R? . - 0.675** ^ 0:609** j d 
N 24 24 

* p « .IO. 


and Allen 1984); and there is a large gap in 
socioeconomic status between the races (Far- 
ley and Allen 1987). In moving to the. 
mainland and experiencing its society, there- 
fore, Caribbean Hispanics inevitably recog- 
nize that within the United States, being black 
carries a significant social stigma and there! 
are advantages to being accepted as white. , 
A second pressure that exacerbates the’ 
potential for conflict between island and 
mainland racial identities is the development ‘ 
of a Hispanic ethnic identity. Prior to . 
migration, identity is primarily national in 
character, focusing on symbols and values 
that define a sense of nationalism. After. 
migration, Hispanics and their children de- 
velop an ethnic identity defined in contradis- 
tinction to the host culture (Nelson and 


‘Tienda 1985). This identity is reinforced and 


strengthened by sharing a common location in 
the. U.S. social and economic structures 
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(Yancey, Ericksen, and Juliani 1976). Even- 
tually, distinctions among different national- 
origin groups assume less importance than the 
common feeling of separateness from the host 
culture, and pan-national ethnicities emerge. 
Among Latin Americans, common language, 
religion, and cultural roots overcome national 
differences, and a “Hispanic” identity emerges 
(Nelson and Tienda 1985; Bean and Tienda 
1987). 

Over time, therefore, Caribbean Hispanics 
are pulled in two opposing directions. On the 
one hand, they are drawn together on the 
basis of an emerging ethnic identity. On the 
other hand, they are bifurcated on the basis of 
race as they come to terms with the racial 
ideology of the United States. The resulting 
tension is clearly revealed in the data we have 
reported. The power of the North American 
racial dichotomy is revealed by the discrepant 
residential situation of black and white 
Hispanics. Black Hispanics remain highly 
segregated from non-Hispanic whites, but 
only moderately segregated from U.S. blacks, 
whereas for white Hispanics it is the opposite: 
a high degree of segregation from blacks, but 
a moderate level of segregation from Anglos. 
Black and white Hispanics are also relatively 
segregated from each other, and these pat- 
terns showed few signs of attenuating during 
the 1970s. 

The: more complex racial categorization 
made possible by the 1980 Census allows us 
to see the fundamental tension that Caribbe- 
ans face as they come to terms with a racial 
ideology that does not allow any middle 
ground between black and white. Public Use 
Microdata show that Caribbean Hispanics, 
and especially Puerto Ricans, reject the 
bipolar classification of the United States; a 
large plurality consider themselves to be 
racially intermediate between black and 
white. At the same time, the immigrant 
experience fosters an ethnic identity that 
draws Caribbean Hispanics together. Like 
white Hispanics, persons of “Spanish race” 
maintain a high degree of segregation from 
both black Hispanics and U.S. blacks; and 
“Spanish race” and white Hispanics are 
relatively integrated with each other. 

Although it is risky to impute motives to 
individuals based on aggregate data, it 
appears that Hispanics of a mixed racial 
background are distancing themselves from 
blacks, whether Hispanic or non-Hispanic, 
and drawing nearer to white Hispanics. White 
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Hispanics, for their part, appear to accept 
persons of “Spanish race” as neighbors. 
When those of mixed race encounter Anglos, 
however, they are apparently still seen as 
black. Levels of segregation between Anglos 
and those of “Spanish race” are nearly as 
high as those between Anglos and black 
Hispanics; and multivariate controls for socio- 
economic variables do not erase these differ- 
entials. These findings provide further evi- 
dence that race remains a salient dimension of 
cleavage within the United States, and that it 
is more important than ethnicity in explaining 
patterns of residential segregation. 
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. RETURNS ON HUMAN CAPITAL IN ETHIC ENCLAVES: : 
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This study addresses a recent controversy over the character of labor markets in 
enclave economies: does the enclave provide positive earnings-returns to 
educational and other human capital characteristics to immigrant minority-group 
workers? We study the case of the Chinese enclave in New York City, using three 
distinct operational definitions of the enciave—as a place of residence, place of 
work, and industrial sector. Regardless of the definition employed, there is 
considerable evidence of positive returns for earnings for male enclave workers 
from education, labor market experience, and English-language ability. By 
contrast, none of these human capital variables is positively related to income of 
female enclave workers. Implications of these results for comparative research are 


suggested. 


A key proposition in the theory of ethnic 
enclave economies is that the enclave opens 
opportunities for its members that are not 
easily accessible in the larger society. The 
enclave housing market, labor market, and 
capital market partially shelter ethnic group 
members from competition by other social 
groups, from discrimination and abuse on 
account of their ethnic origins, and from 
surveillance and regulation by government. In 
many respects these boundaries around the 
enclave provide tangible benefits to group 
members and seem to offer a positive 
alternative to assimilation. 

"This study addresses a recent controversy 
over the specific proposition that immigrant 
workers in the enclave labor market achieve 
greater returns on human capital than those 
who participate in the outside economy 
(Portes and Bach 1985; Wilson and Portes 
1980). Based on research among recent 
immigrants in Miami's Cuban community, 
Portes and his colleagues reported that those 
who work within the enclave have greater 
probability of becoming self-employed entre- 
preneurs. Compared to immigrants who are 
employed in the secondary labor market, 
those in the enclave have occupations that 
correspond more closely with their educa- 
tional attainment, and earnings that corre- 


* The authors wish to thank several colleagues 
for assistance in this study: Richard Alba, Brian 
Fisher, .Nan Lin, Victor Nee, Alejandro Portes, 
and Jimy Sanders. Data were made available 
through the Center for Social and Demographic 
Analysis at SUNY-Albany. ` 


spond more closely with their occupational 
status. 

Portes's interpretation of such findings has 
recently been questioned by Sanders and Nee 
(1987), who analyzed census data for Cuban 
immigrants in Florida and for Chinese 
immigrants in California. They reasoned that 
if human capital investments yield greater - 
returns to persons tied to the enclave, such 
returns should show up as effects of education 
and labor market experience (and possibly 
English-language ability) on earnings. Portes 
did not find such effects on earnings in his 
research on recent immigrants in Miami, or in 
parallel studies of recent Mexican immi- 
grants, but this hypothesis is certainly consis- 
tent with the strong emphasis in his theoreti- 
cal writings on the superior returns on skills 
and past human capital investments of 
enclave workers. And one might reasonably 
expect to find such relationships in a sample 
that includes immigrants from a wider variety 
of time periods. 

Sanders and Nee distinguished ethnic group 
members who could be categorized as entre- 
preneurs from those who are employed by 
others. Among Cuban immigrant entrepre- 
neurs, they found that earnings were posi- 
tively associated with college education and 
labor market experience both within and 
outside of the Miami-Hialeah enclave. Among 
Cuban employees, however, there was a 
significant earnings-return' to education and 
labor market experience only outside of.the 
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enclave economy. "The analysis for Chinese 
immigrants in California generally repro- 
duced these results. Sanders and Nee con- 
.cluded that the earnings-return to human 
capital in enclave economies is mostly limited 
to those immigrants who become entrepre- 
neur$. More generally, by calling attention to 
.this disparity in benefits of the enclave 
economy, Sanders and Nee implicitly sug- 
gested a greater focus on the exploitative 
‘aspects of employer-employee relationships 
between coethnics: 

In their published response to these find- 


ings, Portes and Jensen (1987a) criticized this ` 


study on. theoretical and methodological 
grounds, concluded that its findings are 
“irrelevant” 
‘sis, and reported results of their own analyses 
of census data that they characterized as 
_contradicting those of Sanders and Nee. Our 
.purpose here is to provide further empirical 
grounds for evaluating the validity and 
. generalizability of Sanders and Nee's results. 
We accomplish this through a replication of 
their study for yet another enclave situation, 
'New York's Chinatown. 


ALTERNATIVE DEFINITIONS OF THE 
ENCLAVE ECONOMY 


If earnings-returns from human capital char- 
acteristics such as education and labor market 
experience are limited to ethnic entrepre- 
neurs, as reported by Sanders and Nee, the 
positive function of the enclave is sharply 
circumscribed. In their critique, Portes and 
Jensen accepted the general strategy of 
comparing models of earnings for persons 
‘within and outside of the enclave (although 
Portes's previous work compared the enclave 
to the primary and secondary labor markets as 
distinct economic sectors), and comparing 


models for workers with models for entrepre-. 


neurs. But they criticized Sanders and Nee for 
incorrectly Opersnonaliomg: the concept of 
enclave. 

For Portes, "enclave entrepreneurs are 
owners of firms in an area where similar 
enterprises concentrate. Enclave workers are 
employees of these firms" (Wilson and 





In a footnote, Sanders and Nee (1987, p. 756) 
reported that when the enclave is defined as Dade 


` "County, there i is a modest return on education for 
' enclave workers. 


to the earnings-return hypothe- ` 
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Martin 1982; Portes and Manning 1986; 
Portes and Jensen 1987a). Portes and Jensen 
argued that Sanders and Nee, by defining the 
enclave in terms of place of residence, missed 
the central element in the concept. Further, 
they asserted that the Sanders and Nee 
definition is inherently biased because the 
“place of residence" procedure excluded the 
better-off segment of the enclave participants 


who may move out of the residential enclave ` 
'to more affluent neighborhoods or suburbs 


elsewhere, and overrepresented the worse-off 
segment of the population, who are more 
likely to reside in the geographic enclave 
(Portes and Jensen 1987a, p. 768). 

: Portes and Jensen proposed that census 
data of the type analyzed by Sanders and Nee 
be organized according to place of work, 
rather than place of residence. We note that 
the two methods have one thing in common: 
they are both geographically bound. Neither 
is sensitive to ethnic ownership nor to the 
ethnic. composition of the labor force of the 
firm in which one is employed. It is possible 
that only a modest proportion of persons 
identified as within the enclave actually are 
employed in minority-owned firms. This is 
particularly likely since minority-owned busi- 
nesses often have no paid employees. In the 
New York (N.Y.-N.J.) SMSA in 1982, about 
90% of Chinese-owned firms had no paid 
employees, and the remaining firms averaged 
only about four employees (U.S. Bureau of 
the Census 1982). Therefore, any study 
relying on the Census cf Population is liable 
to define the enclave in a way that includes 
many nonenclave employees. This procedure 
probably leads to an underestimation of 
differences between enclave and nonenclave 
labor markets. At the same time, it may lead 
to an overestimate of the absolute size of the 


“enclave labor market. 


The two teams of researchers Siakteed on 
whether the results of analysis depend on 
which indicator is employed. Sanders and 
Nee (1987, p. 756) ‘specifically stated that 
their results were replicated using place of 
work as the measure of enclave participation. 
But among private sector employees who 
work in the Miami-Hialeah area, Portes and 
Jensen (1987b) reported positive significant 


'effects of work experience, education, and 


English-language ability on earnings. - 
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NEW YORK CITY'S CHINATOWN: A. 
REPLICATION, i 


‘In the following analysis of New York City’s 
Chinatown, we join the debate in three ways. 
First, we add a new case study to the research 
on immigrant earnings. Second, we examine 
and compare three possible ways of defining 
the economic enclave: by place of residence, 
by place of work, and by industry. These 
three definitions require detailed discussion, 
and defense. We stress that we are not really 
interested in where people live or- work, or 


what industry they are employed in. We. 


simply use these measures as indicators of the 
likelihood that a person works in the enclave. 
Each of them partially captures the concept of 
economic enclave. - Third; we include a 
separate analysis of the labor market situation 
of immigrant women, who constitute a large 
share of the enclave labor force but who have 
been ignored in previous studies. 


The place of residence definition assumes . 


that Chinese immigrants who live in New 
York City are more likely to participate in the 
enclave economy than are those who live in 
surrounding areas. This assumption is justi- 


fied by the concentration of the region's - 


Chinese population in New York City. The 
.1980 Census shows that 84.5 percent (for a 
total of 124,372) .of New. York State's 
. Chinese live in New York City, and the 
majority, of New York City's Chinese are 
concentrated in three counties—New York 
County (41.9 percent), Kings County (21.0 
percent), and Queens County (31.8 percent). 
Further 73 percent of New York County's 
Chinese live in 14 census tracts in Lower East 
Manhattan. Chinese immigrants, particularly 
recent immigrants,.tend to seek both resi- 
dence and jobs in Manhattan's Chinatown or 
the newly developing Chinese enclaves in 
Flushing, Queens, and Sunset-Park, Brook- 
lyn. Chinese immigrants who live outside 
New York City are far less likely to work in 
the city, and those who do are more likely to 
take up higher ranking or professional jobs in 
the core economy.? This is possible because 


E 


? The PUMS data show that of those Chinese 
immigrant employees who live in New York City, 
about 25 percent work outside the City; they tend 
to be restaurant workers (61 percent). Chinese 
immigrants who live outside New York City are far 
less likely to work in the city; only 21 percent of 
the immigrant employees who live outside the City 
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New York City, especially Manhattan, is also 
a Central Business District for the metropoli- 
tan area. 

Using place of residence for a study of the 
New York Chinatown may be more defensi- 
ble than the residence definition adopted by - 
Sanders and Nee.. The City of San Francisco 
accounts for only a little more than a quarter 
of California's Chinese population, while 
other large concentrations can be found 
elsewhere in that state (for example, 18 
percent of California's Chinese live 'in 
Oakland and surrounding communities, and 
14 percent in the City of Los Angeles). Thus 
what Sanders and Nee defined as outside of 
the enclave may well include residential 
enclaves other than San Francisco.? . 

Another way to define the enclave is by 
place of work. In using this type of definition, 
we delimit the enclave as Chinese immigrants 
working in New York City. Evidence has 


. Shown that the majority of the Chinese-owned 


businesses are stationed in New York City, 
where thousands of ethnic jobs'are provided 
for the Chinese immigrants. The 1982 Survey 
of Minority-Owned Business Enterprises (U.S. 
Bureau of the Census 1982) indicated that 87. . 
percent of all Chinese-owned firms in New 
York State were located in the New York 
metropolitan area (5,413 .of 6,216 business- 
es). And of the 5,978 total entries in the 1988 
Chinese Business Directory for metropolitan . 
New York (Key publications 1988), the 
majority (59 percent) of Chinese firms were 
located in Manhattan’s Chinatown and Flush- 


‘ing, where they could be supported. by the 


large concentration of ethnic population. Our 
data from the 1980 Public Use Microdata 
Sample for New York State and adjacent 
counties in the Tristate area show that 67 
percent of the region’s immigrant Chinese 
labor force, regardless of where they lived, 
worked in New York City. 

In contrast to the two geographic defini- 
tions, we make use of a third definition by 
industrial sectors. Because ownership infor- 


commute into the City to work. About 63 percent i 
of those non-NYC residents who do work, in the 
City are in the high-ranking managerial, profes- 
sional, and technical jobs. 

3 Sanders and Nee compensated. fór this by 
including a variable in their equations, ETHNIC, 
which represents -the percentage of Chinese or 
Cuban residents in the city, town, or county in 


“which the respondent lives. . 
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mation is not provided in the PUMS data, we 
rely on information about the ethnic composi- 
tion of particular industries. We assume that 
sectors in which Chinese immigrants are 
overrepresented constitute the enclave econ- 
omy, while all others are regarded as 
nonenclave. Recent studies on New York 
City’s Chinatown indicate that the garment 
and restaurant businesses are the two basic 
Chinese industries. According to Peter Kwong 
(1987, p. 26), some 450 restaurants employed 
15,000 workers, and 500 garment factories 
employed about 20,000 immigrant women in 
Manhattan’s extended Chinatown area alone. 
The PUMS data show that 22.6 percent of the 
immigrant Chinese labor force in the entire 
region worked in the garment and textile- 
related industries (as compared to 2.9 percent 
for all workers in New York State) and 23.0 
percent worked in eating and drinking places, 
apparently restaurants (as compared to 3.9 
percent for all workers). Finally, some 14 
percent of Chinese immigrants worked in 
retail shops and services that may have an 
ethnic clientele. 

The 1988 Chinese Business Guide and 
Directory for Metropolitan New York further 
confirmed this distribution: the garment 
industry (a total of 437) and restaurant 
business (a total of 783) alone composed 20 
percent of the total number of entries. Other 
Chinese-owned services catering predomi- 
nantly for a Chinese clientele also were well 
represented, for example, barbershops and 
beauty salons (111), offices of herbal doctors 
(101) and herbal stores (53), clinic centers 
(23), doctors (300), dentists (98), small 
department stores (107), entertainment and 
video-rental stores (61), food stores (303), 
jewelry stores (97), and travel agencies (115). 
Thus the enclave niches are the garment 
industry, restaurant business, and ethnic- 
oriented retail and service industries.* This 
definition is not precise, but no doubt a large 
proportion of the Chinese immigrant popula- 
tion is engaged in those niches. 

Whether the economic enclave is defined 
by place of residence, or place of work, or 
industry, there is a large amount of overlap 
between the three. Of those Chinese immi- 


* The enclave industries include the 1980 PUMS 
standard industry codes 132 to 152, 500 to 532, 
540 to 542, 550 to 571, 580 to 691, 771 to 780, 
and 812 to 830. The nonenclave industries include 
all other industrial codes except 900 to 992. 
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grants who work in New York City, only 8 
percent reside outside of the city, while 54 
percent of those who work outside the city 
also live outside. Further, 69 percent of those 
who live and work in New York City were 
employed in what we define as enclave 
industries, compared to only 33 percent of 
those who live and work outside the city. This 
empirical overlap provides additional confi- 
dence in all three definitions of the enclave. 


METHODS OF ANALYSIS 


For the sake of comparability, our analysis 
follows exactly the procedures employed by 
Sanders and Nee (1987). The source of data is 
the 5 percent PUMS for New York State and 
counties adjacent to New York City in New 
Jersey and Connecticut (U.S. Bureau of the 
Census 1980). Our sample is limited to 
Chinese immigrants, ages 25-64, who worked 
at least 160 hours and earned a minimum of 
$500 in 1979. Persons in the sample are 
categorized as workers or entrepreneurs. We 
conduct separate analyses for males and 
females. 

The dependent variable in this study is the 
logged value of personal earnings in 1979. 
There are a large number of independent 
variables. Those which have clear meaning in 
terms of human capital include: labor market 
experience, education, and English-language 
ability. Marital status, the number of hours 
worked (logged), period of immigration, 
citizenship, and occupation (and presence of 
children for females) are included as control 
variables. 

The operationalization of several of these 
predictors requires some further explanation, 
although details can be found in Sanders and 
Nee (1987). Labor market experience is 
calculated from a person's age, but not 
counting years in which the person was in 
school; the square of this term is also included 
for males.5 Education is represented by three 


5 The squared term for labor market experience 
was included by Sanders and Nee to measure 
nonlinearity in the relationship. We also estimated 
equations without this terrm, which is collinear with 
labor market experience. In some of these 
equations, the coefficient for labor market experi- 
ence (which is typically significant when the 
squared term is included) is not significant. Its 
inclusion or exclusion does not affect the estimates 
of coefficients for other predictors. l 
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variables. showing the; number of years. up io 
` or above a given threshold: years of elemen- 
tary education (with a maximum of eight 
. years), years of high school education (with a 
minimum of zero and maximum of four), -and 
years of college education: (with a minimum 
of zero and maximum of eight). Year of 
immigration is represented by a series of five 
dummy variables, and occupation is repre- 


_ sented by nine dummy variables. 


Sanders and Nee (1987) reported that the 
sample of Cubans for whom place of work 
information is available is seriously biased in 
"favor of more educated, wealthier, and earlier 


immigrants. In such cases, it is especially . 


important to include adjustments for sample 
selection bias in the regression models (Berk 
1983). To test for a similar bias. in data for 
New York's Chinese immigrants, we esti- 


` mated logit regression models in-which. the 


‘dependent variable represented availability of 
place of work data and independent variables 
‘included income (logged), education, and 
year of immigration. We. estimated these 


` “Hazard” models separately for workers and 
:employers.. For the New York case, we found - 


no significant bias in the place of work 
sample (no coefficient was even as large as its 
standard érror), and therefore in the following 
equations ' 'we include no correction factor. 


1 


RESULTS 


We begin with results for male immigrants. 
Our procedure is to estimate OLS regression 
equations separately for workers and éntrepre- 


- neurs, categorized as within or outside of the 


ethnic enclave. We present three sets. of 
results, one for each alternative definition of 


the enclave. (The equation for entrepreneurs j 


who work outside of New York City is 


omitted due to the insufficient number of: 


cases. In equations where the industry 
definition is used, only two. occupational 
dummy variables are jincluded, due to the 
restricted range of eae represented i in 


_ enclave industries.) Significance tests in these. 


tables refer to one-tailed tests of the hypothe- 
sis that the population coefficient equals zero. 
~ In addition to these regression equations, 


' we also conducted F-tests of the significance 


of differences in coefficients between differ- 
‘ent, samples. Sanders and Nee ‘reported 
comparable tests in their samples, ‘showing 


' that the equations for workers and entrepre- 


neurs are significantly different, and equa- 
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tions for workers . inside and” outside the . 
enclave are also significantly different. We 
found significant’ differences between the 
equations for’ workers and entrepreneurs 
within the enclave, regardless of how the 
enclave was defined. We found two compar- 


_isons where the equation for workers within 


the enclave differed significantly from the 
equation for workers outside the enclave 
(using place of: work or industry to define 
enclave). To facilitate comparison with Sand- 
ers and Nee, we report all of the models 
separately iade of the’. results of ‘the 
F-tests. 000 à 


The Place of Residence Definition 
Table 1 summarizes the model that: is 


.estimated' using place of residence as the 


definition of the ‘enclave. Among workers ` 
living in New York City,” there are. significant 
positive effects on earnings.of labor market 
experience, college education, and English- 
language ability. Hours worked, U.S. citizen- 
ship, and occupation, as control variables, 
have statistically significant and strong ef- 


‘fects. Labor market experience and college 


education are similarly related to earnings for 
workers living outside New York City. It is 


~ curious, however, that English-language | abil- 


ity has no effect in this latter group. One 
would expect, from, the enclave model, that 
this variable would have greater ener 
outside the enclave than within. 
The equation for entrepreneurs living in the i 
city shows. `'no effect for ‘ labor ‘market 
experience or’ education... Unlike’ the results 


‘reported for California Chinese: ‘by Sanders 


‘and Nee, it appears that these human capital 
variables are more important for workers than ' 
for entrepreneurs in the New York enclave. 
English-language ability is an ‘important 


. predictor for enclave entrepreneurs, as are the 


hours: worked and occupation control vari- 
ables. 
Finally, the model for entrepreneurs. living |. 


.Qutside the city shows still another pattern: 


labor market experience and English-. 
language ability have no effect. College 


.education has a’ significant positive effect,. 


while high school coucanon has a. aes 
effect. ''. 
These rosiltsi differ from those ‘that might 


‘be: predicted from a. reading of the existing 


literature. Unlike Sanders and Nee, we find . 


` positive returns on human capital for workers 
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Table 1. Regression Equations for Chinese Immigrants (Male, Aged 25 to 64) Living in the Tri-State Area Who 
Worked at Least 160 Hours and Earned a Minimum of $500 in the Private Sector in 1979 (Place of 


Residence Definition) 


Live in NYC 

Dependent variable: Workers 
1979-earnings (In) Bp 
Intercept 4,353** 
Labor market experience .016* 
Labor market experience squared —.019° 
Elementary education — .002 
High school education — .003 
College education .065** 
English-language skills‘ .096** 
Married .007 
US citizen .110* 
Log-hours worked 1979 .S64»* 
Immigration 75—79 -.102 
Immigration 70—74 — .090 
Immigration 65-69 : — .006 
Immigration 60-64 —.130 
Immigration 50—59 .082 
Professional specialists .187* 
Technicians .191 
Sales —.168 
Administrative support 

including clerical —.170 
Business, protective and 

household services — .368** 
Precision production and craft —.014 
Operators —.358** 
Transportation —-.31l 
Laborers — .349* 
R squared 381 
Number of cases 975 





Live in NYC Live in Tri-State Live in Tri-State 
Entrepreneurs Workers Entrepreneurs 
B p B 
3.105* 4.014** 8.789* 
.012 .026* .032 
.00047* — .0395» —.001 
.015 .058 .082 
— .020 —.023 —,353** 

.047 "048" .123* 
.245** .046 — .004 
.065 .203* 1.500* 
—.136 .088 — .206 
.580** .601** .055 
— .056 — 429%" — 1.454* 
.302 — .226 — .973* 
.422* —.130 — .660 
.533* — .082 —.750 
.201 .063 —1.157* 
.848** .080 .699* 
—.324 — 041 — 
— 187 — .059 .033 
—.728 -.220 — 
—.166 -.192** —.596 
.225 ~ .229 .560 
47 -,111 —.369 
— .065 ~ —.114 
— —.182 — 
.383 557 611 
157 365 60 


* Significant at .05 level, one-tail (the t-ratio is greater than or equal to 1.65). 
** Significant at .01 level, one-tail (the r-ratio is greater than or equal to 2.33). 


* Unstandardized regression coeffients. 


> The decimal point to the parameter estimate is moved two places to the right. 
€ Likert scale (0-4). Large number indicate better English-language skills. 


Source: U.S. Bureau of the Census (1980). 


both inside and outside the enclave. And 
unexpectedly, college education has a posi- 
tive effect for entrepreneurs outside the 
enclave, but not inside. Finally, unlike the 
portrait of the enclave as a place where 
cultural assimilation is unnecessary, we find 
positive returns on English language and 
citizenship for workers within the enclave, 
but not outside.$ 


The Place of Work Definition 
Coefficient estimates for models in which the 


$ Sanders and Nee similarly found that English- 
language ability is a significant predictor of 
earnings for both Cuban and Chinese émployees 
who live within the enclave. 


enclave is defined by place of work are 
provided in Table 2. The equation for workers 
employed in New York City shows positive 
returns from labor market experience and 
college education, but not from English- 
language skills. The equation for workers 
employed outside the city shows a compara- 
ble effect of college education, but no effect 
of labor market experience. Again, English- 
language ability has no effect. 

Using the place of work definition, entre- 
preneurs within the enclave have positive 
returns on labor market experience and 
college education comparable to (possibly 
larger than) workers. There is a: very strong 
effect of English-language ability. As noted 
above, we do not report a model for 
entrepreneurs working outside of the enclave. 
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Table 2. Regression Equations for Chinese Immigrants (Male, Aged 25 to 64) Living in the Tri-State Area Who 
Worked at Least 160 Hours and Earned a Minimum of $500 in the Private Sector in 1979 (Place of Work 


Definition) 
Work in NYC Work in NYC Work in Tri-State 

Dependent variable: Workers Entrepreneurs Workers 
1979-earnings (In) Bp BY Bt 
Intercept i 4.287** 3.108 5.192** 
Labor market experience .023* 091* .036 
Labor market experience squared — .033* .001 — .049> 
Elementary education . — .008 — 025 — 026 
High school education 014 — 044 —.018 
College education .060** 095* 068* 
English-language skills* .042 .397* 106 
Married 143* 100 - 
Log-hours worked 1979 .628** 434 .487** 
US citizen .085 002 .201* 
Immigration 75-79 -—.303* — .063 —.197 
Immigration 70-74 — .206* 010 — 234 
Immigration 65-69 —.049 .335 —.199 
Immigration 60-64 —.225* .609 —.318 
Immigration 50-59 ` — .080 .190 .182 
Professional specialists — .049 240 .407* 
Technicians —.123 — 222 
Sales — .412** ~ 326 —.233 
Administrative support 

including clerical — .358** — 1.049* — 1.469** 
Business, protective and 

household services — .705** ~ 029 = 455%" 
Precision production and craft —.377** -—.323 .204 
Operators —.546** —-.176 — .080 
"Transportation —.594** — .307 — 
Laborers —.679** — —.174. 
R squared 515 522 ' 483 
Number of cases 421 80 264 





* Significant at .05 level, one-tail (the r-ratio is greater than or equal to 1.65). 
** Significant at .01 level, one-tail (the t-ratio is greater than or equal to 2.33). 
* Unstandardized regression coeffients. 
> The decimal point to the parameter estimate is moved two places to the right. 


* Likert scale (0-4). Large number indicate better English-language skills. 


Source: U.S. Bureau of the Census (1980). 


The Industrial Sector Definition 


The last set of equations uses industrial sector 
to categorize people as within or outside the 
enclave (see Table 3). Once again, we find 
that college education has positive returns on 
earnings for enclave workers. Labor market 
experience does not, but there is a positive 
effect of English language. For workers 
outside the enclave, all three of these 
predictors have significant positive effects. 
The same human capital variables that 
predict earnings for enclave workers also 
predict earnings for enclave entrepreneurs: 
college education, English language, and 
citizenship. College education is significant 
for entrepreneurs outside of enclave indus- 


tries, as is labor market experience; English 
language is not. , 


The Case of Women Workers 


Up to this point, the analysis has been limited 
to men. This limitation has been necessary in 
order to maintain comparability with previous 
studies. Substantively, however, it is indefen- 
sible. Women constitute a major share of the 
labor force. More important, it is well known 
that women's position in the labor market is. 
not the same as men's, and there is no reason 
to believe that mobility processes experienced 
by men are in any way applicable to women. 
In fact, separate analysis of Chinese immi- 
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Table 3. Regression Equations for Chinese Immigrants (Male, Aged 25 to 64) in the Tri-State Area Who Worked at 
Least 160 Hours and Earned a Minimum of $500 in the Private Sector in 1979 (Industrial Sector Definition) 











a In Enclave In Enclave Outside Enclave Outside Enclave 
Dependent variable: Workers Entrepreneurs Workers Entrepreneurs 
1979-earnings (In) B* Bp p* B* 
Intercept 4.509** .654** 3.312** — 2.069 
Labor market experience —.002 — .004 .050** .104* 

` Labor market experience squared .005> .009° — .O77>** — 001 
Elementary education .006 .014 —.036 .104 
High school education — 004 —.079 —.034 , ~ .155* 
College education 065** .121** .090** .133** 
English-langauge skills" .079** .234* ' ,l23** .080 
Married . . .003 249 .113 .369 
Log-hours worked 1979 .568** .471** ; .697** 1.125** 
US citizen” .016 — 186 "^ 2225 —.138 
Immigration 75-79 — .096 — .458 — .233 ~ .703 
Immigration 70-74 . —.092 .074 —.104 —.157 
Immigration 65-69 —.056 .298 — .030 -— 225 
Immigration 60-64 . —.059 275 — .169 .377 
Immigration 50-59 .081 —.351 .071 -—.013 
Business, protective and. i ; 

household services —.261** —.101 —.136 ~ 1.475* 
"Operators | — .219** —.131 — .339* ~ 1,569*- 
R squared ` 339 .280 .403 . 679 
Number of cases 729 157 611 60 


* Significant at .05 level, one-tail (the f-ratio is greater than or equal to 1.65). 
** Significant at .01 level, one-tail (the r-ratio is greater than or equal to 2.33). 


_’ Unstandardized regression coeffients. | ' 


> The decimal point to the parameter estimate is moved two places to the right. 
* Likert scale (0-4). Large number indicate better English- me skills. 


Source: U.S. Bureau of the Census (1980). 


grant women working in the New York 
metropolitan area reveals some very different 
patterns. We will.describe these briefly here; 
for a more complete analysis, see Zhou 
(1989). l 

New York’s Chinatown was ‘a bachelor’s 
society for the greater part of this century. It 
was not until after 1965, when U.S. immigra- 
tion policies were revised to favor family 
reunification, that Chinese women entered 
‘New York in large numbers. In the 1940s, 
.there were six times as many Chinese men as 
women in New York State; by 1980, the ratio 
had declined to 106 men per 100 women. 
Large-scale immigration of women has had 
two kinds of effects. First, through sheer 
numbers and by promoting a family-centered 
society in-which most adults are living with a 
spouse, ‘it has expanded the market for 
Chinese goods and services that are inacces- 
` gible in the larger society. This has stimulated 
development of a more diversified enclave 
economy, including such sectors as the food 
industry, restaurant business, and beauty 
salons. Second, by providing a large addi- 


tional pool of cheap labor at a critical time in 
the city’s overall economic restructuring, it 
has promoted the rapid development of a 
Chinese garment’ industry. The labor force 
participation of Chinese women over age 16 
is unusually high in New York State (59 
percent). Women’s concentration in the 
garment industry .is extraordinary: more than 
55 percent of all Chinese immigrant women 
who worked at least 160 hours and earned 
over $500 in 1979 were employed in the 
garment industry (compare to Glenn 1983 on 
the “dual wage earner family”). a 
The specific question here is whether 
participation in the enclave economy affects 
immigrant women’s ability to reap earnings 
returns on human capital, as hypothesized for 
men. There are too few cases of entrepreneurs 
in our female sample to support separate 
analysis. Therefore, we will deal only with 
employees. Table 4 reports equations for 
female workers both within and outside of the 
enclave, with enclave defined by.place of 
residence, place of work, and industry. The 
independent variables in | these equations are 
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Table 4. Regression Equations for Immigrant Chinese Employees (Females, Aged 25 to 64) Living in the Tri-State 
Area Who Worked at Least 160 Hours and Eamed a Minimum of $500 in the Private Sector in 1979 














Place of 
Residence 

Dependent variable: NYC Outside 
1979-earnings (In) B B 
Intercept 4.469** 2.334** 
Labor market experience ~ .004* .004 
Elementary education .007 — .013 
High school eductaion — .003 .043 
College education .016 .046* 
English langauge skills" .043 .002 
Married — .082 — .094 
Fertility .007 —.073* 
Log-hours worked 1979 -669** .951** 
U.S. citizen .018 .102 
Immigration 75—79 — 246** —.224 
Immigration 65—74 — .056 —.136 
Sales, administrative support 

and precision production —.472** —.319** 
Business, protective, and 

household services — .376** — .400** 
Operators and laborers — .768** —.328** 
R squared .456 .555 
N of cases 815 211 





Place of Industrial 
Work Sector 
NYC Outside Enclave! ^ Nonenclave? 
B* B B* B 
4.334** 2.826** 4.096** 3.786** 
—.004 — .005 -.002 -—-.003 
.012 — .016 .014 -.032 
— .002 .009 .005 .008 
.020 .093* .038 004 
013 — .058 —.036 . .123** 
—.054 — .343** — .068 ~ 111 
— .004 — .046 .003 — ,003 
.679** .940** .685** .789** 
— .004 .078 .086 — .036 
—-271** —.319 -—.172* — .282** 
— .066 — .203 002 ~~ .148* 
—.516** —.242 ~ 567** -.311** 
—.314 .029 — .440** -.350** 
—.785** — .098 — .612** — .570** 
417 .438 .364 .A3T 
400 130 637 389 





* Significant at .05 level, one-tail (the t-ratio is greater than or equal to 1.65). 
** Significant at'.01 level, one-tail (the t-ratio is greater than or equal to 2.33). 


* Unstandardized regression coeffients. 


^ Likert scale (0—4). Large numbers indicate better English-language skills. 
! Enclave industries include 1980 standard industry code 132 to 152, 500 to 532, 540 to 542, 550 to 571, 580 to 


691, 771 to 780, and 812 to 830. 


? Nonenclave industries include all other industrial codes except 900 to 992. 


Source: U.S. Bureau of the Census (1980). 


the same as those used for men, with the 
addition of a variable indicating whether the 
woman had any children. 

We note first that there are significant 
differences between the equations for men 
and women (based on F-tests similar to those 
reported above for workers vs. entrepreneurs 
and enclave vs. nonenclave). Among women 
outside the enclave, there are some significant 
positive coefficients for human capital vari- 
8bles: college education in one equation, 
English language in another. But the most 
consistent effects are for hours worked and 
occupation. Similarly, among women within 
the enclave, regardless of the definition, the 


'strongest predictors of earnings are occupa- 


tion and hours worked. The surprise here is 
the total absence of human capital effects: 
neither education, nor English-language skills, 
nor citizenship has any significant effects. In 
one equation, labor market experience has a 
significant negative coefficient. 

These findings for women workers require 


a careful reformulation of the hypothesis of 
the positive functions ofthe enclave econ- 
omy. In this case, and we suspect that New 
York's Chinatown is not an unusual case, 
female immigrant workers within the enclave 
have no measurable earnings returns on 
previous human capital. This result does not 
stem in any obvious way from personal 
characteristics of these women. They are not 
unusualy low in education; nor are they 
predominantly part-time workers (of those 
who live in New York City, for example, 26 
percent have some college education and the 
average number of hours worked is 38.3). 
What then accounts for the fact that Chinese 
immigrant women’s earnings are unrelated to 
their education, experience, language ability, 
and citizenship? Our view, based upon 
fieldwork in Chinatown, is that they face two 
sorts of disadvantages. The first, which they 
share with men, is the set of disadvantages 
associated with immigrant status. The second 
is a set of cultural obstacles within the 
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enclave:- occupational segregation by gender 
(particularly in the garment industry).and a 
triple role as wives, mothers, and wage 
workers. These women on the whole are 
expected (and expect themselves) to earn 
wages in ways that do not conflict with their 
family obligations. Sewing at piecework rates 
is a good fit to these expectations: working 
hours are flexible, and a higher income can be 
gained by working faster—even if the pay per 
piece is low. Many middle-aged women, 
those who immigrated at age 40 or 50, accept 
a short-term orientation toward work: their 
purpose is not to develop a working career 
(this applies even to many who had profes- 
sional occupations in China) but to contribute 
immediately to the household income for the 
benefit of younger members. And finally, the 
limited number of jobs with higher educa- 
tional qualifications tend to be reserved to 
men. The male supremacy that dominatés 
Chinese culture reinforces gender discrimina- 
tion in the enclave labor market. 


DISCUSSION AND CONCLUSION 


Broadening the analysis of enclave labor 
markets to include the position of immigrant 
women leads to startling results. Researchers 
must now ask to what degree, the positive 
functions of the enclave for men are derived 
‘from the subordinate position of women. The 
result may depend on the structure of the 
enclave economy and its relationship to the 
larger economy: what kinds of industries can 
prosper, with what labor requirements, and at 
what wages. It may also depend upon the 
values and motives of the immigrants them- 
selves. We encourage caution in evaluating 
women’s status. Like unpaid family labor, 
paid work in jobs incommensurate with one’s 
education and other attainments has both a 
negative and a positive side. Viewed from an 
individualistic perspective, the enclave labor 
market appears clearly exploitative of women. 
But we must remember that Chinese culture 
gives priority not to individual achievement 
but to the welfare of the family and 
community. As Glenn (1983) emphasizes, 
female labor force participation is part of a 
family strategy. It is not obvious that the 
Chinese immigrant community has better 
options in the face of limited opportunities 
and discrimination in the mainstream econ- 
omy. i 

Aside from the situátion of women work- 
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ers, how does the enclave labor. market; - 
function for men, and what differences are. - 
self- ^ 


there between ‘workers and the 
employed? One consistent finding here is that 
college education has positive returns- for 
earnings for male enclave workers, regardless 
of how the enclave is defined. Labor market 
experience and English language have posi- 
tive effects for these workers in two out of. 
three equations. We conclude that among . 
New York's Chinese immigrants, enclave 
workers are able to take advantage of human 
capital resources to increase earnings (al- 
though not consistently more than workers 
outside of the enclave). Further, there is. no 
consistent evidence that entrepreneurs in the 
enclave or outside of the enclave have greater : 
returns on human capital than do similarly. 
situated workers. ; 

There are two ways to interpret this main 
result for New York's Chinatown, and thé 
choice between them depends upon one's 
reading of the facts about San Francisco and 
Miami-Hialeah.: Portes and Jensen (19872) . 
argued that Sanders and Nee's results are an 
artifact of an incorrect specification of the 
Cuba and Chinese enclaves that they studied. . 
Sanders and Nee reported that they derived ` 
the same findings for the Miami-Hialeah and , 
San Francisco cases using the place of work 
definition as they did using the place of 
residence definition. But Portes and Jensen 
disputed this report based on their own 
reanalysis of the same data for Miami- 
Hialeah. 

Unfortunately, neither we.nor the reader 
can resolve this factual discrepancy. We note, 
however, that Portes and Jensen's (1987b) — 
unpublished table included à different age 
range (18—64 instead of 25—64) than used by 
Sanders and Nee, operationalized education, 
occupation, and year of immigration differ- 
ently, compared coefficients in the equation 
for persons working in the enclave with those 
for all persons rather than with those for: 
persons working outside of the enclave, and 
omitted a correction for sample selection bias 
which Sanders and Nee .described as.espe- 
cially important for the place of work sample 
in Miami-Hialeah. 

Let us suppose that a methodologically 
correct analysis of census data for San 
Francisco and Miami-Hialeah does reveal 
significant earnings-returns to human capital 
for enclave workers, as argued by Portes and, 
Jensen. Then, our findings for New York's | - 
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*. Chinese would be one more confirmation of 
Portes' hypothesis as a general rule about 
enclave economies. This enclave hypothesis 
. is weakened, however, by the fact that human 
` capital returns for men. are no greater within 
the enclave than outside. To make a stronger 
case would require that persons outside of the 
enclave be further classified into the primary 
or secondary labor market, which we have not 
been able to do here. A 
Alternatively, let us suppose that Sanders 
and Nee's findings are confirmed. This is 
. theoretically more interesting, because then 
our results for New York City would indicate 
. that there are differences among enclaves, 
even enclaves of the same ethnic minority, 
. concentrated in the same types of industries 


' : and occupations, in the earnings-returns to 


human capital. 

Portes and Bach (1985) have shown 
previously that the situation of Cuban immi- 
grants differs from that of Mexican immi- 
grants, who are not sheltered by a differenti- 
ated enclave economy. If in fact there are also 
important differences among enclave econo- 
mies—if male enclave workers get earnings- 
returns on human capital in New York but not 
in San Francisco and perhaps not in Miami- 
Hialeah-then there is an opportunity to 
develop a comparative theory of enclave 
economies. The most positive outcome of this 
controversy would be to stimulate other 
researchers to extend investigation to other 
minorities in other places. 

Why might the Chinese experience in New 
' York differ from that of Chinese in San 
Francisco or Cubans in Miami? We are not 
able to provide a comparative theory here, or 
to explain these particular cases. However, 
we wish to stress the thesis that enclave 
economies are neither uniform nor static. 
Consider, for example, the specific conditions 
that made possible an enclave economy in 
Miami after 1959. These include (following 
Wilson and Portes 1980, p. 314) “the 
presence of immigrants with sufficient capital 
and initial entrepreneurial skill [and] . . . 
sustained immigration.” The initial group of 
migrants included large numbers of educated 
people from entrepreneurial backgrounds, 
many with considerable assets, for whom the 
United States government made generous 
provisions for small business loans. The most 
recent wave (refugees expelled from Cuban 
jails) -are quite a different group, with 
potentially different prospects for upward 
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mobility. Similarly, Chinese immigration to 
the United States comes from two very 
different origins: one stream from rural areas 
of Southern China (principally Guangdong 
Province), and another from cities and towns 
along the East Coast of China, Hong Kong, 
and Taiwan. Mobility opportunities for the 
former group, who are less educated, and less 
likely to have access to capital resources, may 
be more’ limited. One possible reason for 
differences among Chinese enclave econo- 
mies (or for changes over time) could be 
differences in the origin of immigrants. 
Other sources of variation may include the 
kinds of jobs available in the enclave, the rate 
of immigration over time, the size of the 
enclave labor market, and the strength of the 
boundary protecting it. For example, Sanders 
and Nee (1987, p. 764) argued that the 
long-term development of enclave businesses 
is “constrained by the principle of competi- 
tive exclusion. . . . An enclave economy can 
support only so many entrepreneurs.” And 
further, "[s]uccess of small businesses de- 
pends, substantially, on the maintenance of a 
large pool of low-wage workers.” Possibly, 
then, rates of upward mobility and returns to 
human capital may be greater in the early 
phase of enclave development and may be 
limited by subsequent rates of immigration. - - 
Hypotheses of this sort require more 
comparative studies of enclave labor markets. 
In such research, our experience suggests that’ 
the PUMS data are a useful source of 
information on labor markets, particularly if 
the researcher has enough other information 
on which to base judgments about enclave 
definitions. But we emphasize that census 
data are only a proxy for more direct 
measures, which would require information 
on the ownership and labor force composition 
of firms in which respondents are employed. 
Although we have focused on the issue of 
earnings-returns on human capital, some 
other results concerning Chinese in New York 
should not be overlooked. First, the probabil- 
ity of a male immigrant's being self- 
employed is greater in thé enclave than 
outside, which supports the notion of a 
enclave’ as providing paths for upward 
mobility. (Curiously, based on the sample 
Sizes reported by Sanders and Nee, this is not 
true of the Miami and San Francisco 
enclaves.) Second, whatever the returns to 
human capital, the absolute earnings gap 
between workers (both male and female) 
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within and outside the enclave is large: 

enclave workers have worse jobs at lower 

pay. It is clear that the Chinese enclave in 

New York offers some compensations but not 
. equality with the larger economy. 
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MONEY, BUSINESS, AND THE STATE: 
MATERIAL INTERESTS, FORTUNE 500 CORPORATIONS, 
AND THE SIZE OF POLITICAL ACTION COMMITTEES* 


JOHN L. BOs 
The University of Michigan 
N * * * LA * 
c This paper uses data from the Federal Elections Commission on political action 


‘committees, sponsored by Fortune 500 industrials in 1976 and 1980, to examine ` 


the determinants of business PAC size. Of the five groups of variables suggested by 
the extant PAC literature—availability of resources, free rider problems, material 
interests, previous campaign activity, and industry categories—just the measures 
of material interests have consistent and important effects on the level of firm 
political activity. Firms with the richest history of interaction with the state, top 
defense contractors, major acquirers of other businesses, and corporations 
prosecuted for criminal acts are the most politically active of large firms. Thus, 
while material interests are an important determinant of political action, this study 
suggests only interests tied to special long-term relationships with the state serve to 
increase the amount of political action taken by large firms. These results suggest 
that the pluralist-based theory of business political involvement, so widely accepted 
among contemporary PAC researchers, is of little utility for explaining firin 


differences in political activity. 


INTRODUCTION 


Many recent studies have documented the 
substantial and disproportionate influence the 
business community has had on public 
policy-making processes, including those of 
the legislative, executive, and electoral sys- 
tems. Not surprisingly, this research has 
provided substantial evidence that businesses 
organize themselves into very effective polit- 
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ical groups. However, several analyses show 
a differential commitment to political activity 
among even the best-organized groups of 
firms. For example, only 262 of the firms on 
Fortune Magazine's 1980 list of the 500 . 
largest U.S. industrials had political action 
committees (PACs) with total receipts of 
more than one dollar during the 1979-1980 
election cycle.! Only 989 of the more than 
30,000 non-Fortune 500 public corporations 
had a PAC in 1980 (Matasar 1986, p. 26). 
The amount of money collected and disbursed 
by business political action committees varies 
dramatically from committee to committee 
(Ryan, Swanson, and Buchholz 1987; U.S. 
Federal Election Commission 1978, 1982). 
These firm differences in political action 
suggest that some businesses likely have more 
influence over political events than others. . 
Knowing which firms are more active and 
why will improve explanations and predic- 
tions of our political system. 


! [n the data set for this paper, 262 firms out of 
500 had PACs with more than one dollar in 
receipts. Other authors have found slightly differ- 
ing numbers of committees. For example, Ryan, 
Swanson, and Buchholz (1987) found 280 Fortune 
500 firms with PACs; Matasar (1986) found 286. 
Differing numbers of PACS are likely the result of 
somewhat differing criteria for determining whéther 
a firm has a PAC. 
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This research considers why individual 
businesses vary widely in their commitments 
to political activity by examining the size of 
political action committees of one of the most 
politically active groups of large businesses, 
the Fortune 500 list of U.S. industrials. 
Before significant changes in campaign fi- 
nance legislation appeared during the late 60s 
and 70s, and before businesses used political 
action committees, little systematic data was 
available on the political behavior of business 
(Epstein 1980)? Now, however, the business 
PAC phenomenon offers a heretofore unavail- 
able glimpse into tbe world of business 
political activity. 


LITERATURE REVIEW 


A single theoretical perspective, rooted in 
contemporary pluralist theory (e.g., Dahl 
1961, 1958), dominates both theory and 
research addressing the wide variation in the 
participation of business political action 
committees.3 This perspective applies a ra- 
tional decision-making theory, usually drawn 
directly from the work of Mancur Olson 
(1971), to observations of business political 
behavior. The cost/benefit calculus common 


to this position suggests there are two key 


determinants of business political action. 
Firms act politically if the costs of activism 
can be recouped by the specific action taken 
and if the benefits of the action will be reaped 
primarily by the politically active rather than 
the less active competition, the "free riders." 
Thus research in this tradition focuses on the 
narrow instrumental interests of the firm (or 
industry) such as government purchases 
(Masters and Keim 1985), or regulatory 
burden (Pittman 1977). The free rider prob- 
lem highlights aspects of the structure of the 





? For more detailed expositions on the nature 
and history of political action committees, see the 
works of Epstein (1980), Sabato (1984), Handler 
and Mulkern (1982), and Matasar (1986). 

?' Much other research also examines business- 
sponsored PAC behavior, but this research exam- 
ines the distribution of PAC contributions to 
federal candidates and is therefore not directly 
relevant to this discussion. Moreover, the majority 
of the more recent research challenges the 
dominant pluralist perspective on the level of 
consensus and cohesiveness of business PAC 
behavior (e.g.; Burris 1987; Clawson, Neustadtl, 
and Beardon 1986; Mizruchi and Koenig 1986). 
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firm's primary industry that influence the 
political activity of other firms in the industry 
as well as the distribution of expected returns 
from political action, for example, industry 
concentration and size (Andres 1985; Pittman 
1977; Epstein 1980; Siegfried 1980).* Most 
research and theory following this approach 
suggest that the availability of resources 
significantly influences the level of political 
action (Matasar 1986). In many respects this 
position also draws heavily on the resource 


mobilization perspective described by McCar- 


thy and Zald (1973). Marxist scholars also see 
material interests as an important driving 
force behind business political activity (Gold, 
Lo, and Wright 1975; Whitt 1979}. Hence, 
the most unique property of the dominant 
model is the concept of free riders. 

The vast majority of the empirical research 
examining business PAC formation flowing . 
from the rational actor perspective attempts to 
explain only why a firm does or does not 
sponsor a PAC. However, the amount of 
money PACs collect and disburse varies 
dramatically. For example, in the 1979-80 
election cycle there were 35 firms with PACs 
that collected less than 11,000 dollars, 50 
firms with PACs that collected over 90,000 
dollars, and 12 firms with PACs that collected 
over 200,000 dollars (U.S. Federal Elections 
Commission 1982). The difference in the 
potential influence of a firm with a PAC that 
has 10,000 dollars to contribute as opposed to 
one with 400,000 dollars is substantial. 
Because firms must allocate resources to 
create and administer the committees as well 
as to solicit money—Matasar (1986, p. 31) 
finds some firms spend as much as 50 cents 
for every dollar raised on behalf of the firm’s 
PAC—the wide variation in PAC size also 
suggests large differences in the commitment 
of businesses to political behavior. Thus 
differentiating the political endeavors of firms 
into just two categories, those with PACs and 
those without, conceals much information 
about individual firms’ or groups of firms’ 
political influence and level of commitment to 
political action. 

The central purpose of this paper is to 
examine not just presence or absence of 


* Good reviews of this literature can be found in 
Siegfried (1981) and in Epstein (1980). For a more 
general review of the business and politics 
literature, look to Zald and Berg (1978) and to 
Beitz (1984). 


MONEY, BUSINESS, AND THE STATE 


activity, as does previous PAC research, but 
also variation in amount of business PAC 
activity. The previous analyses consider only 
a very limited number of factors that 
influence PAC activity. The current study, 
however, investigates a wide range of forces 
which PAC research suggests affect a firm’s 
political behavior. 


VARIABLE SELECTION 
The PAC literature points to several groups of 


variables that are important for determining . 


large-firm political behavior. These groups 
include resource availability such as firm size; 
determinants of free rider problems, such as 
industrial concentration; material interests, 
such as government contracts; and political 
behavior, such as membership on the Busi- 
ness Roundtable or previous campaign contri- 
bution activity. Table 1 provides a summary 
of data sources and coding schemes for the 
variables in each group. 


823 


" Resource Availability : sh 


Most PAC researchers argue that resource 
availability, measured principally by annual 
sales volume, is strongly related to a firm 
having a PAC. Thus, in their study of the 
political behavior of Fortune 500 firms,- 
Ryan, Swanson, and Buchholz (1987, p. 130) 
found that 82 percent of the largest 125 
Fortune 500 firms had PACs during the 
1979-80 election cycle compared to only 29 
percent of the smallest 125 firms. Earlier 
researchers have found similar relationships 
(e.g., Brenner 1980, 1981; Budde 1980; Gais 
1983). 

Firm size is measured in the current study 
as the total annual sales in thousands of 
dollars and the number of people employed 
by each corporation in 1979 (Colvin and 
Knight 1980). Other measures of resources in | 
this analysis includé the five-year mean of the : 
ratio of each firm's research and development 
expenditures to sales (R and D) and the labor 


- cost to sales ratio (from Clinard and Yeager 


Table 1. Data Sources, Units, and Coding for the Independent and Dependent Variables 


Variables ' Units and Coding Data Source 
Dependent variables: ! : 
1976 PAC receipts Dollars U.S. F.E.C. 1978 
1980 PAC reeceipts Dollars U.S. F.E.C. 1982 
Resource availability: 
Annual sales in 1979 1000s of dollars Colvin and Knight 1980 
Number of employees in 1979 Persons ! 
Return to investors in 1979 10,000s of dollars 
Earnings per share in 1979 Dollars 
Growth rate for 1969-1979 Percent ? 
R&D to sales ratio Ratio Clinard and Yeager 1981 
Labor/sales ratio Ratio 
Free riders: 
Average concentration ratio Ratio Clinard and Yeager 1981 
Relative firm dominance Percent 7 
Material interests: 
Unique legal actions Legal action Clinard and Yeager 1981 
Defense contracts in 1979 1000s of dollars U.S. Sec. of Defense 1979 


Total mergers and acquisitions 
1960-1979 
Previous political action: 
Business council membership 
Campaign donations of 
top managers in 1972 
Industry effects: 
Lamber and paper firms 
Chemical firms 
Oil producing firms 
Automotive firms - 
Aerospace firms 
Pharmaceutical firms 


1,000,000s of dollars 


1=Yes 0=None. 
Dollars 


1=member of industry 
0=other industry 


Bureau of Economics 1981 


Shoup 1980 
Paul 1975 


From Fortune Industry Codes, 
See Colvin and Knight 1980 
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1981). These measures reflect resources 
beyond those necessary for day-to-day busi- 
ness functions. Because the economic health. 
of a business also affects resources available 
for politics, return to investors for 1979, 
earnings per share for 1979, and growth rate 
for 1969-1979 are also included as variables 
in this analysis (Colvin and Knight 1980). 


The Free Rider Problem 


Epstein (1980) hypothesizes that industrial 
. concentration influences the political activity 
of a firm. Both Andres (1985) and Pittman 
(1977) show that industry concentration 
Significantly affects whether or not a firm is 
‘politically active. Pittman views industrial 
concentration as an important gauge of a 
: firm's perception of the beneficiaries of its 
political activity, as a measure of the potential 
' for "free riders" to get a piece of the pie 
. without investing any resources. Andres, 
. Epstein, and Pittman argue that higher levels 
of industrial concentration increase a firm's 
political action because the resources pro- 
.vided by the state. vary with each firm's 
market share. One measure of industrial 
concentration I use is the average concentra- 
tion ratio in all the manufacturing industries 
in which a firm operates, weighted by the 
relative importance of each industry to the 
- firm based on its sales in each four-digit SIC 
.category (from Clinard and Yeager 1981). 
The other measure of concentration is the 
relative firm dominance in the primary 
. industry classification of each corporation 
(also from Clinard and Yeager 1981). 


Material Self-interest 


Most previous research has used very crude 
indicators of material interests. Regulatory 
burden, for example, has typically been 
measured by whether or not firms or 
industries are regulated (Andres 1985; Pitt- 
man 1977). The rational interest theory used 
by most PAC research argues the more 
regulatory actions a firm faces the more 
politically active it becomes. In this analysis, 
each Fortune 500 firm’s regulatory burden is 
measured by the total number of unique 
administrative, civil, and criminal legal ac- 
tions brought against it. by government 
' regulatory bodies in 1975 and 1976 (Clinard 
and Yeager 1981). The number of adverse 
regulatory actions each firm must endure 
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provides a direct and detailed measure of its 
regulatory environment. 

Pittman (1977) contends the greater the 
firm’s exposure to the risk of antitrust action, 
the more politically activé the firm will be. 
He examines this hypothesis by using the total 
number of antitrust suits filed against firms 
within each of 81 industries. The current 
study uses a more novel approach. The 
relationship to the Department of Justice 
Antitrust Division of a Fortune 500 firm is 
assessed by its history of mergers and 
acquisitions of 10 million dollars or more 
from 1960 through 1979 (Bureau of Econom- 


‘ics. 1981). The variable used is the value of 


these mergers summed over the 20-year period 
of the series. The larger the value of the merg- 
ers in which a firm has participated, the more 
visible a firm will be to the-Department of 
Justice Antitrust Division. Moreover, the larger 
the value of its acquisitions, the more a firm 
has to lose from successful antitrust suits. Firms 
with more active merger histories have to deal 
directly with the state more often than less 
active firms. Therefore, the current study hy- 
pothesizes that more mezgers and acquisitions 
will result in more firm political activity to 
counter the increased risk of adverse govern- 
ment action and to facilitate the smooth pro- 
cessing of paperwork through the Department 
of Justice. 

Etzioni (1984) and Gais (1983), among 
many others, argue that the stronger a firm's 
dependency on government legislative action 
for its sales and profits, the more politically 
active the firm will be. Pittman (1977) found 
that industries with more government con- 
tracts contribute more money to the Commit- 
tee to Re-elect the President than other 
industries. Therefore, the current study uses 
as a third measure of material interests, the 
total value of United States Department of 
Defense contracts a firm received from the 
federal government in fiscal year 1979 (U.S. 
Office of the Secretary of Defense 1979).5 


Political Action 
Many authors have noted that very few 





5 During 1979 some 41.97 billion dollars out a 
total of 63.25 billion dollars in defense procure- 
ment contracts went to the top 100 defense 
contractors. The top 100 hundred firms split about 
66.36 percent of the available contracts while the 
rest were divided among over 10,000 smaller 
contractors. : 
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companies form PACs, concluding that firms 
without PACs may not need them to exercise 
political influence (Matasar 1986, pp. 29-30; 
Sabato 1984). Sabato (1984, p. 33) writes: 


A Times Mirror executive, explaining its [Times 
Mirror] lack of a PAC, noted that “some of our 
management are extremely well connected—you 
know they can pick up the phone and get 
anybody from the President on down.” Simi- 
larly, Irving Shapiro of Dupont was perhaps able 
to do without a PAC because he was Chairman 
of the Business Roundtable. É 


To examine the impact of non-PAC political 
activity on PAC size, I test the effects of two 
measures of other political activity. A dummy 
variable representing the presence of a 
top-level manager or member of the board of 
directors of the firm on the Business Council 
during the years 1976 and 1978 assesses 
nonelectoral political activity (Shoup 1980). 
Some authors suggest previous campaign 
contribution activity by top-level management 
increases the probability of firm PAC forma- 
tion (Handler and Mulkern 1982). Such 
activity is measured here as the total value of 
gifts of $10,000 or more to any 1972 federal 
election campaign by the Chief Executive 
Officer, the Chairman of the Board, the 
members of the Board of Directors, or any 
other top-level manager of the firm (from 


` Paul 1975). 


Industry Effects 


Ryan, Swanson, and Buchholz (1987, p. 144) 
found wide variation among different Fortune 
industry groups in the probability of a firm 
having a PAC. Over 80 percent of the 
businesses in the mining, rubber ànd plastics, 
aerospace, pharmaceutical, and oil industries 


had PACs during the 1980 election cycle, ` 


compared with no firms with PACs in the 
furniture, jewelry, and leather industry cate- 
gories. Most authors regard a firm's primary 
industry category as a proxy for a firm's or 
industry's regulatory exposure, and, less 
often, as an indicator of membership in an 
interest group (Epstein 1980; Pittman 1977; 
Ryan et al. 1987; Siegfried 1981). 

I use seven dummy variables to test the 
effects of a firm's primary industry classifica- 
tion on political participation. The 49 indus- 
tries used by Fortune Magazine are grouped 


into seven categories (see Colvin and Knight . 


1980 for details of the Fortune industry 
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classifications). Firms are coded one for 
membership in an industry grouping and zero 
otherwise. The seven groups are aerospace 
firms (e.g., LTV, Boeing), lumber and paper 
producing firms (e.g., Crown-Zellerbach, 
Weyerhaeuser), chemical firms (e.g., Du- 
pont, Dow), automotive and transportation 
firms (e.g., GM, Chrysler, International 
Harvester), pharmaceutical firms (e.g., Eli 
Lilly), and another category containing the 
rest of the firms on the Fortune list. 


DATA AND METHOD 


Table 2-contains descriptive statistics for the 
variables used in this analysis. 

The large number of independent variables 
necessitated a screening step. Only variables 
correlated highly enough with either 1976 or 
.1980 PAC receipts to be statistically signifi- 
cant at the .01 level are used in the regression 
analysis. The choice of .01 as the criterion for 
inclusion in the OLS procedures is somewhat 
arbitrary. However, because of the large 
sample size (500 cases), I use conservative 
criteria to exclude variables in order to reduce 
the risk of type I error. 

I employ stepwise Ordinary Least Squares 
(OLS) regression to facilitate the selection of: 
a parsimonious model of 1976 and 1980 PAC 
receipts. Although the SPSS*"" stepwise 
procedure employs both forward and back- 
ward selection, multicollinearity can still 
confound the results of this procedure (SPSS 
Inc. 1986). However, the highest correlations 
among the independent variables are .741 
between annual sales and number of employ- 
ees and .600 between membership in the 
aerospace industry and value of defense 
contracts. No other intercorrelations are 
above .500. Therefore, the amount of mul-. 
ticollinearity in this analysis is not sufficient 
to recommend using a different model 
selection method. 


RESULTS 


Of the 19 bivariate correlations in Table 3, 
only 10 are statistically discernible from zero 
at a probability of .01. Total number of 





éI have analyzed this data extensively using a 
number of different techniques. The results of the 
stepwise regression analysis I present here are 
identical to the results of these earlier analyses. 
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Table 2. Descriptive Statistics for the Dependent and Independent Variables 





unique legal actions, average concentration 
ratio, labor to sales ratio, annual sales, 
number of employees, membership in the oil 
production industry, membership in the aero- 
space industry, value of mergers and acquisi- 
tions, value of defense contracts, and Busi- 
ness Council membership all meet the criterion 
for inclusion in the regression analysis. The 
intercorrelations among the variables without 
statistically significant associations with ei- 
ther 1976 or 1980 PAC receipts indicate that 
suppression is not affecting any of the 
relationships shown in Table 3. 

"The best models for 1976 and 1980 PAC 
receipts are shown in Table 4. For 1976 PAC 
receipts 25 percent of the variance is 
explained. For 1980 PAC receipts the se- 
lected variables explain 37 percent of the 
variance. The amount of defense contracts, 
value of mergers and acquisitions, and 
number of legal actions .are statistically 
important predictors of both 1976 and 1980 


z 
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Variable Standard 
Name Mean Deviation Minimum . Maximum 
Dependent variables: ; f 
1976 PACs ($) 6,816.3 20,080.2 0.0 183,573 
1980 PACs ($) 29,889.8 56,296.6 0.0 445,776 
Resource availability: f 
Annual sales (1,000s $) 2,890,237 6,323,718.97 409,662 79,106,471 
Employees 32,366.72 58 327.972 + 642 853,000 
Return to investors in 1979 3,872.73 4,744.05 "42 59,230 
(10,0008 $) 
Earnings/share in 1979 ($) 8.75 19.34 .04 26.22 
Growth rate 1969-1979 (96) 22.51 28.43 0.0 54.77 
- R&D/sales ratio .022 .022 0.0 .130 
Labor/sales ratio .281 .097 .060 .460 
. Free riders: 
' Concentration ratio .405 .133 .140 .8900 . 408 
Relative firm dominance .147 .109 .010 .670 408 
Material interests: 
Unique legal actions 1.289 2.015 0.0 14.00 408 
' Defense contracts (1000s $) 68,344.404 310,220.61 0.0 3,492,140.0 500 
Mergers and acquisitions 141.139 327.298 0.0 4,468.0 500 
(1,000,000s $) d 
Previous political action: 
Business council membership .152 .359 0.0 1.0 500 
Campaign donations of - 14,157.82 52,874.21 0.0 547,000.0 500 
top managers in 1972 ($) 
Industry effects: 
Lumber and paper firms .060 .238 0.0 1.0 500 
Chemical firms .078 .268 0.0 1.0 500 
Oil producing firms .072 .259 0.0 1.0 500 
Automotive firms .042 .201 0.0 1.0 500 
Aerospace firms .024 .153 0.0 1.0 500 
Pharmaceutical firms .034 .181 0.0 1.0 500 


PAC receipts. Despite ihe importance given 
to firm size in the PAC literature, annual sales 
is unimportant in its effects when compared 
with the merger and defense variables. Total 
annual sales is statistically discernible at .05 
only for 1980 PAC receipts and number of 
employees is not significant in either equa- 
tion. Every million dollars in annual sales 
increases 1980 PAC receipts by one dollar. 
Every million dollars in defense contracts 
increases 1980 PAC receipts by 66 dollars. 
Each million dollars in mergers and acquisi- 
tions increases PAC size by 51 dollars. The 
standardized regression coefficient for annual 
sales in 1980 is about half the size of the 
standardized coefficients for mergers and 
acquisitions and for defense contracts. The 
number of legal actions has the weakest effect 
on 1980 PAC receipts with a standardized 
regression coefficient of only.08, half the size 
of the standardized regression coefficient of 
annual sales and a quarter the size of the 


mi 
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Table 3. Pearson Product Moment Correlations d the Independent Variables with PAC Receipts for 1976 and for 


1980 


' Variable Name 


Resource availability: 
Annual sales 
Number of employees 
Return to investors in 1979 
Earnings per share in 1979 . 
Growth rate 1969-1979 
R&D/sales ratio 
Labor/sales ratio 
Free riders: 
Average concentration ratio 
Relative firm dominance 
Material, interests: 
Unique legal actions 
Defense contracts 
Mergers and acquisitions 
Previous political action: 
Business council membership 
Campaign donations of 
top managers in 1972 
Industry effects: ` 
Lumber and paper firms 
Chemical firms 
Oil producing firms 
Automotive firms 
Aerospace firms 
Pharmaceutical firms 


* Statistically Significant at P < .01. 


standardized coefficients for defense and 
mergers. 

The value of defense contracts and total 
value of mergers and acquisitions are also the 
most important predictors of 1976 PAC 
receipts. The standardized regression coeffi- 
cients for these variables are .235 and .249 
respectively, compared with .206 for total 


1976 1980 


PAC Receipts PAC Receipts 
2738* 3452* 
2714* 3638* 
— 0464 — 0440 
—.0272 — .0665 
.0206 0255 
— 0159 .0294 
— .1092* — 0563 
0797 .1011* 
1330* .1497* 
2895* 2015* 
3396* Am 
3257* 4238* 
1534* 1194* 
0445 0364 
0938 .0229 
0022 . —.0285 
1176* '— 4368* 
0179 .0729 
1060* .2496* 
—.0251 0581 


number of legal actions and .144 for Business 
Council membership: The influence of the 
merger and defense variables is smaller for 
1976 than for 1980 PAC receipts. Each 
million dollars in defense contracts a firm 
receives in 1979 adds 15 dollars in PAC” 
receipts. Each million dollars worth of- 
mergers and acquisitions translates into 15 


Table 4. Regression Coefficients from Stepwise Regression Equations Estimating the Size of Political Action 


Committees 
A aN a Legal Business 
Variable Adj. R? Constant Defense Acquisitions Sales Actions ‘Council 
1980 PACs, .3658 10574.20 .066371 50.90842 .001425 2697.916 N.A. 
N=500*- : (.0001)  [.365740] [.295975] [.16050] [.08149] 
(.0000) (.0000) (.0001) (.0227) 
. (.922504) {.907844} {.800301} {.871322} : 
1976 PACs .2529  — 538.388 015184 15:3056 N.A. 2274.0246 8070.2915 
N=500 (.5943)  [.234573] [.249475] [.206301] [.144436] 
: (.0000) (.0000) (.0000) (.0005) > 
{.902422} {.947165} {.916450} 


'* Means were substituted for missing values. 
Notes: Numbers in brackets are tolerance values. 


.815442) 


Numbers in parentheses are probability values from t-tests of the coefficients. 
Numbers in braces are standardized regression coefficients. 


N.A.: Not Applicable to this equation. 
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additional dollars in PAC money. Each legal 
action adds $2274 to 1976 PACs and $2697 
to 1980 PACs. Having a representative on the 
Business Council increases a firm's 1976 
receipts by $8070 but has no discernible 
influence on 1980 PAC size. 

The scatterplots. of the residuals and 
predicted values for the two models show 
evidence of heteroscedasticity. Heteroscedas- 
ticity inflates estimates of the standard errors 
(Blalock 1972). Thus the selection criterion 
used to calculate the models shown in Table 4 
is more conservative than the use of a .05 
alpha level for inclusion would normally 
indicate. I made several attempts to correct 
the heteroscedasticity using Freeman-Turkey, 
logarithmic, and square root transformations 
of the dependent and certain independent 
variables (as suggested by Weisberg 1980) 
and weighted least squares estimates employ- 
ing the reciprocal of annual sales (Weisberg 
1980) and the reciprocal of the absolute value 
of the OLS predicted values as weighting 
factors (Thiel 1971). The heteroscedastic 
condition improved only when all the firms 
without PACs were dropped from the analy- 
sis. However, when these cases were dropped, 
the tolerances for most of the variables in 
both equations plummeted dramatically, par- 
ticularly those for 1976 PACs. The low 
tolerances indicate the presence of severe 
problems with multicollinearity in the smaller 
samples (Berk 1977). The variables in the 
models estimated using the full sample of 500 
firms, on the other hand, all have tolerances 
of .8 or larger, well within the range of 
acceptability. 

The large number of zero cases in the PAC 
receipt variables indicates the models esti- 
mated here also suffer from censored depen- 
dent variables. This problem can sometimes 
lead to biased estimates and to an apparent 
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heteroscedastic condition (Maddala 1983). 
Tobit regression estimation procedures are 
recommended to correct this problem (Greene 
1987; Kennedy 1985; Maddala 1983). Table 
5 presents the tobit estimates for both the 
1976 and 1980 PAC full sample models 
shown in Table 4. The residual plots from the 
tobit regressions contained no evidence of 
heteroscedasticity. i 

However, with the exception of the defense 
contracts regression ccefficient for the 1976 
model (which is more than twice the size of 
the OLS estimate) and the annual sales 
coefficient for the 1980 model (which is half 
the size of the OLS estimate), the coefficients 
resulting from both estimation procedures are 
very similar. Despite the large absolute 
differences in the coefficients for the defense 
and sales variables, the relative magnitudes of 
the variables in the tobit equations remain 
essentially unchanged from those in the OLS 
models. Therefore, because the tobit results 
do not change the interpretation of the OLS 
data, and indeed reinforce it, I present the full 
sample OLS results here. 


DISCUSSION 


The data support only one aspect of the 
pluralist rational interest perspective that 
dominates the business political behavior 
literature, the importance of material self- 
interest as a determinant of business political 
activity. The value of defense contracts, the 
firm’s total value of acquisitions and mergers, 
and the total number of legal actions taken 
against the firm by regulatory agencies have 
important effects for both 1976 and 1980 PC 
receipts. However, the data provide consider- 
ably less support for all other propositions of 
this body of theory and research, including 
the hypothesized effects of the free rider 


Table 5. Unstandardized Regression Coefficients from Tobit Regression Equations Estimating the Size of Political 








Action Committees 





Dept. Legal Business 

Variable Adj. R? Constant Defense ^ Acquisitions Sales Actions Council 

1980 PACs N.C. 10571.0 .070421 47.8588 .000738 | 2792.17 N.A. 
N= 500* (.00001)  (.00000) (.00000) (.00133) (.00014) 

1976 PACs N.C. — 7774.75 .0388225 12.1566 N.A. 3309.05 7400.05 
N=500 (.00030)  (.00000) (.00019) (.00000) (.00000) 





* Means werc substituted for missing values. 





Notes: Numbers in parentheses are probability values from t-tests of the coefficients. 


N.A.: Not Applicable to this equation. 
N.C.: Not Calculated by statistical package. 
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‘problem, the availability of resources, the 
political activity of high corporate officials, 
and the influence of a firm’s industry on the 
level: of corporate political activity. The 
failure of the industrial concentration mea- 
sures; to significantly influence PAC size 
indicates that concern about free rider prob- 
lems by the firm forming or supporting a PAC 
is either very low or nonexistent. Considering 
that many competing perspectives, including 
both ' Pluralist and Marxist. posit material 
interest-seeking behavior by business as a 
central force behind business political action, 
the failure of this aspect of rational interest 
theory —the free rider problem —to influence 
PAC size casts doubt on the utility of this 
perspective for explaining business political 
behavior (Gold et al. 1975; Miliband 1977; 
Olson 1971; Whitt 1982). 

Nearly all theoretical perspectives posit 
firm size as a key determinant of business 
political behavior (Zald and Berg 1978; 
Siegfried 1980; Whitt 1979). Yet, this study 
finds that firm size, at least for large firms, is 
not a key variable in determining the level of 
firm political activity. Firm size is related to 
PAC activity mostly because major acquirers 
of other firms, large defense contractors, and 
corporate lawbreakers tend to be larger than 
other firms. Since the extant literature is 
heavily reliant on simple bivariate tests of the 
relationship between firm size and political 
activity, it is not surprising that firm size is 
given such importance in the literature (e.g., 
Ryan et al. 1987; Epstein 1980). 

The central reason why firm size is 
supposed to be important to PAC size is that 
larger firms have more resources to allocate to 
politics than do smaller firms (Siegfried 
1981). However, quantitative research con- 
cerning this proposition bas examined data 
about only the very largest of firms in the 
United States, Fortune 500, and other similar 
size firms. The enormous size of these 
multibillion dollar firms—Exxon's sales 
amounted to 79.1 billion dollars in 1979— 
militates against the idea that any of them 
would have insufficient resources to support 
even a very large PAC of 500,000 dollars or 
more. The fact that corporations can legally 
provide only in-kind support to their PACs 
Bives additional weight to this criticism. 
Therefore, it seems quite reasonable that both 
firm size measures as well as the other 
measures of resource availability do not have 
statistically discernible effects on PAC size. 


829 


Contrary to the prediction of Sabato (1984) 
and Matasar (1986), political activity among 
high-level managers of Fortune 500 firms 
does not reduce firm political behavior. 
Campaign donations by top managers to 1972 
national election candidates have no influence 
on PAC receipts for either 1976 or 1980. 
Membership of top managers on the Business 
Council significantly increases PAC receipts 
for 1976. 

The Business Council and other business 
policy and lobbying organizations such as the 
Business Roundtable and: the Conference 
Board are often perceived by students of 
corporate elites as groups with which only the 
most powerful business leaders are affiliated. 
Such participation is often regarded as 
indicating membership in the “power elite" 
of C. Wright Mills (1956) and the “inner 
circle” of Michael Useem (1984). The 
finding that membership on the Business 
Council increases 1976 but not later PAC 
receipts suggests that only the most powerful 
and active corporations were on the cutting 
edge of new forms of political action during 
the earliest history of business PACs. By their 
early involvement, the corporate leadership of 
America possibly gave this form of political 
participation a legitimacy for other members 
of the business community it had not had 
before. 

Previous theory and research PACs con- 
clude a firm's primary industry is an 
important determinant of political action. The 
current study indicates there are no significant 
effects on political behavior from either the 
primary industry to which a firm belongs or 
the position a firm holds within an industry or 
group of industries. Although bivariate arialy- 
sis does show different industries have 
different size PACs, M for example, the 
average Aerospace firm's PAC was 119,410 
dollars in 1980 compared with only 35,000 
dollars for timber and lumber firms, M once 
defense contracts and acquisitions and merg- 
ers are introduced into the analysis, the 
influence of a firm's industry disappears.’ 


: 7 The tratios from tobit models of 1976 and 
1980 PACs using just the oil-producing and 
aerospace firm variables are respectively 2.69 and 
2.10 for the 1976 model and 3.16 and 4.85 for the 
1980 model. When the merger and defense 
variables are added to these models, the t-ratios for 
the oil and aerospace firm variables decline to 1.79 
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Remarkably, even being a pharmaceutical 
firm, one of the most extensively regulated of 
industries, has no discernible effect on PAC 
size. 

Andres (1985), Matasar (1986), and Pitt- 
man (1977) all point to the importance of 
regulation as a determinant of a firm's 
political activity and to the substantial varia- 
tion in the amount of regulation between 
industries. The lack of significant industry 
effects in this analysis challenges the idea that 
generic material interests such as the total 
amount of regulation facing a firm or group of 
firms are important reasons why a business 
may be more or less politically active. Rather, 
the data indicate that some industries include 
firms with larger amounts of legal actions 
taken against them, defense contracts, and 
sizable mergers and acquisitions than others. 
For example, the reason aerospace firms on 
the average have larger PACS is not that they 
are aerospace firms but that aerospace firms 
on the average have larger defense contracts 
than do other types of firms. The same is also 
true for oil producing firms. In fact, the firm 
with the largest value of mergers and 
acquisitions is Mobil Oil with $4,468 billion 
worth of mergers. Motor Oil Hellas, a less 
well known oil producer, received the twelfth 
largest amount of defense contracts in 1979 
(U.S. Office of the Secretary of Defense 
1979). However, because the data set used 
here does not include some of the most 
extensively regulated industries, transporta- 
tion, utilities, and financial institutions, these 
results concerning the effects of duy 
category are not conclusive. 

The material interests found to be impor- 


tant in this research denote special ongoing . 


relationships between the most politically 
active firms and organs of the state. The top 
100 defense contractors all have been receiv- 
ing large contracts for many years, many 
since before World War I (Rosen 1973). The 
most active corporate acquirers have been 
aggressively purchasing other firms for most 
of the 20-year merger history used in this 
study. Research on the incidence of corporate 
crime (Clinard and Yeager 1979) reveals that 
the most criminally deviant corporations are 
the ones with histories of recidivism. These 
firms, with the richest history of interaction. 


and.1.61 in the 1976 model and 1. 898 and .050 in 
the 1980 model. 
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with the state, are the most politically active. 
Thus, while material interests are an impor- 
tant determinant of political action, this study 
suggests only interests tied to special long- 
term relationships with the state serve to 
increase the amount of political action taken 
by large firms. 

Another facet of the material interests this 
study finds to be important determinants of 
PAC size is their linkage to policies that, 
historically, have been differentially sup- 
ported by business and working-class political 
endeavors, the interests of business com- 
monly being antithetical to those of the 
working class (Domhoff 1970, 1983). Some 
researchers even suggest the use of PACs and 
other recently developed techniques of politi- 
cal action by business are the result of 
successful political action by nonbusiness 
classes in bringing about major changes in 
environmental laws, reducing defense expen- 
ditures, and otherwise decreasing the support 
of big business by the U.S. central govern- 
ment (Useem 1983; Meadow 1983). Some 
hint of the level of conflict between the 
business and nonbusiness classes is provided, 
by tbe fact that labor-related PACs gave a 
total of 31.9 million dollars to congressional 
campaigns in the 1980 and the 1982 election 
cycles, compared with 46.7 million dollars 
donated by corporate sponsored PACs. These 
contributions total about 57 percent of all 
money donated by PACs in the 1980 and 
1982 congressional elections (U.S. F.E.C. 
1983). 

This study reveals that the key to compre- 
hending the forces influencing a firm's or 
group of firms' level of political activity lies 
in understanding the nature and history of the 
relationship individual firms and the nonbusi- 
ness classes have with the state. The pluralist 
paradigm sees the political action of busi- 
nesses as largely determined by characteris- 
tics of the individual firm or groups of firms. 
The character of the interests that elicit 
political action by businesses and the failure 
of most of the variables measuring other 
phenomena suggested as significant by this 
perspective counsel an alternative explanation 
for corporate political action. However, it 
exceeds the scope of this paper to explore the 
ability of alternative models to explain the 
variation in corporate sponsored political ' 
action committee size. 
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CONCLUSION 


E 

This research has important implications for 
theories describing business political behavior 
and for theories purporting to explain the 
relationship of business to the state. The free 
rider problem as currently operationalized by 
PAC researchers and the availability of 
resources for political action do not seem to 
have much influence on the amount of 
political action a large firm undertakes. These 
results, coupled with the nature’ of the 
material interests found to be important for 
increasing the political. activity of large 
businesses, suggest that a strict rational 
economist interpretation of the determinants 
of business political action does‘ not ade- 
quately explain the phenomenon of business 
political action. 

Future research examining the: extent of 
business political behavior must look beyond 
the most obvious material interests of the firm 
such as tax reform and government regula- 
tion. These results suggest the focus of future 
research must include both the history. and 
types of relationships individual and groups 
of businesses have with the state. Moreover, 


-individual actions and tends in state policy- 


making should be of central interest ‘to 
scholars of business political behavior. While 
significant qualitative work has been done 
examining these facets of business political 
activity (see Domhoff 1970; Useem 1984; 
Whitt 1982 for examples), quantitative re- 
search examining business-sponsored PAC 
has yet to go significantly beyond cross- 
sectional analysis of firm-level rational interest- 
seeking behavior. > i 

The one constant indicated by all research 
approaches is that the rewards of political 
action are enormous, especially for those 
businesses which are the most active. Since 
1979 this nation has experienced the largest 
peacetime military buildup in its history. 
United States defense contractors have posted 
unprecedented growth in profits (Stubbing 
and Mendel 1986). Remarkable changes in 
government regulation of big business and, 
perhaps more importantly, the enforcement of 
regulations directly benefiting the. largest of 
firms have occurred (Edsall 1984; Tolchin 
and Tolchin 1983). More mergers ^ and 
acquisitions have taken place in the decade of 
the 1980s than in any period since the turn of 
the century. The real dollar value’ of these 


` mergers and acquisitions has been larger than 
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any time since the great merger wave at the 
end of the 1960s (Golbe and White 1988). It 
seems unlikely tbat coincidence can explain 
why those firms this study finds to be the 
most active politically—defense contractors, 
violators of government regulations, and the 
most active acquirers of other firms—are the 
ones who seem to have benefited most from 
the unprecedented changes in central govern- 
ment policies. 
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ESTIMATING SELECTION EFFECTS IN. 
,, OCCUPATIONAL MOBILITY IN A 19TH-CENTURY CITY* 


MELISSA A. HARDY 
' Florida State University 


This study examines the relationship between thé ` processes P occupational 
: mobility and persistence. Using manuscript census data from Indianapolis, 
1850-1860; I assess the importance of selection effects on the likelihood of upward 
or downward mobility and on the socioeconomic status achieved by 1860, relative. 
' 4o 1850 origin status. Alternative speculations have argued that out-migrants either . 
constituted a sort of “permanent floating proletariat” (and therefore were 


distinctly unsuccessful in terms of occupational advancement), or were particularly — 


" aggressive entrepreneurs (and therefore likely to be among the most successful). As ` 
an indirect test of these hypotheses, mobility and persistence were modeled as joint 
outcomes influenced by a common set of “unobserved”: variables., The analysis of 


' Indianapolis data shows that the sorting processes of occupational mobility and ` ` 


nonpersistence were affected by a similar set of observed characteristics, but that 


unmeasured variables'exerted no net effects. 


Beginning - in the 1960s, the "new urban 
historians" utilized federal manuscript census 
schedules and other archival resources as a 
basis for investigating occupational status and 
career patterns in 19th-century cities (e.g., 
Blumin 1969; Knights 1971; Thernstrom 
1973; Glasco 1978; Griffin and Griffin 1978). 
In general, these studies concluded that the 
adult male populations were marked by 
considerable flux both geographically and 
occupationally — although Grusky's (1986) re- 
‘cent reassessment suggests that the 19th- 
century rate of career. mobility was only about 
half as great as today’s rate. However, 
because the data bases of those studies were 
constructed by record linkage and could 
include information only for residents who 
were present and located during successive 
enumerations, migrants were neglected. Spec- 
: ulations about the out-migrants—their subse- 
quent occupational successes, their motiva- 
tions for leaving,’ and so forth—typically 
conceived of the migration as, a sorting 
process that at least tended to be equilibrating 


. * Direct all correspondence to Melissa Hardy, 


Department of Sociology, Florida State University, . 


Tallahassee, FL 32306-2011. 

The data file was constructed primarily from the 
.1850 and 1860 federal manuscript census sched- 
ules of Marion County, IN. I wish to thank Mary 
Mathis for her assistance in expanding the data file 
and Lawrence Hazelrigg, Charles Nam, Aage 
Sorénsen, and anonymous reviéwers for their 
' comments on earlier versions of this paper. 
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.of wage and price and/or other labor-market 


differentials. One of' the more extreme 
speculations reasoned that the out-migrants 
may have constituted a sort of “permanent. 
floating proletariat” (Thernstrom 1973, p. 42) 
who mostly drifted from one city to another. 
Conversely, the out-migrants may have been 
especially -"entrepreneurial in spirit” and 
among the most successful. More likely these 
characterizations are much too broadly painted; 
it may well be that, then as today, city 
differences in income, employment, and other 
labor-market characteristics were not related 
in any simple way, if at all, to the flows of. 
intercity- (or interregional) migration. ` 
The importance of the question of 19th- 
century city out-migrants (and, correspond- 
ingly, in-migrants) is twofold. In addition to 
the compositional side of the question —-who 
the out-migrants were, why they left, what 
their subsequent career records were, and so 
on—it is important to consider the selection 
effects of out-migration on estimates of career 
mobility among the persisters. If the out- 
migrants, were; part of a permanent floating 
proletariat, estimates based on only the 
persisters may overstate the’ amount of 
mobility: experienced by city residents— 
unless, of course, that “floating proletariat” 
phenomenon was sufficiently common that a 
city’s in-migrants were largely drawn from it. 
Or, if the out-migrants were unusually 
entrepreneurial and among the most success- 
ful, persister-based. estimates could paint too 


American Sociological: Review, 1989, Vol. DM (October-834 -843) 


4 


D 
Į 


SELECTION EFFECTS IN OCCUPATIONAL MOBILITY `` ` 


'dpproximation" of selection effects of out- 


pessimistic a picture of caféer advancements — 
especially if the city’s in-migrants, who 
typically exceeded in’ number the out- 
migrants during any given interval, shared 
that same “spirit” and record of success. 
The major aim of this paper i is to attempt an 
answer to that question‘of “selection. effects” 
by analyzing census data for one city, 
Indianapolis, Indiana, 1850 to 1860.1 India- 
napolis was then making the transition from a 
small isolated town to a major wholesale and 
retail commercial center, a transformation 


made possible by factors associated with the. 


introduction of the railroad. By 1855 India- 
napolis had become the railway hub of eight 
major lines: Tadiating outward froni the 
nation's first union depot to Chicago, Detroit, 
Cleveland, Cincinnati, Louisville, St. Louis, 


and points beyond. During the 1850s the . 
population doubled in'size, and changes in the’ 


occupational structure increasingly reflected 
the importance of industrial activities in an 
emerging industrial-commercial center. But 
Indianapolis was also a gateway to the great 
expanse of still unsettled area to the west, and 
to cities just beginning to take shape; it was a 
city in which the channels for both occupa- 
tional and geographic mobility were consider- 
able: (Hardy 1978). The mid-1800s were a 
time of rapid and profound social change in 
many American cities, and Indianapolis was 
not unique in these respects. However, 
relative to more established east-coast cities 
such ‘as Thernstrom’s Boston or Blumin’s 
Philadelphia, the rate of population growth 
was greater and the’ transformation from a 
craft-oriented to an industrial job structure 
was in an earlier and more rapid stage of 


; ee: 


ESTIMATING OUT-MIGRATION, 
1850 TO 1860 


This analysis can n yield at best only a a "first 


3 





'- ! The data file consists of the entire adult (i.e., 


'16--) male population of Indianapolis in 1850, as 
'enumerated in the manuscript census schedules of 


' the Seventh U.S. Census. Record linkage to the: 


1860 enumeration was by visual inspection of the 
entire set of manuscript schedules of the Eighth 
Census. Dodd, Talbott, and Parsons’. Indianapolis 
City Directory, and Business Mirror for 1862 was 
used as a supplemental source for record linkage 
(though in fewer than 10 percent of the linked 
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migration. One reason is the paucity of 
information available in the manuscript cen- 
sus schedules. Moreover, as the alternative 
scenarios depicted above indicate, adjusting 
mobility estimates for selection effects should 
attend to the in-migrant as well.as the 
out-migrant population. But historical census 
data included no retrospective information, and 
data files constructed from record linkages 
have typically been "forward looking"; only 
the selection effects of out-migration can be 
addressed. Further, since the basic operation 
of.record linkage is the identification of 
persisters, out-migrants are defined residu- 
ally, but the residual category is not homoge- 
neous. It includes cases of mortality, cases of 
enumerator error, and no doubt at least a few 
cases of error in the record-linkage process. 
There is no direct way of segregating these 
various categories in the data file. But the 
situation is far from hopeless, as we shall see. 

Nearly 7 of every 10 of the 2,337 adult 
males enumerated in' Indianapolis in 1850 
could not be located in the 1860 enumeration 
or in the 1862 city directory. To anyone not 
acquainted with studies of 19th-century cities, 
a nonpersistence rate of 69 percent may seem 


astoundingly high. It.is not. Parkerson's 


(1982, p. 102) review of published data from 
record-linkage studies of 68 separate commu- 
nities shows that in 40 of the communities the 
10-year rate of nonpersistence ranged be- 
tween 60 and 80 percent. The average rate for 
all 68 communities was 62 ‘percent. The 


.question is, how much of that 69 percent was 


actually due to out-migration? Some reason- 
able estimates can be constructed. 

First, mortality. Using the Coale-Demeny 
Parkerson 'estimated a 
decadal mortality among white males, 1850— 
60, of nearly 15 percent. But that estimate 
was based on a life expectancy at birth (37.3 


: years). Since the base population of interest 


here was aged 16 or older in 1850 (past the 
high-risk years of infancy and childhood), 15 
percent is surely too high. Barrow's (1980) 
calculations for late-century Indianapolis sug- 
gest that 5 to 8 percent of the residents did not 
survive to the next decennial courit. Assum- 
ing: from Barrows’. calculations, an outside 
limit of 10 percent, it seems likely that at 
most only one of every seven cases of 
nonpersistence was due to mortality. 

. The combined effect of.enumerator and 
record-linkage error can be estimated indi- 
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rectly with the aid of Parkerson’s study of the 
1855 New York State Census, which included 
a question asking all residents how long they 
had lived in their current community of 
residence. Parkerson concluded that the 
record-linkage studies overstated nonpersis- 
tence by about two-fifths. Whereas the 
average rate among the 68 record-linkage 
studies was 62 percent, it was 44 percent 
among the New York communities. Further- 
more, among communities that experienced 
rapid growth, the discrepancy was not .62 — 
.44 = .18 but .34. Since the population of 
Indianapolis doubled during the 1850s, this 
larger figure will be adopted as a maximum 
estimate of the extent to which nonpersistence 
was overstated in the record-linkage study of 
Indianapolis. But this figure of .34 includes 
cases of mortality as well as enumeration and 
record-linkage error. Subtracting the mortal- 
ity component (no more than .10) leaves an 
estimate of at least .24 attributable to 
enumeration and record-linkage error. 

Thus, it is reasonable to conclude that no 
more than 10 percent of the 2,337 adult males 
enumerated in 1850 died before the 1860 
count, 35 to 40 percent were out-migrants, 
and the remainder (50 to 55 percent) were 
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persisters, although as many as two-fifths of 
these persisters were missed by the census 
takers or by the record-linkage process. While 
the data source does not afford. means by 
which to segregate or reclassify these latter 
cases, that inability is not particularly signifi- 
cant to the major aim of this paper—assessing 
the effect of "sample selection" on estimates 


of occupational mobility — since all sources of- 


selection effect are relevant. It limits what can 
be said specifically about the selection effects 
of out-migration, as distinguished from the 
selection effects of undetected persisters. But 
whether out-migrant or undetected persister, 
if one or more of these categories of men 
differed in mobility-related ways from the 
men who were captured by the enumeration 
and record-linkage processes, estimates of 
career mobility based on only the latter could 
be seriously biased. 


ESTIMATING A PERSISTENCE MODEL 


As reported in Table 1, persisters and 
nonpersisters significantly differ in composi- 
tion on nearly all available measures. The 
differences are consistent with findings from 
other studies of 19th-century cities. Nonper- 


Table 1. Comparisons of Persisters and Nonpersisters in 1850 


Rate of 
Nonpersistence 
Total 69% 
Nativity: Indiana 7396 
Other U.S. 64 
Foreign 74 
Race: White 68 
Nonwhite Tl 
Age: 16-19 80 
20—29 76 
30—39 65 
40-49 57 
50+ 60 
Mean 
s.d, 
Marital: Married 58 
Not married 81 
Occupation: Professional 55 
Major P,M,O 46 
Clerk, sales 61 
Petty P,M,O 44 
Skilled 7 
Semiskilled 68 
Unskilled 78 
Duncans SEI: Mean 


s.d. 





Compositon* 
Persisters Nonpersisters 
(N=730) (N= 1607) 

14% 17% 
63 52 
23 30 
97 95 

3 5 

6 l1 
31 44 
29 24 
21 12 
13 9 
34.9 30.9 
11.5 11.3 
72 46 
28 54 

9 5 

6 2 

8 6 
14 5 
40 48 

8 9 
15 25 
32.0 24.4 
22.7 18.0 





* The compositional comparisons (columns 2 and 3) arc signficant at the .001 level for all variables except race. 
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Table 2. Rates of Nonpersistence (cell values) by Age, Marital Status, and White-Collar versus. Blue-Collar 








Occupation 
White-Collar 
Age Married Unmarried 
16-29 46 .66 
30+ 43 .69 
All ages 43 67 


sisters were more likely to be unmarried, 
foreign-born (predominantly German and 
Irish), younger, and of lower status. In 


general, men in occupational categories that- 


included property ownership had the lowest 
rates of nonpersistence, followed by . the 
professionals and then the clerks and sales- 
men. Blue-collar men, especially the un- 
skilled, had the highest rates. Marital compo- 
sition is partly confounding of those 
differences: for example, 84 percent of the 
major proprietors, managers, and officials, 
and 74 percent of their “smaller” counter- 
parts, were married, which contrasts with 
only 36 percent of the clerks and salesmen, 
54 percent of the skilled and 46 percent of the 
semiskilled and unskilled men. However, 
even after marital status is controlled, some 
notable occupational differences remain. Un- 
skilled laborers had the highest rates of 
nonpersistence regardless of marital status, 
although the rate among single skilled work- 
ers was about as high. And while the 
occupation-specific rates of nonpersistence 
among single men were generally in excess of 
70 percent, the rate for petty (but not major) 
proprietors, managers, and officials was a 
comparatively low 57 percent.? 

The age gradient of nonpersistence rates 
(column 1, Table 1) approximates the age 
gradient of rates of internal migration re- 
ported in present-day studies (e.g., Long and 
Boertlein 1976)—a high rate among those in 
their late teens and 20s, followed by rapid 
diminution of rate. (The small up-tick in 
nonpersistence among the Indianapolis men 
aged 50 and older is most likely a manifesta- 
tion of the mortality component of the 
nonpersister category.) However, as the 
nonpersistence rates (cell values) in Table 2 





^ This difference between petty and major 
proprietors, managers, and officials is partly 
compositional: a larger proportion of the “petty” 
than of the “major” category consisted of 
proprietors as opposed to managers and officials. 





Blue-Collar 
` Total Married Unmarried Total 
59 .70 .83 719 
47 61 .84 .66 
.51 : .64 .83 nB 


show, it was not age so much as marital status 
that discriminated the persisters from the 
nonpersisters. Whether young or old, unmar- 
ried blue-collar men disappeared in droves. 
Of course, single unskilled men were undoubt- 
edly among the most “invisible” to census 
takers, but it is also highly probable that a 
great many of these men had actually left 
Indianapolis. After all, married white-collar 
men aged 30 or older would have been among 
the most visible to enumerators, and 43 
percent of them were nonpersisters. Since 
there is good reason to believe that young, 
single, blue-collar workers were more likely 
than older, married, white-collar men to 
move on to another city, our estimated overall 
rate of out-migration (35 to 40 percent) 
probably means that at least half of the single. 
blue-collar men were out-migrants.’ l 

Results from probit estimations reported in 
Table 3 confirm that marital status is a 
significant and strong net predictor of persis- 
tence, as are foreign nativity and white-collar 
status. Blue-collar men in general were more 
likely to have out-migrated or to have been 
overlooked by enumerators, even after taking 
into account the effects of marital status and 
foreign nativity. This model provides the 
means of assessing selection effects of 
nonpersistence on the probability of career 
mobility. 


SELECTION EFFECTS IN 
OCCUPATIONAL MOBILITY 


Career mobility, 1850 to 1860, is measured in 
two ways. First, using Thernstrom's typol- 
ogy, I examine the probability of upward or 
downward mobility across the seven catego- 
ries previously nominated, assuming ordinal- 
ity among them. Because of the floor and 
ceiling effects inherent in this definition of 
mobility (see footnote 5), I also examine the 
distribution of status distances between 1850 
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Table 3. Probit Estimation of Persistence Model 


Eq 1 Eq 2 
Constant* —.46 -47 
(.09) (07) 
-5.02 — 6.40 
Foreign-bom —.16 —.17 
(.08) (.07) 
' -2.12 —2.29 
Not married -.51 ~ 51 
(.07) (.07) 
—7.20 -7.7 
Age: 16-29 ~ 06 ~ 07 
(.07) (.07) 
—.91 —1.02 
Professional Al i 
' (.14) 
i 2.94 
Major P,M,O .49 
(18) 
f 2.78 
Clerks, sales. .52 
(.14) 
3.58 
Petty P,M,O .69 
(13) 
5.38 
Skilled 12 
(.09) 
1.39 
Semiskilled 22 
(.13) 
1.66 
Duncan SEI .008 
(.002) 
5.179 
Log-likelihood —1107 —1080 ` 
N 1941 1894 


"The reference category for foreign-born is all 
U.S.-born; for age, 30 years and older; for occupation, 
unskilled. 


and 1860 occupations by relying on Hauser’s 
mapping of occupational titles into Duncan’s 
index.?' 


<’ The use of scales such as Duncan’s SEI with 

historical data has been criticized for alleged 
insensitivity to long-term changes in status hierar- 
chies. Hauser's.analysis argued that, even though 
the use of 20th-century SEI scores carries some 
bias, use of alternative scales developed by 
historians on the basis of in-depth analyses of 
specific urban sites also raises the question of bias; 
he concludes that. "true occupational status in the 
twentieth century is about as good an indicator of 
true occupational status in the nineteenth century 
as are any of the five-city ratings" (1982, p. 118). 
Hauser (1982, p. 122) also estimated a correlation 
of .88 between occupational status in the mid- 19th 
century and prestige in 1925. Blau and Duncan 
(1967, p. 120) had previously reported a correla- 
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To allow for correlation of the processes of 
occupational mobility and persistence, I 
estimate a series of two-equation ‘probit 
models. The first equation specifies the 
mobility process by assessing the impact of 
workers’ occupational location and demo- 
graphic characteristics on the likelihood of 
upward (or downward) mobility relative to 
occupational stability. The second equation 
specifies the selection process that defined the 
group of workers whose occupational mobil- 
ity was assessed. The two-equation model 
allows joint estimation of both selection and 
mobility equations without enforcing the 
assumption of zero correlation between un- 
measurable factors that might have been 
influencing the likelihood of both persistence 
and mobility.* 


tion of .93 between 1925 and 1963 prestige 
rankings. William Form has noted in a personal 


- communication that he “found a gross overestima- 


tion of skilled workers in the pre-1900 censuses 
and almost no way to be certain whether many 
occupations were ‘semiskilled’” (see Form 1985, 
pp. 92-93). The classification scheme used in the 
Bureau's tabulations did change a number of times 
after 1870, as did the amcunt (and no doubt the 
standardization of quality) of information solicited 
by enumerators (see Conk 1978). The present 
study is based not on the Bureau's tabulated data, 
however, but on the manuscript schedules and the 
specific occupational titles there recorded by 
enumerators, Following Thernstrom's (1973) clas- 
sification scheme, more than 85 percent of the 
cases I coded as "skilled workers" consisted of the 
following occupational titles: carpenter, brickma- 
son, plasterer, painter, blacksmith, carriage or 
wagonmaker, harnessmaker, saddler, tailor or 
hatter, shoemaker, and printer. Other titles in- 
cluded gunsmith, millwright, foundryman, baker, 
silversmith, bookbinder, and jeweler. In any 
number of the cases, the accuracy of the nominal 
attribution may be doubted, of course. A self- 
attribution of "carpenter," for instance, may have 
been an exaggeration of the respondent's "real" 
skill status of "carpenter's assistant." With such 
questions we simply face the limits of the data. 

* This technique expands specification of the 
mobility equation by including a latent variable 
that is based on unmeasured characteristics related 
to persistence. Rho provides an estimate of the 
correlation between the probabilities of mobility 
and persistence, net of the influence of specified 
independent variables. Estimating the SES models 
follows the same logic, using lambda to denote a 
hazard rate that captures the instantaneous proba- 
bility of being excluded from the persister 
population, conditional on membership in the risk 
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Table 4. Probit Estimation of Upward Mobility, with and without Selection 


without selection 
1, 2 : 3 4 5 6 

Constant — 49 -1.68 ` —3.09 —1.37 —2.07 —3.10 
: (.06) (.21) (53) (.06) (.23) (.68) 

—8.20 -1.63 —5.80 —24.46 —9.10 -4.53 

Petty P,M,O 1.58 1.61 1.58 1.61 
(.27) (.28) (.28) (.28) 

5.79 5.83 5.69 5.69 

Skilled 1.01 99 91 99 
(.23) (.24) (.25) (.25) 

436 . 4.21 3.63 3.97 

Semiskilled 1.47 1.58 1.35 1.58 
i (.29) (.31) (.33) (.32) 

5.08 5.15 4.08 4.97 

Unskilled 1.73 1.79 1.50 1.79 
(.25) (.27) (.35) (.30) 

m 6.84 6.53 4.31 5.99 

Race: White .95 .95 
: (.41) (.40) 

2.33 2.34 

Foreign-born —.15 —.15 
(16) (17) 

-.91 ~ 88 

Age: 16-29 78 8 
(.30) (.35) 

2.57 2.23 

30-49 47 46 
(.30) (34). 

1.55 1.38 

Log-likelihood —300 —264 —256 — 1285 — 1260 — 1253 

Rho .94 47 00 
(.08) (.26) (.40) 

12.18 1.80 00 





Probit estimations of models predicting the 
probability of upward and downward mobility 
are reported in Tables 4 and 5, respectively.5 
In each table, equations 1—3 describe models 


pool; the coefficient of lambda estimates the 
association of the two processes, net of specified 
factors. 

5Since none of the men in the top two 
occupational categories was upwardly mobile (for 
professionals this was by definition; for the major 


‘ proprietors, managers, and officials it was proba- 
bly an effect of career tracking and therefore 


virtually definitional), the top three categories were 
collapsed into one reference category. For down- 
ward mobility the unskilled and semiskilled 
categories were collapsed into a reference cate- 
gory. Also, initial specification of the models 


included marital status, but it: proved to be 


excessively collinear with other variables (espe- 
cially age) and was therefore deleted. 


without adjustment for selection effects. 
Equation 1 fits only a constant, showing the 
probability of mobility (upward or downward) 
relative to immobility. Thus, on average the 
relative probability of career advancement 
was .31 (associated with a Z-value of — .49), 
while the relative probability of career decline 
was about half that large, at .16 (ie., a 
Z-value of — 1.01). Equation 2 assesses the 
probability of upward or downward mobility 
for specific occupational categories; equation 
3 adds demographic variables to the model. 
These results are straightforward, but four 
specific findings can be noted. First, although 
race was not a significant predictor of 
persistence, whité men (95 percent of the 
1850 total) were more likely upwardly 
mobile, net of other factors. Second, foreign 
nativity predicted persistence but not mobil- 
ity. Third, whereas marital status was a strong 
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Table 5. Probit Estimation of Downward Mobility, with and without Selection 


























without selection with selection 
1 f 2 3 4 5 6 

Constant -1.01 —1.68 -1.23 —.09 -222 —1.87 
(.08) (.27) (.45) (.24) (.44) (.56) 

— 13.24 — 6.26 —2.70 —.38 — 5.07 —3.34 

Professional .83 .83 .99 1.05 
(34) (.40) (.33) (.39) 

2.46 2.09 2.96 2.68 

Major P,M,O 42 51 64 79 
(.41) (.46) (.43) (.49) 

1.02 1.10 1.50 1.62 

Clerks, sales 1.14 1.14 1.25 1.30 
j (.35) CA1) (.34) (.39) 

3.25 2.79 3.71 ' 337 

Petty P,M,O 1.44 1.46 1.58 1.65 
(.33) (.40) (.32) (.36) 

4.42 3.85 5.00 4.52 

Skilled .38 40 42 50 
(.30) (.36) (.29) (.36) 

1.26 1.10 1.46 1.46 

Race: White -.31 —.35 
(.53) (.52) 

-4À1 —.68 

Foreign-born 7,39 —.43 
(27) (.26) 

— 1.46 — 1.65 

Age: 50+ —.21 —12 
(.18) (.20) 

—]1.14 -.62 

Log-likelihood — 170 -152 — 150 — 1006 —992 —989 

Rho —.65 41 45 
(.13) (.36) (.40) 

—4.95 1.14 1.14 





predictor of persistence, it did not. have 
separately estimable net effects on mobility. 
Fourth, while age had no net effects in the 
model of persistence, younger men (16-29) 
were more likely upwardly mobile, net of 
other factors.$ 

Such conclusions are hardly novel (cf. the 
previously cited studies). The question of 
central interest, however, turns on the charac- 
 teristics or dispositions of workers that could 
not be specified in either the persistence or 


Because downward mobility was less frequent 
than upward mobility, it was not possible to 
discriminate among the three age-groups. The 
contrast between young and middle-aged workers 
(a combined reference group) and older workers 
was maintained to see whether old age (relative to 
the age structure of the time) increased a worker's 
vulnerability to downward mobility. 


the mobility equations and whether unmea- 
sured traits that increased the probability of 
persistence also increased (or decreased) the 
probability of career mobility. If the out- 
migrants, for example, end perhaps as well 
those overlooked by enumerators, were char- 
acterized by some unmeasured personality 
traits associated with career advancement 
(e.g., "entrepreneurial spirit"), the two- 
equation estimations should show this interre- 
latedness as a negative correlation (rho) 
between the error terms or the persistence and 
mobility equations. If, however, it is the 
persisters who possess traits conducive to 
career advancement, rho should be positive in 
the case of upward mobility and negative in 
the case of downward mobility. Equation 4 
assesses the impact of unmeasured factors 
that increase the likelihood of persistence on 
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the probability of career mobility. For upward 
mobility the value of rho is large, positive, 
and clearly significant; that is, unmeasured 
factors. associated with career advancement 
also increased the probability of persistence. 
Moreover, the estimate for the constant is a 
larger negative number than in equation 1; 
that is; the probability of career advancement, 
net of unmeasured factors related to persis- 
tence, was very small (only .09). The results 
for downward mobility correspond: unmea- 
sured traits that decreased the risk of 
nonpersistence were associated with a lower 
risk of downward mobility. 

Equations 5 and 6 successively introduce 
the occupational categories and the demo- 
graphic variables. As a result, the value of 
rho is. no longer significantly different from 
zero. What's more, the parameter estimates 
of equations 5 and 6 are virtually identical to 
those of equations 2 and 3, for both upward 
and downward mobility. In sum, if we think 
of career mobility as a process that selects 
workers with characteristics favorable to job 
performance or productivity, we can questiori 
whether workers possessing these traits were, 
on the whole, more likely to remain in 
Indianapolis during this 10-year period, or 
more likely to have been lost to out- 
migration, enumerator error, or mortality. 
The evidence favors the persisters, showing 


that unmeasured factors related to persistence 
were positively related to the likelihood of 
upward mobility and negatively related. to 
downward mobility, but that these factors, 
which were distinct from the effects of 
occupational and demographic characteristics 
in predicting persistence, were mediated 
through (or at least correlated with) origin 
occupation and demographic traits in predict- 
ing mobility. 

Since collapsing all nondiagonal elements 
of the mobility table into two categories 
(upward or downward mobility) neglects 
information on within-category differences, I 
also estimated models using Duncan's index 
to assess selection effects on the status 
distance traversed through upward or down- 
ward mobility. Average improvement in job 
status was 3.8 points; the positive value is 
consistent with the predominance of upward 
over downward mobility, but the distribu- 
tional characteristics of the distance measure 
suggest incremental rather than dramatic 
changes in job status for-workers, on average. 

Results of this analysis (Table 6) corre- 
spond to those of Table 5. The positive 
lambda coefficient in equation 3 indicates that 
unmeasured worker characteristics that in- 
creased the probability of persistence also 
predict a gain in status during the 10-year 
period. However, the estimate is significant 


Table 6. OLS Estimation of Duncan's Index with and without Selection 


without selection 


Constant 11.04 

(1.12) 
. 9.88 
Duncan 1850 -78 
(.03) 

27.68 

Race 

Foreign-born 

Older 

lambda 

R 58 


with selection 
2 1 2 

4.16 3.07 —2.89 
(3.68) (4.78) (6.03) 
1.13 .64 —.48 
7] .82 .80 
(.03) (.04) (.04) 
25.50 21.72 20.71 
8.17 8.13 
(3.76) (3.73) 
2.18 2.18 
— 1.00 — 1.04 
(1.62) (1.61) 
-.62 —.65 
—1.70 —1.19 
(1.43) (1.46) 
—1.19 —.8i 
5.93 5.20 

(3.45) (3.54) 

1.72 1.47 


58 -58 58 
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` only by a one-tailed test (.05 level), and once 
-. the additional variables are specified (i.e., 
race),? the coefficient for lambda is further 
reduced in size and precision. 


DISCUSSION 
Researchers have speculated that, because of 


compositional differences between persisters. 


and nonpersisters on traits relevant to the 
career-mobility process, studies based only on 
persisters yield biased estimates of mobility. 
. However, results of the present study leave 
` little room for doubting the mobility estimates 
on grounds of selectivity bias. Once the 
likelihood of mobility was conditioned on 
demographic traits, the two-equation esti- 
mates converged with the single-equation 
> estimates, and the error correlation across 
, equations converged to Zero. 

The general theoretical point is simple. 
Selectivity on an identifiable factor does not 
in itself necessarily entail bias in subpopula- 

_tion estimates; the nature of the selection 
' process must be considered. Nonrandom 


‘explicit selection, as in exclusion of observa- - 


‘tions because of a threshold on the dependent 
variable, leads to biased and inconsistent 
regression estimates. But when selectivity is 
incidental, as in this study, it is difficult to 
predict the correlation between disturbances 
of the -selection equation and those of the 
“primary” equation—unless one can identify 

. factors that have been omitted from both 
equations and that are orthogonal to the 
specified regressands (Berk 1983). 
"Both career mobility and nonpersistence 
(out-migration as well as enumerator error 


and mortality) are selection processes; the’ 


question of interest is whether the processes 
operated by the same or by different criteria. 
Thernstrom's hypothesis of a "permanent 
- floating proletariat” implies the processes 
were regulated by different criteria: less 
skilled workers who had little chance of 
advancement wandered from place to place, 
in a futile search for better opportunities. This 
.. thesis, buttressed by evidence of high rates of 
blue-collar nonpersistence, seems to suggest 
that whereas a city’s working class fit 





~ T Equation 4 provides further support for the . 


race effect detected in Table 5. Net of origin 
status, the 1860 SES of white workers averaged 8 
points higher than that of black workers. 
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Schumpeter's (1955, pp. 127-29)’ general 
description of "class"—"a hotel or an 
omnibus, always full, but always of different 
people”—this was partly because of the 
existence of a large, regionally based contin- 
gent of mostly permanent drifters. However, 
the pessimism of Themstrom’s speculation 
about the prospects of migrating workers may 
not have been warranted. At least the present 
Study gives reason to doubt it. 

Since nonpersisters tended to be younger 
and of lower status—traits: associated with 
career advancement— support for the hypoth- 
esis of a permanent floating proletariat would 
have to come from characteristics of persis- 
tence that were not included either in 
Thernstrom's data or in mine. But the models 
estimated here allowed for mobility effects of 
unmeasured as well as measured traits. In 
other words, the selection processes of 
persistence and mobility were modeled as 
joint outcomes produced partly by common 
unmeasured variables. The results gave no 
indication of net effects of any unmeasured 
traits. In short, there seems to have been 
nothing distinctive about the nonpersisters 
that entailed for them a competitive disadvan- 
tage in chances of career mobility. That the 


. Donpersisters were numerous and usually : 


young, unmarried, and of lower status is 
indisputable. But taken as a whole they fit 
Schumpeter's "omnibus" depiction no less 
than did the working-class persisters of 
Indianapolis. Indeed, it seems likely that the 
out-migrants, rather than being permanent 
mernbers of a floating proletariat (or lumpen- 
proletariat), settled elsewhere, probably to the 
west, and subsequently experienced career 
trajectories that, on the whole, mirrored those 
of their counterparts who remained in India- 
napolis. The latter point is speculative, of 
course. It is almost impossible to track 
out-migrants of 19th-century cities, but rele- 
vant, evidence could be gained indirectly by . 
comparing a city’s in-migrants to its nonper- 
sisters in terms of demographic composition 
and to the persisters in terms of mobility. 

It would also be useful to replicate the 
present analysis with data from one or more 


- 19th-century cities differing from Indianapo- 


lis in size, age, and stage of economic 
development in order to determine whether ` 
the results reported here are generalizable. 


MELISSA A. HARDY is Associate Professor 
of Sociology and Research Associate with the: 


P 


tá. 


Institute on Aging at Florida State University. 


“In addition to expanding her study: - of 
“historical patterns of occupational mobility, 
` she continues her. research on retirement, the . 


changing structures of opportunity and con- 
straint faced by older workers, patterns of 


individual decision making, and questions of 


intergenerational equity. 


REFERENCES 


Barrows, Robert G. 1980. "Hurryin' Hoosiers and 


the American ‘Pattern’: Geographic Mobility in 
' Indianapolis and Urban North America.” Social 
Science History 5:197—222. ae 


Berk, Richard A. 1983. “An Introduction. to 


Sample Selection Bias in Sociological Data.” 
American Sociological Review 48:386-98. 


‘Blau, Peter and Otis Dudley Duncan. 1967. The 


American Occupational: Structure. New York: 
Wiley. 

Blumin, Stuart. 1969. “Mobility and Change in 
Ante-Bellum Philadelphia.” Pp. 165-208 in Nine- 
teenth-Century Cities, edited by Stephan Thern- 
strom and Richard Sennett. New Haven: Yale 
University Press. 


"Conk, Margo. 1978. “Occupational Classification 


in the United States Census: 1870-1940.” 
- Journal of Interdisciplinary History 9:111—30. 


' Form, William. 1985. Divided We Stand: Working- 


Class Stratification in America. Urbana and 
Chicago: University of Illinois Press. 


Glasco, Laurence. 1978. “Migration and Adjust- 


ment in the Nineteenth-Century City.” Pp. 
- 123-75 in Population in Nineteenth-Century 
America, edited by Tamara K. Hareven.and 
Maris Vinovskis. Princeton: Princeton Univer- 
sity Press. '"- n 


. SELECTION EFFECTS IN, OCCUPATIONAL MOBILITY 843 


Griffin, Clyde and Sally Griffin. 1978. Natives 
and Newcomers: The Ordering of Opportunity in 
Mid-Nineteenth . Century Poughkeepsie. Cam 
bridge: Harvard University Press. i 

Grusky, David B. 1986. “American Social Mobil- 
ity in the 19th and 20th Centuries.” CDE 
Working Paper 86-28. Madison: Center for 
Demography and Ecology, University of Wis- 
consin. 

Hardy, Melissa A. 1978. “Occupational Mobility 
and Nativity-Ethnicity in Indianapolis, 1850- 
60." Social Forces 57:205-21. : 

Hauser, Robert M. 1982. "Occupational Status in 
the Nineteenth and Twentieth Centuries." His-. 
torical Methods 15:111-26. 

Knights, Peter R. 1971. The Plain People of 
. Boston, 1830-1860. New York: Oxford Univer- 
sity Press. 

Long, Larry H. and Celia G. Boertlein. 1976. 
“The Geographical Mobility of Americans.” 
Current Population Reports, Series P-23, no. 

. 64. Washington, DC: U.S. Government Printing’ 
Office. 

Mare, Robert D., ‘Christopher Winship, and 
Warren N. Kubitschek. 1984. “The Age Pattern 

- of Employment." American Journal of Sociol- 
ogy 90:326—58. ` 

Parkerson, Donald H. 1982. “How Mobile Were 
Nineteenth-Century Americans?” Historical 
Methods 15:99-109. 

Schumpeter, Joseph. 1955. Imperialism ana Sorrel: 
Classes. New York: Meridian. 

Thernstrom, Stephan. 1973. The Other ‘Bosto- 
nians. Cambridge: Harvard University Press. 
U.S.‘ Bureau of Census. 1980. “Geographical 
Mobility: March 1975 to March 1979." Current 
Population Reports; ‘Series P-20, no. 353. 
Washington, DC: U.S. Government Printing 

Office. 


ACQUIRING CAPITAL FOR COLLEGE: 
THE CONSTRAINTS OF FAMILY CONFIGURATION* 
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We investigate the influence of size of sibling group and ordinal position on 
financial arrangements for college. While this analysis has implications for the 
Status attainment and human capital perspectives, it also represents a direct test of 
the resource dilution hypothesis. Analyzing a sample of 3279 subjects from the 
National Longitudinal Survey of the High School Class of 1972, we estimate the 
association between the number of siblings and (1) the likelihood of parental 
support of students in college, (2) the amount of parental contributions, and (3) the 
proportion of total college costs borne by parents. In all cases, a strong inverse 
relationship is found. Conversely, large sibship sizes increase the dependence upon 


-funds from extrafamilial sources, such as loans, scholarships, employment, and 


savings. Effects of birth order, although less pronounced, signify a financial 
advantage to latter-born children. Moreover, sibship structure affects the 


likelihood that youths cite financial barriers for not attending college. 


The relationship between sibling structure and 
educational/occupational success has captured 
scholarly imagination since the late 1800s 
with Dumont’s “law of capillary action” 
(1890) and Galton’s commentary on the 
“superiority of the eldest” (1874). This 
research investigates the association between 
sibship configuration and parental financial 
support for their children’s postsecondary 
education. Although inquiry into college 
financial arrangements should attract social 
scientists, few have undertaken this course of 
study. Evidence on the financial constraints 
preempting qualified students from entering 
college is also limited. Research tying sibling 
structure to parental monetary inputs is 
scarcer still. In this analysis, we focus not 
only on sibship size but also on the effect of 
ordinal position. Concurrently, socioeco- 
nomic background, gender, and ability/ 
achievement are included as adjunct factors 
potentially related to collegiate funding. 
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THEORETICAL AND 
EMPIRICAL BACKGROUND 


Unlike its European counterparts, the Ameri- 
can system of higher education assumes 
contributions from the family toward meeting 
college expenses. Since parental support is 
voluntary, this assumption does not automat- 
ically translate into actual parental financial 
aid. Moreover, the ability of parents to meet 
this financial obligation has been seriously 
compromised as the inflation rate of college 
costs has surpassed that reported in other 
basic areas such as food and transportation 
(Hauptman and Hartle 1987; Leslie 1984). 
Although some commentators forecast con- 
tinued and even expanding federal interven- 
tion as a solution to spiraling college costs 
(Kaestle and Smith 1984; Keppel 1987), the 
predominant view is that the current political 
climate augurs poorly for future governmental 
involvement (Leslie 1984). Critics of antici- 
pated federal cutbacks foresee reduced col- 
lege enrollment, perpetuation of class, sex, 
and race distinctions, and ultimately dimin- 
ished productivity and research potential 
(Saurman and Riccucci 1983). Others specu- 
late that federal retrenchment will be offset by 
more vigorous participation of state govern- 
ments, postsecondary institutions, and the 
private sector (Saurman and Riccucci 1983). 
Still others forecast that finances will increas- 
ingly be shouldered by students and their 


. families, despite reservations expressed by 
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parents about their ability to subsidize such 
endeavors (Leslie 1984). l 

Curiously, social scientists have rarely 
studied the conditions under which parents 
arė able and willing to support their children’s 
postsecondary education. This gap is espe- 
cially surprising given the obvious implica- 
tions for the status attainment model, which 
accounts for educational, occupational, and 
economic success as a function of socioeco- 
nomic background, ability, and intervening 
social-psychological factors (Blau and Dun- 
can 1967; Sewell and Hauser 1976; Alwin 
and Thornton 1984). Moreover, the relevance 
to human capital theory, which has played a 
distinctive role in studies of investment in 
education, is evident (Becker 1964). Identify- 
ing the conditions under which parents 
sponsor their children’s education may be 
useful in understanding how social class 
differences are reproduced across genera- 
tions. 

In this analysis, we investigate the link 
between sibship structure and college financ- 
ing. Although frequently included in status 
attainment studies, the number of siblings 
typically is relegated a secondary role to 
socioeconomic indicators and ability. This is 
an unwise practice. The negative effect of 
sibship size on a variety of educational 
indicators is one of the most consistent 
patterns observed in the literature (Blake 
1986; Steelman and Mercy 1980; Smith 
1984), 

One path through which sibship size 
conceivably affects educational level is vis- 
a-vis financial support for college. Our 
specific argument fits under the broader 
resource dilution hypothesis first outlined by 
Anastasi (1956) and recently resurrected by 
Blake (1981). According to this hypothesis, 
the sheer number of dependents in the family 
necessarily depletes the amount of finite 
resources channeled to any given child. Curtis 
(1986) adds that appropriation of resources 
ineradicably binds the family to social 
inequality because families vary not only in 
the resources they have but also in the number 
of members among whom goods and services 
are to be divided. Although the resource 
dilution hypothesis frequently is used as an ad 
hoc explanation for the deleterious impact of 
sibship size on a wide range of educational 
phenomena, it seldom is tested directly. 

While our work concentrates on economic 
resources, other researchers stress cultural 
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and social resources, such as parental atten- 
tion directed toward, time spent reading to, 
and books available’ to children (Murnane, 
Maynard, and Ohls 1981; Teachman 1987; 
Liebowitz 1977; DiMaggio 1982). Social 
scientists also have examined the mediating 
role of social resources to account for the 
negative relationship between the size of the 
sibling group and ability. To illustrate, it has 
been documented that children in large as 
compared to small sibships spend less time 
with their parents and, furthermore, engage 
less often in intellectually profitable activities 
(Mercy and Steelman 1982; Lindert 1974). 
It is difficult to distinguish between mate- 
rial and social resources and it is equally 
problematic to disentangle the relative influ- 
encé of each. Past studies accentuate the 
social realm, . perhaps because they are 
pitched to outcomes for youths of precollege 
age. Economic and social resources may exert 
different influences on educational outcomes 
at various points over the life cycle. Social 
and cultural resources provided by parents 
may more profoundly shape educational 
output during childhood and adolescence.! ` 
College, however, may bring the salience 
of economic resources to fore. A college 
education is neither required nor subsidized 
fully by the government. At this point, when 
the prospect of parental financial assistance 
looms large, family size may further screen 
who will and who will not advance education- 
ally. College expenses are one facet of the 
family budget wherein resource drainage is 
absolute. Expenditures on other items such as 
clothes, toys, and books are not as exacting 
on the pool of familial resources because 
these items are less expensive and can be 
shared among or handed down to other 
children. Social/interactional investments also 
overlap. For example, parents may read to or 
play games with more than one child 


! Although Blake (1986) demonstrates persua- 
sively a strong effect of sibship size on educational 
attainment, her argument that sibship size effects 
are stronger on high school attainment than on 
college attainment requires updating. Blake's 
sample overrepresents men who completed their 
education before the upswing in high school 
graduation and in college attendance rates experi- 
enced in the 1970s and 1980s. With nearly 
one-third of high school graduates attending 
college, the effects of sibship size on college 
attendance may well be larger than in the past. 
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simultaneously. Moreover, enhancing ‘the 
intellectual skills of older siblings may pay 
off eventually as they provide intellectual 
stimulation and tutelage to younger siblings. 
In financing college, however, resources 
spent are not recoverable. 

In addition, ordinal position may alter 
parental support. Whether birth order affects 
academic performance is unsettled (Cicirelli 
1978; Steelman 1985; Hauser and Sewell 
~ 1985). The direct effect of ordinal position on 
the allocation .of economic resources is of 
interest here. Several patterns are plausible. If 
there is a socioemotional or normative 
preference for firstborns, this preference may 


surface in economic support. Moreover, if . 


parental decisions on educational investments 
are based in part on economic returns to the 
investment, parents may invest in firstborns 
in anticipation of an earlier financial return. 
` Although preferences or expectations may 
‘boost the odds of parental support for 
firstborns, the relative availability of funds 
may benefit latér-born children. While , the 
firstborn attends college, parents still are 
encumbered by the daily costs of supporting 


his or her younger siblings. Conversely, when | 


younger siblings reach college age, parental 
. financial obligations ordinarily decline. Out- 
lays to the youngest child may represent the 
last major financial obligation parents shoul- 
der. Moreover, parental income may peak 
when later-born children matriculate. 


© RELATED LITERATURE 


Studies positing economic support as an 
intervening mechanism to account for the 
negative impact of sibship size on’ highér 
educational outcomes fall into three catego- 
ries, In the first category, the relationship i is 
assumed but not tested. Adams and Méidam 
(1968), for example, suggest relationships 
- among 'sibship size, college attendance, and 
parental support but did not test explicitly 
whether large sibship sizes depress the 
probability of parental assistance. 

" Studies in the second category spotlight 
other characteristics (e.g., familial income) 
while including sibship structure as a back- 
ground factor. Unfortunately, these studies 
ill-advisedly have used number of siblirigs 
in college to tap sibship structure (Leslie 
1984). Not surprisingly, null or miixed effects 
are observed because the number of siblings 


in college is used in lieu of total sibship size. 
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The former measure may haye offsetting 
effects on parental support. On the one hand, 
a sibling in college could signify parental 
willingness to support offspring. On the other 
hand, the number of siblings in college may 
diminish a family's financial wherewithal. 
The more appropriate concept to evaluate is 
the total number of siblings. 

- Most pertinent is the third category, where 
studies incorporate ordinal position and sib- 
ship size to predict parental support. Taub- 
man and Behrman's work (1986) represents 
the' most successful effort in this category. ` 
They contended that as sibship size increases, 
students rely less heavily on parental contri- 
butions and more heavily on loans and 
scholarships. They attributed the absence of a 
birth-order effect to the modest sample size. 
Although thoughtfully conceived, this study 
has several shortcomings compromising gen- 
eralization. First, academic ability, which is 
related to sibship structure and may influence 


‘parental decisions to provide financial sup- 


port, was not taken into account. The 
exclusion of ability inadvertently may exag- 
gerate or underestimate the effects of sibship 
structure. Second, the-small sample consists 
of adult offspring of war veterans born 
between 1917 and 1927. Consequently, the . 
years during which the youths attended 
college vary widely. Lack -of common 
histories potentially contaminates the findings 
because college costs changed yearly (which 
Taubman and Behrman adjusted in part by 
standardizing costs to a 1975 baseline), as did 
the availability of governmental assistance 
and other social conditions influencing col- 
lege attendance (e.g., college deferments 


‘during the Vietnam, War). Third, whereas 


they examined . whether: sibship structure 
affects parental support for offspring attend- 
ing college, they did not test whether sibship 
size affects those not matriculating. Thus it 
cannot be ascertained whether sibling struc- 
ture creates .financial obstacles that thwart 
college attendance. 

Previous research identifies several factors 
that should be related to the existence and 


level of parental support. Because they may 


be confounded with sibling structure, these 
variables are incorporated in this inquiry. 
Socioeconomic factors such as income and 
education are inversely related to fertility and 
also may affect parental backing (Powell and 
Steelman 1989). Similarly, the intellectual: 
level of the youth, which is - inversely 
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associated with family size - (Zajonc “and 


..Markus 1975), may positively affect. the 
extent of economic resources allocated. That 


Taubman and Behrman find gender effects on 
parental support justifies the inclusion of 


~. gender into’ our model. Ha 


In sum, we expect that, as the number of 
siblings increases, the probability and the 
extent of familial support decline. Con- 
versely, as sibship size increases, the depen- 
dence upon alternative funding. strategies 
(e.g., loans) increases. Those not attending 
college more likely attribute their decision to 
financial barriers. if they come from large 
sibships. Despite competing influences, later- 
borns receive more financial support than 
firstborns because familial income is higher 
and burdens are fewer. These patterns should 
occur net of academic ability, parental 
income and education, and sex. 


DATA AND METHODS 


Data 


` Our data are derived from the National 


Longitudinal Survey of the High School Class 
of 1972 (NLS).. Collected for the National 
Center for Education Statistics, NLS uses a 
highly stratified two-stage probability sample 


‘of high school seniors from both public and 
private schools in the United States. While : 


the base year data in 1972 provide informa- 
tion on background characteristics, academic 
outcomes, and academic and occupational 


aspirations, the follow-up surveys report on: 


educational and occupational, attainment in 
early adulthood. Because the NLS offers 
detailed information on how students financed 
their college education as well as reasonably 
complete data on sibling configuration, aca- 
demic achievement, and family background, 
these-data are well suited to answer the questions 
posed in this research.? 

Because we focus primarily on the 'effects 
of sibling configuration on the likelihood of 


parental support in college and the implica- . 


? The sampling design of NLS could suggest the 


, need to weight the data. Yet recent work using NLS 


discoünts this possibility because, weighting yields 
negligible differences (Teachman 1987), especially 


when restricting one's analysis to college matricu- - 


lants (Rindfuss, Swicegood, and Rosenfeld 1987). 
As a result, the analysis presenien here i is based on 


i eee data. 


model by adding other p. ale EN 
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tions of this relationship, we consider subjects. 
who. entered college in the fall of 1972. We 
restrict Our analysis to students (approxi- 
mately 40 percent atterid college immediately 
upon graduation) who provided complete data 
(a reduction of 26 percent), yielding a sample | 
of 3279 cases.? Although the bulk of this | 
research evaluates college students, we also 
examine other. youths to ascertain their. 
explanations for not attending college. 


Variable Measurement dnd Analytic Strategy 


The means and standard deviations of the key 
variables are provided in Tablé 1. We 
examine two. structural parameters of the 
sibship: sibship sizé and ordinal position. 
Sibship size is the total number of brothers - 
and sisters. Ordinal ‘position is measured by 
the number of liblings older versus younger `` 


, than the subject (see Paulhus and Shaffer . 


1981).* Background variables included in the 
model are: sex (females coded as 1), students' 
éstimates of parental incorne (in thousands of 
dollars), father's and mother's education. 
(from 1, less than high-school, to 5, 
postgraduate degree), and achievement (mea- 
sured by scores on four math, verbal, and | 
analytical ability sections on a standardized 
test developed by the Educational Testing 
Service. 2:0 


! 





3 Similar to other studies using NLS, the 
reduction in the number of cases, as a result of our 


"restriction to subjects with complete 'data, is not 


trivial. However; we 'also conducted supplemen- 


‘tary analyses using pairwise techniques and 


replacing missing values with the variable mean. 
Both sets of analyses yield the same general 
patterns as those presented in this paper, thus 
attesting to the robust nature of our results. 

* Although’ the tables presented include number 
of. younger and older siblings as. exogenous 


“variables, we also have (1) added firstborn, 


last-born, and only-born status (as dummy vari- , 
ables)'to the model; (2) used number of rs 
and. number of older siblings in the model, 
practice which generates coefficients derivable 
from the models already presented in the paper; . 
and (3) added the number of siblings currently in 
college to our basic model. While these alterna- 
tives obviously produce different coefficients, - 
none of the models changes the substantive 
conclusions of this paper. 

* Although our analyses include central back- 
ground characteristics, we also have; jded:: our 
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Table 1. Variables, Descriptions, Means, and Standard Deviations 





Variable Description of Variable Mean SD 
Father's education Less than high school - 1; . 2.79 1.37 

High school degree = 2; 

Some postsecondary education = 3; 

College degree = 4, 

Postbaccalaureate degree = 5 
Mother's education Same as father's education 2.63 1.04 
Achievement Standardized test scores (in 10's) 22.34 2.59 
Family income In thousands of dollars 12.90 5.35 
Sex Female —1 : 47 .50 
Sibship-size Number of &iblings 2.67 1.97 
Older siblings Number of older siblings 1.20 1.37 
Younger siblings Number of younger siblings 1.47 1.56 
Parental contribution Yes=1 67 AT 
Dollar contribution Amount of support in dollars 826.48 1123.92 

(by all parents) i 
Proportion contribution Amount of support as a proportion 46 55 
of total costs (by all parents) . 
Continuation Continued into second year of .84 i .37 
college (Yes = 1) } i 

Graduation : Yes=1 . .39: .49 
Loans In dollars 172.86 440.40: 
Scholarships In dollars 296.74 662.35 
Savings/employment In dollars 425.79 558.64 


The sample restriction to college students 
(as well as our later sample of nonmatricu- 
lants) yields a nonrandom subset of the 
population. This in turn potentially produces 
a problem of sample selection bias (see 
Heckman 1976, 1979; Berk 1983). Because 
variables linked to the likelihood or amount of 
parental contribution may also be related to 
the likelihood of attending college, the 
uncorrected regression estimates may be over- 
or understated. This problem can be allevi- 
ated by developing a model that incorporates 


ethnicity (coded as Black, Asian-American, His- 
panic, and other), educational plans and aspira- 
tions, paternal occupation, and whether the mother 
worked. The inclusion of these variables does not 
change the patterns discussed in the paper. 
Moreover, separate analyses for males and females 
indicate no significant sex differences in the effect 
of gibship structure on parental contribution. 

It is unclear whether performance on the 
standardized test is a measure of “ability,” 
“achievement,” or “developed ability.” We have 
completed additional analyses using alternative 
measures of academic achievement/ability, namely, 
high school ranking (in percentile) and grade point 
average. Our choice of test performance is based 
on the slightly better fit obtained. As is consis- 
tently the case in our supplementary analyses, the 
use of high school ranking and/or grade point 
-average results in patterns similar to those 
discussed in the paper. 


_ the selection process into the substantive 


mode] (Heckman 1979). After comparing 
several models, we estimate the likelihood of 
attending college as a function of sibship 
structure, grade point average, educational 
expectation, sex, and income. This model 
explains over 38 percent of the variance.- 
Based on the predicted values, a hazard rate, 
which gauges the likelihood of being ex- 
cluded from..the sample, is calculated. The 
inclusion of the hazard rate in all subsequent 
equations should reduce the bias in the 
estimates. Despite our confidence that the 
model provides a reasonable correction for 
sample bias, other selection models are 
plausible. Thus, we considered alternative 
selection models by adding and substituting 
other variables as predictors of the likelihood 
of attending college, bv. calculating alterna- 
tive hazard rates, and by including these rates 
in the subsequent equations of interest. 
Additionally, models with no correction for 
sample selection bias were estimated. Our 
results are quite robust. Regardless of the 
approach taken, the strong effects of sibship 
structure as reported hereinafter emerge. 

Our investigation of the college sample in 
the NLS centers around the probability of 
parental support. First assessed is a dichoto- 
mous variable differentiating between parents 
who did and did not provide financial 
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assistance during the student's first year in 
college. This variable is based on answers in 
the 1973 NLS First Follow-up questionnaire: 
“In paying for these costs (of training and 
schooling during the first year after high 
school), how much came from each of the 
following sources?" The likelihood of paren- 
tal assistance is estimated using logistic 
regression equations. . 

We then evaluate the extent of parental 
contribution, using a tobit (censored regres- 
sion) model (Tobin 1958; Maddala 1983). 
This model is concerned not only with the 
dichotomy (i.e., contribution vs. no contribu- 
tion) but also with the amount of contribution 
when it exceeds zero. The amount of the 
contribution is measured in two ways. The 
first is the absolute amount of parental 
contribution (in dollars) whereas the second is 
the proportion of total college costs paid by 
the parents. i 

We also consider whether sibling struc- 
ture modifies the likelihood and amount of 
alternative funding sources: whether students 
used loans, scholarships, and their own 
savings and earnings. Since we are interested 
in the nonlinear effect of the sibship structure 
on the likelihood of using these other sources 
and the linear effect of sibship structure ọn 
the dollar amount used, we once again 
employ tobit analysis. Using logistic regres- 
sion, we then test whether sibling structure 
directly and indirectly (via parental contribu- 
tions) affects the likelihood of persistence in 
college beyond the first year and of gradia- 
tion from college within four years upon high 
school graduation. 

Finally, our research examines the finan- 


. cial considerations among those who do not 


attend college. Although we analyzed all 
relevant attitudinal questions on why people 
did not obtain a college degree or seek further 
education, we confine our discussion to 
subjects not attending college in 1972 (using 
the first follow-up inventory). They were 


. asked: "Here are some reasons others have 


given for not continuing their formal educa- 
tion right after leaving high school. Which of 
these reasons apply to you?" We are 
interested in whether they "needed to earn 
money before I could pay for further 
education" and whether they “could not 
afford a four-year college or university 
education." As in the case of those attending 
college, the use of this subsample also 
requires a correction for selectivity (i.e., 
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calculation of hazard rate) in the subsequent 
logistic analyses. 


RESULTS 
Sibship Structure and Funding One's 
College Education 


. Does sibling structure affect the likelihood that 


parents provide financial assistance to their chil- 
drén during the freshman year in college? The 
first column of Table 2 confirms our prediction 
of an inverse relationship between the number 
of siblings and the likelihood of parental sup- 
port. Based on the coefficient, each additional 
sibling translates into a 15 percent reduction in 
the odds (nonlogged) of parental assistance. 
However, based on the second column of Table 
2, the number of younger siblings has a greater 
negative effect (a reduction of 18 percent in the 
odds of parental support per younger sibling) 
than the number of older siblings (a reduction 
of 11 percent), suggesting an advantage to later- 


Table 2. Logistic Regression Coefficients for the Likeli- 
hood of Parental Contributions on Sibship 
Structure and Background Characteristics 


Coefficient 





Coefficient 
Variable. (Std. Error) (Std. Error) 
Father's education —.033 — —.028 
(.041) (.041) 
Mother's education |o .26* .126* 
(.051) (.051) 
Achievement .024 .028. 
(.020) (.020) 
Family income .l06*** .107*** 
(.010) (.010) 
Sex Alger "Alsen 
f (.087) (.087) 
Sibship size -.l166*** 
(.022) 
Older siblings —.120*** 
(031) 
Younger siblings — .197*** 
(027) 
Hazard rate —]1:5199«* —]1.510*** 
(.453) (.454) 
Intercept —.534 —.622 
L?-L? 10 10 
La? 1 
Proportion Yes .67 .67 
* p « .05. ` 
** p « 0l. 


*** p « 00l. 
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bom children. Family income also exerts a pow- 


erful influence on parental assistance. Compar- 


. ing one family earning $10,000 more than 


another, a youth from the first family will have’ 


nearly three (2.89) times greater odds of receiv- 
ing parental support than his or her less finan- 
cially advantaged counterpart. Income is the 
sole variable that hds’a more pronounced effect 
than sibship size. Sex and maternal education 
evince smaller, albeit significant, effects, while 
ability and father's education do not approach 
conventional levels of statistical significance. 
"While Table 2 inquires only into . the 


` likelihood of parental assistance, Table 3 also ' 


provides evidence on the: magnitude of 


* support, by parents. The tobit coefficients 


. Suggest that sibship size:has a very strong 


negative influence on the amount of parental 
aid, © whether the amount is, measured in 


. absolute dollars or as a proportion of total 


college costs. Moreover, the effect of the 


: number of younger siblings is over twice that 


of the number of older siblings. 


We: also should emphasize, the strong. 


positive effects of income and the mixed 
effects-of education and achievement. That is, 


- . test scores are positively linked to the dollar 


amount of parental aid but negatively related 


to the proportion of parental support. We 


attribute this seeming contradiction to a 
pattérn found in auxiliary analyses: students 
with bigh test scores are more likely than 


‘others to attend expensive: schools and obtain 
' sizàble scholarships. Even if parents provide 


a large absolute amount of support to their 
children at costly institutions, the. amount as a 
proportion may be relatively low. This 
explanation also reconciles the ostensibly 
inconsistent results on maternal education, 


-- which is linked positively to dollar amount of 


* 


contribution but not to proportion. 

: If parental support is contingent upon sibship 
structure, how do sibship size and ordinal po- 
sition affect alternative funding strategies? Ta- 
ble 4 presents tobit parameters predicting three 
methods of defraying college costs: loans, schol- 
arships and grants, and individual savings and 
employment.? The effect of sibling structure is 
quite clear: the number of siblings significantly 


7 We have disaggregated the last category into 


. .Savings, work during summer, and yearlong em- 
” ployment. Tobit estimates .of these: subcategories 


and logistic regression estimates of the likelihood 
of loans, scholarships, and/or money from savings 
and employment are available upon ca ae 
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increases the use of loans; however, the posi- 
tive effect is mainly realized through the num- 


_ ber of younger siblings. Indeed, the number of 


older siblings does not have a significant effect . 
on the acquisition of loans. High test scores 
also increase the probability and amount of loans 
in part becaüse students with higher scores at- 


.tend expensive colleges. In contrast, income 
. depresses the likelihood and magnitude of loans. 


Scholarships and grants .are positively 
associated with the number of siblings. 


~ Again, the effect of the number of younger 


siblings exceeds that of the number of older, . 
siblings by twofold. As ‘suggested earlier, 
standardized test performance is among the 
most powerful predictors of scholarship 
support. And, as the criteria for -obtaining . 
scholarships include economic need in combi- 
nation with academic potential,'a very strong 
negative link between familial income and 
scholarships is evident. 

As for 'self-initiative, the number of 
siblings increases the necessity of using one’s 
own resources and “working one’s way 
through college.” Further, the positive effect - 
of the number of younger siblings is three 
times that ‘of the number of older siblings. 
Interestingly, the magnitude of the signifi- , 
cance of sibling size (and, even more notably, 
the number of younger siblings) exceeds that . 


‘of achievement and income and is surpassed 


only by the influence of gender, with females 
saving and earning less than males. 

We next consider whether sibship structure . 
directly and indirectly (via parental contribu- 
tion) affects the probability of continuing in 
college beyond one’s freshman year and of grad- 
uating within a standard four-year period.’ As 
expected, data in Table 5 indicate that both 
dependent variables are based on achievement 
and probability of parental contribution.? If par- 
ents provided funds, the odds of continuing into 
the second year and of graduating are increased 
by 49 percent and 59 percent, respectively. 
While size of sibling group (and ordinal posi- 
tion) indirectly influences continuation and grad- 
uation via pae. contribution, its ante ef- 


8 Not only have we checked for the effect of 


.parental contribution on continuation beyond the 


first year, but we also have done the same analyses 
on continuation beyond the second and third years, - 
reaching similar conclusions. 

? Models using amount or percentage of parental 
contribution yield the same general findings as 
those using the likelihood of parental support. 


e 


ad. 


ve 


“ee ‘ . L 
soo * 


E 


ACQUIRING CAPITAL FOR COLLEGE 


851 


Table 3. Tobit Coefficients (Standard Errors) for Amount. of Parental Contribution or Sibship . Structure and 





*** p < 001. 


fects virtually disappear. So too does the direct ` 


effect of income, although an indirect effect of 


income is manifested via achievement and pa- 


rental contribution. 
' While the foregoing analysis demonstrates 


, that sibship structure alters funding strategies 


for college students, it does not address 
whether sibship structure affects those who do 
not attend college. Table 6 provides logistic 
regression parameters estimating the likeli- 
hood that nonmatriculants cite "the need to 
earn money before I can pay for further 
schooling" or “I could not afford a four-year 
college or university education.”!° In both 


' cases, sibship size has a powerful positive 


effect. An affirmative response for each 
question is linked positively to sibship size, 
with a stronger relation to number of younger 
siblings than to number of older siblings. ) 


t 


` 10 One question that is not included i in Table 6 is 
noteworthy. We estimated the logistic regression 


, parameters for the likelihood of using the follow- 
, ing reason. for not attending college: 


support my family." Although sibship size.is not 
significant here, being a firstborn significantly 
increases the likelihood of citing this rationale. 


"need to | 





Background Characteristics 
: DEN Dollar LOO i Proportion Contribution + 
‘Coefficient . | Coefficient ù . + Coefficient ` Coefficient 
Varable ' $ (Std. Error) (Std. Error) < (Std. Error) (Std. Error) 
.. Father's education | 25167 ..30.519 |: 00 005 
`, (25.119). 3 . (25.134) (.011) (011) 
Mother's education ^ 95.721** 94.695** © 012 zÜ CUm 
E (30.298) (30.235) (.013) (013) 
Achievement < 30.764* . 33.213" -Qes —010-,. 
l (12.523) (12.522) , (.005) (.006) 
Family income 108.486*** 108.380*** EN 0476 
. l (6.153) ` 6.137) (.003) G 003) ` 
Sex 265.7589 . . 261.128*** Moses .140*** 
0l (53.543) (53.456) (024) . (.024) 
Sibship size —122.105*** — .065*** 
(14.497) , . .(.006) 
Older siblings —16.669**» — 039*e* 
(20.867) (.009) 
Younger siblings —153.63]*** —.083*** , 
. <.” (08.633) . . (.008) 
Hazard rate — 1885. 880*** —1870.6859**  ..  —.199 . — :189 
(325.345) (324.661) (142). (.142) 
Intercept ' —1005.582 — 1082.508 119 078 
* p < .05. l 
** p< 01. y 


Would those not in college have received . 
parental aid had they chosen to attend 
college? We estimated the likelihood’ of 


` parental support by first developing à model 


to predict support for all respondents, adjust- 

ing for sample selectivity. Using.this equation - 
and substituting the means for those not 
attending college, we find that the likelihood 
of parental support would be exceptionally 
low, less than a 20 percent chance. The 
likelihood for those who needed “to earn 
more money before I can pay for further ' 
schooling" was even lower, 15 percent. In 
other words, students not attending college 
„are reasonably accurate in their appraisal. of 


' the likelihood of parental support. 


. DISCUSSION 


The above findings are clear. We observe a 
decidedly negative impact of sibship size:o: 

. parental financial assistance and a arate ve 
effect of sibship size on the use of alternative . 
funding sources. Moreover, we find a posi- 
tive relationship between number of siblings 


, and the probability that financial drawbacks 


D 
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Table 4. Tobit Coefficients for Loans, Scholarships, and Savings/Employment on Sibship Structure and Background 


are cited as reasons for not enrolling in 
college. For nearly all of the models exam- 
ined, sibship size ranks either first or second 
in relative influence, placing its effect nearly 
on par with that of familial income. 

That later-born children are favored with 
respect to parental backing must be viewed 
cautiously. These findings do not necessarily 
mean that parents prefer later-born children. 
Rather, in comparison to those born earlier, 
later-born children may simply reach college 
age at a more opportune time in the family 
life cycle when parents have assets to divest 
or when heavy investment is not seen as 
debilitating to the family budget. Neverthe- 
less, because we relied on interfamilial data, 
this finding is provisional. Intrafamilial data 
would provide a more rigorous test of 
birth-order effects. 

The other variables included in the analysis 
are related to college funding in the expected 
direction, with the possible exception of sex. 
Females acquire greater parental assistance 
than males and are less likely to use funds 
obtained from savings or employment. This 


finding is not consonant with the human’ 


Characteristics 
Loans Scholarships Savings/Employment 
Coefficient ' Coefficient Coefficient Coefficient Coefficient ' Coefficient 
Variable (Std. Error) (Std. Error) (Std. Error) (Std. Error) (Std. Error) (Std. Error) 
Father's education — 43.320 — 50.011 — 45.790 —51.643 10.326 — 12.834 
(40.163) (40.297) (30.396) - (30.433) (13.823) (13.841) 
Mother's education 4.712 3.953 47.751 47.753 —5.757 — 5.424 
(48.343) (48.421) (36.705) (36.664) (16.679) (16.663) 
Achievement 44.995* 41.985* 115.294** 112.709*** 25.367*** 24.117*** 
(19.379) (19.389) (15.256) (15.240) (6.867) (6.871) 
Family income -—86.180***.  —87.561*** .—115.856*** .—116.898***  —10.918*** .—]11.117*** 
(9.947) (9.995) (7.652) (7.665) (3.307) (3.305) 
Sex 58.601 62.803 —93.452 — 88.589 —224.813*** —222.626*** 
(83.165) (83.163) i (63.636) (63.548) (29.303) (29.279) 
Sibship size 86.222*** 96.103*** 45.262*** 
(20.706) (15.891) (7.550) 
Older siblings 25.515 41.068 21.313 
(29.919) (23.064) (10.898) 
Younger siblings 129.049*** 132.380*** 62.088*** 
(25.784) (19.739) (9.359) 
Hazard rate —2261.750*** —2245.120*** — 3024.970*** —3044.911*** —691.579*** — 699 229*** 
(510.809) (508.966) (400.194) (399.458) (169.091) (168.896) 
Intercept — 562.601 —458.413 — 825.088 — 726.965 138.668 181.804 
* p< .05. 
** p < 0l. 
*** p < .001. 


capital argument that males, because of their 
greater earnings potential, will receive a 
disproportionate share of parental support. 
Rather, if their daughter attends college, 
parents may feel more obligated to support 
her either because males have easier access to 
employment opportunities to pay for college 
or because socialization priorities require 
greater evidence of independence from males 
than females. 

These findings accentuate the need for and 
point toward new directions to enhance the 
status attainment model. Despite the spate of 
studies inspired by this model, as well as the 
claims by some critics that the model has 
been studied to the point of saturation, this 
work shows that some core relationships have 
been overlooked. If the uncharted parts of the 
status attainment model are now explored, 
much still is to be gained in terms of 
understanding educational and occupational 
attainment and their by-products. 

Finally, our results buttress the resource 
dilution explanation. No longer is it sufficient 
to pose this explanation ex post facto. Our 
results establish that sibship structure influ- 


ug 
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Table 5. Logistic Regression Coefficients for the Likelihood of Continuation in College beyond the First Year and of 
Graduation on Parental Contribution, Sibship Structure, and Background Characteristics 


Continuation . Graduation. 
Coefficient Coefficient Coefficient "Coefficient 
Variable (Std. Error) (Std. Error) (Std. Error) (Std. Error) 
Parental contribution .400*** .394*** .A63*** A56*** 
(Yes= 1) C116) : (.116) (.097) (.097) 
Father's education .021 .025 .048 .055 
(.052) (.052) (.039) (.039) 
Mother's education .002 .002 - 005 004 
(.063) (.063) : (.048) (.048) 
Achievement : .028 .031 .138*** .142*** 
(.024) (.023) (.021) (.021) 
Income — .022 —.021 . —.016 — 016 
(.012) (.012) (.010) (.010) 
Sex — .236* —.233* ` .273** .267** 
(.108) (.108) (.084) (.084) 
Sibship size — .015 —.005 
(027) (.023) 
Older siblings .025 054 
(.039) (.033) 
Younger siblings — .044 — 046 
(.033) (.028) 
Hazard rate -5.316*** —5.309*** —5.176*** —8.207*** 
(.490) (.490) (.495) (.739) 
Intercept 3.284 3.199 — 1.000 —1.091 
L2-L? 0 " 42 12 
L2 d dl 
Proportion Yes 84 84 39 39 
* p « .05 l 
** p « Ol 
*** p < 001. 


ences parental support which, in turn, deter- 
mines the likelihood of attendance at, persis- 
tence in, and graduation from college. It is 
unfortunate that research has typically ignored 


' economic resource diminution especially in 
light of the promising literature on the 


implications of sibship structure for social 
resources. At this point in our sketchy 
knowledge of what inputs affect youths, it 
would be foolhardy to argue that any type of 
resource takes precedence over others. Simi- 
larly, it is imprudent to assume that sibship 
effects on resource allocation are uniform 
regardless of the type of the resource. For 
example, whereas later-born childrén may be 
penalized by receiving less parental attention 
than their older siblings, our study suggests 
that later-born children benefit by gaining 
more economic resources, such as college 
aid, from their parents. Whether the alloca- 
tion of economic resources is skewed toward 


later-born children always, periodically, or 
only in the case of college funding also 
should be evaluated. We should consider 
whether the effects of the intellectual, eco- 
nomic, and social resources, as well as the 
dependence of resource allocation on sibship 
parameters, vary in their intensity across 
different points of the life cycle. Unti] we 
recognize the multidimensional, dynamic 
nature of resource allocation and until we 
move size and other parameters of sibship 
.Structure beyond the category of subsidiary 
background information to the forefront, 
resolution of the above issues will not 
materialize. 
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specific interests in the relationship between 
family background and life chances. and 
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Table 6. Logistic eee Coefficients for Citing Financial Ressdhs for Not Artending College on ibat 


Structure and Background Characteristics 


Not Attending College Because "I Need to 


Earn Money Before I Can Pay’ For Further 


childhood socialization. Most of her work has 
concentrated on the effects of familial struc- 
ture. ' à i 


BRIAN POWELL is Assistant Professor in 
the Department "of Sociology at Indiana 
‘University. His interests extend to the sociol- 
ogy of education, sociology of mental health/ 
illness, and sociology of the family. This 
article is part of a larger joint project 


investigating parental social, intellectual, and 


economic investments in children. 
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OVEREDUCATION AND EARNINGS: 
A STRUCTURAL APPROACH TO DIFFERENTIAL ATTAINMENT 
IN THE U.S. LABOR FORCE (1970—1982)* 


JAMES W. SHOCKEY 
University of Arizona 


This paper examines the hypothesis that “overeducated” workers receive lower 
earnings returns to a year of educational attainment than those who are not 
mismatched. Data from the Current Population Survey reveal that those who are 
mismatched actually receive higher overall earnings as well as higher absolute 
returns to a year of college. A structural model of earnings determination shows 
that the primary impact of advanced education among occupationally mismatched 
workers is through the attainment of jobs which offer higher pay. Among workers 
not mismatched, the main influence of schooling is direct rather than through job 
characteristics. However, the analysis also suggests the difference between 
mismatched and non-mismatched workers narrowed between 1972 and 1982. 


INTRODUCTION 


A number of perspectives have been used to 
explain the process by which education "pays 
off” in the labor market. While most focus on 
some combination of individual and structural 
characteristics (see Berg 1981), it should be 
useful to consider also the way they are 
linked—the process of matching workers to 
jobs (Granovetter 1981; Sorensen and Kalle- 
berg 1981). Yet research has not sufficiently 
examined the role of matching in models of 
labor market earnings. Notable exceptions 
include Freeman (1976, 1980), Rumberger 
(1980, 1987), and Verdugo and Verdugo 
(1988). Berg (1970), B. Burris (1983), V. 
Burris (1983), and Kalleberg and Sorensen 
(1973) are examples that deal with outcomes 
other than earnings. 

Although several dimensions of work can 
be used to gauge the quality or success of 
matching outcomes, the most common is 
educational attainment. Empirical evidence 
suggests that the proportion of workers who 
have completed more years of schooling than 
is either required or typical for their jobs is 
large and growing.! Between 1969 and 1980, 


* An earlier version of this paper was presented 
at the 1987 Annual Meetings of the Population 
Association of America. The author is indebted to 
Clifford Clogg, Roberto Fernandez, Herb Smith, 
Michael Sobel, and two anonymous reviewers for 
helpful comments and suggestions, and to Irene 
Browne for programming assistance. 

! Various terminology is used to describe 
essentially equivalent conditions, including overed- 
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the proportion "occupationally mismatched" 
nearly doubled, rising from 8 percent to 14 
percent (Clogg and Shockey 1984; Clogg and 
Sullivan 1983; Clogg, Sullivan, and Mutchler 
1986). With different indicators, estimates 
range from 22 percent of the labor force for 
1977-1978 (V. Burris 1983) to well over 30 
percent for 1973 (Rumberger 1987). 
Documenting the growing prevalence of 
occupationally mismatched workers, how- 
ever, does not directly address the relation- 
ship between earnings and labor market 
matching. The present paper examines the 
hypothesis that outcomes of the matching 
process are a central part of the larger process 
by which earnings are determined. I further 
argue that the “well-known” relationship 
between education and earnings itself may 
depend upon the success of the matching 
process. This implies that earnings determina- 
tion models cannot simply be updated by 
including indicators of mismatch status. 
Instead, models that allow interactions or that 
estimate and compare properly matched and 
mismatched workers may lead to different 
conclusions. ua 
This paper is organized as follows. First, I 
review the role of education in determining 
eamings levels from several perspectives. 


ucation, occupational mismatch, underemploy- 
ment, skill underutilization, and overtraining. In 
this paper overeducation and occupational mis- 
match are considered to be indicators of the same 
concept regardless of any differences in measure- 
ment procedure. See Shockey (1987). 
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OVEREDUCATION AND EARNINGS 


Second, I consider the impact of improperly 
matching workers to jobs. The next section 
expands upon the hypothesis that earnings 
returns to post-high school education de- 
clined through the early 1970s to include a 
more complete set of demographic groups 
tban is typically examined. Building on these 
results, I present a model of earnings 
determination including both individual and 
structural factors, and estimate it for both 
mismatch statuses at two points in time. 


EDUCATIONAL ATTAINMENT 
AND EARNINGS 


Human capital theory proposes that each 
additional year of schooling completed repre- 
sents an increase in future earnings potential. 
However, the key factor determining earnings 
is not education per se but productivity. An 
employer will set each individual’s earnings 
based upon overall contributions to the firm’s 
productivity. Thus an additional year of 
schooling will "pay off” insofar as it 
increases the employee's productive capacity. 

This view of education's value to the 
worker has come under substantial criticism. 
Many authors claim, for example, that the 
U.S. educational system does not provide 
workers with the skills needed on the job 
(Bills 1988; Bowles and Gintis 1976; Walters 
1984). Rather, schooling encourages such 
secondary traits as perseverance, competitive- 
ness, and bureaucratic obedience which aid in 
socializing students to the behavioral expecta- 
tions of most occupational roles. If the link 
between school and skill has been weakened 
over time, the human capital perspective 
cannot provide an adequate understanding of 
why education continues to be related to 
earnings. 

A second problem with the human capital 
viewpoint is the assumption that the capabili- 
ties of individual workers determine their 
productivity. Clearly, ceteris paribus an 
employee with more skill can produce more 
for a firm, but all else may not be equal. 
Structural features of the job, such as 
equipment, management style, and physical 
workplace layout, may have as much to do 
with productivity as individual characteristics 
(Sorensen and Kalleberg 1981). 

If education does not directly increase the 
level of relevant skills and there is more to 
productivity than simply being a competent 
worker, how does schooling increase earn- 
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ings? The answer lies in the fact that if jobs 
even partly determine potential earnings 
levels, obtaining one of the "good" jobs is 
essential for those who are qualified. Employ- 
ers, on the other hand, need to be certain that 
qualified employees are supplied with the 
tools and environment necessary to be 
productive. In short, workers and jobs should 
be properly matched (Granovetter 1981). 

For the moment, I define occupational ` 
mismatch to exist where a worker’s level of 
formal schooling is greater than that typical of 
workers in the same occupation. From a 
human capital perspective, surplus education 
should not be rewarded to the same degree as 
typical levels of schooling. Each increment to 
education beyond that typical or required 
should have a decreasing impact on individual 
productivity (Rumberger 1987). If the educa- . 
tional system produces workers with relevant 
skills, those who are inadequately matched to 
jobs will, on average, obtain lower dollar 
returns to their education than adequately 
matched workers. 

During the late 1960s and early 1970s, the 
supply of highly educated workers apparently 
exceeded the number of "appropriately com- 
plex" job opportunities (Easterlin 1978). 
Given a large pool of qualified applicants, 
employers should have chosen those with the 
greatest levels of schooling provided that 
additional education does not inhibit produc- 
tivity (see Rumberger 1987). Education 
comes to be treated as a relative indicator, not 
necessarily of skill (Thurow 1975). Regard- 
less of its ability to impart skills or forecast 
trainability, education becomes necessary in 
order to obtain jobs with high productive or 
earnings potential. 


LABOR MARKET MATCHING AND THE 
RETURNS TO SCHOOLING 


In this section I compare the annual earnings 
and returns to post-high school education of 
mismatched and non-mismatched workers. If 
structural position is important, persons with 
greater opportunities of being hired into 
well-paying positions (e.g., occupationally 
mismatched) should have higher overall 
earnings and returns to schooling. If individ- 
ual characteristics predominate, we should 
observe lower returns for the mismatched. 
The literature provides little precedent for 
this comparison. Freeman (1976) finds evi- 
dence of overeducation in the declining value 


_ only for white males. Verdugo and Verdugo . 
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'of college versus high school edücation 


between 1969 and 1973.. He assumed, 
however, that declining returns necessarily 
signal overeducation, and documented this 


(1988) present a model of earnings determina- 
tion with a variety of predictors including 
mismatch status, but they do not consider 
whether the returns to schooling differ 


“between mismatched and other workers. 


Rumberger (1987) did make this comparison 
and found that schooling obtained beyond 
required levels generally yields greater re- 
turns. His analysis assumes, however, that 
returns to required schooling are the same for 
adequately employed and overeducated work- 
ers, whereas in reality employers are unlikely 
to clearly distinguish ‘among required and 
surplus attainments. 

This study uses the measure of occupa- 
tional mismatch proposed by Sullivan (1978) 
and discussed in detail by Clogg and Shockey 


. (1984). Workers are considered mismatched 


if their educational. attainment is greater than 
one standard deviation above the mean 
education among workers in similar occupa- 


tional positions.? This definition centers on 


the upper tail of the educational distribution 


'(ie., the "overeducated"). Given that the 


“success of a match should range from good to 


poor, it may be important in some cases to 
focus on persons in the opposite tail (i.e., the 
“undereducated”). However, the earnings 
determination process is quite reasonably 
different in these opposing positions. I focus 
here only on mismatching that corresponds to 
overeducation. . 
. Mean earnings and returns to education. 


-"Table'1 presents the mean annual earnings (in 
.constant 1970 dollars) of full-time civilian, 


workers aged 25 to 64 by year. The data are 
from the March Current Population Survey 
(CPS) for all even years between 1970 and 
1982 inclusive.) These data show that mis- 


? The occupation-specific mean and standard 
deviation for educational attainment are obtained 
from the Public Use Sample of the 1970 Census of 
Population. The use of a single time point is 
necessary as a benchmark from which to determine 
the extent of growth’ or decline in mismatching. 
See Clogg and Shockey (1984). 

_ 3 Due to CPS procedures, there is a 50. percent 
overlap in households included in the samples for ‘ 


, two consecutive years. To ‘avoid a lack of 


os 
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matched workers earn significantly more-on 
average than those who are not, and, with few 
exceptions, this is true regardless of gender or 
color (p « .05). As expected, nonblack males 
have the highest average earnings of any. 
group regardless of mismatch status. The 
pattern of decline among nonblack males, 
noted by Freeman and others for the pre-1976 
period, is replicated only: among the occupa- - 
tionally mismatched. Overall, declines are 
found more often among mismatched than 


. non-mismatched workers, which could be 


taken as support for the human capital notion 


'. that overeducation and earnings declines are 


tightly linked. On the other hand, there are 
many significant increases in earnings .as 
well; however, this should not be used: as : 
direct evidence against the human capital 
approach. The premise is that overeducation 


necessarily implies a decline in returns rather 


than raw earnings levels. ` 

Table 2 presents the returns to education 
estimated by regressing earnings (in 1970 
dollars) on age, age-squared, and a spline of 
education which separates the incremental 
benefit of each year of “graded” schooling 
(0-12 years) from that for each year of 
“postsecondary” schooling (13 + years). Only 
the coefficient for postsecondary education is 


reported.* Every gender-color-mismatch sta- 


tus category excluding occupationally mis- 
matched black males shows evidence of a 


significant decline in returns. And with very 


few. exceptions the returns are greater among 
the mismatched. However, the mismatched 
also exhibit greater percentage declines in 
their returns over the period. For nonblack 
males, as an example, returns fell 26 percent 
among those not occupationally mismatched, 
but fell 37 percent among the mismatched. 
The anomalous group, black males, experi- 
enced an increase in returns from 1970 to 
1982 for those who are mismatched, but a ` 
decline of 57 percent for those not mis- 
matched. This partly results from an extreme 


observation in 1970 and fails to eliminate the 


basic conclusion that mismatched workers | 


independence between files, I examine only those 
for even years. 7 

" Freeman's work centers on the ratio of 
earnings obtained by college graduates to those of 
high School graduates rather than absolute changes 
in returns to college. His findings (and those of 
others) imply that absolute declines should also be. 
observed, 


hy 


we 
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Table 1. Mean Annual Bémings of the Full-Time Civilian Labor Force, by Gender, Color, Year, and Mismatct 








Status* "NP . 
" Nonblack Males Ner Females” Black Males Black Females 
Survey Not _ ~ Not Not. Not 
Year Mismatched Mismatched Mismatched Mismatched Mismatched Mismatched Mismatched Mismatche 
1970 $9308° $12622  . $4735 $5856 $5874 $7177 .— $308 $4320 
5602 , 8397 2854 3968 3417 3296 ^ . 2364 3015 
22464 ` 3051 9771 972 2100 187, 1350 186. 
1972 9335 12141] -'47600 © 5917 6010 7616 . , 4074 4563 
5680 7650 . "2844 3777 :3360 4333 2623 | 3059 
20772 3441 9489 1097 1909 211 1305 166 | 
1974 10030 12148 5045 5575 "6603 ` 7586. ` 4352 4326 
'-— «6019 7399 3019 3499 3193 3721 2672 2996 
19524 3824 9035 1313 1839 225 1273 204 
1976 . 9298 11087 -` 4810 '5719 6342 7435 4396 4863 
5533 6690 2861 3503 3181 3935 2490 2955 
18460 4325 9312 1624 _ 1617 248 1342 217 
1978 ` 9515 “11248 4924 5596 - 6458 . $358 ` 4569 5715 
5433 6669 3037 3457 3874 4361 2788 2948 
21330 5625 11543: , 2201 1968 435 1618 : 326 
1980 9354 10805 4904 5758 — 6498 7169 4598 | 5425 
5137 5987 2884 3610 ^ 372 4436 ' 2663 3215 
23715 7118 13903 3114 2159 512 1889 455 
1982 8787 . 10200 4804 5558 6052 5,6915 4366 5414 
5184 6198 - 2848 ` 3306 3373 3526 2422 2696 
26221 8907 15993 3979 : 2338 711 2233 ` 464 





* Data are from the March CPS files; universe includes persoris 25-64, with earings from the previous year, an 
who are currently working full-time 
> Earnings are for the 12-month period prior to the survey, reported i in constant 1970 dollars. Cell entries from tor 
to bottom are the mean, standard deviation; and sample size. 


receive greater returns its — 


education than do those ‘who are properly 
matched to their jobs. 


EARNINGS, JOB COMPLEXITY, AND 
POST-HIGH: SCHOOL EDUCATION 


Structural Model of Earnings Determination 


The above analyses show that differences in 
earnings returns do exist depending upon 
matching outcomes. But they do not distin- 
guish among the various pathways from 


. education to earnings. This section proposes a 


structural model that permits the relationship 
between postsecondary schooling and earn- 
ings to be decomposed into two parts. The 
first, or indirect effect, is interpreted as the 
impact of schooling on earnings resulting 
from improved job positions' attributable to 


schooling net of geríder, color, and age.. 


Second, the direct effect represents ‘the 
change in earnings levels associated with 
formal education beyond high school that 
cannot be accounted for by simple dérno- 


EH 


graphic or job characteristics, as with : 
human capital interpretation. The model treat: 
race, gender, and age (linear and quadratic) a: 


. exogenous variables. Three sets of endoge- 


nous factors are included: .education, jot 
characteristics, and earnings. The specifica 
tion of job .characteristics utilizes three 
indicators of job complexity from the Dictio- 
nary of Occupational Titles (DOT). The first, 
DATA, considers the required usage oj 


information (e.g., written and numeric mate- 


rial); the -second, PEOPLE, stresses the 
frequency and types of interactions with othe: 
people; and finally, THINGS indicates the 
dégree to which tools or objects are utilized 
(Miller, Treiman, . Cain, -and Roos 1980).' 


"Ihe. model is constructed with all three 


complexity, measures intervening between 
education and earnings and includes the 


errorterm correlations among these > three 


TES TES 

* DATA i$ coded .to integer values" ranging 
between a low complexity of 1 to a high of 7, with 
both PEOPLE and THINGS coded from 0 to 8. 


0 
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Table 2. Increment to Annual Earnings Obtained for Each Year of Post-High School Education, by Gender, Color, 


Year, and Mismatch Status* 





Nonblack Males Nonblack Females Black Males Black Females 
Survey Not ; Not Not Not 
Year? Mismatched Mismatched Mismatched Mismatched Mismatched Mismatched Mismatched Mismatched 
1970 $1085° $1565 $695 $957 $952 $563 $785 $833 
23 86 75 126 20 60 45 84 
1972 1073 1354 672 1044 695 1320 872 833 ` 
23 69 73 130 19 57 50 90 
1974 961 1346 705 681 828 762 787 871 
25 63 85 129 20 49 51 101 
1976 982 1195 626 805 644 856 595 924 
22 53 69 131 18 46 44 89 
1978 890 1131 612 761 785 947 684 828 
19 47 69 107 18 40 48 71 
1980 792 985 558 752 850 1057 730 ^ 752 
17 38 65 99 15 36 39 74 
1982 801 982 566 574 411 1023 533 551 
' 16 33 54 68 13 29 34 68 


3 Data are from the March CPS files; universe includes persons 25—64, with earnings from the previous year, and 


who are currently working full-time. 


> Earnings are for the 12-month period prior to the survey; figures are in constant 197C dollars. 


and 


Two types of comparisons are derived from 
the model. The first is between mismatched 
and non-mismatched workers. As noted, it is 
anticipated that the indirect influence of 
- education through jobs will be greater than its 
direct impact for those who are mismatched 
but not among workers who are adequately 
matched. Second, the discussion implies that 
earnings returns to a year of formal post-high 
School education should decline over time, 
particularly for the indirect influence of 
schooling. To facilitate these comparisons, 
the model is estimated at two points in time : 
(1972 and 1982) for each mismatch status 
group. Data from the 1972 and 1982 March 
CPS files are used to obtain full-information 
maximum-likelihood estimates for the mod- 
el'S parameters. The final sample includes 
only persons aged 25 to 64 with more than 12 
years of completed schooling in order to 
reflect the present interest in postsecondary 
education. Further restrictions exclude per- 
sons without earnings, those who are not 
currently employed, and those who did not 
work full-time for the entire previous year.$ 


6 An additional restriction, excluding persons 
with full-time earnings below the official poverty 
level (Clogg 1979), produces equivalent results to 


s nae to post-high school education and standard error from a regression controlling for graded schooling, age 


Each CPS cross-section is weighted to reflect 
sampling procedures and nonresponse, then 
adjusted to maintain the original size of the 
CPS sample. The resulting sample contains 
3862 mismatched and 5840 non—mismatched 
persons in 1972, with 8075 mismatched and 
8399 non-mismatched in 1982.7 


Model Estimates and Effect Decompositions 


Comparing mismatched and non—mismatched 
workers in 1972, the mismatched appear 
more likely on average :o be nonblack, male, 
somewhat older, with slightly more education 
and higher earnings (see Table 3). In 1982 the 
mismatched are more likely than those not 
mismatched to be black, male, and younger, 
still with more education but essentially no 
difference in annual earnings. The occupation- 
ally mismatched in 1972 appear to have more 
complex jobs with respect to physical 


those reported here. Results may be obtained upon 
uest. 
. * Note that mismatch rates are higher here than 
elsewhere (Clogg and Shockey 1984; 
Sullivan 1978) because of the inclusion only of 
persons with some post-high school education. 
This is also reflected in the smaller sample size. 


—— 
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THINGS, but less complex in their relation- 
ship to DATA and other PEOPLE. The same 
is true. in 1982. This is surprising, although 
the expectation that occupationally mis- 
matched workers will hold more complex 
jobs is based on complexity returns to 
schooling rather than absolute levels . of 
complexity. 

Schooling effects on job complexity. After 
accounting for demographic factors in 1972, 
important differences remain in the relation- 
ship between post-high school education and 
jób complexity (see Table 4). As a general 
rule, education leads to jobs with greater 
complexity in terms of DATA use and 
relations with PEOPLE, but to less complex 
jobs regarding physical THINGS. Central to 
this paper is the fact that these "complexity 


returns" are greater (in absolute value) for 
mismatched workers. A simplistic profile of a 


Table 3. Means and Standard Deviations by Mismatch Status and Year 20 f 2 
1972 . 1982 
Not Mismatched — | Mismatched Not Mismatched ` Mismatched 
. Variables x s H so z s i T 
Color (&) .953 211 .939 239 .938 241 .919 272 
Gender (E) ^ 734 442 187 .410 .668 ATI 707 .455 
Age. (Es) 40.772 ` 11.035 41.177 10.822 39.814 10.570 35.537 10.198 
Age? (E) .1784.064 944.439 1812.616 927,961 1696.865 904.423 1589.087, 861.981 
Schooling (mı) ~ 15.288-: 1.726 15.800 1.543. 15.264 1.748 15.878 1.547 
Data (m) 4.420 .913 3.656 1.460 4.406 .942 3.623 1.474 
People (n3) 4.486 2.370 . 3391 2.580 4.406 ` 2442 3.205 2.529 
Things (n4) 2.635 2.282 ` 2.956 2.268 2.846 2.475 3.125 2.292 
Earnings (ns) _ 12.652 7.904  ' 12.890 7.955 4.830 2.831 4.748 2.835 
UN l 5840 3862 


“good” job, based solely upon its s complex- ; 
ity, might include the extensive use of data 
resources, frequent interaction with cowork- 
ers (typically subordinates), but very little use 
‘of physical objects which would be more 
common in manual labor positions. Given 
this profile, it is not surprising that the link 
between the better labor market positions and 
education beyond high school is stronger 
among those who are mismatched. This 
pattern of findings is replicated for the 1982 
sample, with only minor attenuation in 
magnitude (see Table 5). © 

Education, complexity, ind earnings. For 
each subgroup the effect of postsecondary 
education on earnings is again considered net 
of the impacts-of color, gender, age, and 
age-squared. This effect is décomposed into 
direct and indirect effects. The total impact of 
cious estimated for 1972 is about’ 44 


"Table 4. Maximum-Likelihood Estimates of Earnings Model Parameters: _By Mismatch Status, 1972 








Dependent Variable : : =. Gk 


Non-Mismatched Workers: dz = .276**; ty, = 1191 
Mismatched Workers: 23 = 1.380**; (4, = —.065; 
Significance Levels: 

* p< .05. 
** p< OL, 





I on ideni . Non-Mismatched Workers Mismatched Workers ' 
Variables un m w- n T5... w H m å 7 — 0 05 
Color (£i) —.102 .098  —.328**  .409** 1.410** .628** — .523**  .540** ^ 043 — 1.598**- 
^ Gender (Ej) 046 .687** —.007 .486** 5.128** 239** .416**  .688** —.886** 5.000** 
Age. ($) .010 .013* 002 .040 .723** .059** ' 051**  110**  .064** .780** 
' Age? (E) .000 .000 .000 .000  —.007**  —.001** —.001** —.001**  .001** —.008** 
Schooling (m) — — .152** .582** —.122** — 732** — 510**  .861** —.573** — .517** 
Data (Mm) ^. — — — — .447** — z = pr 734** 
People (n3) = — Š -— .404** Z e — ~ .526** 
Things (n4) = -* = oe 479% s. — -— = — 119 
Earnings (n) |— = — - — ~ — so — — o 
R 001 .199° 185 021 .220 .020 . .338 .308 .195 283 





*t* du. = —2.260**, 
ju = -2. sip. 
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Table 5. Maximum-Likelihood Estimates of Earnings Model Parameters: By Mismatch Status, 1982 





Dependent Variable 

Independent Non Mismatched ORARE Io c a 
Variables Tı "m Tis w -ms un "m T3 nM Tis 
Color (e;) .428**  .152** — .070 .183 .739** -751**  .445**  .302**  250*-  .486** 
Gender (€2) .403**  .568**  .192**  .358** l.653** .044 .360** — 494** — 439*- 1.820** 
Age (€3) ` .069** — .011 .130** —.091**  .294** .149**  ,056**  072**  .017 .295** 
Age? (ea) —.001** 000 Á —.001** .001** —.003** . —.002** —,001** —.001** .000  —.003** 
Schooling (mı) | — .159**  .549** —.030 .230** — .484**  772** —,565**  .]70** 
Data (n2) — — — — .235** — — — — .293** 
People (na) — — — — .157** — — — — .107** 
Things (n4) — — — — .226** — — — — —.082** 
Eamings (m) — =- | -=  - L- - - - = = 
R? .021 .194 .172 .007 .280 .038 311 .265 .158 .304 

Non-Mismatched Workers: yn = .395**; to, = .038**; W354 = —2.491**. 

Mismatched Workers: oz = 1.428**; jp, = —.086**; aq = —2.641**. 

Significance Levels: 

* p « .05. 
** p< 0l. 


percent greater for workers who are mis- 
matched than for those who are not (1.441/ 
0.997 = 1.44). This advantage fell to 27 
percent in 1982, although the magnitude of 
the total effect fell by nearly one-third for 
each group over the decade. Among mis- 
matched workers in 1972, the estimated 
indirect effect of schooling through job 
complexity is nearly 75 percent greater than 
the direct effect (.895/.517 = 1.73). For 
workers not mismatched, the direct effect is 
nearly three times as great as the indirect 
effect (.732/.245 = 2.95). Similar results are 
found in 1982, although absolute as well as 
relative magnitudes are diminished. The 
indirect effect is 59 percent greater than the 
direct effect among the mismatched (.271/ 
.170 = 1.59), whereas the direct effect is 
twice as large as the indirect effect for those 
not occupationally mismatched (.230/.116 = 
1.98). 

In sum, the effects of postsecondary 
schooling on the attainment of job-related 
earnings, net of demographic characteristics, 
(a) are greater among mismatched workers, 
(b) are mediated by the influence of job 
complexity to a much greater extent among 
the occupationally mismatched, and (c) have 
diminished over the period from 1972 to 
1982. To examine these results more thor- 
oughly, several modifications were imposed 
on the model, and in no case are the 
conclusions significantly altered. This is true 
when additional control variables are added 
(i.e., region, metro./nonmetro., industrial 


sector), when the ordinality assumption im- 
posed on each complexity measure is relaxed, 
and when alternative measures of the match- 
ing status are used.? 


DISCUSSION 


The literature of the past two decades contains 
numerous references to the declining value of 
a college education experienced by members 
of the “baby boom” generation. Even with a 
drop in the relative advantage of higher 
education, schooling beyond minimal require- 
ments has become beneficial in competing for 
access to better jobs. This means that each 
additional year of education results in smaller 
returns as the decade progresses, but that the 
returns themselves will be greater for mis- 
matched workers. Furthermore, mismatched 
or overeducated workers will be more success- 
ful on average at obtaining better occupa- 
tional positions by virtue of their “excess” 
educational attainment. As a result, jobs 
should be associated with higher levels of 
earnings for those who are overeducated than 
for those who are not. Given two workers in 
equivalent occupational positions (in terms of 
earnings, job complexity, authority, etc.), but 
with different levels of schooling, the one 
with fewer years of schooling will have 
greater net returns to each year completed. 
This implies that the direct effect of education 


8 Results available on request. 
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on earnings, net of the job held, should be 
greater for persons who are not overeducated. 
Finally, as expected with the growing supply 
of labor in the 1970s, the net dollar benefits 
of post-high school education declined to the 
end of the decade for both mismatched and 
non-mismatched workers, but the disadvan- 
tage in returns for education experienced by 
those mismatched also declined. In short, the 
distinction between those who are overedu- 
cated and those who are not has become less 
pronounced over time. 


These results taken together suggest that 


the value of additional schooling as a 
credential may supersede its worth as a 
transmitter of skills in providing access to 


good, "productive" jobs (Thurow 1975), and : 


provide one explanation as to why schooling 
is routinely obtained beyond necessary levels 
(Smith 1986). Even so, the paths to overedu- 
cation are many. An individual may obtain 
precisely the level of education perceived as 
being required in a chosen occupation, 
although the actual level is lower than that 
perceived. Or education may be obtained with 
at most a casual awareness of labor market 
requirements. On the other hand, surplus 
schooling may be obtained knowingly be- 
cause of an awareness that an educational 
credential exists either explicitly or implicitly. 
Whereas the political implications of these 
paths may differ, and each is likely to lead to 
job-worker mismatching, the economic con- 


sequences for the individual appear to be 


minimal. 


JAMES W. SHOCKEY is Assistant Professor 
of Sociology at the University of Arizona. His 
current research deals with underemployment 
in thé U.S. workforce, especially overeduca- 
tion, and with the development of techniques 
for latent class analysis. 
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DEMOCRACY AND INCOME 
INEQUALITY RECONSIDERED* 


(Comment on Muller, ASR, February 1988) 


ERICH WEEDE 
University of Cologne 


While cross-national studies on the relationship 
between the level of democracy and income 
inequality produced contradictory results, Muller 
(1988) has recently argued that there is a 
theoretically and statistically significant relation- 
ship between the age of democracy and income 
redistribution. Here, it is to be demonstrated that 


. Muller's finding is neither sustainable for his own 


data set, once a single case is added to some 
equations, or once literacy rates are controlled for, 
nor replicable with a different inequality data set. 

According to Muller (1988, pp. 58 and 60) the 
natural logarithm of years of democracy had some 
equalizing impact on top 20 percent income shares 
as well as on Gini, where measures of income in- 
equality refer to the 1965-1976 period. This result 
held in bivariate as well as in multivariate analysis. 
It survived the replacement of the continuous “years” 
variable by some qualitative variables and a variety 
of specifications of multiple regression equations. 
While Muller did control for the effects of the level 
of economic development, age of constitution, per- 
centage of very young people, and world-system sta- 
tus, he neglected some variables which previous re- 
searchers found to be rather strongly related to income 
inequality, in particular literacy (Ahluwalia 1976). 
Without entering literacy into the regression equa- 
tion, we cannot exclude the possibility that Muller 
(1988) produced spurious findings by neglecting some 
important determinant of the size distribution of in- 
come. d 

Muller (1988, p. 54) reports top 20 percent in- 
come shares and Gini coefficients for 55 nations 
where the inequality data refer to some year in be- 
tween 1965 and 1976. Since his analyses of the re- 
lationship between democracy and inequality pro- 
duce extremely similar results for regression equations 
explaining Gini and upper quintile income shares, I 
can limit this study to either one of these dependent 
variables. Since another inequality data set which I 
shall use below reports only top 20 percent income 


* Direct all correspondence to Erich Weede, 
Forschungsinstitut fuer Soziologie, Lindenburger 
Allee 15, 5000 Koeln 41, West Germany. The 
correlations between variables included in Table 1 
are available from the author upon request. I 
appreciate the computational assistance of Ulrich 
Albrecht and Juergen Schwuchow. 


shares, but not Gini, it is advisable to choose Mull- 
er's top 20 percent income shares as the dependent 
variable for this paper. It is important to remember, 
however, that nothing in Muller's (1988) findings 
provides any reason to believe that this should bias 
the findings below.! 

Although I am not fully convinced that Muller's 
(1988, p. 55) data on the date of inauguration of 
democracy and therefore his “years of democracy” 
variable are valid (see Weede 1989 for an 
alternative operationalization), I shall here accept 
them without modification. Since Muller's qualita- 
tive measures of democracy again produce similar 
results as his quantitative measure, since the 
qualitative measures are merely discontinuous 
transformations of the qualitative measure, I shall 
neglect them here.? 

In order to defend his findings against the charge 
of spuriousness, Muller applied various control 
variables. Some of them are never significant in his 
equations: individual (as distinguished from house- 
hold) income and the world system dummies, that 
is, periphery and semiphery. I shall eliminate these 
from the very beginning. 

But there are also some significant control variables: 
The level of economic development as measured by 
energy consumption per capita in 1965. Like Muller, 
I shall use Ballmer-Cao and Scheidegger (1979) as 
the main source of data and Taylor and Hudson (1972) 
as a supplementary source for Barbados and Gabon. 
Since the relationship between the level of economic 
development and income inequality is non-mono- 
tonic and curvilinear, I shall—again like Muller 
(1988), or Ahluwalia (1976), or Weede (1980)— 
regress income shares on the logged development 
term and on its square. f 

Then there is the youth of the population, that is, 
the percentage of persons younger than 15 (World 
Bank 1983, vol. 2). Unfortunately, this variable 
made Muller (1988, p. 59, note 16) lose the case of 
Taiwan; so there are at most 54 cases of 
observation. Finally, there is (logged) constitu- 
tional age (Taylor and Hudson 1972, pp. 26-29) 
which is sometimes significant and sometimes not 
in Muller’s regressions. 

The first column of Table 1 provides as close a 
replication of Muller’s (1988, p. 60) Table 4 as a 
single equation (instead of 10) can deliver. Using 


! Moreover, I also did the regressions for Gini 
insofar as possible and found very similar results 
for those equations. 

? As Blalock (1970) has demonstrated, making 
qualitative out of quantitative measures may easily 
create nonrandom measurement error. It seems 
preferable to me to avoid this. 
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Table 1. Regressions* of Top 20% Income Shares on the Level of Economic "Development, Youth of Population, 


~ Political Characteristics of Nations, and Literacy 


i i Muller’s Inequality Data 





(1). (2) 





percentage of population . 0.51 . 0.49 


ON i : 54 55 l 


Intercept ` -8.68 | -1.54 
TEER 14.26 12.33 

; phon] |. @.01) , (0.03) 
(In energy consumption p.c) poc ' 2 
younger than 15 (0.00) (0.00). 
In years of democracy (0.05) pns 
In constitutional age por pu | 
literacy : ; © = - 
adjusted percentage of 59.9 52.5 


variance explained 
a Significance levels in two-tailed tests In parenthesis., 


t 


only those independent variables that have been 
significant in Muller's equation 4.2a or 4.3a, I get 
_ extremely similar regression coefficients to Mul- 


ler's. Like Muller's findings, the level of economic : 
development is non-monotonically related to top . 


20 percent income shares; a large percentage of 
young persons increases inequality; the age of 
democracy deceases inequality; and the constitu- 
tional age has little effect on top 20 percent income 
shares. Since the latter finding looks more different 
from Muller’s findings than it really is, I want to 
underline that my coefficient of —1.00 is very 
“close. to Muller's (1988, p. 60, equation 4.32) 


'—— 1.05, and that Muller himself found this 


independent' variable insignificant in equations 
4.4a and 4.4b and therefore deleted it in his 
equations 4,5a and 4.5b. In essence, Muller's 
findings and my own. agree on the marginal 
importance of constitutional age. 

As an ASR reviewer has pointed out to me, 
Muller need not have eliminated Taiwan from 
those equations where age structure was to be 
controlled. An’ older World Bank (1976, p. 512) 


publication provides information for 1960 and. 


1970, although not for 1965. Taking the mean of 
the available values as a substitute, that is, 42.5 
percent younger than 15, one may add the case of 


Taiwan, as has been done in the second column of 


Táble.1. Thereafter, the fit of the equation 
deteriorates somewhat and logged years of democ- 
racy is no longer a significant explanatory variable. 
The second equation highlights the lack of 


robustness of Muller's previous findings and the , 
tentative character of any conclusions that we ` 


might draw from accidental samples, that is, from 


E: 


World Bank Data 
(3) (4) (5) (6) 
—24.08- | —22.38 —43.02 — 27.10 
20.82 20.66 ` 19.39 14.71 
(0.00) (0.00) (0.03) (0.00) 
—1.47. — 1.45 —138 : — 1.05 
(0.00) (0.00) (0.00 (0.00 
0.43 0.42 0.71 0.75 
(0.00) (0.00) (0.00) . (0.00) 
—0.22 -0.18 0.29 F-level-in- 
(0.71) (0.77) (0.72) sufficient 

—0.64 : 
= (0.34) i = 
-0.17 —0.16 
(0.00) (0.00) X m 
54 54 36 46 
68.8 68.8 68.1 |. "4 


; samples which are necessarily limited by data 


availability. In order to demonstrate that the 
inclusion of Taiwan is not essential in order to call 
Muller’s findings into question, I shall not include 
it in the following equations. 

The third equation (and the third colima of Table 
1) differs from the first by eliminating constitutional 


' age (which in itself has little effect on other coeffi- 


cients) and by including literacy rates as an addi- 
tional independent variable.? This new explanatory 
variable is significant beyord reasonable doubt; it 
slightly modifies the previous estimates of the rela- 
tionships between the level of economic develop- 
ment or the percentage of young people and inequal- 
ity; and it comes close to reducing the impact of 
logged years of democracy to zero. So, adding a 
control for literacy rates in itself is enough to call 
Muller's (1988) relationship between democratic ex- 
perience and (less) inequality into question. But you 
do not absolutely depend on literacy rates to arrive at 


this conclusion as has been demonstrated in the sec- - 


ond equation and will be demonstrated again in the 
fifth and sixth equations of Table 1. The fourth equa- 
tion merely reconfirms that the negative conclusion 


on the explanatory value of logged years of democ- : 


racy is not dependent on excluding logged constitu- 
tional age. 

Muller (1988) compiled his own inequality data 
set, because he was not satisfied with any single one 
of the previously existing compilations. Muller's pe- 


3 [n oider to avoid missing values on literacy, I. 
chose Taylor and Jodice (1983 and data tape) as 
my source and 1960 as the reference year. 
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riod of inequality- assessment was 1965-76. The ` 
World Bank (1987, pp. 252-53) has recently re- ` 


ported inequality data for 46 nations, the earliest of 
which refer to 1970 and the most recent ones to 
1986. The time spread between early and late mea- 
surements is a little bit longer than for. Muller's sam- 


ple. Most of the data refer to some later. point of ` 


time. Insofar as there is some overlap in timing—of 
course, there also is some actual identity of data, 

since Muller used the World Development Reports, 
too—this should bias my analysis below in favor of 
reproducing Muller’s findings. It is also to be re- 
membered that my acceptance of Muller’s data of 
inauguration of democracy also should make a rep- 
lication of his findings easier. 

'Given the typically later measurements of 
income inequality, energy consumption per capita 
now. refers to 1975. The data source is Taylor and 
Jodice (1983, pp. 114-17). Likewise, the percent- 
age of population younger than 15 now also refers 
to 1975. As in Muller’s study the data source is ite 
World Bank (1983, vol. 2). 

Muller (1988) reports the inauguration of 
democracy for merely 55 nations. Unfortunately, 
the overlap between his 55 nations and the World 
Bank's (1987) 46 nations is merely 36. But all of 
these 36 nations have been represented in Muller's 
(1988) study, in. the first equation of Table 1 
above, and now in its fifth equation. 

For the 36 nations where it is possible to 
substitute later World Bank inequality data for 
Muller's earlier data, but to rely on his own 
measurement of the inauguration of democracy and 
therefore years of democracy, democratic experi- 
ence has no significant impact on top 20 percent 
income shares. Worse still, even the sign of the 
insignificant coefficient turns out wrong by 
pointing to some (very weak) inegalitarian effect of 
logged years of democracy. A comparison of the 
fit of the fifth equation with the first column of 
Table. 1 ' reinforces the point that years of 
democracy do not contribute to the explanation of 
income inequality in the World Bank data set, even 
where literacy is neglected. 

The discrepancy between Muller's (1988) find- 
ings which are summarized in the first equation 


and my own fifth equation may result either from ' 


the difference in the inequality data or from the 
loss of cases in the fifth column as compared with 
the first one. Rerunning the first equation only for 
those 36 cases which are also included in the fifth 
equation reduces the size of the effect of logged 
years of democracy from previously — 1.22 to 
—0.36 which is nowhere near to statistical 


: significance. So, the reduction in sample size 


ratlier than different numerical estimates of top 20 
percent income shares looks largely mespouaible for 
the discrepancy in findings. 

On the one hand, one may argue that larger sam- 


ples are always preferable to smaller ones and there-` ` 


fore prefer Muller's findings and my first equation 
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"over the fifth equation. Followiüg this logic, one 


"should also remember that the inclusion of Taiwan- 


ese data on age stricture in the second equation gen~“ 
erates the largest number of cases and also calls the 


‘impact of logged years of democracy on top 20 per- 


cent income shares into question. On the other hand, | 
Muller accepted some older publications by World . 
Bank-affiliated authors, that is, Jain (1975) and Ahl- ` 
uwalia (1976), as sources for his own compilation of 
inequality data, but the World Bank (1987) itself 


: prefers no longer to include some of these data in the 


World Development’ Reports. Obviously, there is 
room for disagreement about the acceptability of some 


of the inequality data. . 
Since the World Bank supplies -inequality ‘data 


for another 10 nations where Muller (1988, p. 55) 
does not give us the data of inauguration of democ- 
racy, I also added my own éstimates for these na- 
tions.^ Thereafter, logged years of democracy still 
does not affect top 20 percent income shares, as can 
be seen in the sixth column of Table 1. i 

- In sum, Muller’s (1988) finding on the relation- 
ship between democratic experience and income: 
equalization is not robust. It does not survive the 
inclusion of a single additional case, (Taiwan)* or 
control variable (literacy rates), Moreover, it does. 


` not hold, if you substitute World Bank (1987) data 


on income shares for Muller's (1988) own data. 
The firmest conclusion of this comment is nega- 
tive. Currently available inequality data do not per- 


mit a clear judgment on the relationship between 


years of democracy and income inequality, If Mull- 


'er's (1988) sample of nations should prove more. 
- representative than the World Bank sample, that is, 


if the inclusion or exclusion of literacy rates were to 
determine the size and significance of the democracy- 
inequality relationship in the larger-and better data 
sets that hopefully become available in future, then 
we would face the issue of indirect relationships be- 
tween democracy, literacy, and inequality, Conceiv- 
ably, democracy might improve. literacy rates, and 
thereby indirectly reduce inequality. Jt is also con- ' 


_ ceivable, however, that higher literacy rates contrib- 


ute to the survival of democracy (i.e., to years of 


‘democracy) as well as to less inequality. While these 


are fascinating issues, it is questionable whether cur- 
rently available data permit a NE investiga- 
tion of them. 


1 


* Relying on Banks (1975), Gastil (1983-84), 


` and Hewitt (1977), I scored Hungary, Yugoslavia, 


Zambia, Mauritius, Hongkong, and Bangladesh as 
nondemocracies at the time of measurement of 
inequality. Years of democracy was scored 106 for 
Switzerland in 1978, 64 for Finland in 1981; 60 for 
Denmark in 1981, and 32 for Israel in 1980... 

5 Strictly, this statement refers to an equation 
where age structure is controlled; as it snòuld be 
according to Muller’ s s findings: 


n 
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DEMOCRACY AND INEQUALITY* 
(Reply to Weede) 


Epwarpb N. MULLER 
University of Arizona 


I accept Weede's (1989) conclusion that my index 
of years of democratic experience (Muller 1988) 


* Direct all correspondence to Edward N. 
Muller, Department of Political Science, Univer- 
sity of Arizona, Tucson, AZ 85721. 
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does not have a robust direct negative effect on 
income equality. The reason is not the case of 
Taiwan but rather the control for literacy. The 
missing score for Taiwan’s population 0-14 years 
of age in 1965 is 44.9 percent (from the Taiwan 
Statistical Yearbook). As equations 1.1 and 1.2 of 
Table 1 show, across all 55 cases with information 
on income distribution during 1965-75, my 
experience of democracy measure, Years of 
Democracy M,! has a marginal negative effect on 
gini income inequality (ż ratio of — 1.7) controlling 
only for level of economic development and 
population age structure, the baseline socioeco- 
nomic variables in Bollen and Jackman’s (1985) 
model; but the coefficient of years of democracy is 
reduced to zero if the literacy variable that Weede 
used, World Handbook literacy (from the data tape 
of the World Handbook of Political and Social 
Indicators), is included. 

In response to a critique of my democracy 
measure by Bollen and Jackman (1989), I have 
investigated alternative indicators and found that 
the index of the age of contemporary democratic 
systems developed by Rustow (1967) shows a 
considerably stronger bivariate correlation with 
income inequality than mine (the respective 
correlations are ~67 and — .51; see Muller 1989). 
Rustow's measure is quite simple conceptually: 
Democracy is defined by the presence of popular 
competitive elections that determine the composi- 
tion of the government; three consecutive compet- 
itive elections are required far the establishment of 
a functioning or consolidated democracy; and 
among countries that have consolidated a demo- 
cratic regime, the age of democracy is defined by 
the number of years of continuous elections. The 
principal differences between the Rustow index 
and mine are (1) that the 50 percent right-to-vote 
criterion is not used for determining popular 
elections, (2) the threshold of a consolidated 
democracy is three instead of two consecutive 
competitive elections, and (3) if democracy is 
replaced by authoritarian rule, years of democratic 
experience revert to zero. 

The strong negative correlation between Rus- 
tow's measure and income inequality may reflect 
an egalitarian influence of continuous popular 
elections. But since Rustow's index of years of 
democracy correlates strongly with other possible 
causes of income inequality such as level of 


! The score of the United States on Years of 
Democracy M is adjusted from 102 to 52. The 
U.S. loses 50 years of democracy because of de 
facto exclusion of blacks, which effectively 
reduced the population eligible to vote below the 
50 percent level, despite de jure universal male 
suffrage in 1870. The inaugural date of democracy 
in the United States now is set at 1920, when 
universal suffrage was introduced. On this point 
see Bollen and Jackman (1986) and Muller (1989). 
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Years of Democracy, and eid (N=55) 


1 E - t 








ES 


Q.D. 
intercept | 024. 
, (713) 
In Energy Consumption p.c., 1965 +0.15 
. ‘ 2S (2.6) 
In Energy Consumption p.c.? —0.011 
c -2.4)  , 
% Population « 15 Years, 1965 : +0 0053 
. (3.7) 
In Years of Democracy M . —0.011 
: (71.7) 
In Years of Democracy R 
World Handbook Literacy, 1960 
World Bank Literacy, 1960-70 
Adj R= . 77 . - 468 


. * r ratio in parentheses. 


' 


economic development, youth of population, and. 


literacy, the association between it and incomé 
inequality could, of course, be spurious or indirect. 
Equations 1.3, 1.4, and 1.5 of Table | address that 
issue. . 

The third and fourth equations have the same 
explanatory variables as the first and sécond except 
that my measure of experience of democracy is 
replaced by the Rustow index, denoted Years of 
Democracy R.? Without controlling ‘for literacy, 
the coefficient of Years of Democracy R is twice as 
large as that of Years of Democracy M and its t 
ratio is. substantial. The control for World Hand- 
book Literacy cuts the size of the Years of 
Democracy R coefficient in half and its ¢ ratio of 
—1.6 is indicative of only a marginal direct 
negative effect. . 

The principle source of the World Handbook 
literacy data is a 1967 publication of the United 
States Agency for International Development (see 


?"The Rustow index i$ calculated to the ‘most 
recent date of income distribution measurement. 
See Table 5 of Rustow (1967) for the initial year of 


continuous popular elections for all democracies in . 


1967. The list of democracies subsequently was 
amended to exclude Mexico (Rustow. 1970, `p, 
349). Several small democratic states that had 
recently gained independence also were omitted, 
including Barbados and Trinidad and, Tobago in 
this sample. Continuous popular elections in 
Barbados and Trinidad and Tobago are dated froin 
1946 and 1950, respectively, when universal 
NE was introduced. 


Gini Income Inequality, 1965-75 


(1.2) (0.3.4. +f a) (1.5) 
—0.40, —0.09 ~0.28 -01; 
(-2.5) (—0:5) (~1.6) (~1.2) 
+0.22 -+0.10 +0.18 +0.15 
n (4.3) (2.0) , 8.4) 2.7) , 
` —0.015 —0.007 -0.012 —0.010 
(-3) € (717 (-2.9) (-2.3) 
-- 0.0043 +0,0044 +0.0039 +0.0042 

G.4) G2) 3.1) G2) 
` *0.001 Dj 

(9.1). 

24 —0.620 —0.010 . —0.014 

A (-3.5) (71.6) (-2.1) 

—0.0018 -0:0015 ` E 

(74.1) (-3.2) a 
" l —0.0010 

(—2.0) 

1:570 


.596 254 c 616 


Taylor and Hudson 1972, p. 234). More recent 


3 $869. 
Table 1. Regressions* à Gini Income "Aii on Economic’ Development, Youth of Population, Indicators of 


data ‘for 1960 are reported by the World Bank' 


(1983). The 1960 World Bank data for LDCs differ 


substantially in many instances from those in the. 


U.S.A.I.D. publication. For the sample of nations 
with: income inequality data between 1965 and 
1975, the major discrepancies are as follows: 
South Africa, where the World Bank reports a 
literacy rate circa 1960 of 57 percent but 
U:S.A.LD. reported 35 percent; Taiwan, 54 
percent versus 75 percent; Zimbabwe, 39 percent 
versus 20 percent; Venezuela, 63 percent versus 80 
percent; Ivory 
Tunisia, 16 percent versus 30 percent; Trinidad 
and. Tobago, 93° ‘percent versus 80 percent; 
Zambia, 29 percent versus 41' percent; Malaysia, 
53 percent versus 43 percent; ‘Tanzania, 10 percent 
versus 18 -percent; Turkey, 38 percent versus. 46 
percent; Bolivia, 39 percent versus 32 percent; 
. Iran, 16 percent versus 23 percent; and Guatemala, 


Coast, 5 percent versus 20 percent; ` 


32 percent versus 36 percent. Since the World 


Bank data are' more recent, 
preferred to the U.S.A.I.D. data. . 
The 1960~70 average literacy rate computed from 
World Bank data is substituted for the 1960 World 
Handbook data in equation ‘1.5. 3° The coefficient 
for Years of Democracy R is reduced by only one- 


they should be 


third when World Bank Literacy is.controlled for, ' 


and its t ratio is indicative of statistical significance 
at the 95 percent confidence level for a two-tailed 
test. bine I TAS that the results of EA GM 1.5 are 


fe 
i 


< 


3 " data.are available only for 1960 or only for 


1970, then that score is used. | 
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Table 2. Regressions* of World Bank Top 20% Income Inequality on Economic Development, Youth of Population, 


Literacy, and Regime Characteristics 





Upper 20% Income 
Share, 1965-77 
(2.1a) (2.1b) 
intercept +53.72 + 35.49 
(0.9) (5.1) 
In Energy Consumption p.c., —4.12 
1965/1975 (—0.3) 
In Energy Consumption p.c.? +0.23 
1965/75 (0.2) 
% Population < 15 Years, +0.38 +0.46 
1965/1975 (1.7) (2.9) 
World Bank Literacy, +0.021 
1960-70/1975 . (.02) 
In Years of Democracy R —1.54 —1.42 
{~ 1.3) (—2.0) 
Communist Regime —9.26 — 8.99 
(—1.3) (—1.4) 
adj. R?= .470 523 
N=) (28) (28) 


* t ratio in parentheses. 


more reliable than those of 1.4, I conclude that 
experience of democracy, as measured by years of 
continuous popular elections, has a direct egalitar- 
ian influence on income distribution independent of 
socioeconomic determinants. 

In compiling data on income inequality my 
principle source was the World Bank. Data from 
three other reputable sources, the International 
Labour Organization, the Organization for Eco- 
nomic Development and Cooperation, and the 
. United Nations Economic Commission for Latin 
America, also were used because I wanted to 
maximize the number of cases and minimize the 
time-spread of the measurement interval. The 
World Bank has been reporting data on income 
distribution annually in its World Development 
Report since 1979, but the number of cases 
(independent states only) has varied from 28 (in 
both the 1979 and 1980 WDR) to 49 (in the 1985 
WDR) and the measurement dates have varied 
from’ 1965 to 1986. If one wants to test the 
robustness of results with data from editions of the 
World Development Report only, I would not use 
the 1987 WDR, as Weede did, because the 
time-spread of the measurement interval is the 
longest of all (17 years —1970—86) and the number 
of cases (45, excluding Hong Kong, a dependent 
. territory) is not maximized. 

The 1980 (and 1979) WDR reports data for 
1965—77, the interval closest to mine, but only 28 
'cases are available. The largest number of cases 
closest to the 1965—75 interval is contained in the 
1983 WDR, where-income distribution is reported 
for 39 independent states during 1967—79. The 
single largest number of cases is 49, contained in 


Upper 20% Income Upper 20% Income 
Share, 1967—79 Share, 1968—82 
(2.22) (2.2b) (2.3a) (2.3b) 
431.14 434.13 + 18.96 32.92 

(3.0) (6.9) (1.7) (6.7) 
—0.08 +1.50 — 
(0) (0.4) 
+0.05 —0.02 
(0.2) (-0.1) 
+0.52 +0.47 +0.63 +0.49 
(3.8) (4.2) (3.9) (4.3) 
—0.004 — 0.003 
(—0.1) (—0.1) 
—1.58 —1.40 -1.17 —1.03 
(—-2.6) (-2.9) (-2.1) (—2.1) 
— 9.63 —9.44 — 6.56 — 6.89 
(—1.9) (—1.9) (—1.6) (—1.7) 
.665 .689 .656 .657 
(38) (38) (49) (49) 


the 1985 WDR, where the measurement interval is 
1968-82. Using the income share of the upper 20 
percent as the measure of income inequality, I will 
conduct tests of the robustness of the experience of 
democracy effect, as measured by the Rustow 
index, across these three WDR data sets.4 For the 
1965-77 and 1967—79 data, the socioeconomic 
variables are the same as in equation 1.5. For the 
1968-82 data, values of energy consumption and 
population age structure are for 1975 (from the 
same sources as used by Weede), and literacy also 
is for circa 1975 (from World Bank 1980, 1981, 
and 1982). In addition to Years of Democracy R, I 
include another political variable, Communist 
Regime, scored one for countries under communist 
rule and zero otherwise. 

Equation 2.1a shows that for the 1965—77 data, 
with only 28 cases, none of the parameter 
estimates are sharp when all explanatory variables 
are entered. The ¢ ratios for the coefficients of 
literacy and of energy consumption per capita and 
its square are nearly zero, whereas those for the 
coefficients of the other variables are above 1.0. 
When the obviously irrelevant literacy and energy 
consumption variables are removed in equation 
2.1b, the ¢ ratios for the youth of population and 
democracy effects are substantially improved. 
Cross-national variation in 1965—77 World Bank 
top 20 percent income shares is explained 
principally by a positive effect of youth of 


^ For each of the data sets, Years of Democracy 
R is calculated to the date of the income 
distribution measurement. 
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population and a negative effect of years of popular 
elections. 

The irrelevance of both energy consumption per 
capita and literacy is again apparent in the data 
from the. 1983 WDR, where information on 
income distribution during 1967-79 is available for 
38 independent states. As equations 2.2a and 2.2b 
show, only age structure, experience of democ- 
racy, and Communist government have significant 
effects on 1967-79 World Bank top 20 percent 
income shares. 

The results obtained for the largest World Bank 

data set, reported in the 1985 WDR, are very 
similar to the results found for the smaller data 
sets. In equation 2.3a energy consumption per 
capita, youth of population, and literacy are all 
updated to 1975; nevertheless, the expected 
inverted U effect of energy consumption and the 
expected negative effect of literacy fail to materi- 
alize, as the ¢ ratios for the coefficients are 
approximately equal to zero. As in the smaller data 
sets, the two most significant determinants of 
World Bank top 20 percent income shares are 
youth of population and years of popular elections, 
while there is a marginal negative effect of 
Communist government. The irrelevance of energy 
consumption and literacy in the largest data set is 
underscored by the fact that their removal, in 
equation 3b, does not reduce adjusted R. 


EDWARD N. MULLER is Profesor of 
Political Science at the University of Arizona. 
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